database
Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.
Pgpool-II is a middleware that works between PostgreSQL servers and a PostgreSQL database client.
Pgpool-II saves connections to the PostgreSQL servers, and reuse them whenever a new connection with the same properties (i.e. username, database, protocol version) comes in. It reduces connection overhead, and improves system's overall throughput.
Allows your company to implement disaster recovery solutions for PostgreSQL databases with high requirements of business continuity.
Taking an online hot backup of PostgreSQL is now as easy as ordering a good espresso coffee.
Bundle service integration of official mongodb/mongo-php-library.
This bundle integrates the official mongodb/mongo-php-library driver library (mongodb/mongodb on packagist) with your Symfony app.
PostgREST is a standalone web server that turns your PostgreSQL database directly into a RESTful API. The structural constraints and permissions in the database determine the API endpoints and operations.
Kubegres is a Kubernetes operator allowing to deploy one or many clusters of PostgreSql instances with data replication and failover enabled out-of-the box. It brings simplicity when using PostgreSql considering how complex managing stateful-set's life-cycle and data replication could be with Kubernetes.
Fast Open-Source OLAP DBMS.
ClickHouse® is an open-source column-oriented database management system that allows generating analytical data reports in real-time.
Related contents:
FerretDB is an open-source proxy, converting the MongoDB 6.0+ wire protocol queries to SQL - using PostgreSQL as a database engine.
Scriptable database and system performance benchmark.
sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. It is most frequently used for database benchmarks, but can also be used to create arbitrarily complex workloads that do not involve a database server.
postgresql_anonymizer is an extension to mask or replace personally identifiable information (PII) or commercially sensitive data from a PostgreSQL database.
The project is aiming toward a declarative approach of anonymization. This means we’re trying to extend PostgreSQL Data Definition Language (DDL) in order to specify the anonymization strategy inside the table definition itself.
Related contents:
Hypothetical Indexes for PostgreSQL. HypoPG is a PostgreSQL extension adding support for hypothetical indexes.
An hypothetical -- or virtual -- index is an index that doesn't really exists, and thus doesn't cost CPU, disk or any resource to create. They're useful to know if specific indexes can increase performance for problematic queries, since you can know if PostgreSQL will use these indexes or not without having to spend resources to create them.
Open source UI for Postgres databases.
Mathesar is a straightforward open source tool that provides a spreadsheet-like interface to a PostgreSQL database. Our web-based interface helps you and your collaborators work with data more independently and comfortably – no technical skills needed.
You can use Mathesar to build data models, enter data, and even build reports. You host your own Mathesar installation, which gives you ownership, privacy, and control of your data.
Provides utilities for Postgres database schema versioning. Pyrseas provides utilities to describe a PostgreSQL database schema as YAML, to verify the schema against the same or a different database and to generate SQL that will modify the schema to match the YAML description.
Open source no-code database and Airtable alternative. Create your own online database without technical experience. Our user friendly no-code tool gives you the powers of a developer without leaving your browser.
A Metadata Platform for the Modern Data Stack . The #1 Open Source Data Catalog.
DataHub's extensible metadata platform enables data discovery, data observability and federated governance that helps tame the complexity of your data ecosystem.
A distributed, reliable key-value store for the most critical data of a distributed system.
etcd is a strongly consistent, distributed key-value store that provides a reliable way to store data that needs to be accessed by a distributed system or cluster of machines. It gracefully handles leader elections during network partitions and can tolerate machine failure, even in the leader node. Learn more.
Related contents:
The Open Source Firebase Alternative.
Supabase is an open source Firebase alternative. Start your project with a Postgres database, Authentication, instant APIs, Edge Functions, Realtime subscriptions, and Storage.
Related contents:
MongoDB Easy Change Management.
mongeez allows you to manage changes of your mongo documents and propagate these changes in sync with your code changes when you perform deployments.
A free, community-sourced, machine-readable knowledge base of digital forensic artifacts that the world can use both as an information source and within other tools.
If you'd like to use the artifacts in your own tools, all you need to be able to do is read YAML. That is it, no other dependencies. The Python code in this project is just used to validate all the artifacts to make sure they follow the specification.
Pipelined Relational Query Language, pronounced "Prequel". PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement. Like SQL, it's readable, explicit and declarative. Unlike SQL, it forms a logical pipeline of transformations, and supports abstractions such as variables and functions. It can be used with any database that uses SQL, since it compiles to SQL.
The SQL Editor and Database Manager Of Your Dreams. A modern, easy to use, and good looking SQL client for MySQL, Postgres, SQLite, SQL Server, and more.
A distributed vulnerability database for Open Source. An open, precise, and distributed approach to producing and consuming vulnerability information for open source.
Related contents:
DuckDB is an in-process SQL OLAP database management system.
DuckDB is a high-performance analytical database system. It is designed to be fast, reliable, portable, and easy to use. DuckDB provides a rich SQL dialect, with support far beyond basic SQL DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs, maps), and several extensions designed to make SQL easier to use.
Related contents:
- DuckDB - Le moteur SQL qui transforme vos données @ Korben :fr:.
- Why DuckDB is my first choice for data processing @ >robinlinacre.
- DuckDB is Probably the Most Important Geospatial Software of the Last Decade @ dbreunig.com.
- Why Semantic Layers Matter — and How to Build One with DuckDB @ MotherDuck.
- Querying Billions of GitHub Events Using Modal and DuckDB (Part 1: Ingesting Data) @ noreasontopanic.
A truly Open Source MongoDB alternative. FerretDB (previously MangoDB) was founded to become the de-facto open-source substitute to MongoDB. FerretDB is an open-source proxy, converting the MongoDB 6.0+ wire protocol queries to SQL - using PostgreSQL as a database engine.
Related contents:
In-Database Machine Learning. MindsDB ML-SQL Server enables machine learning workflows for the most powerful databases and data warehouses using SQL.
Graph Database Management System. Neo4j Graph Data Platform. Blazing-Fast Graph, Petabyte Scale. With proven trillion+ entity performance, developers, data scientists, and enterprises rely on Neo4j as the top choice for high-performance, scalable analytics, intelligent app development, and advanced AI/ML pipelines.
ClickHouse® is a free analytics DBMS for big data. ClickHouse® is an open-source column-oriented database management system that allows generating analytical data reports in real-time.
Native GraphQL Database with graph backend. Dgraph is a horizontally scalable and distributed GraphQL database with a graph backend. It provides ACID transactions, consistent replication, and linearizable reads. It's built from the ground up to perform for a rich set of queries. Being a native GraphQL database, it tightly controls how the data is arranged on disk to optimize for query performance and throughput, reducing disk seeks and network calls in a cluster.
WiredTiger is an high performance, scalable, production quality, NoSQL, Open Source extensible platform for data management.
A JavaScript library for structured state. Database engine for javascript. The reactive data store for local‑first apps.
TinyBase lets you listen to changes made to any part of your data. This means your app will be fast, since you only spend rendering cycles on things that change. The optional bindings to React and pre-built components let you easily build fully reactive UIs on top of TinyBase. You even get a built-in undo stack, and developer tools!
MongoDB GUI Admin for 21st century. The best GUI for MongoDB Get your MongoDB data under control
Free multi-platform database tool for developers, database administrators, analysts and all people who need to work with databases. Supports all popular databases: MySQL, PostgreSQL, SQLite, Oracle, DB2, SQL Server, Sybase, MS Access, Teradata, Firebird, Apache Hive, Phoenix, Presto, etc.
Migrate to PostgreSQL in a single command!
pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. Read the White Paper to learn how to limit risks and control your budget, and start your PostgreSQL migration today!
NMIG is a database migration tool, written in Node.js and highly inspired by FromMySqlToPostgreSql. NMIG is an app, intended to make a process of migration from MySQL to PostgreSQL as easy and smooth as possible.
DBML (Database Markup Language) is an open-source DSL language designed to define and document database schemas and structures. It is designed to be simple, consistent and highly-readable.
Fast key-value DB in Go. BadgerDB is an embeddable, persistent and fast key-value (KV) database written in pure Go. It is the underlying database for Dgraph, a fast, distributed graph database. It's meant to be a performant alternative to non-Go-based key-value stores like RocksDB.
Version control for your database
Robust schema evolution across all your environments. With ease, pleasure, and plain SQL.
Next-generation Node.js and TypeScript ORM for Databases. Prisma helps app developers build faster and make fewer errors with an open source database toolkit for PostgreSQL, MySQL, SQL Server, and SQLite.
Related contents:
TiDB is a distributed NewSQL database compatible with MySQL protocol TiDB (The pronunciation is: /'taɪdiːbi:/ tai-D-B, etymology: titanium) is a distributed SQL database. Inspired by the design of Google F1, TiDB supports the best features of both traditional RDBMS and NoSQL.
Beringei is a high performance, in-memory storage engine for time series data.
Consus is a geo-replicated transactional key-value store that upholds strong consistency and fault tolerance guarantees across multiple data centers. By geo-replicating data, Consus can can withstand correlated failures up to and including entire data centers, and reduce latency for clients by directing them to nearby replicas.
the JavaScript Database that Syncs! PouchDB is an open-source JavaScript database inspired by Apache CouchDB that is designed to run well within the browser.
PouchDB was created to help web developers build applications that work as well offline as they do online.
Related contents:
a fast and simple NoSQL OLAP. Sybil is an append only analytics datastore with no up front table schema requirements; just log JSON records to a table and run queries. Written in Go, sybil is designed for fast full table scans of multi-dimensional data on a single machine.
Sophia is a modern transactional key-value / row storage library. How does it differ from other storages?
Sophia is RAM-Disk hybrid storage. It is designed to provide best possible on-disk performance without degradation in time. It has guaranteed O(1) worst case complexity for read, write and range scan operations.
BuntDB is a fast, embeddable, in-memory key/value database for Go with geospatial support. BuntDB is a low-level, in-memory, key/value store in pure Go. It persists to disk, is ACID compliant, and uses locking for multiple readers and a single writer. It supports custom indexes and geospatial data. It's ideal for projects that need a dependable database and favor speed over data size.
Monitoring system & time series database. Power your metrics and alerting with a leading open-source monitoring solution.
Related contents:
FlatDB is a simple flat file database designed to persist data using just PHP and flat files. Perfect solution when no other database is available.
Replicating SQLite using the Raft consensus protocol. rqlite is a distributed system that provides a replicated SQLite database. rqlite is written in Go and uses Raft to achieve consensus across all the instances of the SQLite databases. rqlite ensures that every change made to the database is made to a quorum of SQLite files, or none at all.
high performance, high scalability and high reliability database for big data. GridDB has a KVS (Key-Value Store)-type data model that is suitable for sensor data stored in a timeseries. It is a database that can be easily scaled-out according to the number of sensors.
Open source analytics toolbox
Analyze your data without writing a line of SQL.
Kinto is a lightweight JSON storage service with synchronisation and sharing abilities. It is meant to be easy to use and easy to self-host.
Kinto is used at Mozilla and released under the Apache v2 licence.
automatic SQL injection and database takeover tool.
SQLmap is an open source penetration testing tool that automates the process of detecting and exploiting SQL injection flaws and taking over of database servers.
Apache HBase™ is the Hadoop database, a distributed, scalable, big data store. Use Apache HBase when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.
Allows you to easily execute SQL against structured text like CSV or TSV.
SchemaSpy is a Java-based tool (requires Java 5 or higher) that analyzes the metadata of a schema in a database and generates a visual representation of it in a browser-displayable format. It lets you click through the hierarchy of database tables via child and parent table relationships as represented by both HTML links and entity-relationship diagrams. It's also designed to help resolve the obtuse errors that a database sometimes gives related to failures due to constraints.