Top 23 distributed-database Open-Source Projects

etcd

63 46,614 9.9 Go

Distributed reliable key-value store for the most critical data of a distributed system

Project mention: Is Kubernetes a database? CRDs explained in five minutes | dev.to | 2024-05-24

Each time we create or update a K8s resource, the Kubernetes API stores it in its database — etcd. etcd is a distributed key-value store used to store all of your resource configurations, such as deployments, services, and so on. A neat feature of etcd is that you can subscribe to changes in some keys in the database, which is used by other Kubernetes mechanisms.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
tidb

27 36,358 10.0 Go

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://www.pingcap.com/tidb-serverless/

Project mention: A MySQL compatible database engine written in pure Go | news.ycombinator.com | 2024-04-09

tidb has been around for a while, it is distributed, written in Go and Rust, and MySQL compatible. https://github.com/pingcap/tidb
Somewhat relatedly, StarRocks is also MySQL compatible, written in Java and C++, but it's tackling OLAP use-cases. https://github.com/StarRocks/starrocks

ClickHouse

211 34,961 10.0 C++

ClickHouse® is a real-time analytics DBMS

Project mention: Universal Data Migration: Using Slingdata to Transfer Data Between Databases | dev.to | 2024-05-24

ClickHouse installed and running.

cockroach

101 29,329 10.0 Go

CockroachDB - the open source, cloud-native distributed SQL database.

Project mention: Show HN: Restate, low-latency durable workflows for JavaScript/Java, in Rust | news.ycombinator.com | 2024-06-12

Restate is built as a sharded replicated state machine similar to how TiKV (https://tikv.org/), Kudu (https://kudu.apache.org/kudu.pdf) or CockroachDB (https://github.com/cockroachdb/cockroach) are designed. Instead of relying on a specific consensus implementation, we have decided to encapsulate this part into a virtual log (inspired by Delos https://www.usenix.org/system/files/osdi20-balakrishnan.pdf) since it makes it possible to tune the system more easily for different deployment scenarios (on-prem, cloud, cost-effective blob storage). Moreover, it allows for some other cool things like seamlessly moving from one log implementation to another. Apart from that the whole system design has been influenced by ideas from stream processing systems such as Apache Flink (https://flink.apache.org/), log storage systems such as LogDevice (https://logdevice.io/) and others.
We plan to publish a more detailed follow-up blog post where we explain why we developed a new stateful system, how we implemented it, and what the benefits are. Stay tuned!

surrealdb

95 25,834 9.8 Rust

A scalable, distributed, collaborative, document-graph database, for the realtime web

Project mention: Event Driven services using Kafka, SurrealDB, Rust, and Go. | dev.to | 2024-06-04

Also to keep store the data of the products and their available units we will be using a database called SurrealDB. I have chosen SurrealDB because of a specific reason which we will explore later in this article. Now that we have produced a message from the inventory we need a consumer to consume this message by connecting to the Kafka broker. So for this, we will create a shipment service using Go to simulate the shipping process when the products are released from the inventory but to keep this project short and concise we are not going to build the whole shipment system.

shardingsphere

23 19,555 10.0 Java

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
rqlite

113 15,098 9.9 Go

The lightweight, distributed relational database built on SQLite.

Project mention: Why SQLite Is Taking over with Brian Holt and Marco Bambini | news.ycombinator.com | 2024-06-12

SQLite is not competing with RDMBSes. SQLite is competing with fopen().
There are of course solutions which wrap this fopen() replacement in a network/cluster-aware tools, e.g. https://github.com/rqlite/rqlite - these are competing with postgres.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
foundationdb

21 14,128 9.8 C++

FoundationDB - the open source, distributed, transactional key-value store

Project mention: Figma's Databases team lived to tell the scale | news.ycombinator.com | 2024-03-14

Actually, Apple does this for iCloud! They use FoundationDB[1] to store billions of databases, one for each user (plus shared or global databases).
See: https://read.engineerscodex.com/p/how-apple-built-icloud-to-...
Discussed on HN at the time: https://news.ycombinator.com/item?id=39028672
[1]: https://github.com/apple/foundationdb https://en.wikipedia.org/wiki/FoundationDB

ArangoDB

18 13,404 9.9 C++

🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

Project mention: System Design: Databases and DBMS | dev.to | 2024-05-01

ArangoDB

awesome-bigdata

4 12,890 0.0

A curated list of awesome big data frameworks, ressources and other awesomeness.

Project mention: Top 20 Awesome on Github | dev.to | 2024-06-12

12. Awesome Big Data

Apache ZooKeeper

36 11,997 8.0 Java

Apache ZooKeeper

Project mention: On Implementation of Distributed Protocols | dev.to | 2024-04-05

Apache ZooKeeper — a distributed coordination, synchronization, and configuration service (written in Java);

citus

63 9,991 9.3 C

Distributed PostgreSQL as an extension

Project mention: Show HN: Serverless Postgres | news.ycombinator.com | 2024-05-29

talent-plan

23 9,890 0.0 Rust

open source training courses about distributed database and distributed systems

Project mention: Need insights to build a distributed key value store from scratch. | /r/DistributedComputing | 2023-12-08

Pls check this course: https://github.com/pingcap/talent-plan . It includes how to implement sql and key value store. It’s an awesome course

Trino

45 9,735 10.0 Java

Official repository of Trino, the distributed SQL query engine for big data, former

Project mention: Trino & Iceberg Made Easy: A Ready-to-Use Playground | dev.to | 2024-05-19

By the way, I wanted to continue to use the previous experiment with Flink SQL and Iceberg, but I found out Trino doesn't support Iceberg's DynamoDB catalog. Therefore, I had to create a new one.

yugabyte-db

91 8,580 10.0 C

YugabyteDB - the cloud native distributed SQL database for mission-critical applications.

Project mention: Crash on clock skew: performance vs availability | dev.to | 2024-05-11

In a distributed database, NTP synchronization is essential and should be carefully monitored and fixed in case of any failures. To allow some time drift, a maximum clock skew is set. This skew should be kept low enough for performance to avoid too many read retries and high enough for availability to avoid any node evictions caused by network errors. It is a good idea to check the NTP synchronization when starting a YugabyteDB node. This will be implemented by 22255.

starrocks

12 8,092 10.0 Java

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.

Project mention: A MySQL compatible database engine written in pure Go | news.ycombinator.com | 2024-04-09

tidb has been around for a while, it is distributed, written in Go and Rust, and MySQL compatible. https://github.com/pingcap/tidb
Somewhat relatedly, StarRocks is also MySQL compatible, written in Java and C++, but it's tackling OLAP use-cases. https://github.com/StarRocks/starrocks

oceanbase

8 7,534 10.0 C++

OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
risingwave

27 6,456 10.0 Rust

SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.

Project mention: Proton, a fast and lightweight alternative to Apache Flink | news.ycombinator.com | 2024-01-30

How does this compare to RisingWave and Materialize?
https://github.com/risingwavelabs/risingwave

dynomite

3 4,164 0.0 C

A generic dynamo implementation for different k-v storage engines
Crate

6 3,982 9.9 Java

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01

awesome-blockchains

2 3,712 0.0 Ruby

A collection about awesome blockchains - open distributed public databases w/ crypto hashes incl. git ;-). Blockchains are the new tulips :tulip::tulip::tulip:. Distributed is the new centralized.
ydb

10 3,485 10.0 C++

YDB is an open source Distributed SQL Database that combines high availability and scalability with strong consistency and ACID transactions

Project mention: Erasure Coding versus Tail Latency | news.ycombinator.com | 2024-03-28

There https://ydb.tech/ open source db that uses erasure coding for replication in single zone/region.

Olric

28 3,028 6.3 Go

Distributed in-memory object store. It can be used as an embedded Go library and a language-independent service.

Project mention: Olric: Distributed, embeddable in-memory data structures in Go | news.ycombinator.com | 2024-02-05

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

distributed-database discussion

distributed-database related posts

Event Driven services using Kafka, SurrealDB, Rust, and Go.

2 projects | dev.to | 4 Jun 2024
Apache HoraeDB is a high-performance, distributed, time-series database in Rust

1 project | news.ycombinator.com | 25 May 2024
Crash on clock skew: performance vs availability

1 project | dev.to | 11 May 2024
Why SurrealDB is the Future of Database Technology - An In-Depth Look

3 projects | dev.to | 9 May 2024
Advisory/Custom/Application Lock with YugabyteDB

1 project | dev.to | 9 May 2024
A lightweight YugabyteDB docker image for CI/CD

1 project | dev.to | 7 May 2024
Multi-region YugabyteDB deployment on AWS EKS with Istio

3 projects | dev.to | 2 May 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 12 Jun 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source distributed-database projects? This list will help you:

	Project	Stars
1	etcd	46,614
2	tidb	36,358
3	ClickHouse	34,961
4	cockroach	29,329
5	surrealdb	25,834
6	shardingsphere	19,555
7	rqlite	15,098
8	foundationdb	14,128
9	ArangoDB	13,404
10	awesome-bigdata	12,890
11	Apache ZooKeeper	11,997
12	citus	9,991
13	talent-plan	9,890
14	Trino	9,735
15	yugabyte-db	8,580
16	starrocks	8,092
17	oceanbase	7,534
18	risingwave	6,456
19	dynomite	4,164
20	Crate	3,982
21	awesome-blockchains	3,712
22	ydb	3,485
23	Olric	3,028

distributed-database

Top 23 distributed-database Open-Source Projects

distributed-database discussion

distributed-database related posts

Event Driven services using Kafka, SurrealDB, Rust, and Go.

Apache HoraeDB is a high-performance, distributed, time-series database in Rust

Crash on clock skew: performance vs availability

Why SurrealDB is the Future of Database Technology - An In-Depth Look

Advisory/Custom/Application Lock with YugabyteDB

A lightweight YugabyteDB docker image for CI/CD

Multi-region YugabyteDB deployment on AWS EKS with Istio

Index