SaaSHub helps you find the best software and product alternatives Learn more ā
Top 23 Bigdata Open-Source Projects
-
TDengine
TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
-
shardingsphere
Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second š
-
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
griddb
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
-
spark
.NET for ApacheĀ® Sparkā¢ makes Apache Sparkā¢ easily accessible to .NET developers. (by dotnet)
-
Optimus
:truck: Agile Data Preparation Workflows madeĀ easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)
-
odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
-
incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
-
visualpython
GUI-based Python code generator for data science, extension to Jupyter Lab, Jupyter Notebook and Google Colab.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Opposite to what the documentation tells, the full prefix is jdbc:shardingsphere:absolutepath. I've opened a PR to fix the documentation.
Project mention: Data Sync in JuiceFS 1.2: Enhanced Selective Sync and Performance Optimizations | dev.to | 2024-05-17In JuiceFS 1.2, we introduced several new features for juicefs sync. We also optimized performance for multiple scenarios to improve users' data synchronization efficiency when dealing with large directories and complex migrations.
Project mention: Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog | dev.to | 2023-12-18Apache Iceberg is one of the three types of lakehouse, the other two are Apache Hudi and Delta Lake.
Project mention: How to Dynamically Adjust the Height of a Textarea in ReactJS | dev.to | 2023-10-25In this blog post, I have demonstrated how I addressed the challenge of dynamically adjusting the height of a textarea element based on its content, preventing the need for vertical scrolling in the title section of the OpenMetadata Knowledge article page.
Project mention: Open Table Formats Such as Apache Iceberg Are Inevitable for Analytical Data | news.ycombinator.com | 2024-01-18Apache AVRO [1] is one but it has been largely replaced by Parquet [2] which is a hybrid row/columnar format
[1] https://avro.apache.org/
Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
Bigdata related posts
-
TDengine: NEW Data - star count:22190.0
-
TDengine: NEW Data - star count:22190.0
-
TDengine: NEW Data - star count:22190.0
-
TDengine: NEW Data - star count:21816.0
-
TDengine: NEW Data - star count:21816.0
-
How to Dynamically Adjust the Height of a Textarea in ReactJS
-
TDengine: NEW Data - star count:21816.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2024
Index
What are some of the best open-source Bigdata projects? This list will help you:
Project | Stars | |
---|---|---|
1 | TDengine | 22,870 |
2 | shardingsphere | 19,475 |
3 | awesome-bigdata | 12,845 |
4 | juicefs | 9,881 |
5 | vaex | 8,180 |
6 | hudi | 5,114 |
7 | OpenMetadata | 4,271 |
8 | volcano | 3,805 |
9 | Apache Avro | 2,780 |
10 | dpark | 2,691 |
11 | griddb | 2,324 |
12 | spark | 2,000 |
13 | Optimus | 1,447 |
14 | tensorbase | 1,429 |
15 | odd-platform | 1,124 |
16 | cds | 956 |
17 | Mobius: C# API for Spark | 939 |
18 | tispark | 877 |
19 | incubator-livy | 857 |
20 | visualpython | 811 |
21 | Gearpump | 765 |
22 | WeDataSphere | 640 |
23 | spline | 583 |
Sponsored