✨ 5 Open Source Data Engineering Projects 🔥

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • HashtagCashtag

    My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - ​Apache Kafka ​for data ingestions, Apache Spark ​& ​Spark Streaming ​for batch & real-time processing, ​Apache Cassandra f​ or storage, ​Flask​, ​Bootstrap and ​HighCharts f​ or frontend.

  • 1️⃣ HashtagCashtag

  • PANDAS-TUTORIAL

    Jupyter Notebooks and Data Sets for Pandas Library (by TirendazAcademy)

  • ✨ Follow me Twitter | Instagram | YouTube| Tiktok | Medium for more content on data engineering.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • practical-data-engineering

    Practical Data Engineering: A Hands-On Real-Estate Project Guide

  • 2️⃣ Building a Data Engineering Project in 20 Minutes

  • WebCrawlerForOnlineInflation

    Price Crawler - Tracking Price Inflation

  • 4️⃣ Web Crawler For Online Inflation

  • Data-Engineering-Projects

    Personal Data Engineering Projects

  • 5️⃣ Data Engineering Projects

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Write production grade pandas (and other libraries!) with Hamilton

    2 projects | /r/Python | 27 Feb 2023
  • The Machine Learning Project Lifecycle

    1 project | /r/learnmachinelearning | 5 Nov 2022
  • I'm in my 30's and never had a "real job" - I have depression and anxiety, how do I get my life in order?

    1 project | /r/findapath | 2 Oct 2022
  • Learn data science step by step in 6 months😉

    1 project | /r/Datewithdata | 3 May 2022
  • 2022 Goals

    1 project | /r/pythontips | 14 Apr 2022