Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →
Top 3 Python data-lake Projects
-
Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
amazon-s3-find-and-forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Project mention: Show HN: Automatically extract data from APIs with dlt and OpenAPI | news.ycombinator.com | 2024-05-29- You always have the last say. The generated code is declarative and ready to hack in case we pick the wrong paginator or response entity.
The tool and dlt are open source, find the code here: https://github.com/dlt-hub/dlt-init-openapi and here: https://github.com/dlt-hub/dlt
Python data-lake related posts
-
Show HN: Automatically extract data from APIs with dlt and OpenAPI
-
Show HN: Data load tool(dlt)-Python library to automate the creation of datasets
-
Data load tool (dlt) – open-source Python library that makes data loading easy
-
[Discussion] How to implement Data Contracts generically? Seeking advice from data contract users.
-
Deleting particular data from S3 External Tables
-
Update S3 Files
-
A note from our sponsor - Scout Monitoring
www.scoutapm.com | 1 Jun 2024
Index
What are some of the best open-source data-lake projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | dlt | 1,837 |
2 | Udacity-Data-Engineering-Projects | 1,363 |
3 | amazon-s3-find-and-forget | 233 |
Sponsored