Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 4 Python data-testing Projects
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Project mention: Show HN: PipeRider – open-source Data Impact Analysis for dbt changes | news.ycombinator.com | 2023-09-06
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
Python data-testing related posts
-
Data profiling tools / approaches?
-
Data QC? Great Expectations?
-
Show HN: Soda Core is now GA – Test data like you would test your code
-
Data Quality - Great Expectations for Data Engineers
-
dbt vs R/Python for transformation
-
SodaCL - preview of a new "data reliability as code" language
-
How do you test your pipelines?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 2 Jun 2024
Index
What are some of the best open-source data-testing projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | soda-core | 1,786 |
2 | piperider | 471 |
3 | soda-spark | 60 |
4 | data_check | 4 |
Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com