Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →
Top 21 Python Datascience Projects
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
Mimesis
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Fast-F1
FastF1 is a python package for accessing and analyzing Formula 1 results, schedules, timing data and telemetry
-
CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
-
socios-brasil
Captura os dados de sócios das empresas brasileiras na Receita Federal e exporta para um formato legível por humanos
-
scrape-google-play-store-app
Single script to scrape Google Play Store App info without browser automation
-
OLX-Analytics
🔍 This project allows easy and efficient browsing of classifieds on the OLX portal. The user has the option to register for a subscription and receive the latest information from the category that interests him every 4 hours.
-
Machine-Learning-Cyrillic-Classifier
This is a web app where you can draw a letter in the russian alphabet and the ML algorithm will predict the letter that you drew.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07This is a great project, little bit similar to https://github.com/ludwig-ai/ludwig, but it includes testing capabilities and ablation.
questions regarding the LLM testing aspect: How extensive is the test coverage for LLM use cases, and what is the current state of this project area? Do you offer any guarantees, or is it considered an open-ended problem?
Would love to see more progress toward this area!
Project mention: Python Day 9: Building Interactive Web Apps without HTML/CSS and JavaScript | dev.to | 2024-04-26Taipy is an open-source Python library that enables data scientists and developers to build robust end-to-end data pipelines.
Metaflow is an open source Python library that allows engineers to build and manage ML projects. It focuses on rapid prototyping and reducing time from development to production. It makes the job of ML data scientists easier by taking care of the low-level infrastructure: data, compute, orchestration, and versioning.
panel – data exploration & web app framework for Python
Project mention: Python: Uncovering the Overlooked Core Functionalities | news.ycombinator.com | 2023-07-24If you actually think this code is better there's a real library that does this: https://github.com/EntilZha/PyFunctional.
Stack: Python, Flask, HTML, CSS, Bootstrap, Docker, SQLite, APScheduler Source code
Python Datascience related posts
-
Python Day 9: Building Interactive Web Apps without HTML/CSS and JavaScript
-
+10 Resources to Empower Women in Technology
-
Show HN: Building data and AI apps, an alternative to Streamlit
-
Our open-source project for building AI / Data full-stack apps got funded! 🎉 🎉
-
Plotting 1,000,000 points on a webpage using only Python
-
Python: Uncovering the Overlooked Core Functionalities
-
Consume Live Timing/Telemetry From API During Race
-
A note from our sponsor - Scout Monitoring
www.scoutapm.com | 10 Jun 2024
Index
What are some of the best open-source Datascience projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | ludwig | 10,905 |
2 | modin | 9,539 |
3 | Taipy | 9,471 |
4 | metaflow | 7,721 |
5 | Mimesis | 4,319 |
6 | panel | 4,329 |
7 | PyFunctional | 2,359 |
8 | Fast-F1 | 2,240 |
9 | openllmetry | 1,420 |
10 | CleverCSV | 1,229 |
11 | streamlit-geospatial | 816 |
12 | DGFraud | 655 |
13 | socios-brasil | 549 |
14 | Mobile-Phone-Dataset-GSMArena | 58 |
15 | gretel-python-client | 44 |
16 | PathDict | 24 |
17 | linkedin-connections-analyzer | 12 |
18 | TagMaps | 6 |
19 | scrape-google-play-store-app | 2 |
20 | OLX-Analytics | 1 |
21 | Machine-Learning-Cyrillic-Classifier | 1 |