SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Panda Projects
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
-
30-Days-Of-Python
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
-
pandas-ta
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
-
Mimesis
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
-
AWS Data Wrangler
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
It's also possible for you to give a package an alias by using the as keyword. For instance, you could use the pandas package as pd like this:
4. Asabeneh/30-Days-Of-Python - This repository presents a 30-day challenge for beginners to learn Python from the ground up. The course covers everything from the basics to more advanced topics like statistics, data analysis, and web development. https://github.com/Asabeneh/30-Days-Of-Python
Project mention: 🐍🐍 23 issues to grow yourself as an exceptional open-source Python expert 🧑💻 🥇 | dev.to | 2023-10-19
If you check the file here - https://github.com/ranaroussi/yfinance/blob/main/yfinance/base.py - you can see this is communicated via the "raise Exception('%s: %s' % (self.ticker, err_msg))" line. I'm trying to use the following to catch the exception but no luck.
Project mention: PandasAI is great but is there a more general library? | news.ycombinator.com | 2023-08-23
Project mention: Show HN: Use an "eraser" to clean data on flight without breaking your workflow | news.ycombinator.com | 2024-03-15
Project mention: Data Science at the Command Line, 2nd Edition (2021) | news.ycombinator.com | 2024-05-06I'd like to call out one of my favorite pieces of software from the past 10 years: VisiData [1] has completely changed the way I do ad-hoc data processing, and is now my go-to for pretty much all use cases that I previously used spreadsheets for, and about half of those I previously used databases for.
It's a TUI application, not strictly CLI, but scriptable, and I figure anyone building pipelines using tools like jq, q, awk, grep, etc. to process tabular data will find it extremely useful.
----
[1]: https://visidata.org
I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.
Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.
https://orangedatamining.com/
https://orange3.readthedocs.io/projects/orange-visual-progra...
Project mention: Show HN: Hashquery, a Python library for defining reusable analysis | news.ycombinator.com | 2024-04-23I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]
0: https://ibis-project.org/
Project mention: Read files from s3 using Pandas/s3fs or AWS Data Wrangler? | /r/dataengineering | 2023-12-06I had no problem with awswrangler (https://github.com/aws/aws-sdk-pandas) and it supports reading and writing partitions which was really helpful and a few other optimizations that made it a great tool
Python Pandas related posts
-
The Birth of Parquet
-
PDEP-13: The Pandas Logical Type System
-
"No" is not an actionable error message
-
Pandas reset_index(): How To Reset Indexes in Pandas
-
Show HN: Hashquery, a Python library for defining reusable analysis
-
The Design Philosophy of Great Tables (Software Package)
-
Show HN: Use an "eraser" to clean data on flight without breaking your workflow
-
A note from our sponsor - SaaSHub
www.saashub.com | 22 May 2024
Index
What are some of the best open-source Panda projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | Pandas | 42,159 |
2 | 30-Days-Of-Python | 32,404 |
3 | tqdm | 27,570 |
4 | data-science-ipython-notebooks | 26,545 |
5 | datasets | 18,523 |
6 | ydata-profiling | 12,101 |
7 | Dask | 12,055 |
8 | seaborn | 12,003 |
9 | yfinance | 12,000 |
10 | pandas-ai | 11,214 |
11 | pygwalker | 10,213 |
12 | modin | 9,498 |
13 | mlcourse.ai | 9,470 |
14 | visidata | 7,454 |
15 | lux | 5,029 |
16 | pandas-ta | 4,823 |
17 | orange | 4,633 |
18 | Mimesis | 4,310 |
19 | ibis | 4,304 |
20 | geopandas | 4,234 |
21 | alpha_vantage | 4,170 |
22 | AWS Data Wrangler | 3,816 |
23 | missingno | 3,771 |
Sponsored