Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python Text Projects
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
1filellm
Specify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion
-
py_midicsv
A Python port and library-fication of the midicsv tool by John Walker. If you need to convert MIDI files to human-readable text files and back, this is the library for you.
-
semchunk
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
-
pytextcodifier
:package: Turn your text files into codified images or your codified images into text files.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Exploring Open-Source Alternatives to Landing AI for Robust MLOps | dev.to | 2023-12-13Numerous tools exist for detecting anomalies in time series data, but Alibi Detect stood out to me, particularly for its capabilities and its compatibility with both TensorFlow and PyTorch backends.
Project mention: ART 6.0 released: ASCII and Non-ASCII art library for Python (+ Space support) | /r/coolgithubprojects | 2023-06-14
Evennia - MUD server (text-based MMORPG). Python
Project mention: Show HN: FileKitty – Combine and label text files for LLM prompt contexts | news.ycombinator.com | 2024-05-01I created something similar, https://github.com/jimmc414/1filellm
It converts papers, repositories, PRs and web docs into one text file for llm ingestion
Project mention: Ask HN: What Underrated Open Source Project Deserves More Recognition? | news.ycombinator.com | 2024-03-07
See https://github.com/pszemraj/textsum. He's the guy that trained most of the popular long finetuned long models. He created a pip package to make life easier (which uses Huggingface under the hood, just pre-selects good models and obfuscates boilerplate).
Project mention: semchunk alternatives - text-splitter and langchain | libhunt.com/r/semchunk | 2023-11-09
Python Text related posts
-
Building a Multi-Tenant App with FastAPI, SQLModel, and PropelAuth
-
On why Markdown is not a good, or even a half-decent, markup language
-
ART 6.0 released: ASCII and Non-ASCII art library for Python (+ Space support)
-
Show HN: I turned my microeconomics textbook into a chatbot with GPT-3
-
Modern Polars: an extensive side-by-side comparison of Polars and Pandas
-
Show HN: Pygame's Text Input Module
-
How to create diagrams via code?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 3 May 2024
Index
What are some of the best open-source Text projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | TextRecognitionDataGenerator | 3,043 |
2 | aeneas | 2,379 |
3 | alibi-detect | 2,085 |
4 | art | 1,996 |
5 | evennia | 1,717 |
6 | pytorch-widedeep | 1,238 |
7 | pygame-menu | 503 |
8 | pangu.py | 233 |
9 | 1filellm | 224 |
10 | pygame-text-input | 138 |
11 | orange3-text | 124 |
12 | textsum | 110 |
13 | py_midicsv | 72 |
14 | zeroshot_topics | 60 |
15 | Quote2Image | 58 |
16 | To-ASCII | 57 |
17 | namekrea | 49 |
18 | semchunk | 23 |
19 | Oz-Engine | 14 |
20 | pytextcodifier | 14 |
21 | litemark | 13 |
22 | pythontextnow | 10 |
23 | linesieve | 7 |
Sponsored