Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 15 Python optical-character-recognition Projects
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
signature_extractor
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
-
edenai-apis
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
-
OS-Bot-COLOR
A lightweight desktop client & toolkit for writing, controlling and monitoring color-based automation scripts.
-
Orchestra
Orchestra is a sheet music reader (optical music recognition (OMR) system) that converts sheet music to a machine-readable version.
-
image-to-sound-python-
A python project for converting an Image into audible sound using OCR and speech synthesis
-
Typewriter-OCR-TintypeText
This typewriter OCR code can convert JPEG typewritten text images into RTF documents, while removing typos for you!
-
Braille-OCR-e-Braille-Tales
This braille OCR code can convert JPEG braille text images into RTF documents, while removing typos for you!
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: I built an online PDF management platform using open-source software | news.ycombinator.com | 2024-05-12Ok on cleaned aligned data, but there are a few newer ones like EasyOCR [0] that can deal with much less organized text (albeit more slowly)
[0] https://github.com/JaidedAI/EasyOCR
Project mention: Ask HN: I have many PDFs – what is the best local way to leverage AI for search? | news.ycombinator.com | 2024-05-30Paperless supports OCR + full text indexing: https://docs.paperless-ngx.com/
As far as AI goes, not sure.
Project mention: Show HN: How do you OCR on a Mac using the CLI or just Python for free | news.ycombinator.com | 2024-01-02https://github.com/mindee/doctr/issues/1049
I am looking for something this polished and reliable for handwriting, does anyone have any pointers? I want to integrate it in a workflow with my eink tablet I take notes on. A few years ago, I tried various models, but they performed poorly (around 80% accuracy) on my handwriting, which I can read almost 90% of the time.
I really recommend the usage of scene text recognition models. They are perfect for these type of usecases: https://github.com/baudm/parseq or check https://paperswithcode.com/task/scene-text-recognition make sure to check the licenses and good luck 👍🏻
Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28HackerNoon featured our latest article in the "Future of AI" category
We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.
You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis
Python optical-character-recognition related posts
-
OCR at Edge on Cloudflare Constellation
-
Tesserocr
-
New Eco-Friendly Indigo Typewriter Ink (Recipe Included!)
-
Digitalizing typewritten text
-
Python Testing 1
-
How to make Brilliant Blue FCF (blue food dye)-glycerine erasable typewriter ink
-
Make Your Own Gamebook
-
A note from our sponsor - InfluxDB
www.influxdata.com | 31 May 2024
Index
What are some of the best open-source optical-character-recognition projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | EasyOCR | 22,312 |
2 | paperless-ngx | 17,416 |
3 | doctr | 3,166 |
4 | tesserocr | 1,945 |
5 | J.A.R.V.I.S | 797 |
6 | kraken | 661 |
7 | parseq | 511 |
8 | signature_extractor | 426 |
9 | edenai-apis | 373 |
10 | OS-Bot-COLOR | 232 |
11 | handprint | 157 |
12 | Orchestra | 97 |
13 | image-to-sound-python- | 57 |
14 | Typewriter-OCR-TintypeText | 10 |
15 | Braille-OCR-e-Braille-Tales | 2 |
Sponsored