Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 4 HTML OCR Projects
-
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
documentation
Documentation for Papermerge DMS - Installation, Help, User Manual, REST API (by papermerge)
-
Warframe-OCR
A relic inventory recognition system for Warframe, based on experimental Rust bindings to Tesseract OCR. Supports detection in real-time. Very much WIP.
Be careful with unstructured:
https://github.com/Unstructured-IO/unstructured/blob/d11c70c...
from: https://github.com/open-webui/open-webui/issues/687
Project mention: Show HN: Kimchi Reader – Immersive Korean Learning with a Popup Dictionary | news.ycombinator.com | 2023-10-29
HTML OCR related posts
-
Show HN: Kimchi Reader – Immersive Korean Learning with a Popup Dictionary
-
Unstructured – OSS libraries and APIs to build custom preprocessing pipelines
-
More intelligent Pdf parsers
-
Help extracting data from multiple PDF's
-
Any way to convert my handwritten diary to searchable PDFs?
-
Pre-processing text documents such as PDFs, HTML and Word Documents for LLMs
-
Sites for anime or series sub japanese? or other forms of immersion.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 17 May 2024
Index
What are some of the best open-source OCR projects in HTML? This list will help you:
Project | Stars | |
---|---|---|
1 | unstructured | 6,682 |
2 | mokuro | 728 |
3 | documentation | 13 |
4 | Warframe-OCR | 1 |
Sponsored