Running OCR against PDFs and images directly in the browser

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

ocrs

2 910 9.2 Rust

Rust library and CLI tool for OCR (extracting text from images)

Out of curiosity have you tried ocrs by Robert Knight? https://github.com/robertknight/ocrs

pdfplumber

29 5,639 8.2 Python

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
open-parse

3 1,834 9.2 Python

Improved file parsing for LLM’s

I recently built a similar tool except it’s configured to use some deep learning libraries for the table extraction. I’m excited to integrate unitable which has state of the art performance later this week.
I built this because most of the basic layout detection libraries have terrible performance on anything non trivial. Deep learning is really the long term solution here.
https://github.com/Filimoa/open-parse

s3-ocr

1 108 - Python

Tools for running OCR against files stored in S3

My s3-ocr tool can do that with quite a bit of extra configuration.
https://github.com/simonw/s3-ocr

textract-ai

1 9 6.7 Python

TextractAI: Extract and process text from PDFs using Python, OpenAI API, and OCR techniques.

This is cool! I built something similar but it's CLI based. [1] https://github.com/lifeiswilde/textract-ai

mitta-community

12 12 9.7 Python

Community repository for MittaAI users.

Here's an EasyOCR service: https://github.com/MittaAI/mitta-community/tree/main/service.... A PDF to image processor is being built and should be out in a few weeks.
No docs, but happy to help anyone wanting to use it. Email is kord @ the company I'm working on.

unstract

5 141 9.7 Python

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

1 project | dev.to | 26 Apr 2024
Show HN: I made a ROS package for realtime semantic segmentation

1 project | news.ycombinator.com | 26 Apr 2024
The Nimble File Format by Meta

2 projects | news.ycombinator.com | 25 Apr 2024
How to Estimate Depth from a Single Image

8 projects | dev.to | 25 Apr 2024
How to Cluster Images

5 projects | dev.to | 9 Apr 2024

Running OCR against PDFs and images directly in the browser

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
PDF Computer Vision etl-pipeline pdf-parsing Machine Learning
Post date: 30 Mar 2024

ocrs

pdfplumber

InfluxDB

open-parse

s3-ocr

textract-ai

mitta-community

unstract

SaaSHub

Related posts

Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Show HN: I made a ROS package for realtime semantic segmentation

The Nimble File Format by Meta

How to Estimate Depth from a Single Image

How to Cluster Images

Running OCR against PDFs and images directly in the browser

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com PDF Computer Vision etl-pipeline pdf-parsing Machine Learning Post date: 30 Mar 2024

Related posts

Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

Show HN: I made a ROS package for realtime semantic segmentation

The Nimble File Format by Meta

How to Estimate Depth from a Single Image

How to Cluster Images

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
PDF Computer Vision etl-pipeline pdf-parsing Machine Learning
Post date: 30 Mar 2024