Running OCR against PDFs and images directly in the browser

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • ocrs

    Rust library and CLI tool for OCR (extracting text from images)

  • Out of curiosity have you tried ocrs by Robert Knight? https://github.com/robertknight/ocrs

  • pdfplumber

    Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • open-parse

    Improved file parsing for LLM’s

  • I recently built a similar tool except it’s configured to use some deep learning libraries for the table extraction. I’m excited to integrate unitable which has state of the art performance later this week.

    I built this because most of the basic layout detection libraries have terrible performance on anything non trivial. Deep learning is really the long term solution here.

    https://github.com/Filimoa/open-parse

  • s3-ocr

    Tools for running OCR against files stored in S3

  • My s3-ocr tool can do that with quite a bit of extra configuration.

    https://github.com/simonw/s3-ocr

  • textract-ai

    TextractAI: Extract and process text from PDFs using Python, OpenAI API, and OCR techniques.

  • This is cool! I built something similar but it's CLI based. [1] https://github.com/lifeiswilde/textract-ai

  • mitta-community

    Community repository for MittaAI users.

  • Here's an EasyOCR service: https://github.com/MittaAI/mitta-community/tree/main/service.... A PDF to image processor is being built and should be out in a few weeks.

    No docs, but happy to help anyone wanting to use it. Email is kord @ the company I'm working on.

  • unstract

    No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean

    1 project | dev.to | 26 Apr 2024
  • Show HN: I made a ROS package for realtime semantic segmentation

    1 project | news.ycombinator.com | 26 Apr 2024
  • The Nimble File Format by Meta

    2 projects | news.ycombinator.com | 25 Apr 2024
  • How to Estimate Depth from a Single Image

    8 projects | dev.to | 25 Apr 2024
  • How to Cluster Images

    5 projects | dev.to | 9 Apr 2024