Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

  • PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

  • PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  • Layout LM v1, v2 and v3 models [ Github ] DocBERT [ Github ]

  • hedwig

    PyTorch deep learning models for document classification (by castorini)

  • Layout LM v1, v2 and v3 models [ Github ] DocBERT [ Github ]

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • I built an online PDF management platform using open-source software

    4 projects | news.ycombinator.com | 12 May 2024
  • Finding an dictionary key value in an image or on an screen.

    2 projects | /r/learnpython | 2 Apr 2021
  • Multimodal AI: Bridging the Gap Between Human and Machine Understanding

    1 project | dev.to | 14 May 2024
  • Highlighting Image Text

    1 project | dev.to | 30 Apr 2024
  • one of the Codia AI Design technologies: OCR Technology

    1 project | dev.to | 14 Feb 2024