Python Parser

Open-source Python projects categorized as Parser

Top 23 Python Parser Projects

  • pydantic

    Data validation using Python type hints

  • Project mention: Advanced RAG with guided generation | dev.to | 2024-04-18

    First, note the method prefix_allowed_tokens_fn. This method applies a Pydantic model to constrain/guide how the LLM generates tokens. Next, see how that constrain can be applied to txtai's LLM pipeline.

  • sqlglot

    Python SQL Parser and Transpiler

  • Project mention: The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol | news.ycombinator.com | 2024-04-26

    This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".

    Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).

    [1] https://github.com/tobymao/sqlglot

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • pdfminer.six

    Community maintained fork of pdfminer - we fathom PDF

  • Project mention: Code to extract text from pdf to excel | /r/Python | 2023-06-02

    I love to use PDFMiner and PDFQuery for this https://github.com/pdfminer/pdfminer.six https://towardsdatascience.com/scrape-data-from-pdf-files-using-python-and-pdfquery-d033721c3b28

  • Lark

    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

  • Project mention: Show HN: I wrote a RDBMS (SQLite clone) from scratch in pure Python | news.ycombinator.com | 2023-08-13

    Lark supports, and recommends, writing and storing the grammar in a .lark file. We have syntax highlighting support in all major IDEs, and even in github itself. For example, here is Lark's built-in grammar for Python: https://github.com/lark-parser/lark/blob/master/lark/grammar...

    You can also test grammars "live" in our online IDE: https://www.lark-parser.org/ide/

    The rationale is that it's more terse and has less visual clutter than a DSL over Python, which makes it easier to read and write.

  • sqlparse

    A non-validating SQL parser module for Python

  • Project mention: Show HN: Databasediagram.com – Private, Text to Entity-Relationship Diagram Tool | news.ycombinator.com | 2023-06-08

    Suggest checking out the sqlparse library for a way to do the different flavours without needing to address each case directly: https://github.com/andialbrecht/sqlparse

  • phonenumbers

    Python port of Google's libphonenumber

  • oletools

    oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and debugging.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PLY

    Python Lex-Yacc

  • snoop

    Snoop — инструмент разведки на основе открытых данных (OSINT world)

  • Project mention: Osint update of the Snoop Project tool search for user by nickname | news.ycombinator.com | 2024-01-02
  • rdflib

    RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.

  • m3u8

    Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions

  • msgspec

    A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML

  • Project mention: Htmx, Rust and Shuttle: A New Rapid Prototyping Stack | news.ycombinator.com | 2023-11-01
  • typeguard

    Run-time type checker for Python

  • strictyaml

    Type-safe YAML parser and validator.

  • Project mention: StrictYAML | news.ycombinator.com | 2023-09-29
  • python-user-agents

    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

  • cinemagoer

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

  • ViperMonkey

    A VBA parser and emulation engine to analyze malicious macros.

  • Construct

    Construct: Declarative data structures for python that allow symmetric parsing and building

  • guessit

    GuessIt is a python library that extracts as much information as possible from a video filename.

  • textX

    Domain-Specific Languages and parsers in Python made easy http://textx.github.io/textX/

  • wiktextract

    Wiktionary dump file parser and multilingual data extractor

  • mwparserfromhell

    A Python parser for MediaWiki wikicode

  • Project mention: FLaNK AI Weekly for 29 April 2024 | dev.to | 2024-04-29
  • crossplane

    Quick and reliable way to convert NGINX configurations into JSON and back. (by nginxinc)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Parser related posts

Index

What are some of the best open-source Parser projects in Python? This list will help you:

Project Stars
1 pydantic 19,167
2 sqlglot 5,778
3 pdfminer.six 5,534
4 Lark 4,534
5 sqlparse 3,605
6 phonenumbers 3,423
7 oletools 2,770
8 PLY 2,714
9 snoop 2,739
10 rdflib 2,091
11 m3u8 1,950
12 msgspec 1,939
13 typeguard 1,458
14 strictyaml 1,424
15 python-user-agents 1,418
16 cinemagoer 1,200
17 ViperMonkey 1,024
18 Construct 890
19 guessit 808
20 textX 756
21 wiktextract 724
22 mwparserfromhell 710
23 crossplane 684

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com