SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Parser Projects
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
Lark
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
-
oletools
oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and debugging.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
rdflib
RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.
-
msgspec
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
-
python-user-agents
A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
-
cinemagoer
Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies
-
Construct
Construct: Declarative data structures for python that allow symmetric parsing and building
-
guessit
GuessIt is a python library that extracts as much information as possible from a video filename.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
First, note the method prefix_allowed_tokens_fn. This method applies a Pydantic model to constrain/guide how the LLM generates tokens. Next, see how that constrain can be applied to txtai's LLM pipeline.
Project mention: The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol | news.ycombinator.com | 2024-04-26This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".
Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).
[1] https://github.com/tobymao/sqlglot
I love to use PDFMiner and PDFQuery for this https://github.com/pdfminer/pdfminer.six https://towardsdatascience.com/scrape-data-from-pdf-files-using-python-and-pdfquery-d033721c3b28
Project mention: Show HN: I wrote a RDBMS (SQLite clone) from scratch in pure Python | news.ycombinator.com | 2023-08-13Lark supports, and recommends, writing and storing the grammar in a .lark file. We have syntax highlighting support in all major IDEs, and even in github itself. For example, here is Lark's built-in grammar for Python: https://github.com/lark-parser/lark/blob/master/lark/grammar...
You can also test grammars "live" in our online IDE: https://www.lark-parser.org/ide/
The rationale is that it's more terse and has less visual clutter than a DSL over Python, which makes it easier to read and write.
Project mention: Show HN: Databasediagram.com – Private, Text to Entity-Relationship Diagram Tool | news.ycombinator.com | 2023-06-08Suggest checking out the sqlparse library for a way to do the different flavours without needing to address each case directly: https://github.com/andialbrecht/sqlparse
Project mention: Osint update of the Snoop Project tool search for user by nickname | news.ycombinator.com | 2024-01-02
Project mention: Htmx, Rust and Shuttle: A New Rapid Prototyping Stack | news.ycombinator.com | 2023-11-01
Python Parser related posts
-
The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol
-
Advanced RAG with guided generation
-
SQLGlot: No-dependency SQL parser, transpiler, optimizer for 21 SQL dialects
-
Ship Faster by Organising Less
-
SQLglot: Python SQL Parser and Transpiler
-
utype VS pydantic - a user suggested alternative
2 projects | 15 Feb 2024 -
100% User-Supported
-
A note from our sponsor - SaaSHub
www.saashub.com | 2 Jun 2024
Index
What are some of the best open-source Parser projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | pydantic | 19,167 |
2 | sqlglot | 5,778 |
3 | pdfminer.six | 5,534 |
4 | Lark | 4,534 |
5 | sqlparse | 3,605 |
6 | phonenumbers | 3,423 |
7 | oletools | 2,770 |
8 | PLY | 2,714 |
9 | snoop | 2,739 |
10 | rdflib | 2,091 |
11 | m3u8 | 1,950 |
12 | msgspec | 1,939 |
13 | typeguard | 1,458 |
14 | strictyaml | 1,424 |
15 | python-user-agents | 1,418 |
16 | cinemagoer | 1,200 |
17 | ViperMonkey | 1,024 |
18 | Construct | 890 |
19 | guessit | 808 |
20 | textX | 756 |
21 | wiktextract | 724 |
22 | mwparserfromhell | 710 |
23 | crossplane | 684 |
Sponsored