Python duckdb

Open-source Python projects categorized as duckdb

Top 14 Python duckdb Projects

  • sqlglot

    Python SQL Parser and Transpiler

  • Project mention: The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol | news.ycombinator.com | 2024-04-26

    This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".

    Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).

    [1] https://github.com/tobymao/sqlglot

  • ibis

    the portable Python dataframe library

  • Project mention: Show HN: Hashquery, a Python library for defining reusable analysis | news.ycombinator.com | 2024-04-23

    I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]

    0: https://ibis-project.org/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ingestr

    ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

  • Project mention: FLaNK 04 March 2024 | dev.to | 2024-03-04
    Project mention: Splink: Fast, accurate, scalable probabilistic data linkage | news.ycombinator.com | 2024-03-13
  • dbt-duckdb

    dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

  • jupysql

    Better SQL in Jupyter. 📊

  • Project mention: Show HN: JupySQL – a SQL client for Jupyter (ipython-SQL successor) | news.ycombinator.com | 2023-12-06

    Hey, HN community!

    We're stoked to launch JupySQL today! JupySQL is an open-source library that brings a modern SQL experience to Jupyter. JupySQL is compatible with all major databases, such as Snowflake, Redshift, PostgreSQL, MySQL, MariaDB, DuckDB, SQL Server, Clickhouse, Trino, and more!

    To get started, check out our tutorial: https://jupysql.ploomber.io/en/latest/quick-start.html

    SQL is the defacto language for data analysis; however, analysis often requires a mix of SQL and Python. JupySQL bridges this gap, allowing users to execute SQL queries seamlessly in Jupyter and continue their analysis in Python. Add %%sql to the top of your cell and start writing SQL.

    Here are some of JupySQL's main features:

    - Syntax highlighting

  • inline-sql

    🪄 Inline SQL in any Python program

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • quack-reduce

    A playground for running duckdb as a stateless query engine over a data lake

  • Project mention: quack-reduce: duckdb as a stateless query engine over a data lake | news.ycombinator.com | 2024-01-27
  • portable-data-stack-dagster

    A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset

  • Project mention: Portable Data Stack | news.ycombinator.com | 2023-09-20
  • cuallee

    Possibly the fastest DataFrame-agnostic quality check library in town.

  • Project mention: Show HN: Snowflake Data Quality Checks in Python | news.ycombinator.com | 2024-02-11
  • talksheet

    A GPT powered CLI tool that answers questions about your data

  • icedb

    An in-process Parquet merge engine for better data warehousing in S3

  • Project mention: IceDB v2 – An in-process Parquet merge engine to build dirt-cheap OLAP | news.ycombinator.com | 2023-06-17
  • airflow-elt-blueprint

    A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.

  • bigdataeng

    A number of sample data analytics and coding tasks

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python duckdb related posts

  • quack-reduce: duckdb as a stateless query engine over a data lake

    1 project | news.ycombinator.com | 27 Jan 2024
  • JupySQL: Connecting to a SQL database from Jupyter

    1 project | /r/SQL | 9 Sep 2023
  • GitHub - ploomber/jupysql: Better SQL in Jupyter. 📊

    1 project | /r/coolgithubprojects | 6 Sep 2023
  • SQL CTE's in Jupyter notebooks, DuckDB integration and more

    1 project | /r/Jupyter | 2 Aug 2023
  • TL;DR incorporate SQL functionality within Jupyter, access to modern data processing DBs (like DuckDB), polars and data exploration through plotting easier with JupySQL.

    1 project | /r/coolgithubprojects | 2 Aug 2023
  • IceDB v2 – An in-process Parquet merge engine to build dirt-cheap OLAP

    1 project | news.ycombinator.com | 17 Jun 2023
  • A full-featured SQL client for Jupyter

    1 project | news.ycombinator.com | 7 Apr 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 20 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source duckdb projects in Python? This list will help you:

Project Stars
1 sqlglot 5,679
2 ibis 4,304
3 ingestr 2,341
4 splink 1,104
5 dbt-duckdb 744
6 jupysql 611
7 inline-sql 415
8 quack-reduce 134
9 portable-data-stack-dagster 119
10 cuallee 111
11 talksheet 95
12 icedb 91
13 airflow-elt-blueprint 44
14 bigdataeng 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com