PeerDB Streams – Simple, Native Postgres Change Data Capture

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • peerdb

    Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

  • Hello HN, I am Sai Srirampur, one of the Co-founders of PeerDB. (https://github.com/PeerDB-io/peerdb)

    We spent the past 7 months building a solid experience to replicate data from Postgres to Data Warehouses. Now, we want to expand to Queues. We are excited to announce PeerDB Streams to provide a simple and native way to replicate changes as they happen in Postgres to Queues (Kafka, Redpanda, Google PubSub, etc). We use Postgres logical decoding to enable Change Data Capture (CDC).

    We chose queues as many users found that existing tools are complex. Debezium is the most used tool for this use-case. It has large production usage. However, a common pain point among our users is that it has a significant learning curve taking months to productionize.

    A few issues are: a)Interacting through a command line interface, understanding the various settings, and learning best practices for running it in production is not trivial. Debezium UI, released to address usability concerns [1], is still in an incubating state [2]. Additionally, reading Debezium resources to get started can be overwhelming [3].

  • examples

    Example scripts for PeerDB (by PeerDB-io)

  • Yes, that is doable through a very simple Lua-based row-level transformation. :) Here is an example transformation that adds an additional field: https://github.com/PeerDB-io/examples/blob/main/usdcad.lua . More details on row-level transformations are planned to be released later this week. :)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Benchmarking Postgres Replication: PeerDB vs. Airbyte

    1 project | news.ycombinator.com | 10 Oct 2023
  • Pgwire: a Rust library for PostgreSQL compatible application

    2 projects | news.ycombinator.com | 20 Mar 2024
  • Launch HN: PeerDB (YC S23) – Fast, Native ETL/ELT for Postgres

    2 projects | news.ycombinator.com | 27 Jul 2023
  • Data Engineering Tools in Go

    2 projects | /r/dataengineering | 18 May 2023
  • Test On 4 Concurrent Jobs Using Python-Polars 0.17.11 to GroupBy Billion Rows

    3 projects | /r/Python | 7 May 2023