-
neosync
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I did a similar tool a few years ago: https://github.com/ClickHouse/ClickHouse/tree/master/program...
Its idea is to avoid custom models for various data (email, first name, address), but to create models on the fly. It requires some balance between quality and speed.
Overall, it was an interesting project for me, and we use it in practice for testing till today. I will take a look at Neosync...
Related posts
-
We Built a 19 PiB Logging Platform with ClickHouse and Saved Millions
-
1 billion rows challenge in PostgreSQL and ClickHouse
-
We Executed a Critical Supply Chain Attack on PyTorch
-
Tell HN: Hacker News dataset on BigQuery hasn't been updated since Nov 2022
-
Real-Time Data Enrichment and Analytics With RisingWave and ClickHouse