Do you *really* need to store all that telemetry?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

rkvdns

8 6 7.8 Python

DNS Proxy Server for Redis

Agree with the article enough that I did something about it which I call "Poor Fred's SIEM". The heart of it is a DNS proxy for Redis (https://github.com/m3047/rkvdns). However it's not targeted at environments where everything is in a "bubble" such that there are no ingress / egress costs. (Lookin' at you, Cloud.) Furthermore "control plane" is an important concept, and it's well understood in the industrial control world as the Purdue Model.
From a systems standpoint do you need to have all resources stored centrally in order to do centralized reporting? No, of course not. Admittedly it's handy if bandwidth and storage are free. The alternative is distributed storage, with or without summarization at the edge (and aggregating from distributed storage for reporting).
Having it distributed does raise access issues: access needs to be controlled, and management of access needs to be managed. Philosophically the Cloud solutions sell centralized management, but federation is a perfectly viable option. The choice is largely dictated by organizational structure not technology.
There is also a difference between diagnostic and evaluative indicators. Trying to evaluate from diagnostics causes fatigue because humans aren't built that way; evaluatives can and should be built from diagnostics. Diagnostics can't be built from evaluatives.
The logging/telemetry stack that I propose is:
1) Ephemeral logging at the limits of whatever observability you can build. E.g.: systemd journal with a small backing store, similar to a ring buffer.
2) Your compliance framework may require shipping some classes of events off of the local host, but I don't think any of them require shipping it to the cloud.
3) Build evaluatives locally in Redis.
4) Use DNS to query those evaluatives from elsewhere for ad hoc as well as historical purposes. This could be a centralized location or it could be true federation where each site accesses all other site's evaluatives.
I wouldn't put Redis on the internet, but I don't worry too much about DNS; and there are well-understood ways of securing DNS from tampering, unauthorized access, and even observation. By the way, DNS will handle hundreds or thousands of queries per second you just have to build for it.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Who has a smaller Redis DB with lots of reads and writes?

1 project | news.ycombinator.com | 9 Jun 2023
Story: Redis and Its Creator Antirez

1 project | news.ycombinator.com | 10 May 2023
rkvdns: DNS Reverse / Caching Proxy for Redis

1 project | /r/CKsTechNews | 11 Jul 2022
Reverse-engineered Shazam audio signature generator

1 project | news.ycombinator.com | 20 May 2024
Pydantic-resolve, a hierarchical solution for data fetching and processing

3 projects | news.ycombinator.com | 25 Feb 2024

Do you really need to store all that telemetry?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Asyncio dns-server proxy-server Python3 Redis
Post date: 15 Apr 2024

rkvdns

InfluxDB

Related posts

Ask HN: Who has a smaller Redis DB with lots of reads and writes?

Story: Redis and Its Creator Antirez

rkvdns: DNS Reverse / Caching Proxy for Redis

Reverse-engineered Shazam audio signature generator

Pydantic-resolve, a hierarchical solution for data fetching and processing