Deepmind Alphadev: Faster sorting algorithms discovered using deep RL

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

abseil-cpp

54 14,078 9.5 C++

Abseil Common Libraries (C++)

You can see hashing optimizations as well https://www.deepmind.com/blog/alphadev-discovers-faster-sort..., https://github.com/abseil/abseil-cpp/commit/74eee2aff683cc7d...
I was one of the members who reviewed expertly what has been done both in sorting and hashing. Overall it's more about assembly, finding missed compiler optimizations and balancing between correctness and distribution (in hashing in particular).
It was not revolutionary in a sense it hasn't found completely new approaches but converged to something incomprehensible for humans but relatively good for performance which proves the point that optimal programs are very inhuman.
Note that for instructions in sorting, removing them does not always lead to better performance, for example, instructions can run in parallel and the effect can be less profound. Benchmarks can lie and compiler could do something differently when recompiling the sort3 function which was changed. There was some evidence that the effect can come from the other side.
For hashing it was even funnier, very small strings up to 64 bit already used 3 instructions like add some constant -> multiply 64x64 -> xor upper/lower. For bigger ones the question becomes more complicated, that's why 9-16 was a better spot and it simplified from 2 multiplications to just one and a rotation. Distribution on real workloads was good, it almost passed smhasher and we decided it was good enough to try out in prod. We did not rollback as you can see from abseil :)
But even given all that, it was fascinating to watch how this system was searching and was able to find particular programs can be further simplified. Kudos to everyone involved, it's a great incremental change that can bring more results in the future.

Halide

43 5,733 9.4 C++

a language for fast, portable data-parallel computation

It is not the sorting per-se which was improved here, but sorting (particularly short sequences) on modern CPUs with really the complexity being on the difficulty of predicting what will work quickly on these modern CPUs.
Doing an empirical algorithm search to find which algorithms fit well on modern CPUs/memory systems is pretty common, see e.g. FFTW, ATLAS, https://halide-lang.org/

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: Flash Attention in ~100 lines of CUDA

2 projects | news.ycombinator.com | 16 Mar 2024
Halide v17.0.0

1 project | news.ycombinator.com | 1 Feb 2024
Implementing Mario's Stack Blur 15 times in C++ (with tests and benchmarks)

1 project | news.ycombinator.com | 10 Nov 2023
Blog Post: Can You Trust a Compiler to Optimize Your Code?

1 project | /r/rust | 9 Apr 2023
Halide – a language for fast, portable computation on images and tensors

1 project | news.ycombinator.com | 16 Jan 2023

Deepmind Alphadev: Faster sorting algorithms discovered using deep RL

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Frameworks halide hexagon Compiler DSL
Post date: 7 Jun 2023

abseil-cpp

Halide

InfluxDB

Related posts

Show HN: Flash Attention in ~100 lines of CUDA

Halide v17.0.0

Implementing Mario's Stack Blur 15 times in C++ (with tests and benchmarks)

Blog Post: Can You Trust a Compiler to Optimize Your Code?

Halide – a language for fast, portable computation on images and tensors

Deepmind Alphadev: Faster sorting algorithms discovered using deep RL

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Frameworks halide hexagon Compiler DSL Post date: 7 Jun 2023

abseil-cpp

Halide

InfluxDB

Related posts

Show HN: Flash Attention in ~100 lines of CUDA

Halide v17.0.0

Implementing Mario's Stack Blur 15 times in C++ (with tests and benchmarks)

Blog Post: Can You Trust a Compiler to Optimize Your Code?

Halide – a language for fast, portable computation on images and tensors

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Frameworks halide hexagon Compiler DSL
Post date: 7 Jun 2023