Top 23 C++ GPU Projects

taichi

38 24,876 8.6 C++

Productive, portable, and performant GPU programming in Python.

Project mention: CERN Root | news.ycombinator.com | 2024-06-01

The haughtiness is not for nothing. Since Dec 2023, they made a lame excuse that Pytorch didn't support 3.12: https://github.com/taichi-dev/taichi/issues/8365#issuecommen...
Later, even when Pytorch added support for 3.12, nothing changed (so far) in Taichi.

Open3D

11 10,631 8.6 C++

Open3D: A Modern Library for 3D Data Processing

Project mention: Does anyone else agree that the links to the latest development version of Open3D don't work? | /r/cscareerquestions | 2023-07-10

I was going to file a bug about another issue, but I have to download the development version. This is why I want this solved quickly. None of the links seem to work: https://github.com/isl-org/Open3D/issues/6259

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
cudf

25 7,476 9.9 C++

cuDF - GPU DataFrame Library

Project mention: cuDF – GPU DataFrame Library | news.ycombinator.com | 2024-06-02

Halide

43 5,733 9.4 C++

a language for fast, portable data-parallel computation

Project mention: Show HN: Flash Attention in ~100 lines of CUDA | news.ycombinator.com | 2024-03-16

If CPU/GPU execution speed is the goal while simultaneously code golfing the source size, https://halide-lang.org/ might have come in handy.

meshoptimizer

12 5,194 9.2 C++

Mesh optimization library that makes meshes smaller and faster to render
DALI

5 4,942 9.6 C++

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Project mention: [D] Will data augmentations work faster on TPUs? | /r/MachineLearning | 2023-12-07

Another option is DALI https://github.com/NVIDIA/DALI For my project while training EfficientNet2, it was a game changer. But it a way harder to implement in code than TorchVision or Kornia.

MegEngine

5 4,731 8.7 C++

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
cutlass

16 4,681 8.7 C++

CUDA Templates for Linear Algebra Subroutines

Project mention: Optimization Techniques for GPU Programming [pdf] | news.ycombinator.com | 2023-08-09

I would recommend the course from Oxford (https://people.maths.ox.ac.uk/gilesm/cuda/). Also explore the tutorial section of cutlass (https://github.com/NVIDIA/cutlass/blob/main/media/docs/cute/...) if you want to learn more about high performance gemm.

ArrayFire

6 4,438 7.1 C++

ArrayFire: a general purpose GPU library.
cuml

10 3,971 9.3 C++

cuML - RAPIDS Machine Learning Library

Project mention: FLaNK Stack Weekly for 13 November 2023 | dev.to | 2023-11-13

tiny-cuda-nn

9 3,473 5.6 C++

Lightning fast C++/CUDA neural network framework
FluidX3D

53 3,334 8.7 C++

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.

Project mention: FluidX3D | news.ycombinator.com | 2024-03-24

heavydb

1 2,911 7.6 C++

HeavyDB (formerly OmniSciDB)
deepdetect

4 2,500 6.7 C++

Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

Project mention: Exploring Open-Source Alternatives to Landing AI for Robust MLOps | dev.to | 2023-12-13

For those seeking a lightweight solution for setting up deep learning REST APIs across platforms without the complexity of Kubernetes, Deepdetect is worth considering.

CV-CUDA

1 2,216 5.5 C++

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
GLSL-PathTracer

1 1,749 3.1 C++

A toy physically based GPU path tracer (C++/OpenGL/GLSL)
Boost.Compute

0 1,508 0.0 C++

A C++ GPU Computing Library for OpenCL
rpi-vk-driver

3 1,219 0.0 C++

VK driver for the Raspberry Pi (Broadcom Videocore IV)
marian

3 1,184 0.0 C++

Fast Neural Machine Translation in C++
executorch

2 1,335 10.0 C++

On-device AI across mobile, embedded and edge for PyTorch

Project mention: ExecuTorch: Enabling On-Device interference for embedded devices | news.ycombinator.com | 2023-10-17

Yes ExecuTorch is currently targeted at Edge devices. The runtime is written in C++ with 50KB binary size (without kernels) and should run in most of platforms. You are right that we have not integrated to Nvidia backend yet. Have you tried torch.compile() in PyTorch 2.0? It would do the Nvidia optimization for you without Torchscript. If you have specific binary size or edge specific request, feel free to file issues in https://github.com/pytorch/executorch/issues

MatX

7 1,127 9.2 C++

An efficient C++17 GPU numerical computing library with Python-like syntax

Project mention: An efficient C++17 GPU numerical computing library with Python-like syntax | /r/programming | 2023-10-05

stdgpu

0 1,099 7.1 C++

stdgpu: Efficient STL-like Data Structures on the GPU
compute-runtime

58 1,086 10.0 C++

Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

Project mention: Intel Graphics Compute Runtime for OneAPI Level Zero and OpenCL | news.ycombinator.com | 2023-08-02

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ GPU related posts

cuDF – GPU DataFrame Library

1 project | news.ycombinator.com | 2 Jun 2024
CuDF – GPU DataFrame Library

1 project | news.ycombinator.com | 1 Jun 2024
FluidX3D

1 project | news.ycombinator.com | 24 Mar 2024
Show HN: Flash Attention in ~100 lines of CUDA

2 projects | news.ycombinator.com | 16 Mar 2024
Taichi: Accessible GPU programming, embedded in Python

1 project | news.ycombinator.com | 11 Mar 2024
Halide v17.0.0

1 project | news.ycombinator.com | 1 Feb 2024
Earthquake in Japan yesterday may have shifted land 1.3 meters

1 project | news.ycombinator.com | 2 Jan 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 3 Jun 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source GPU projects in C++? This list will help you:

	Project	Stars
1	taichi	24,876
2	Open3D	10,631
3	cudf	7,476
4	Halide	5,733
5	meshoptimizer	5,194
6	DALI	4,942
7	MegEngine	4,731
8	cutlass	4,681
9	ArrayFire	4,438
10	cuml	3,971
11	tiny-cuda-nn	3,473
12	FluidX3D	3,334
13	heavydb	2,911
14	deepdetect	2,500
15	CV-CUDA	2,216
16	GLSL-PathTracer	1,749
17	Boost.Compute	1,508
18	rpi-vk-driver	1,219
19	marian	1,184
20	executorch	1,335
21	MatX	1,127
22	stdgpu	1,099
23	compute-runtime	1,086

C++ GPU

Top 23 C++ GPU Projects

C++ GPU related posts

cuDF – GPU DataFrame Library

CuDF – GPU DataFrame Library

FluidX3D

Show HN: Flash Attention in ~100 lines of CUDA

Taichi: Accessible GPU programming, embedded in Python

Halide v17.0.0

Earthquake in Japan yesterday may have shifted land 1.3 meters

Index