Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 C++ GPU Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
-
deepdetect
Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
-
CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
The haughtiness is not for nothing. Since Dec 2023, they made a lame excuse that Pytorch didn't support 3.12: https://github.com/taichi-dev/taichi/issues/8365#issuecommen...
Later, even when Pytorch added support for 3.12, nothing changed (so far) in Taichi.
Project mention: Does anyone else agree that the links to the latest development version of Open3D don't work? | /r/cscareerquestions | 2023-07-10I was going to file a bug about another issue, but I have to download the development version. This is why I want this solved quickly. None of the links seem to work: https://github.com/isl-org/Open3D/issues/6259
If CPU/GPU execution speed is the goal while simultaneously code golfing the source size, https://halide-lang.org/ might have come in handy.
Another option is DALI https://github.com/NVIDIA/DALI For my project while training EfficientNet2, it was a game changer. But it a way harder to implement in code than TorchVision or Kornia.
Project mention: Optimization Techniques for GPU Programming [pdf] | news.ycombinator.com | 2023-08-09I would recommend the course from Oxford (https://people.maths.ox.ac.uk/gilesm/cuda/). Also explore the tutorial section of cutlass (https://github.com/NVIDIA/cutlass/blob/main/media/docs/cute/...) if you want to learn more about high performance gemm.
Project mention: Exploring Open-Source Alternatives to Landing AI for Robust MLOps | dev.to | 2023-12-13For those seeking a lightweight solution for setting up deep learning REST APIs across platforms without the complexity of Kubernetes, Deepdetect is worth considering.
Project mention: ExecuTorch: Enabling On-Device interference for embedded devices | news.ycombinator.com | 2023-10-17Yes ExecuTorch is currently targeted at Edge devices. The runtime is written in C++ with 50KB binary size (without kernels) and should run in most of platforms. You are right that we have not integrated to Nvidia backend yet. Have you tried torch.compile() in PyTorch 2.0? It would do the Nvidia optimization for you without Torchscript. If you have specific binary size or edge specific request, feel free to file issues in https://github.com/pytorch/executorch/issues
Project mention: An efficient C++17 GPU numerical computing library with Python-like syntax | /r/programming | 2023-10-05
Project mention: Intel Graphics Compute Runtime for OneAPI Level Zero and OpenCL | news.ycombinator.com | 2023-08-02
C++ GPU related posts
-
cuDF – GPU DataFrame Library
-
CuDF – GPU DataFrame Library
-
FluidX3D
-
Show HN: Flash Attention in ~100 lines of CUDA
-
Taichi: Accessible GPU programming, embedded in Python
-
Halide v17.0.0
-
Earthquake in Japan yesterday may have shifted land 1.3 meters
-
A note from our sponsor - InfluxDB
www.influxdata.com | 3 Jun 2024
Index
What are some of the best open-source GPU projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | taichi | 24,876 |
2 | Open3D | 10,631 |
3 | cudf | 7,476 |
4 | Halide | 5,733 |
5 | meshoptimizer | 5,194 |
6 | DALI | 4,942 |
7 | MegEngine | 4,731 |
8 | cutlass | 4,681 |
9 | ArrayFire | 4,438 |
10 | cuml | 3,971 |
11 | tiny-cuda-nn | 3,473 |
12 | FluidX3D | 3,334 |
13 | heavydb | 2,911 |
14 | deepdetect | 2,500 |
15 | CV-CUDA | 2,216 |
16 | GLSL-PathTracer | 1,749 |
17 | Boost.Compute | 1,508 |
18 | rpi-vk-driver | 1,219 |
19 | marian | 1,184 |
20 | executorch | 1,335 |
21 | MatX | 1,127 |
22 | stdgpu | 1,099 |
23 | compute-runtime | 1,086 |
Sponsored