nerfstudio-project / gsplat
CUDA accelerated rasterization of gaussian splatting
See what the GitHub community is most excited about this month.
CUDA accelerated rasterization of gaussian splatting
CUDA Kernel Benchmarking Library
Sample codes for my CUDA programming book
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Causal depthwise conv1d in CUDA, with a PyTorch interface
NCCL Tests
A massively parallel, optimal functional runtime in Rust
cuGraph - RAPIDS Graph Analytics Library