C++ and Python support for the CUDA Quantum programming model
Performance meets Productivity
Accelerated libraries for quantum-classical computing built on CUDA-Q
CV-CUDA™ is an open-source, GPU accelerated library
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
CUDA programming in Julia
The CUDA target for Numba
Thin, unified, C++-flavored wrappers for the CUDA APIs
CUDA Core Compute Libraries
Build an automated pipeline that converts CUDA APIs into Numba
How to optimize some algorithm in cuda
Lightning fast C++/CUDA neural network framework
A NumPy-compatible array library accelerated by CUDA
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
The best AI Aimbot for Fortnite, Valorant, CS2, R6, COD, Apex, & more
CUDA Templates for Linear Algebra Subroutines
Solve puzzles. Learn CUDA
ONNX-TensorRT: TensorRT backend for ONNX
Solving the Satoshi Puzzle
A Python framework for accelerated simulation, data generation
RandomX, KawPow, CryptoNight, AstroBWT and GhostRider unified miner
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Distributed parallelization of stencil-based GPU and CPU applications
Development repository for the Triton language and compiler
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code