Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
RandomX, KawPow, CryptoNight, AstroBWT and GhostRider unified miner
Geometric deep learning extension library for PyTorch
Clean and efficient FP8 GEMM kernels with fine-grained scaling
An experimental version of DeepSeek model
Face recognition with deep neural networks
BCI: Breast Cancer Immunohistochemical Image Generation
Tiny Face Detector, CVPR 2017
CUDA library for continuous optimization and light field analysis
CUDA-Quicksort: A GPU-based implementation of the quicksort algorithm
CUDA SVM training benchmark