cuda-z free download - SourceForge

Showing 1087 open source projects for "cuda-z"

View related business solutions

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

CUDA-Q

C++ and Python support for the CUDA Quantum programming model

CUDA-Q is an open-source platform for developing hybrid quantum-classical applications using a unified programming model across CPUs, GPUs, and quantum processing units. It provides a full toolchain that includes compilers, runtimes, and libraries for writing quantum programs in both C++ and Python. The platform is designed to be hardware-agnostic, allowing developers to run applications on different quantum backends or simulate them efficiently using GPU acceleration when physical quantum hardware is unavailable. ...

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
2

CUDA Python

Performance meets Productivity

CUDA Python is a unified Python interface for accessing and working with the NVIDIA CUDA platform, enabling developers to build GPU-accelerated applications entirely in Python. It acts as a metapackage composed of multiple submodules that provide both high-level and low-level access to CUDA functionality, including runtime APIs, driver APIs, and JIT compilation tools.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
3

Z-Image

Image generation model with single-stream diffusion transformer

...The project includes several variants: Z-Image-Turbo, a distilled version optimized for speed and low resource consumption; Z-Image-Base, the full-capacity foundation model; and Z-Image-Edit, fine-tuned for image editing tasks. Despite its compact size, Z-Image produces outputs that closely rival those from much larger models — including strong rendering of bilingual (English and Chinese) text inside images, accurate prompt adherence, and good layout and composition.

Downloads: 32 This Week

Last Update: 2026-02-09
See Project
4

CUDA-QX

Accelerated libraries for quantum-classical computing built on CUDA-Q

CUDA-QX is a collection of accelerated libraries built on top of the CUDA-Q platform, designed to enable rapid development of hybrid quantum-classical applications. It extends the CUDA-Q programming model by providing optimized implementations of domain-specific quantum computing primitives and workflows. The libraries are intended to help researchers and developers leverage GPUs, CPUs, and quantum processing units together in a unified computational model.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library

CV-CUDA is an open-source project that enables building efficient cloud-scale Artificial Intelligence (AI) imaging and computer vision (CV) applications. It uses graphics processing unit (GPU) acceleration to help developers build highly efficient pre- and post-processing pipelines. CV-CUDA originated as a collaborative effort between NVIDIA and ByteDance.

Downloads: 1 This Week

Last Update: 2025-11-15
See Project
6

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback.

Downloads: 0 This Week

Last Update: 2026-03-03
See Project
7

Z-BlogPHP

Z-BlogPHP blog program

Z-BlogPHP is a blog program provided by the Z-Blog community and has been committed to providing excellent blog writing experience to domestic users. The first edition has been released since 2005 and has a history of 18 years. It is one of the few open-source CMS systems that continue to provide updates in China. Our goal is to immerse users in writing and record life, without paying attention to cumbersome settings, etc., and let users focus on creation.

Downloads: 3 This Week

Last Update: 2026-02-28
See Project
8

CUDA.jl

CUDA programming in Julia

High-performance GPU programming in a high-level language. JuliaGPU is a GitHub organization created to unify the many packages for programming GPUs in Julia. With its high-level syntax and flexible compiler, Julia is well-positioned to productively program hardware accelerators like GPUs without sacrificing performance. The latest development version of CUDA.jl requires Julia 1.8 or higher. If you are using an older version of Julia, you need to use a previous version of CUDA.jl. This will...

Downloads: 1 This Week

Last Update: 2026-03-13
See Project
9

z.lua

A new cd command that helps you navigate faster

z.lua is a faster way to navigate your filesystem. It tracks your most used directories, based on 'frecency'. After a short learning phase, z will take you to the most 'recent' directory that matches ALL of the regexes given on the command line, in order. Available for posix shells, bash, zsh, dash, sh, ash, ksh, busybox and etc. Available for Fish Shell, Power Shell and Windows cmd. An enhanced matching algorithm takes you to where ever you want precisely. Allow updating database only if $PWD changed with "$_ZL_ADD_ONCE" set to 1. ...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Numba CUDA Target

The CUDA target for Numba

Numba CUDA Target is NVIDIA’s maintained CUDA backend for the Numba JIT compiler, enabling developers to write GPU-accelerated code directly in Python. It allows users to define CUDA kernels using Python syntax, which are then compiled into efficient GPU code at runtime using LLVM-based toolchains. This approach significantly lowers the barrier to entry for GPU programming by eliminating the need to write CUDA C++ while still delivering high performance. ...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
11

CUDA API Wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

...In a nutshell - making CUDA API work more fun.

Downloads: 0 This Week

Last Update: 2026-02-09
See Project
12

A to Z of Networking for DevOps

Learn Networking from A to Z at one place with realtime examples

The A to Z of Networking for DevOps repository is a comprehensive, beginner-friendly guide designed to help DevOps practitioners and anyone new to networking understand core concepts through practical explanations and real-world examples. It structures networking fundamentals from basic IP addressing and subnetting all the way through modern container and cloud networking topics like Docker and Kubernetes, making it valuable for learners at different stages of their DevOps journey. ...

Downloads: 0 This Week

Last Update: 2026-01-27
See Project
13

Numbast

Build an automated pipeline that converts CUDA APIs into Numba

Numbast is an automated toolchain that bridges CUDA C++ and Python by generating Numba-compatible bindings directly from CUDA header files. Its primary goal is to eliminate the manual effort required to expose CUDA libraries to Python, enabling developers to use GPU-accelerated functionality in Python environments more easily. The system parses CUDA C++ declarations and converts them into Python bindings that can be used within Numba, allowing seamless integration with Python-based GPU workflows. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
14

CUDA Core Compute Libraries (CCCL)

CUDA Core Compute Libraries

CCCL, or CUDA Core Compute Libraries, is a unified repository that consolidates several foundational CUDA C++ libraries into a single, cohesive development platform. It brings together Thrust, CUB, and libcudacxx, which collectively provide high-level abstractions, low-level performance primitives, and a CUDA-compatible standard library for GPU programming.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
15

stable-diffusion.cpp

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference

...It enables text-to-image and image-to-image generation, supports a growing set of models like SD1.x, SD2.x, SDXL, SD-Turbo, Qwen Image, and more, and is continually updated with support for cutting-edge model variants including video and image editing models. The project is built on the ggml backend, which allows efficient execution on CPUs and GPUs via backends like CUDA, Vulkan, Metal, OpenCL, and SYCL, making it suitable for everything from desktops to mobile devices. It includes options for ControlNet, LoRA models, upscaling via ESRGAN, and advanced sampling techniques, giving developers and users a rich toolkit for creative workflows.

Downloads: 40 This Week

Last Update: 2026-03-16
See Project
16

how-to-optim-algorithm-in-cuda

How to optimize some algorithm in cuda

how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency. Instead of presenting only theoretical explanations, the repository includes hand-written CUDA implementations of fundamental operations such as reductions, element-wise computations, softmax, and attention mechanisms. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
17

Tiny CUDA Neural Networks

Lightning fast C++/CUDA neural network framework

...It will likely only work on an RTX 3090, an RTX 2080 Ti, or high-end enterprise GPUs. Lower-end cards must reduce the n_neurons parameter or use the CutlassMLP (better compatibility but slower) instead. tiny-cuda-nn comes with a PyTorch extension that allows using the fast MLPs and input encodings from within a Python context. These bindings can be significantly faster than full Python implementations; in particular for the multiresolution hash encoding.

Downloads: 0 This Week

Last Update: 2025-07-08
See Project
18

CUDA Containers for Edge AI & Robotics

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

CUDA Containers for Edge AI & Robotics is an open-source project that provides a modular container build system designed for running machine learning and AI workloads on NVIDIA Jetson devices. The repository contains container configurations that package the latest AI frameworks and dependencies optimized for Jetson hardware. These containers simplify the deployment of complex machine learning environments by bundling libraries such as CUDA, TensorRT, and deep learning frameworks into reproducible container images. ...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
19

CuPy

A NumPy-compatible array library accelerated by CUDA

CuPy is an open source implementation of NumPy-compatible multi-dimensional array accelerated with NVIDIA CUDA. It consists of cupy.ndarray, a core multi-dimensional array class and many functions on it. CuPy offers GPU accelerated computing with Python, using CUDA-related libraries to fully utilize the GPU architecture. According to benchmarks, it can even speed up some operations by more than 100X. CuPy is highly compatible with NumPy, serving as a drop-in replacement in most cases. ...

Downloads: 0 This Week

Last Update: 2026-02-20
See Project
20

WSJT-Z

WSJT-X fork by SQ9FVE

WSJT-Z is a modified version of the WSJT-X software by Joe Taylor K1JT (https://sourceforge.net/projects/wsjt/). VISIT OUR GROUPS.IO PAGE FOR ANY COMMENTS / QUESTIONS / BUG REPORTS: https://groups.io/g/WSJT-Z/topics The CHANGELOG is available on the Files page: https://sourceforge.net/projects/wsjt-z/files/. Initially developed as an automation project, WSJT-Z now focuses on enhancing the functionality of the original software.

44 Reviews

Downloads: 517 This Week

Last Update: 2025-01-10
See Project
21

AIMr

The best AI Aimbot for Fortnite, Valorant, CS2, R6, COD, Apex, & more

...AIMr also provides visual customization options like field-of-view displays and detection indicators, allowing players to tailor their interface. The system is compatible with games that use human-shaped models, and although it functions effectively out of the box, optimizing it with CUDA-accelerated OpenCV is recommended for maximum performance.

1 Review

Downloads: 233 This Week

Last Update: 2025-08-31
See Project
22

CPU-Z

Utility for detailed CPU and system hardware information.

CPU-Z is fully supported on Windows® 11.

1 Review

Downloads: 253 This Week

Last Update: 2025-10-10
See Project
23

CUTLASS

CUDA Templates for Linear Algebra Subroutines

CUTLASS is a collection of CUDA C++ template abstractions for implementing high-performance matrix-multiplication (GEMM) and related computations at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS and cuDNN. CUTLASS decomposes these "moving parts" into reusable, modular software components abstracted by C++ template classes.

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
24

KeyKiller-Cuda

Solving the Satoshi Puzzle

KeyKiller is a GPU-accelerated version of the KeyKiller project, designed to achieve extreme performance in solving Satoshi Nakamoto's puzzles using modern NVIDIA GPUs. KeyKiller CUDA pushes the limits of cryptographic key search performance by leveraging CUDA, thread-beam parallelism, and batch EC operations. The command-line version is open-source and free to use. For the paid advanced graphics version, please visit: https://gitlab.com/8891689/KeyKiller-Cuda/

Downloads: 8 This Week

Last Update: 2025-12-13
See Project
25

GPU Puzzles

Solve puzzles. Learn CUDA

...Instead of presenting traditional lecture-style explanations, the project immerses learners directly in hands-on programming tasks that demonstrate how GPU computation works. The exercises are implemented using Python with the Numba CUDA interface, which allows Python code to compile into GPU kernels that run on CUDA-enabled hardware. By solving progressively more complex puzzles, learners gain a practical understanding of how parallel algorithms operate on graphics processing units. The project emphasizes experimentation and problem solving, encouraging learners to discover GPU programming techniques through trial and exploration. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project