A library for accelerating Transformer models on NVIDIA GPUs
Learn How LLM Transformer Models Work with Interactive Visualization
Tool for exploring and debugging transformer model behaviors
Implementation of Vision Transformer, a simple way to achieve SOTA
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Fast inference engine for Transformer models
Julia Implementation of Transformer models
Ongoing research training transformer models at scale
RF-DETR is a real-time object detection and segmentation
Image generation model with single-stream diffusion transformer
MoBA: Mixture of Block Attention for Long-Context LLMs
Build your chatbot within minutes on your favorite device
Repo for SeedVR2 & SeedVR
Fast State-of-the-Art Static Embeddings
Plugin for IntelliJ IDEA that gives special support for Minecraft mods
The most powerful local music generation model
ReFT: Representation Finetuning for Language Models
NeurIPS2025 Spotlight] Quantized Attention
Hackable and optimized Transformers building blocks
PyTorch library of curated Transformer models and their components
Diffusion Transformer with Fine-Grained Chinese Understanding
Unified Multimodal Understanding and Generation Models
Ongoing research training transformer models at scale
A CSS parser, transformer, and minifier written in Rust