Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Open-Source Financial Large Language Models!
Qwen (通义千问) chat/pretrained large language model Alibaba Cloud
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open-source, high-performance Mixture-of-Experts large language model
Janus-Series: Unified Multimodal Understanding and Generation Models
Open Multilingual Multimodal Chat LMs
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Blazeface is a lightweight model that detects faces in images
A CNN model that predicts human joints from RGB images of a person
A Conversational Speech Generation Model
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Detect faces in an image
Encoder of greater-than-word length text trained on a variety of data
Text-to-image diffusion model for high-quality image generation
Custom BLEURT model for evaluating text similarity using PyTorch
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
Inference framework for 1-bit LLMs
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Extension for Stable Diffusion using edge, depth, pose, and more
ControlNet-1 enables precise image generation via input conditioning
State-of-the-art RL-trained coding agent for complex SWE tasks
DeepSeek-R1-0528 is a powerful reasoning-focused LLM with 64K context
Advanced multilingual LLM with enhanced reasoning and code generation
Agentic 24B LLM optimized for coding tasks with 128k context support