Ship AI Agents to Google Cloud in minutes, not months
Allow LLMs to control a browser with Browserbase and Stagehand
CLI tool for multi-agent workflows and automated code generation
An MCP server for interacting with Google Colab
Framework to prove inference of ML models blazingly fast
A TTS model capable of generating ultra-realistic dialogue
AI Agent Builder and Runtime by Docker Engineering
An orchestration framework for agentic AI and LLM applications
C++-based high-performance parallel environment execution engine
Transformer related optimization, including BERT, GPT
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
This repository contains code released by Google Research
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Access large language models from the command-line
Local long-term memory engine for AI apps with persistent storage
Qwen3-ASR is an open-source series of ASR models
Provides code for running inference with the SegmentAnything Model
All-in-one AI productivity platform with agents, workflows, and IM
A lightweight, lightning-fast, in-process vector database
Open platform for sharing and discovering Stable Diffusion models
Chinese-language edition of Dive into Deep Learning
Fast ML inference & training for ONNX models in Rust
Python-free Rust inference server
Open-source, high-performance Mixture-of-Experts large language model