Kimi K2

Kimi K2 Instruct is a high-performance Mixture-of-Experts (MoE) language model developed by Moonshot AI, activating 32B parameters per forward pass from a total 1 trillion. Designed for agentic reasoning, tool use, and advanced coding tasks, it achieves SOTA-level performance on multiple benchmarks such as SWE-Bench, AIME, and MMLU. Trained on 15.5T tokens using the Muon optimizer, it incorporates novel techniques for scaling stability. Kimi K2 supports a 128K context window, enabling detailed multi-turn conversations and long input handling. It includes native support for tool-calling, making it suitable for autonomous agents and real-world task execution. The Instruct variant is fine-tuned for chat-style interaction and general-purpose deployment, while the Base variant targets research and customization. Kimi K2 is released under a modified MIT license and deployable through engines like vLLM, SGLang, KTransformers, and TensorRT-LLM.

Features

1T parameter MoE with 32B active parameters per inference
128K context length for long-form tasks and reasoning
Exceptional agentic performance and tool-calling capabilities
Top-tier results on SWE-Bench, AIME, MMLU, and coding tasks
Uses Muon optimizer for stable, scalable training
Available in Instruct and Base variants
Released under a modified MIT license
Supports deployment via vLLM, SGLang, TensorRT-LLM, and more Preguntar a ChatGPT

Project Samples

Project Activity

See All Activity >

Follow Kimi K2

Kimi K2 Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of Kimi K2!

Additional Project Details

Registered

2025-07-14

Similar Business Software

Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software
Qwen3-Coder

Qwen3‑Coder is an agentic code model available in multiple sizes, led by the 480B‑parameter Mixture‑of‑Experts variant (35B active) that natively supports 256K‑token contexts (extendable to 1M) and achieves state‑of‑the‑art results comparable to Claude Sonnet 4. Pre‑training on 7.5T tokens (70 %...

See Software