Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a versatile tool for developers looking to integrate advanced AI functionalities into their applications.
Features
- Family of models of different sizes: Qwen-1.8B, 7B, 14B, 72B for both base and chat versions
- Quantization support: Int8, Int4 quantized versions to reduce memory/inference cost with little performance loss
- Long context support: large context windows (e.g. up to 32K tokens for some variants)
- Finetuning capabilities: full-parameter finetuning, LoRA, Q-LoRA, including distributed / multi-GPU setups
- Inference deployment via Transformers, ModelScope, Docker; also examples, demos, WebUI, CLI usage
- Strong benchmark performance across many tasks: multilingual understanding, math, code, reasoning, etc., often matching or exceeding other large models at similar sizes
License
Apache License V2.0Follow Qwen
Other Useful Business Software
Our Free Plans just got better! | Auth0
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Rate This Project
Login To Rate This Project
User Reviews
-
Really great open source models