Llama 2Meta
|
Solar MiniUpstage AI
|
|||||
Related Products
|
||||||
About
The next generation of our open source large language model. This release includes model weights and starting code for pretrained and fine-tuned Llama language models — ranging from 7B to 70B parameters.
Llama 2 pretrained models are trained on 2 trillion tokens, and have double the context length than Llama 1. Its fine-tuned models have been trained on over 1 million human annotations.
Llama 2 outperforms other open source language models on many external benchmarks, including reasoning, coding, proficiency, and knowledge tests.
Llama 2 was pretrained on publicly available online data sources. The fine-tuned model, Llama-2-chat, leverages publicly available instruction datasets and over 1 million human annotations.
We have a broad range of supporters around the world who believe in our open approach to today’s AI — companies that have given early feedback and are excited to build with Llama 2.
|
About
Solar Mini is a pre‑trained large language model that delivers GPT‑3.5‑comparable responses with 2.5× faster inference while staying under 30 billion parameters. It achieved first place on the Hugging Face Open LLM Leaderboard in December 2023 by combining a 32‑layer Llama 2 architecture, initialized with high‑quality Mistral 7B weights, with an innovative “depth up‑scaling” (DUS) approach that deepens the model efficiently without adding complex modules. After DUS, continued pretraining restores and enhances performance, and instruction tuning in a QA format, especially for Korean, refines its ability to follow user prompts, while alignment tuning ensures its outputs meet human or advanced AI preferences. Solar Mini outperforms competitors such as Llama 2, Mistral 7B, Ko‑Alpaca, and KULLM across a variety of benchmarks, proving that compact size need not sacrifice capability.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Developers interested in a powerful free and open source large language model
|
Audience
Developers and AI teams in need of a solution with robust instruction following and RAG‑enabled document processing for real‑time applications
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
$0.1 per 1M tokens
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationMeta
Founded: 2004
United States
ai.meta.com/llama/
|
Company InformationUpstage AI
Founded: 2020
United States
www.upstage.ai/blog/en/introducing-solar-mini-compact-yet-powerful
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|||||
|
|
|||||
|
|
|||||
|
|
|||||
Categories |
Categories |
|||||
Integrations
AiAssistWorks
Automi
BlueFlame AI
Browser Use
DuckDuckGoose AI Text Detection
Firecrawl
Graydient AI
Groq
Lakera
Odyssey
|
Integrations
AiAssistWorks
Automi
BlueFlame AI
Browser Use
DuckDuckGoose AI Text Detection
Firecrawl
Graydient AI
Groq
Lakera
Odyssey
|
|||||
|
|