Sky-T1

Sky-T1

NovaSky
Smaug-72B

Smaug-72B

Abacus
+
+

Related Products

  • Vertex AI
    944 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • LM-Kit.NET
    25 Ratings
    Visit Website
  • imgproxy
    15 Ratings
    Visit Website
  • Source Defense
    7 Ratings
    Visit Website
  • Teradata VantageCloud
    1,105 Ratings
    Visit Website
  • TrustInSoft Analyzer
    6 Ratings
    Visit Website
  • Windsurf Editor
    161 Ratings
    Visit Website
  • Reflectiz
    18 Ratings
    Visit Website
  • wp2print
    23 Ratings
    Visit Website

About

Sky-T1-32B-Preview is an open source reasoning model developed by the NovaSky team at UC Berkeley's Sky Computing Lab. It matches the performance of proprietary models like o1-preview on reasoning and coding benchmarks, yet was trained for under $450, showcasing the feasibility of cost-effective, high-level reasoning capabilities. The model was fine-tuned from Qwen2.5-32B-Instruct using a curated dataset of 17,000 examples across diverse domains, including math and coding. The training was completed in 19 hours on eight H100 GPUs with DeepSpeed Zero-3 offloading. All aspects of the project, including data, code, and model weights, are fully open-source, empowering the academic and open-source communities to replicate and enhance the model's performance.

About

Smaug-72B is a powerful open-source large language model (LLM) known for several key features: High Performance: It currently holds the top spot on the Hugging Face Open LLM leaderboard, surpassing models like GPT-3.5 in various benchmarks. This means it excels at tasks like understanding, responding to, and generating human-like text. Open Source: Unlike many other advanced LLMs, Smaug-72B is freely available for anyone to use and modify, fostering collaboration and innovation in the AI community. Focus on Reasoning and Math: It specifically shines in handling reasoning and mathematical tasks, attributing this strength to unique fine-tuning techniques developed by Abacus AI, the creators of Smaug-72B. Based on Qwen-72B: It's technically a fine-tuned version of another powerful LLM called Qwen-72B, released by Alibaba, further improving upon its capabilities. Overall, Smaug-72B represents a significant step forward in open-source AI.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Users that want to train their own powerful AI Model like o1

Audience

AI developers interested in a powerful large language model

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

No images available

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

NovaSky
United States
novasky-ai.github.io/posts/sky-t1/

Company Information

Abacus
Founded: 2019
United States
huggingface.co/abacusai/Smaug-72B-v0.1

Alternatives

Alternatives

Qwen

Qwen

Alibaba
DeepSeek R1

DeepSeek R1

DeepSeek
Sky-T1

Sky-T1

NovaSky
Qwen2

Qwen2

Alibaba
Qwen2

Qwen2

Alibaba
GLM-5

GLM-5

Zhipu AI
DeepScaleR

DeepScaleR

Agentica Project

Categories

Categories

Integrations

ChatLLM

Integrations

ChatLLM
Claim Sky-T1 and update features and information
Claim Sky-T1 and update features and information
Claim Smaug-72B and update features and information
Claim Smaug-72B and update features and information