MPT-7B

MPT-7B

MosaicML
+
+

Related Products

  • Vertex AI
    783 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • LM-Kit.NET
    23 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • Buildxact
    233 Ratings
    Visit Website
  • ClickLearn
    65 Ratings
    Visit Website
  • Partful
    17 Ratings
    Visit Website
  • CBT Nuggets
    483 Ratings
    Visit Website
  • Yeastar P-Series PBX System
    95 Ratings
    Visit Website
  • Datasite Diligence Virtual Data Room
    574 Ratings
    Visit Website

About

Introducing MPT-7B, the latest entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with zero human intervention at a cost of ~$200k. Now you can train, finetune, and deploy your own private MPT models, either starting from one of our checkpoints or training from scratch. For inspiration, we are also releasing three finetuned models in addition to the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the last of which uses a context length of 65k tokens!

About

The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. With some proper optimization, we can achieve this within a span of "just" 90 days using 16 A100-40G GPUs. We adopted exactly the same architecture and tokenizer as Llama 2. This means TinyLlama can be plugged and played in many open-source projects built upon Llama. Besides, TinyLlama is compact with only 1.1B parameters. This compactness allows it to cater to a multitude of applications demanding a restricted computation and memory footprint.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI and LLM developers and engineers

Audience

Developers interested in a small language model

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

No images available

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

MosaicML
Founded: 2021
United States
www.mosaicml.com/blog/mpt-7b

Company Information

TinyLlama
github.com/jzhang38/TinyLlama

Alternatives

Alpaca

Alpaca

Stanford Center for Research on Foundation Models (CRFM)

Alternatives

Llama 2

Llama 2

Meta
Dolly

Dolly

Databricks
Falcon-40B

Falcon-40B

Technology Innovation Institute (TII)
Llama 2

Llama 2

Meta
Qwen2.5-1M

Qwen2.5-1M

Alibaba
Llama

Llama

Meta

Categories

Categories

Integrations

Axolotl
MosaicML
RunPod

Integrations

Axolotl
MosaicML
RunPod
Claim MPT-7B and update features and information
Claim MPT-7B and update features and information
Claim TinyLlama and update features and information
Claim TinyLlama and update features and information