Chinchilla

Chinchilla

Google DeepMind
+
+

Related Products

  • LM-Kit.NET
    16 Ratings
    Visit Website
  • Vertex AI
    726 Ratings
    Visit Website
  • Google AI Studio
    9 Ratings
    Visit Website
  • Stack AI
    18 Ratings
    Visit Website
  • CredentialStream
    161 Ratings
    Visit Website
  • PESTBOSS
    2 Ratings
    Visit Website
  • Pipedrive
    8,713 Ratings
    Visit Website
  • CLEAR
    1 Rating
    Visit Website
  • Amp
    86 Ratings
    Visit Website
  • Quaeris
    6 Ratings
    Visit Website

About

Chinchilla is a large language model. Chinchilla uses the same compute budget as Gopher but with 70B parameters and 4× more more data. Chinchilla uniformly and significantly outperforms Gopher (280B), GPT-3 (175B), Jurassic-1 (178B), and Megatron-Turing NLG (530B) on a large range of downstream evaluation tasks. This also means that Chinchilla uses substantially less compute for fine-tuning and inference, greatly facilitating downstream usage. As a highlight, Chinchilla reaches a state-of-the-art average accuracy of 67.5% on the MMLU benchmark, greater than a 7% improvement over Gopher.

About

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers interested in a powerful large language model (LLM)

Audience

Users interested in a powerful Large Language Model solution

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

No images available

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Google DeepMind
United States
arxiv.org/abs/2203.15556

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Alternatives

Qwen2.5-Max

Qwen2.5-Max

Alibaba

Alternatives

Llama 2

Llama 2

Meta
Mistral 7B

Mistral 7B

Mistral AI
Mistral NeMo

Mistral NeMo

Mistral AI
Llama 2

Llama 2

Meta
Phi-2

Phi-2

Microsoft

Categories

Categories

Integrations

MusicFX
Stitch
WeatherNext

Integrations

MusicFX
Stitch
WeatherNext
Claim Chinchilla and update features and information
Claim Chinchilla and update features and information
Claim LongLLaMA and update features and information
Claim LongLLaMA and update features and information