DBRX

DBRX

Databricks
OLMo 2

OLMo 2

Ai2
+
+

Related Products

  • LM-Kit.NET
    17 Ratings
    Visit Website
  • Vertex AI
    726 Ratings
    Visit Website
  • Google AI Studio
    9 Ratings
    Visit Website
  • Seedance
    6 Ratings
    Visit Website
  • OpenVPN
    198,256 Ratings
    Visit Website
  • KrakenD
    71 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • RunPod
    152 Ratings
    Visit Website
  • CLEAR
    1 Rating
    Visit Website
  • Stigg
    25 Ratings
    Visit Website

About

Today, we are excited to introduce DBRX, an open, general-purpose LLM created by Databricks. Across a range of standard benchmarks, DBRX sets a new state-of-the-art for established open LLMs. Moreover, it provides the open community and enterprises building their own LLMs with capabilities that were previously limited to closed model APIs; according to our measurements, it surpasses GPT-3.5, and it is competitive with Gemini 1.0 Pro. It is an especially capable code model, surpassing specialized models like CodeLLaMA-70B in programming, in addition to its strength as a general-purpose LLM. This state-of-the-art quality comes with marked improvements in training and inference performance. DBRX advances the state-of-the-art in efficiency among open models thanks to its fine-grained mixture-of-experts (MoE) architecture. Inference is up to 2x faster than LLaMA2-70B, and DBRX is about 40% of the size of Grok-1 in terms of both total and active parameter counts.

About

OLMo 2 is a family of fully open language models developed by the Allen Institute for AI (AI2), designed to provide researchers and developers with transparent access to training data, open-source code, reproducible training recipes, and comprehensive evaluations. These models are trained on up to 5 trillion tokens and are competitive with leading open-weight models like Llama 3.1 on English academic benchmarks. OLMo 2 emphasizes training stability, implementing techniques to prevent loss spikes during long training runs, and utilizes staged training interventions during late pretraining to address capability deficiencies. The models incorporate state-of-the-art post-training methodologies from AI2's Tülu 3, resulting in the creation of OLMo 2-Instruct models. An actionable evaluation framework, the Open Language Modeling Evaluation System (OLMES), was established to guide improvements through development stages, consisting of 20 evaluation benchmarks assessing core capabilities.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Organizations looking for an advanced Large Language Model solution

Audience

Developers and researchers searching for a tool to streamline their AI research and operations

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Databricks
United States
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm

Company Information

Ai2
Founded: 2014
United States
allenai.org/blog/olmo2

Alternatives

FLIP

FLIP

Kanerika

Alternatives

Molmo

Molmo

Ai2
DeepSeek-V2

DeepSeek-V2

DeepSeek
Ai2 OLMoE

Ai2 OLMoE

The Allen Institute for Artificial Intelligence
Llama 2

Llama 2

Meta
Qwen2

Qwen2

Alibaba
Baichuan-13B

Baichuan-13B

Baichuan Intelligent Technology
Gemma

Gemma

Google

Categories

Categories

Integrations

Double
GPT-3.5
GPT-4
Rayven
ZenML

Integrations

Double
GPT-3.5
GPT-4
Rayven
ZenML
Claim DBRX and update features and information
Claim DBRX and update features and information
Claim OLMo 2 and update features and information
Claim OLMo 2 and update features and information