+
+

Related Products

  • Vertex AI
    827 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • LM-Kit.NET
    24 Ratings
    Visit Website
  • DataHub
    10 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • Enterprise Bot
    23 Ratings
    Visit Website
  • Iru
    1,487 Ratings
    Visit Website
  • QuickApps
    Visit Website
  • Nexo
    16,466 Ratings
    Visit Website
  • TrueLoyal
    241 Ratings
    Visit Website

About

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

About

Tülu 3 is an advanced instruction-following language model developed by the Allen Institute for AI (Ai2), designed to enhance capabilities in areas such as knowledge, reasoning, mathematics, coding, and safety. Built upon the Llama 3 Base, Tülu 3 employs a comprehensive four-stage post-training process: meticulous prompt curation and synthesis, supervised fine-tuning on a diverse set of prompts and completions, preference tuning using both off- and on-policy data, and a novel reinforcement learning approach to bolster specific skills with verifiable rewards. This open-source model distinguishes itself by providing full transparency, including access to training data, code, and evaluation tools, thereby closing the performance gap between open and proprietary fine-tuning methods. Evaluations indicate that Tülu 3 outperforms other open-weight models of similar size, such as Llama 3.1-Instruct and Qwen2.5-Instruct, across various benchmarks.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Users interested in a powerful Large Language Model solution

Audience

Tülu 3 is designed for AI researchers, developers, and organizations seeking a high-performance, open-source language model for advanced reasoning, coding, and instruction-following tasks

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Company Information

Ai2
Founded: 2014
United States
allenai.org/tulu

Alternatives

Llama 2

Llama 2

Meta

Alternatives

Molmo

Molmo

Ai2
Olmo 3

Olmo 3

Ai2
Olmo 3

Olmo 3

Ai2
Llama 2

Llama 2

Meta
Mistral 7B

Mistral 7B

Mistral AI
Hermes 3

Hermes 3

Nous Research
Alpaca

Alpaca

Stanford Center for Research on Foundation Models (CRFM)

Categories

Categories

Integrations

BuildThatIdea
C
C#
C++
CSS
Clojure
F#
HTML
Java
JavaScript
Julia
Kotlin
Python
R
Ruby
Rust
SQL
Scala
TypeScript
Visual Basic

Integrations

BuildThatIdea
C
C#
C++
CSS
Clojure
F#
HTML
Java
JavaScript
Julia
Kotlin
Python
R
Ruby
Rust
SQL
Scala
TypeScript
Visual Basic
Claim LongLLaMA and update features and information
Claim LongLLaMA and update features and information
Claim Tülu 3 and update features and information
Claim Tülu 3 and update features and information