falcon-40b

Falcon-40B is a 40-billion-parameter, causal decoder-only language model developed by the Technology Innovation Institute (TII) and trained on 1 trillion tokens from the RefinedWeb dataset and curated corpora. Designed for high inference efficiency, it incorporates FlashAttention and multiquery attention for faster processing. Falcon-40B outperforms LLaMA, MPT, and other open-source models, making it one of the top-performing public LLMs. It supports English, German, Spanish, and French, with limited capabilities in several other European languages. Although powerful, Falcon-40B is a raw pretrained model and is best used after fine-tuning for specific applications such as summarization, chatbots, or content generation. It is released under the permissive Apache 2.0 license, allowing commercial use. The model requires significant hardware (85–100 GB VRAM) but offers state-of-the-art performance for large-scale NLP research and development.

Features

40B parameter decoder-only transformer architecture
Trained on 1T tokens from high-quality web and curated datasets
FlashAttention and multiquery attention for optimized inference
Apache 2.0 license allows unrestricted commercial use
Supports multiple European languages with strong English performance
Compatible with Hugging Face transformers and text-generation-inference
Requires bfloat16 and PyTorch 2.0+ for optimal performance
Can be fine-tuned for chatbots, summarization, and other NLP tasks

Project Samples

Project Activity

See All Activity >

Follow falcon-40b

falcon-40b Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of falcon-40b!

Additional Project Details

Registered

2025-06-27

Similar Business Software

Falcon-40B

Falcon-40B is a 40B parameters causal decoder-only model built by TII and trained on 1,000B tokens of RefinedWeb enhanced with curated corpora. It is made available under the Apache 2.0 license. Why use Falcon-40B? It is the best open-source model currently available. Falcon-40B...

See Software
Falcon-7B

Falcon-7B is a 7B parameters causal decoder-only model built by TII and trained on 1,500B tokens of RefinedWeb enhanced with curated corpora. It is made available under the Apache 2.0 license. Why use Falcon-7B? It outperforms comparable open-source models (e.g., MPT-7B, StableLM,...

See Software
MPT-7B

Introducing MPT-7B, the latest entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with...

See Software