Falcon 3: Making Advanced AI Accessible and Available to Everyone, Everywhere Experience unmatched performance and scalability on lightweight devices such as laptop and energy constraint infrastructure
About Falcon 3

Revolutionizing
AI for All

Falcon 3 now offers new multimodal capabilities, processing not only text, but also images, and for the first time in the Falcon series, video and audio. These enhancements open exciting new possibilities for media analysis and interactive user experiences.

Falcon 3 has been meticulously designed to address this gap with multimodal capabilities.

The addition of multimodal capabilities – image, video, and audio – further elevates the Falcon 3 family, pushing the limits of open-source AI with unprecedented performance and usability. As an opensource large language model (LLM), Falcon 3 is designed to democratize advanced AI by combining outstanding performance with the ability to run on lightweight devices, including laptops.

Released under TII’s Falcon License 2.0, Falcon 3 is a pioneering step toward making advanced AI tools available to all.

Performance
Benchmark

Falcon3 - Vision

Falcon3 - Video

Falcon3 - Audio

New multimodal functionalities: Falcon 3 Vision, Video, and Audio

Falcon 3 now offers new multimodal capabilities, processing not only text, but also images, and for the first time in the Falcon series, video and audio. These enhancements open exciting new possibilities for media analysis and interactive user experiences. With image processing, Falcon 3 excels in object recognition, scene description and visual charts interpretation. Falcon 3’s image processing capabilities outperform other open-source models on most standard vision benchmarks, delivering remarkable accuracy. Its video processing capabilities empower users to analyze and extract insights from dynamic content, offering features such as video content summarization and question answering on video streams or video recordings up to one hour long. Audio capabilities enable content analysis and understanding for speech, sound and music and allow speech transcription, summarization and acoustic patterns recognition. All outputs from Falcon 3 Vision, Video, and Audio are provided in text format, ensuring clarity and usability across various scenarios and enabling seamless ecosystem integration and potential model cooperation. The multimodal version of Falcon 3 currently supports English for processing audio, video, and image data. By redefining multimodality, Falcon 3 broadens the horizon of what large language models can achieve, setting a new benchmark for versatility and innovation across industries.
Vision: Falcon 3’s image base models have a vocabulary size of 131K, 32K context, GQA, Llama compatible architecture, fast inference speed (30 tokens/s), low latency (3.3s), and low memory consumption (30.23GB) all with exceptional zero-shot and few-shot performance on the open leaderboards Video: Our video-language based model with visual and language decoder works on 131K vocab size, 32K context, GQA and has Llama compatible architecture. With our state-of-the-art 10B model and competitive 7B model, all working exceptionally when compared with similar-sized open models. Audio: Our Falcon3 Audio models have displayed exceptional performance across speech, music, and mixed audio. The 7B model ranks second overall, outperforming larger models like SALMONN (13B) in key metrics. Even our lightweight models (3B and 1B) deliver competitive results, making Falcon3 Audio an outstanding choice for diverse audio applications, from sound analysis to speech recognition, while maintaining efficiency in the small model category.
Our Ambitions for Falcon 3
Democratized AI Access Falcon 3 by TII offers models that are small, efficient, and capable of running on lightweight infrastructures. It ensures high performance without requiring extensive computational resources.
High Accessibility & Performance Designed for developers, researchers, and businesses, Falcon 3 empowers users to leverage cutting-edge AI tools while maintaining ease of use and accessibility.
Access to State of the Art Multimodal Capabilities in AI Falcon 3 models now feature image, video, and audio analysis and understanding with exceptional performance, offering advanced AI capabilities for the open-source AI community.
Improved Efficiency & Fine-Tuning Falcon 3 builds on the success of Falcon 2, delivering enhanced reasoning, fine-tuning capabilities, and improved efficiency across a wide range of use cases.
Commitment to Innovation Reinforcing Technology Innovation Institutes’s (TII) mission, Falcon 3 fosters inclusive, open-source innovation, providing the global community with state-of-the-art AI models.
Model Architecture
Optimized Decoder-Only Design

Falcon 3’s architecture is based on a decoder-only design using flash attention 2 to grouped query attention. It integrates Grouped Query Attention (GQA) to share parameters, minimizing memory for Key-Value (KV) cache during inference, ensuring faster and more efficient operations.

Advanced Tokenization

With a tokenizer supporting a high vocabulary of 131K tokens—double that of Falcon 2—Falcon 3 offers superior compression and improved downstream performance, enhancing its ability to handle diverse tasks.

Enhanced Long-Context Training

Trained natively with a 32K context size, Falcon 3 demonstrates exceptional long-context capabilities, delivering enhanced performance for extended input data compared to its predecessors.

High-Performing Multimodal Models

Falcon 3 Vision, Video, and Audio all provide modality-to-text capabilities, enabling seamless eco-system integration and potential model cooperation. The multimodal version of Falcon 3 supports English for processing audio, video, and image data seamlessly.

The Falcon 3 series represents a huge leap forward in AI technology. Trained on an impressive 14 Trillions tokens, Falcon 3 more than doubles the capacity of its predecessor, Falcon 180B, ensuring a significant boost in performance and capability. The initial training was followed by multiple stages to improve reasoning and math performance with high-quality data and context extension with natively long context data. Falcon 3 was trained on 4 main languages (English, Spanish, Portuguese and French) to ensure a much higher, earning capability and quality for those languages. The inclusion of multimodal capabilities advances Falcon 3, offering enhanced support to the open-source community.
Our Approach to Responsible AI
Falcon 3 is released under the TII Falcon License. This framework promotes the responsible development and deployment of AI while empowering the global community to innovate freely. By emphasizing ethical AI practices, Falcon 3 balances openness with accountability, ensuring technology is used for the benefit of society.
Building with Falcon 3

Advanced AI for Everyone, Everywhere

Falcon’s quantized versions, such as GGUF, AWQ, and GPTQ (in int4, int8, and 1.58 Bitnet), make it highly efficient, even for resource-constrained environments. Optimized for lightweight systems, Falcon 3 is a game-changer. The latest update to the Falcon family comprises four multimodal models—1B, 3B, 7B, and 10B— now featuring text, image, video, and audio analysis tailored for different needs. Falcon 3 models can be further customized through tools like vLLM, Llama.cpp, and MLX, ensuring seamless adoption for developers.

These innovations reflect our commitment to ensuring AI is accessible and efficient for a wide range of users.

Falcon 3 is versatile, designed for both general-purpose and specialized tasks, providing immense flexibility to users. 

Its Base model is perfect for generative applications, while the Instruct model excels in conversational tasks like customer service or virtual assistants.

Falcon 3 is straight forward to implement, whether you’re a startup seeking to enhance user experience or a researcher exploring innovative AI applications. For organizations and individuals with limited computational resources, Falcon 3’s quantized versions offer rapid deployment and optimized efficiency without compromising performance.

Try Falcon 3

By redefining multimodality, Falcon 3 broadens the horizon of what large language models can achieve,
setting a new benchmark for versatility and innovation across industries.