Xgen-small

Xgen-small

Salesforce
+
+

Related Products

  • RaimaDB
    5 Ratings
    Visit Website
  • Dragonfly
    16 Ratings
    Visit Website
  • TrustInSoft Analyzer
    6 Ratings
    Visit Website
  • RunPod
    167 Ratings
    Visit Website
  • Qloo
    23 Ratings
    Visit Website
  • Vehicle Acquisition Network (VAN)
    3 Ratings
    Visit Website
  • RetailEdge
    194 Ratings
    Visit Website
  • Buildium
    2,426 Ratings
    Visit Website
  • Adobe PDF Library SDK
    35 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website

About

Phi-4-mini-flash-reasoning is a 3.8 billion‑parameter open model in Microsoft’s Phi family, purpose‑built for edge, mobile, and other resource‑constrained environments where compute, memory, and latency are tightly limited. It introduces the SambaY decoder‑hybrid‑decoder architecture with Gated Memory Units (GMUs) interleaved alongside Mamba state‑space and sliding‑window attention layers, delivering up to 10× higher throughput and a 2–3× reduction in latency compared to its predecessor without sacrificing advanced math and logic reasoning performance. Supporting a 64 K‑token context length and fine‑tuned on high‑quality synthetic data, it excels at long‑context retrieval, reasoning tasks, and real‑time inference, all deployable on a single GPU. Phi-4-mini-flash-reasoning is available today via Azure AI Foundry, NVIDIA API Catalog, and Hugging Face, enabling developers to build fast, scalable, logic‑intensive applications.

About

Xgen-small is an enterprise-ready compact language model developed by Salesforce AI Research, designed to deliver long-context performance at a predictable, low cost. It combines domain-focused data curation, scalable pre-training, length extension, instruction fine-tuning, and reinforcement learning to meet the complex, high-volume inference demands of modern enterprises. Unlike traditional large models, Xgen-small offers efficient processing of extensive contexts, enabling the synthesis of information from internal documentation, code repositories, research reports, and real-time data streams. With sizes optimized at 4B and 9B parameters, it provides a strategic advantage by balancing cost efficiency, privacy safeguards, and long-context understanding, making it a sustainable and predictable solution for deploying Enterprise AI at scale.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI professionals and developers searching for a tool to power advanced inference on edge and mobile platforms

Audience

IT leaders and AI practitioners seeking a compact, efficient language model capable of processing long-context information while ensuring cost-effectiveness and data privacy

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Microsoft
Founded: 1975
United States
azure.microsoft.com/en-us/blog/reasoning-reimagined-introducing-phi-4-mini-flash-reasoning/

Company Information

Salesforce
Founded: 1999
United States
www.salesforce.com/blog/xgen-small-enterprise-ready-small-language-models/

Alternatives

Alternatives

Phi-4-reasoning

Phi-4-reasoning

Microsoft
Mistral NeMo

Mistral NeMo

Mistral AI
Ministral 3B

Ministral 3B

Mistral AI
Kimi K2

Kimi K2

Moonshot AI

Categories

Categories

Integrations

Agentforce Vibes
Azure AI Foundry
Azure AI Foundry Agent Service
Hugging Face
Microsoft 365 Copilot
NVIDIA DRIVE

Integrations

Agentforce Vibes
Azure AI Foundry
Azure AI Foundry Agent Service
Hugging Face
Microsoft 365 Copilot
NVIDIA DRIVE
Claim Phi-4-mini-flash-reasoning and update features and information
Claim Phi-4-mini-flash-reasoning and update features and information
Claim Xgen-small and update features and information
Claim Xgen-small and update features and information