+
+

Related Products

  • LM-Kit.NET
    16 Ratings
    Visit Website
  • Vertex AI
    714 Ratings
    Visit Website
  • Amazon Bedrock
    72 Ratings
    Visit Website
  • Google AI Studio
    5 Ratings
    Visit Website
  • Boomi
    839 Ratings
    Visit Website
  • Stack AI
    16 Ratings
    Visit Website
  • Jotform
    6,380 Ratings
    Visit Website
  • LTX Studio
    130 Ratings
    Visit Website
  • Sendbird
    126 Ratings
    Visit Website
  • RunPod
    141 Ratings
    Visit Website

About

​Transformers is a library of pretrained natural language processing, computer vision, audio, and multimodal models for inference and training. Use Transformers to train models on your data, build inference applications, and generate text with large language models. Explore the Hugging Face Hub today to find a model and use Transformers to help you get started right away.​ Simple and optimized inference class for many machine learning tasks like text generation, image segmentation, automatic speech recognition, document question answering, and more. A comprehensive trainer that supports features such as mixed precision, torch.compile, and FlashAttention for training and distributed training for PyTorch models.​ Fast text generation with large language models and vision language models. Every model is implemented from only three main classes (configuration, model, and preprocessor) and can be quickly used for inference or training.

About

VLLM is a high-performance library designed to facilitate efficient inference and serving of Large Language Models (LLMs). Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evolved into a community-driven project with contributions from both academia and industry. It offers state-of-the-art serving throughput by efficiently managing attention key and value memory through its PagedAttention mechanism. It supports continuous batching of incoming requests and utilizes optimized CUDA kernels, including integration with FlashAttention and FlashInfer, to enhance model execution speed. Additionally, vLLM provides quantization support for GPTQ, AWQ, INT4, INT8, and FP8, as well as speculative decoding capabilities. Users benefit from seamless integration with popular Hugging Face models, support for various decoding algorithms such as parallel sampling and beam search, and compatibility with NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs, and more.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Machine learning practitioners looking for a tool to train and deploy state-of-the-art models across NLP, vision, and audio tasks

Audience

AI infrastructure engineers looking for a solution to optimize the deployment and serving of large-scale language models in production environments

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$9 per month
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Hugging Face
Founded: 2016
United States
huggingface.co/docs/transformers/en/index

Company Information

VLLM
United States
docs.vllm.ai/en/latest/

Alternatives

LM-Kit.NET

LM-Kit.NET

LM-Kit

Alternatives

OpenVINO

OpenVINO

Intel
Contextual.ai

Contextual.ai

Contextual AI
Cohere

Cohere

Cohere AI

Categories

Categories

Integrations

Hugging Face
PyTorch
Database Mart
Docker
KServe
Kubernetes
NGINX
NVIDIA DRIVE
OpenAI

Integrations

Hugging Face
PyTorch
Database Mart
Docker
KServe
Kubernetes
NGINX
NVIDIA DRIVE
OpenAI
Claim Hugging Face Transformers and update features and information
Claim Hugging Face Transformers and update features and information
Claim VLLM and update features and information
Claim VLLM and update features and information