KServe provides a Kubernetes Custom Resource Definition for serving machine learning (ML) models on arbitrary frameworks. It aims to solve production model serving use cases by providing performant, high abstraction interfaces for common ML frameworks like Tensorflow, XGBoost, ScikitLearn, PyTorch, and ONNX. It encapsulates the complexity of autoscaling, networking, health checking, and server configuration to bring cutting edge serving features like GPU Autoscaling, Scale to Zero, and Canary Rollouts to your ML deployments. It enables a simple, pluggable, and complete story for Production ML Serving including prediction, pre-processing, post-processing and explainability. KServe is being used across various organizations.

Features

  • KServe is a standard, cloud agnostic Model Inference Platform on Kubernetes, built for highly scalable use cases
  • Provides performant, standardized inference protocol across ML frameworks
  • Support modern serverless inference workload with request based autoscaling including scale-to-zero on CPU and GPU
  • Provides high scalability, density packing and intelligent routing using ModelMesh
  • Simple and pluggable production serving for inference, pre/post processing, monitoring and explainability
  • Advanced deployments for canary rollout, pipeline, ensembles with InferenceGraph

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow KServe

KServe Web Site

Other Useful Business Software
Build on Google Cloud with $300 in Free Credit Icon
Build on Google Cloud with $300 in Free Credit

New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
Start Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of KServe!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Container Management Software, Python LLM Inference Tool

Registered

2024-03-08