LLamaSharp

The C#/.NET binding of llama.cpp. It provides APIs to infer the LLaMa Models and deploy it on the local environment. It works on both Windows, Linux and MAC without the requirement for compiling llama.cpp yourself. Its performance is close to llama.cpp. Furthermore, it provides integrations with other projects such as BotSharp to provide higher-level applications and UI.

Features

Model Inference and Chat Session
LLamaSharp provides two ways to run inference: LLamaExecutor and ChatSession
With LLamaSharp you needn't to compile c++ project and run scripts to quantize the model, instead, just run it in C#
We provide the integration of ASP.NET core
Embeddings generation, tokenization and detokenization
LLaMa model inference

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow LLamaSharp

LLamaSharp Web Site

Other Useful Business Software

Easily Host LLMs and Web Apps on Cloud Run

Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.

Try Cloud Run Free

Rate This Project

User Reviews

Be the first to post a review of LLamaSharp!

Additional Project Details

Programming Language

Related Categories

C# Large Language Models (LLM), C# AI Models, C# LLM Inference Tool

Registered

2023-08-25

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
RunPod

RunPod offers a cloud-based platform designed for running AI workloads, focusing on providing scalable, on-demand GPU resources to accelerate machine learning (ML) model training and inference. With its diverse selection of powerful GPUs like the NVIDIA A100, RTX 3090, and H100, RunPod supports...

See Software
EXAONE Deep

EXAONE Deep is a series of reasoning-enhanced language models developed by LG AI Research, featuring parameter sizes of 2.4 billion, 7.8 billion, and 32 billion. These models demonstrate superior capabilities in various reasoning tasks, including math and coding benchmarks. Notably, EXAONE Deep...

See Software
LFM2.5

Liquid AI’s LFM2.5 is the next generation of on-device AI foundation models designed to deliver high-performance, efficient AI inference on edge devices such as phones, laptops, vehicles, IoT systems, and embedded hardware without relying on cloud compute. It extends the previous LFM2...

See Software

Report inappropriate content

LLamaSharp

C#/.NET binding of llama.cpp, including LLaMa/GPT model inference

Get an email when there's a new version of LLamaSharp

Features

Project Samples

Project Activity

Categories

License

Follow LLamaSharp

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered