The C#/.NET binding of llama.cpp. It provides APIs to infer the LLaMa Models and deploy it on the local environment. It works on both Windows, Linux and MAC without the requirement for compiling llama.cpp yourself. Its performance is close to llama.cpp. Furthermore, it provides integrations with other projects such as BotSharp to provide higher-level applications and UI.

Features

  • Model Inference and Chat Session
  • LLamaSharp provides two ways to run inference: LLamaExecutor and ChatSession
  • With LLamaSharp you needn't to compile c++ project and run scripts to quantize the model, instead, just run it in C#
  • We provide the integration of ASP.NET core
  • Embeddings generation, tokenization and detokenization
  • LLaMa model inference

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow LLamaSharp

LLamaSharp Web Site

Other Useful Business Software
Easily Host LLMs and Web Apps on Cloud Run Icon
Easily Host LLMs and Web Apps on Cloud Run

Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
Try Cloud Run Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of LLamaSharp!

Additional Project Details

Programming Language

C#

Related Categories

C# Large Language Models (LLM), C# AI Models, C# LLM Inference Tool

Registered

2023-08-25