The C#/.NET binding of llama.cpp. It provides APIs to infer the LLaMa Models and deploy it on the local environment. It works on both Windows, Linux and MAC without the requirement for compiling llama.cpp yourself. Its performance is close to llama.cpp. Furthermore, it provides integrations with other projects such as BotSharp to provide higher-level applications and UI.
Features
- Model Inference and Chat Session
- LLamaSharp provides two ways to run inference: LLamaExecutor and ChatSession
- With LLamaSharp you needn't to compile c++ project and run scripts to quantize the model, instead, just run it in C#
- We provide the integration of ASP.NET core
- Embeddings generation, tokenization and detokenization
- LLaMa model inference
License
MIT LicenseFollow LLamaSharp
Other Useful Business Software
Stay in Flow. Let Zenflow Handle the Heavy Lifting.
Zenflow is your engineering control center, turning specs into shipped features. Parallel agents handle coding, testing, and refactoring with real repo context. Multi-agent workflows remove bottlenecks and automate routine work so developers stay focused and in flow.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of LLamaSharp!