Port of Facebook's LLaMA model in C/C++
Python bindings for llama.cpp
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Maid is a cross-platform Flutter app for interfacing with GGUF
Run Local LLMs on Any Device. Open-source
React and Electron-based app that executes the FreedomGPT LLM locally
Interface for OuteTTS models
The simplest way to run Alpaca on your own computer
Inference Llama 2 in one file of pure C
Distribute and run LLMs with a single file
Amica is an open source interface for interactive communication
Qwen3 is the large language model series developed by Qwen team
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM's
An easy-to-understand framework for LLM samplers
A gradio web UI for running Large Language Models like LLaMA
Towards Human-Sounding Speech
GLM-4 series: Open Multilingual Multimodal Chat LMs
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Run GGUF models easily with a UI or API. One File. Zero Install.
Open source large-language-model based code completion engine
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Chat with your favourite LLaMA models in a native macOS app
llama.go is like llama.cpp in pure Golang
Locally run an Instruction-Tuned Chat-Style LLM