Port of Facebook's LLaMA model in C/C++
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
A gradio web UI for running Large Language Models like LLaMA
The simplest way to run Alpaca on your own computer
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Maid is a cross-platform Flutter app for interfacing with GGUF
React and Electron-based app that executes the FreedomGPT LLM locally
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM's
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
An easy-to-understand framework for LLM samplers
Open source large-language-model based code completion engine
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
llama.go is like llama.cpp in pure Golang
Chat with your favourite LLaMA models in a native macOS app
Locally run an Instruction-Tuned Chat-Style LLM
Run GGUF models easily with a UI or API. One File. Zero Install.
Powerful large language model (LLM) from Alibaba Cloud
Agentic 24B LLM optimized for coding tasks with 128k context support
Multilingual 3B LLM optimized for reasoning, math, and long contexts
Llama-3.3-70B-Instruct is a multilingual AI optimized for helpful chat