Guidance Files

A guidance language for controlling large language models

This is an exact mirror of the Guidance project, hosted at https://github.com/guidance-ai/guidance/tree/main. SourceForge is not affiliated with Guidance. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
0.2.3 source code.tar.gz	2025-06-23	7.0 MB	0
0.2.3 source code.zip	2025-06-23	7.1 MB	0
README.md	2025-06-23	671 Bytes	0
Totals: 3 Items		14.0 MB	0

Guidance 0.2.3

We have a performance hotfix, and then we snuck in some extras.

Added

Added Llama3.2 chat template

Removed

Deleted some dead code, in particular sample_with_temperature from Engine classes

Changed

Switched top-k (for widget) implementation to use a priority-queue instead of a full sort, saving a few milliseconds per token when widget/vis is turned on

Fixed

Fix performance regression introduced in issue [#1261]: full logits history no longer cached, and fast-forwarded token probabilities are now only available (in widget) the first time they are added to the KV cache and will be missing otherwise.

Source: README.md, updated 2025-06-23

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Gen AI apps are built with MongoDB Atlas Icon

Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free

Recommended Projects

Qwen2.5
Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering...
Qwen-2.5-VL
Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering...
Qwen
Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks....
llm
llm is an ecosystem of Rust libraries for working with large language models - it's built on top of the fast, efficient GGML library for machine learning. The primary entry point for developers is the llm crate, which wraps the llm-base and the supported model crates. Documentation for the...
FreedomGPT
FreedomGPT is a locally executed large language model (LLM) application built using React and Electron, allowing users to interact with AI models privately on their Mac or Windows devices. The app enables offline operation, ensuring privacy and security while providing a chat-based interface for...