orsonpdf-1.6-eval free download

Showing 395 open source projects for "orsonpdf-1.6-eval"

View related business solutions

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Prometheus-Eval

Evaluate your LLM's response with Prometheus and GPT4

Prometheus-Eval is an open-source framework designed to evaluate the outputs of large language models using specialized evaluator models known as Prometheus. The project provides tools, datasets, and scripts that allow developers and researchers to measure the quality of LLM responses through automated scoring rather than relying solely on human evaluators.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
2

The LLM Evaluation guidebook

Sharing both practical insights and theoretical knowledge about LLM

The Evaluation Guidebook is an open educational resource created by Hugging Face that explains how to evaluate machine learning and large language models effectively. It compiles practical insights and theoretical knowledge gathered from real-world evaluation work, including experience managing the Open LLM Leaderboard and designing evaluation tools. The guidebook teaches developers how to design evaluation pipelines, select appropriate metrics, and interpret model performance results. It...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
3

web-eval-agent MCP Server

An MCP server that autonomously evaluates web applications

web-eval-agent is a Model Context Protocol (MCP) server that spins up a browser-use–capable debugging agent to autonomously run and evaluate web apps straight from your editor. It’s positioned as a “let the coding agent debug itself” companion: the agent launches the app, navigates flows, captures evidence, and iterates on failures without manual copy-pasting of logs.

Downloads: 0 This Week

Last Update: 2025-11-22
See Project
4

InsightFace

State-of-the-art 2D and 3D Face Analysis Project

State-of-the-art deep face analysis library. InsightFace is an open-source 2D&3D deep face analysis library. InsightFace is an integrated Python library for 2D&3D face analysis. InsightFace efficiently implements a wide variety of state-of-the-art algorithms for face recognition, face detection, and face alignment, which are optimized for both training and deployment. Research institutes and industrial organizations can get benefits from InsightFace library.

Downloads: 406 This Week

Last Update: 2026-03-12
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
5

RuntimeGeneratedFunctions.jl

Functions generated at runtime without world-age issues or overhead

RuntimeGeneratedFunctions are functions generated at runtime without world-age issues and with the full performance of a standard Julia anonymous function. This builds functions in a way that avoids eval. For technical reasons, RuntimeGeneratedFunctions needs to cache the function expression in a global variable within some module. This is normally transparent to the user, but if the RuntimeGeneratedFunction is evaluated during module precompilation, the cache module must be explicitly set to the module currently being precompiled. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
6

HumanEval

Code for the paper "Evaluating Large Language Models Trained on Code"

...Researchers can use the dataset to run reproducible comparisons across models and track improvements in functional code synthesis. By focusing on correctness through execution, human-eval provides a rigorous and practical way to evaluate programming capabilities in AI systems.

Downloads: 3 This Week

Last Update: 6 days ago
See Project
7

Oscar.jl

A comprehensive open source computer algebra system for computations

Welcome to the OSCAR project, a visionary new computer algebra system that combines the capabilities of four cornerstone systems: GAP, Polymake, Antic and Singular. OSCAR requires Julia 1.6 or newer. In principle it can be installed and used like any other Julia package; doing so will take a couple of minutes. A comprehensive open source computer algebra system for computations in algebra, geometry, and number theory.

Downloads: 2 This Week

Last Update: 2026-03-26
See Project
8

Yaegi

Yaegi is Another Elegant Go Interpreter

...Note that you can use rlwrap (install with your favorite package manager), and alias the yaegi command in alias yaegi='rlwrap yaegi' in your ~/.bashrc, to have history and command line edition. Complete support of Go specification. Written in pure Go, using only the standard library. Simple interpreter API: New(), Eval(), Use(). Works everywhere Go works.

Downloads: 0 This Week

Last Update: 2024-04-03
See Project
9

Tencent-Hunyuan-Large

Open-source large language model family from Tencent Hunyuan

...It aims to provide competitive capability with efficient deployment and inference. FP8 quantization support to reduce memory usage (~50%) while maintaining precision. High benchmarking performance on tasks like MMLU, MATH, CMMLU, C-Eval, etc.

Downloads: 2 This Week

Last Update: 2025-09-24
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

openbench

Provider-agnostic, open-source evaluation infrastructure

...It bundles dozens of evaluation suites — covering knowledge, reasoning, math, code, science, reading comprehension, long-context recall, graph reasoning, and more — so users don’t need to assemble disparate datasets themselves. With a simple CLI interface (e.g. bench eval <benchmark> --model <model-id>), you can quickly evaluate any model supported by Groq or other providers (OpenAI, Anthropic, HuggingFace, local models, etc.). openbench also supports private/local evaluations: you can integrate your own custom benchmarks or data (e.g. internal test suites, domain-specific tasks) to evaluate models in a privacy-preserving way.

Downloads: 0 This Week

Last Update: 2025-12-09
See Project
11

nixd

Nix language server, based on nix libraries

This is a feature-rich nix language server interoperating with C++ nix.

Downloads: 0 This Week

Last Update: 2026-02-07
See Project
12

jlrs

Julia bindings for Rust

jlrs is a crate that provides access to most of the Julia C API, it can be used to embed Julia in Rust applications and to use functionality it provides when writing ccallable functions in Rust. Currently, this crate is only tested in combination with Julia 1.6 and 1.9, but also supports Julia 1.7, 1.8, and 1.10. Using the current stable version is highly recommended. The minimum supported Rust version is currently 1.65. Julia must be installed before jlrs can be used, jlrs is compatible with Julia 1.6 up to and including Julia 1.10. The JlrsCore package must also have been installed, if this is not the case it will automatically be added when jlrs is initialized by default. jlrs has not been tested with juliaup yet on Linux and macOS.

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
13

OhMyREPL.jl

Syntax highlighting and other enhancements for the Julia REPL

OhMyREPL.jl is a Julia package that enhances the Julia REPL (Read-Eval-Print Loop) experience with syntax highlighting, bracket matching, prompt customization, and automatic indentation. It is designed to make the command-line interface more visually appealing and user-friendly, especially during interactive development and debugging. It runs entirely in the terminal and does not require external dependencies or GUI.

Downloads: 0 This Week

Last Update: 2025-10-16
See Project
14

PyTorch Image Models

The largest collection of PyTorch image encoders / backbones

timm (PyTorch Image Models) is a premier library hosting a vast collection of state-of-the-art image classification models and backbones such as ResNet, EfficientNet, NFNet, Vision Transformer, ConvNeXt, and more. Created by Ross Wightman and now maintained by Hugging Face, it includes pretrained weights, data loaders, augmentations, optimizers, schedulers, and reference scripts for training, evaluation, inference, and model export. It's an essential toolkit for vision research and...

Downloads: 2 This Week

Last Update: 2026-03-23
See Project
15

burg-eval-plus

Downloads: 0 This Week

Last Update: 2024-05-31
See Project
16

Dialyxir

Mix tasks to simplify use of Dialyzer in Elixir projects

Mix tasks to simplify use of Dialyzer in Elixir projects. Elixir 1.6 is required, to support the new pretty printing feature. If your project is not yet on 1.6, continue to specify 0.5 in your mix deps. Warning messages have been greatly improved, but are filtered through the legacy formatter to support your existing ignore files. You can optionally use the new Elixir term format for ignore files.

Downloads: 0 This Week

Last Update: 2025-11-06
See Project
17

SCI

Configurable Clojure/Script interpreter suitable for scripting

SCI is a lightweight Clojure interpreter designed for embedding and scripting in JVM or native apps (e.g., Babashka). It supports stateful contexts, multiple evaluations, and a configurable environment without full JVM startup overhead. Packed with optional extensions (like reagent, js-interop), it enables REPL-like interactivity in minimal environments.

Downloads: 0 This Week

Last Update: 2026-02-07
See Project
18

DeepEval

DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.

Downloads: 8 This Week

Last Update: 2 days ago
See Project
19

Clapeyron

Framework for the development and use of fluid-thermodynamic models

Welcome to Clapeyron! This module provides both a large library of thermodynamic models and a framework for one to easily implement their own models. Clapeyron provides a framework for the development and use of fluid-thermodynamic models, including SAFT, cubic, activity, multi-parameter, and COSMO-SAC.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
20

Lmod

An Environment Module System based on Lua, Reads TCL Modules

Lmod is a program to manage the user environment under Unix: (Linux, Mac OS X, ...). It is a new implementation of environment modules. Lmod is a Lua-based module system that easily handles the MODULEPATH Hierarchical problem. Environment Modules provide a convenient way to dynamically change the users’ environment through modulefiles. This includes easily adding or removing directories to the PATH environment variable. Module files for Library packages provide environment variables that...

Downloads: 0 This Week

Last Update: 2026-02-23
See Project
21

InMemoryDatasets.jl

Multithreaded package for working with tabular data in Julia

InMemoryDatasets.jl is a multithreaded package for data manipulation and is designed for Julia 1.6+ (64-bit OS). The core computation engine of the package is a set of customized algorithms developed specifically for columnar tables.

Downloads: 0 This Week

Last Update: 2025-11-10
See Project
22

The Tengo Language

A fast script language for Go

...Executable as a standalone language / REPL. Use cases, rules engine, state machine, data pipeline, transpiler. If you need to evaluate a simple expression, you can use Eval function instead.

Downloads: 0 This Week

Last Update: 2025-05-24
See Project
23

ExplainableAI.jl

Explainable AI in Julia

This package implements interpretability methods for black box models, with a focus on local explanations and attribution maps in input space. It is similar to Captum and Zennit for PyTorch and iNNvestigate for Keras models. Most of the implemented methods only require the model to be differentiable with Zygote. Layerwise Relevance Propagation (LRP) is implemented for use with Flux.jl models.

Downloads: 0 This Week

Last Update: 2025-06-17
See Project
24

SLIME

The Superior Lisp Interaction Mode for Emacs

...While lisp-mode supports editing Lisp source files, slime-mode adds support for interacting with a running Common Lisp process for compilation, debugging, documentation lookup, and so on. The Read-Eval-Print Loop ("top-level") is written in Emacs Lisp for tighter integration with Emacs. The REPL also has builtin "shortcut" commands similar to those of the McCLIM listener. SLIME is able to take compiler messages and annotate them directly into source buffers.

Downloads: 1 This Week

Last Update: 2025-12-08
See Project
25

DBreeze Database

C# .NET NOSQL ( key value store embedded ) ACID database

DBreeze Database is a professional, open-source, multi-paradigm (embedded Key-Value store, objects, NoSql, text search, multi-parameter search, embedding vector database, vector similarity search/clustering, etc.), multi-threaded, transactional and ACID-compliant data management system for .NET5> / .NET Framework 3.5> / Xamarin MONO Android iOS / .NET Core 1.0> / .NET Standard 1.6> / Universal Windows Platform / .NET Portable / UNITY / CoreRT.

Downloads: 1 This Week

Last Update: 2026-03-13
See Project