Showing 395 open source projects for "orsonpdf-1.6-eval"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Prometheus-Eval

    Prometheus-Eval

    Evaluate your LLM's response with Prometheus and GPT4

    Prometheus-Eval is an open-source framework designed to evaluate the outputs of large language models using specialized evaluator models known as Prometheus. The project provides tools, datasets, and scripts that allow developers and researchers to measure the quality of LLM responses through automated scoring rather than relying solely on human evaluators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The LLM Evaluation guidebook

    The LLM Evaluation guidebook

    Sharing both practical insights and theoretical knowledge about LLM

    The Evaluation Guidebook is an open educational resource created by Hugging Face that explains how to evaluate machine learning and large language models effectively. It compiles practical insights and theoretical knowledge gathered from real-world evaluation work, including experience managing the Open LLM Leaderboard and designing evaluation tools. The guidebook teaches developers how to design evaluation pipelines, select appropriate metrics, and interpret model performance results. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    web-eval-agent MCP Server

    web-eval-agent MCP Server

    An MCP server that autonomously evaluates web applications

    web-eval-agent is a Model Context Protocol (MCP) server that spins up a browser-use–capable debugging agent to autonomously run and evaluate web apps straight from your editor. It’s positioned as a “let the coding agent debug itself” companion: the agent launches the app, navigates flows, captures evidence, and iterates on failures without manual copy-pasting of logs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    InsightFace

    InsightFace

    State-of-the-art 2D and 3D Face Analysis Project

    State-of-the-art deep face analysis library. InsightFace is an open-source 2D&3D deep face analysis library. InsightFace is an integrated Python library for 2D&3D face analysis. InsightFace efficiently implements a wide variety of state-of-the-art algorithms for face recognition, face detection, and face alignment, which are optimized for both training and deployment. Research institutes and industrial organizations can get benefits from InsightFace library.
    Downloads: 406 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    RuntimeGeneratedFunctions.jl

    RuntimeGeneratedFunctions.jl

    Functions generated at runtime without world-age issues or overhead

    RuntimeGeneratedFunctions are functions generated at runtime without world-age issues and with the full performance of a standard Julia anonymous function. This builds functions in a way that avoids eval. For technical reasons, RuntimeGeneratedFunctions needs to cache the function expression in a global variable within some module. This is normally transparent to the user, but if the RuntimeGeneratedFunction is evaluated during module precompilation, the cache module must be explicitly set to the module currently being precompiled. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HumanEval

    HumanEval

    Code for the paper "Evaluating Large Language Models Trained on Code"

    ...Researchers can use the dataset to run reproducible comparisons across models and track improvements in functional code synthesis. By focusing on correctness through execution, human-eval provides a rigorous and practical way to evaluate programming capabilities in AI systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Oscar.jl

    Oscar.jl

    A comprehensive open source computer algebra system for computations

    Welcome to the OSCAR project, a visionary new computer algebra system that combines the capabilities of four cornerstone systems: GAP, Polymake, Antic and Singular. OSCAR requires Julia 1.6 or newer. In principle it can be installed and used like any other Julia package; doing so will take a couple of minutes. A comprehensive open source computer algebra system for computations in algebra, geometry, and number theory.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Yaegi

    Yaegi

    Yaegi is Another Elegant Go Interpreter

    ...Note that you can use rlwrap (install with your favorite package manager), and alias the yaegi command in alias yaegi='rlwrap yaegi' in your ~/.bashrc, to have history and command line edition. Complete support of Go specification. Written in pure Go, using only the standard library. Simple interpreter API: New(), Eval(), Use(). Works everywhere Go works.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Tencent-Hunyuan-Large

    Tencent-Hunyuan-Large

    Open-source large language model family from Tencent Hunyuan

    ...It aims to provide competitive capability with efficient deployment and inference. FP8 quantization support to reduce memory usage (~50%) while maintaining precision. High benchmarking performance on tasks like MMLU, MATH, CMMLU, C-Eval, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    openbench

    openbench

    Provider-agnostic, open-source evaluation infrastructure

    ...It bundles dozens of evaluation suites — covering knowledge, reasoning, math, code, science, reading comprehension, long-context recall, graph reasoning, and more — so users don’t need to assemble disparate datasets themselves. With a simple CLI interface (e.g. bench eval <benchmark> --model <model-id>), you can quickly evaluate any model supported by Groq or other providers (OpenAI, Anthropic, HuggingFace, local models, etc.). openbench also supports private/local evaluations: you can integrate your own custom benchmarks or data (e.g. internal test suites, domain-specific tasks) to evaluate models in a privacy-preserving way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    nixd

    nixd

    Nix language server, based on nix libraries

    This is a feature-rich nix language server interoperating with C++ nix.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    jlrs

    jlrs

    Julia bindings for Rust

    jlrs is a crate that provides access to most of the Julia C API, it can be used to embed Julia in Rust applications and to use functionality it provides when writing ccallable functions in Rust. Currently, this crate is only tested in combination with Julia 1.6 and 1.9, but also supports Julia 1.7, 1.8, and 1.10. Using the current stable version is highly recommended. The minimum supported Rust version is currently 1.65. Julia must be installed before jlrs can be used, jlrs is compatible with Julia 1.6 up to and including Julia 1.10. The JlrsCore package must also have been installed, if this is not the case it will automatically be added when jlrs is initialized by default. jlrs has not been tested with juliaup yet on Linux and macOS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OhMyREPL.jl

    OhMyREPL.jl

    Syntax highlighting and other enhancements for the Julia REPL

    OhMyREPL.jl is a Julia package that enhances the Julia REPL (Read-Eval-Print Loop) experience with syntax highlighting, bracket matching, prompt customization, and automatic indentation. It is designed to make the command-line interface more visually appealing and user-friendly, especially during interactive development and debugging. It runs entirely in the terminal and does not require external dependencies or GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PyTorch Image Models

    PyTorch Image Models

    The largest collection of PyTorch image encoders / backbones

    timm (PyTorch Image Models) is a premier library hosting a vast collection of state-of-the-art image classification models and backbones such as ResNet, EfficientNet, NFNet, Vision Transformer, ConvNeXt, and more. Created by Ross Wightman and now maintained by Hugging Face, it includes pretrained weights, data loaders, augmentations, optimizers, schedulers, and reference scripts for training, evaluation, inference, and model export. It's an essential toolkit for vision research and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Dialyxir

    Dialyxir

    Mix tasks to simplify use of Dialyzer in Elixir projects

    Mix tasks to simplify use of Dialyzer in Elixir projects. Elixir 1.6 is required, to support the new pretty printing feature. If your project is not yet on 1.6, continue to specify 0.5 in your mix deps. Warning messages have been greatly improved, but are filtered through the legacy formatter to support your existing ignore files. You can optionally use the new Elixir term format for ignore files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SCI

    SCI

    Configurable Clojure/Script interpreter suitable for scripting

    SCI is a lightweight Clojure interpreter designed for embedding and scripting in JVM or native apps (e.g., Babashka). It supports stateful contexts, multiple evaluations, and a configurable environment without full JVM startup overhead. Packed with optional extensions (like reagent, js-interop), it enables REPL-like interactivity in minimal environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DeepEval
    DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    Clapeyron

    Clapeyron

    Framework for the development and use of fluid-thermodynamic models

    Welcome to Clapeyron! This module provides both a large library of thermodynamic models and a framework for one to easily implement their own models. Clapeyron provides a framework for the development and use of fluid-thermodynamic models, including SAFT, cubic, activity, multi-parameter, and COSMO-SAC.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Lmod

    Lmod

    An Environment Module System based on Lua, Reads TCL Modules

    Lmod is a program to manage the user environment under Unix: (Linux, Mac OS X, ...). It is a new implementation of environment modules. Lmod is a Lua-based module system that easily handles the MODULEPATH Hierarchical problem. Environment Modules provide a convenient way to dynamically change the users’ environment through modulefiles. This includes easily adding or removing directories to the PATH environment variable. Module files for Library packages provide environment variables that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    InMemoryDatasets.jl

    InMemoryDatasets.jl

    Multithreaded package for working with tabular data in Julia

    InMemoryDatasets.jl is a multithreaded package for data manipulation and is designed for Julia 1.6+ (64-bit OS). The core computation engine of the package is a set of customized algorithms developed specifically for columnar tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Tengo Language

    The Tengo Language

    A fast script language for Go

    ...Executable as a standalone language / REPL. Use cases, rules engine, state machine, data pipeline, transpiler. If you need to evaluate a simple expression, you can use Eval function instead.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ExplainableAI.jl

    ExplainableAI.jl

    Explainable AI in Julia

    This package implements interpretability methods for black box models, with a focus on local explanations and attribution maps in input space. It is similar to Captum and Zennit for PyTorch and iNNvestigate for Keras models. Most of the implemented methods only require the model to be differentiable with Zygote. Layerwise Relevance Propagation (LRP) is implemented for use with Flux.jl models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SLIME

    SLIME

    The Superior Lisp Interaction Mode for Emacs

    ...While lisp-mode supports editing Lisp source files, slime-mode adds support for interacting with a running Common Lisp process for compilation, debugging, documentation lookup, and so on. The Read-Eval-Print Loop ("top-level") is written in Emacs Lisp for tighter integration with Emacs. The REPL also has builtin "shortcut" commands similar to those of the McCLIM listener. SLIME is able to take compiler messages and annotate them directly into source buffers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    DBreeze Database

    DBreeze Database

    C# .NET NOSQL ( key value store embedded ) ACID database

    DBreeze Database is a professional, open-source, multi-paradigm (embedded Key-Value store, objects, NoSql, text search, multi-parameter search, embedding vector database, vector similarity search/clustering, etc.), multi-threaded, transactional and ACID-compliant data management system for .NET5> / .NET Framework 3.5> / Xamarin MONO Android iOS / .NET Core 1.0> / .NET Standard 1.6> / Universal Windows Platform / .NET Portable / UNITY / CoreRT.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB