Tianji

Tianji is a comprehensive evaluation suite designed to assess the performance of large language models (LLMs) across multiple dimensions. It focuses on measuring general capabilities such as reasoning, knowledge, commonsense, and language understanding. Tianji provides a curated set of benchmarks and a unified framework for systematically comparing LLMs, making it useful for research and model selection.

Features

Provides a wide variety of benchmarks for evaluating LLM performance
Supports testing across reasoning, commonsense, and language understanding tasks
Includes automatic and human evaluation pipelines
Offers standardized metrics for fair model comparison
Easy to extend with custom datasets and evaluation criteria
Compatible with leading LLM architectures and APIs

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Tianji

Tianji Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of Tianji!

Additional Project Details

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

2025-03-13

Similar Business Software

TruLens

TruLens is an open-source Python library designed to systematically evaluate and track Large Language Model (LLM) applications. It provides fine-grained instrumentation, feedback functions, and a user interface to compare and iterate on app versions, facilitating rapid development and...

See Software
Ragas

Ragas is an open-source framework designed to test and evaluate Large Language Model (LLM) applications. It offers automatic metrics to assess performance and robustness, synthetic test data generation tailored to specific requirements, and workflows to ensure quality during development and...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software

Report inappropriate content

Tianji

Evaluation suite designed to assess the performance of LLMs

Get an email when there's a new version of Tianji

Features

Project Samples

Project Activity

Categories

License

Follow Tianji

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered