Autolabel is a Python library to label, clean and enrich datasets with Large Language Models (LLMs). Autolabel data for NLP tasks such as classification, question-answering and named entity recognition, entity matching and more. Seamlessly use commercial and open-source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more.
Features
- Autolabel data for NLP tasks such as classification, question-answering and named entity-recognition, entity matching and more
- Seamlessly use commercial and open source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more
- Leverage research-proven LLM techniques to boost label quality, such as few-shot learning and chain-of-thought prompting
- Confidence estimation and explanations out of the box for every single output label
- Caching and state management to minimize costs and experimentation time
- You can get started with Autolabel by simpling bringing the dataset you want to label, picking your favorite LLM and writing a few lines of code
Categories
Large Language Models (LLM)License
MIT LicenseFollow Autolabel
Other Useful Business Software
Go from Code to Production URL in Seconds
Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Autolabel!