Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools.

Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB.

Download 'uplug-main' first and then add other packages.

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Uplug corpus tools

Uplug corpus tools Web Site

nel_h2
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Uplug corpus tools!

Additional Project Details

Operating Systems

BSD, Linux, Mac

Intended Audience

Science/Research

User Interface

Command-line, Web-based

Programming Language

Perl, Unix Shell

Related Categories

Unix Shell Machine Translation Software, Unix Shell Research Software, Perl Machine Translation Software, Perl Research Software

Registered

2004-04-17