aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
Features
- aeneas has been developed and tested on Debian 64bit, with Python 2.7 and Python 3.5, which are the only supported platforms at the moment
- Documentation available
- All-in-one installers are available for Mac OS X and Windows
- Input text files in parsed, plain, subtitles, or unparsed (XML) format
- Multilevel input text files in mplain and munparsed (XML) format
- Text extraction from XML (e.g., XHTML) files using id and class attributes
- Arbitrary text fragment granularity (single word, subphrase, phrase, paragraph, etc.)
- Input audio file formats: all those readable by ffmpeg
- MFCC and DTW computed via Python C extensions to reduce the processing time
- Robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes
- Adjustable splitting times, including a max character/second constraint for CC applications
Categories
LibrariesLicense
Affero GNU Public LicenseFollow aeneas
Other Useful Business Software
Cut Cloud Costs with Google Compute Engine
Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of aeneas!