Offline speech recognition API for Android, iOS, Raspberry Pi
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A PyTorch-based Speech Toolkit
OpenVINO™ Toolkit repository
Training data (data labeling, annotation, workflow) for all data types
Interactive Machine Learning experiments
Multilingual Automatic Speech Recognition with word-level timestamps
A ranked list of awesome machine learning Python libraries
Toolkit for conversational AI
Port of OpenAI's Whisper model in C/C++
Python Audio Analysis Library: Feature Extraction, Classification
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A Lightweight Face Recognition and Facial Attribute Analysis
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Statistical machine intelligence and learning engine
Models for the spaCy Natural Language Processing (NLP) library
Open Source Computer Vision Library
Data manipulation and transformation for audio signal processing
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Translate the video from one language to another and embed dubbing
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
Han Language Processing
Apache OpenNLP