DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Features

Uses a model trained by machine learning techniques
Based on Baidu's Deep Speech research paper
Uses Google's TensorFlow to make the implementation easier
A pre-trained English model is available for use
Download important inference material from the DeepSpeech releases page
Run in real time on all devices

Project Samples

Project Activity

See All Activity >

License

Mozilla Public License 2.0 (MPL 2.0)

Follow DeepSpeech

DeepSpeech Web Site

Other Useful Business Software

Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free

Rate This Project

User Reviews

Be the first to post a review of DeepSpeech!

Additional Project Details

Programming Language

C++, Python

Related Categories

Python Word Processors, Python Speech Software, Python Speech to Text Software, Python Raspberry Pi Software, C++ Word Processors, C++ Speech Software, C++ Speech to Text Software, C++ Raspberry Pi Software

Registered

2021-03-16

Similar Business Software

Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
talvala surveillance

Talvala is a speech analytics company. We use Baidu’s Deep Speech technology and machine learning for compliance surveillance and human/machine interfaces. We develop speech-based monitoring applications and human machine interfaces (“HMI”) for a wide variety of clients. We believe that the...

See Software
Comet Backup

Start running backups and restores in less than 15 minutes! Fast, secure backup software for businesses and IT providers. Comet is a flexible, all-in-one backup platform available in 13 languages. You choose your backup destination, server location, configuration and setup. Backup to your...

See Software
Kasm Workspaces

Kasm Workspaces streams your workplace environment directly to your web browser…on any device and from any location. Kasm uses our high-performance streaming and secure isolation technology to provide web-native Desktop as a Service (DaaS), application streaming, and secure/private web...

See Software
Wave Browser

Wave Browser is an efficient, eco-conscious browser that creates a cleaner, more organized, and more meaningful online experience while helping remove ocean plastic through its partnership with 4ocean. Built on the trusted Chromium foundation, Wave Browser brings essential tools directly into...

See Software
Canva

Design anything. Publish anywhere. Use Canva’s drag-and-drop feature and professional layouts to design consistently stunning graphics. Design presentations, social media graphics with thousands of beautiful forms, over 100 million stock photos, video & audio, and all the tools you need. Design...

See Software

Report inappropriate content

DeepSpeech

Open source embedded speech-to-text engine

Get an email when there's a new version of DeepSpeech

Features

Project Samples

Project Activity

Categories

License

Follow DeepSpeech

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered