Showing 23 open source projects for "pdf to speech"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    pdf-to-podcast

    pdf-to-podcast

    PDF to Podcast transforms any PDF document into a podcast-ready audio

    PDF to Podcast transforms any PDF document into a podcast-ready audio episode using advanced AI text-to-speech (TTS) providers. Upload a PDF, select your preferred voice and provider, and receive an MP3 and a ready-to-use RSS feed for your podcast app.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Docling

    Docling

    Get your documents ready for gen AI

    Docling is an open-source document processing toolkit built to prepare diverse content types for modern generative AI and data workflows. The project focuses on converting and parsing many document formats into a unified structured representation that downstream systems can easily consume. It supports advanced PDF understanding, including layout detection, table extraction, and reading order analysis, enabling high-fidelity document intelligence pipelines. Docling is designed to run...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    shuyuan

    shuyuan

    Reading book source

    shuyuan is a project oriented around reading and knowledge consumption, especially targeting large-scale text content such as books, articles, or educational material. The name suggests “academy” or “study hall,” and the tool aims to help users ingest, organize, and manage reading content — possibly offering features like text parsing, annotation, metadata generation, translation, or storage for later reference. The repository is set up to support document ingestion, indexing, and maybe some...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Ailice

    Ailice

    AIlice is a fully autonomous, general-purpose AI agent

    AIlice is an open-source autonomous AI agent framework built to function as a general-purpose assistant that can plan, decompose, and execute complex tasks through a structured multi-agent architecture. The project presents itself as a standalone assistant powered by open-source language models, with an internal design that treats user requests almost like executable programs rather than simple chat prompts. Its core IACT architecture allows the system to break large goals into smaller...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Accessible-Coconut

    Accessible-Coconut

    A GNU/Linux operating system accessible for visually impaired.

    Accessible-Coconut(AC) is a community driven GNU/Linux operating system which is completely accessible for persons with visual impairment. AC is derived from Ubuntu-MATE. Yes the goal is to make a free and open-source eyes free desktop environment. Forum : https://groups.google.com/forum/#!forum/accessible-coconut Telegram forum : https://telegram.me/accessible_coconut Project home : https://zendalona.com/
    Leader badge
    Downloads: 73 This Week
    Last Update:
    See Project
  • 7
    ChatGPT Desktop Application

    ChatGPT Desktop Application

    🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

    ChatGPT Desktop Application (Mac, Windows and Linux)
    Downloads: 91 This Week
    Last Update:
    See Project
  • 8
    Light.Web NovelReader

    Light.Web NovelReader

    A light or web novel reader app for desktop (windows)

    You can read your favourite light, web or wuxia novel on this application.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    Idealtake

    Idealtake

    Search engine using MySql and VB .NET

    This software was designed to simplify your research on personal or business documents, it is akin to a search engine. You can imagine databases on different topics, Photo Album, personal documents, catalog of a company and many others. The initial project has begun in June 2006, i developped this application for the company where i work in GPL licence. This software was used internally by employees and by their clients. This project was used until the end of 2009. The initial project was...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    C# Speech Recognition Tutorial

    C# Speech Recognition Tutorial

    ...The pdf file in the zip file explains how to link the voice recognition to a database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Word Doctor

    Word Doctor

    Nextgen word app. Word Docs made easy!

    Word Doctor is a word editor/ writers aid, designed to analyze writing "Content" and "Style". Inspire your creative process and get to work fast using dictation (Speech to Text), or the Ink-Blot test to inspire creativity. Analyze what you already have and Identify imagery, weak writing structures, and more. Content is king, and Word Doctor can certainly help with that!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Text2MP3

    Text2MP3

    PDF/Text to MP3 - Text Processing to speech

    This project is depricated. We apologize. ---------------------------------------------------------------------------- Windows Application that strips PDF's into text and converts to speech. You can save the extracted text also into text files, Word docs, csv's and rtf format. Browse for PDF's from the web, save them and strip them. Good for students, lecturers, theses and educational purposes. Some bugs yet to fix in the coming weeks, although these do not effect the functionality...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ...Teager energy has been used to amplify the difference between the noisy and clean signal coefficients. The link to the paper in pdf: https://www.researchgate.net/profile/Md_Tauhidul_Islam2/publications or Matlab code in file exchange: http://www.mathworks.com/matlabcentral/fileexchange/55030-speech-enhancement-based-on-student-t-modeling-of-te-operated-pwp-coefficients Please do not forget to cite our paper when you use our code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    "MedicalRecords"

    MedicalRecords is an integrated medical information system.

    ...Data that are downloadable in machine readable format can be transferred electronically to the database. Alternately, the data can be transferred from USB flash drives, CD ROMs or other removable storage media. Documents can be entered by scanning to PDF files or other formats. Finally, information may be entered through use of speech recognition or typing. “MedicalRecords” gives one or more patients access to an integrated medical record the data in which may come from a variety of sources. It also provides an easy means for presenting the integrated data to specialist or other new care provider, emergency room staff or admitting physicians.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WikiDict English Sinhala Dictionary

    WikiDict English Sinhala Dictionary

    English Sinhala Dictionary

    English To Sinhala Translate Sinhala To English Translate Full Text Translate Results To Speech PDF Reader Mini Dictionary Mode Search Bookmark Real Time Search Suggestion Search Mode Sinhala Unicode Keyboard Single Word Search Results To Mail Over 200,000 word database
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Speaking Words

    Speaking Words

    Now With Speaking Words you can convert all documents in Audio Files

    I always wonder if it is possible to learn from my books without reading them! It is possbile now, Thanks to Speaking word software. With Speaking Words you can convert any Documents into Audio File.You can listen web articles , PowerPoint slides ,Pdf files and any other file. Now you have the opportunity to to listen every doc on your Mobile phone or Computer without any hassle or worry. Some Features of Speaking Words Software Totally Free of Cost :) Nice and user friendly...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Fountain Libre Office Tools

    Fountain Tools for LibreOffice easily format professional screenplays.

    Fountain Tools for LibreOffice is a freeware toolset to easily create professionally formatted screenplays "from scratch," sourced from platform-independent and future-proof fountain text files (or, a subset of fountain), including proper handling of elements that span pages (no widows or orphans--it's a layout thing), and formatting which so closely mimics the better "bang for your buck" layout of Movie Magic Screenwriter that no reader will be the wiser. In other words, it may shorten your...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    pdf2mp3

    Simply convert your PDF files into audio books

    Summary: Your eyes are tired of looking into the tablet or cell-phone screen reading ebooks? You have difficulty reading from LCD screen specially in a driving vehicle? This software is for you! It converts your PDF files to MP3 audio books. Special Features (Compared to similar projects): Each page is in a separate MP3 file. Created MP3 files have ID3v2 tags showing Book name and page number. Multi-threaded conversion, means all CPU cores will be used thus multiple times faster conversion.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MyDSReader is an homebrew for the Nintendo DS that helps visually impaired users: 1. Read documents in digital format (text, word, pdf, DAISY) 2. Take voice annotations 3. Read e-mails and reply/write using recorded voice clips
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB