Search Results for "convert word documents to html"

Showing 123 open source projects for "convert word documents to html"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services. It is built for developers who need high-quality parsing from scans, photos, PDFs, office files, and other document sources while preserving privacy and control over the processing flow. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    rmarkdown

    rmarkdown

    Dynamic Documents for R

    R Markdown is an R package for creating dynamic, reproducible documents that combine code (R, Python, SQL, etc.), results (figures, tables), and narrative text. Built on Knitr and Pandoc, it supports generating HTML, PDF, Word, slideshows, dashboards, and more. It’s widely used in data science and reproducible reporting workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PasteMD

    PasteMD

    Paste Markdown and AI responses into Word Excel instantly fast

    PasteMD is a lightweight desktop utility designed to streamline the process of transferring formatted content from the clipboard into office applications such as Word, WPS, and Excel. It primarily targets users who frequently copy content from AI chat tools or web pages and encounter formatting issues, especially with Markdown, tables, and LaTeX formulas. PasteMD operates from the system tray and monitors clipboard content, automatically converting Markdown or HTML into properly formatted documents using Pandoc. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    ...It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB, and FictionBook 2. You can annotate PDF documents and fill out forms with the mobile viewers (this feature is coming soon to the desktop viewer as well). The command line tools allow you to annotate, edit, and convert documents to other formats such as HTML, SVG, PDF, and CBZ. You can also write scripts to manipulate documents using Javascript. The library is written modularly in portable C, so features can be added and removed by integrators if they so desire.
    Downloads: 7 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Pandoc

    Pandoc

    The universal markup converter

    Pandoc is a universal document converter able to convert files from a multitude of markup formats into another. With Pandoc, you have a swiss-army knife of a converter, able to convert practically any markup format into any other. Pandoc contains a Haskell library for conversions as well as a command-line tool that uses this library. It can convert to and from just about anything-- lightweight markup formats, HTML formats, documentation formats, ebooks, TeX formats, word processor formats and so much more. ...
    Downloads: 195 This Week
    Last Update:
    See Project
  • 6
    WeChatMsg

    WeChatMsg

    Project aimed at extracting, exporting, and analyzing chat records

    WeChatMsg repository hosts an open-source project aimed at extracting, exporting, and analyzing chat records from the WeChat messaging platform. It provides tools that read local WeChat database files and allow users to convert chat data into readable formats such as HTML, Word, and CSV, making it possible to inspect conversations outside the mobile app environment. Beyond simple export, the project includes mechanisms for analyzing chat histories and generating annual reports or visual summaries about messaging trends, interaction patterns, and more. ...
    Downloads: 146 This Week
    Last Update:
    See Project
  • 7
    bookdown

    bookdown

    Authoring Books and Technical Documents with R Markdown

    A open-source (GPL-3) R package to facilitate writing books and long-form articles/reports with R Markdown. Generate printer-ready books and ebooks from R Markdown documents. A markup language easier to learn than LaTeX, and to write elements such as section headers, lists, quotes, figures, tables, and citations. Multiple choices of output formats: PDF, LaTeX, HTML, EPUB, and Word. Possibility of including dynamic graphics and interactive applications (HTML widgets and Shiny apps) Support for languages other than R, including C/C++, Python, and SQL, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    iLovePDF Api

    iLovePDF Api

    iLovePDF Rest Api - PHP Library

    ...We offer a simple and concise API Reference and Guide as well as API Libraries with their own docs too. Our infrastructure uses the best PDF technology for processing PDF files. Merge and split documents with a variety of custom options. Remove, extract or organize PDF pages as you need. Reduce the size of your PDF while maintaining its original quality and formatting. Easily convert Images, MS Word, PowerPoint and Excel files into non-editable PDF documents. Convert PDF documents to JPG images or to PDF/A format.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    PDF to Word Converter
    Coolmuster PDF to Word Converter turns any PDF into a fully editable Microsoft Word document — with perfect preservation of: Text styles & fonts Images & graphics Tables & columns Hyperlinks Overall layout & formatting No more re-formatting after conversion.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    A simple, free, unlimited, secure web-based tool that converts Microsoft Word documents (.docx) into Markdown (.md/.txt) and HTML files. Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    Huxtable

    Huxtable

    An R package to create styled tables in multiple output formats

    Huxtable is an R package to create LaTeX and HTML tables, with a friendly, modern interface. Features include control over text styling, number format, background color, borders, padding, and alignment. Cells can span multiple rows and/or columns. Tables can be manipulated with standard R subsetting or dplyr functions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    LandPPT

    LandPPT

    An LLM-based presentation generation platform

    LandPPT is an open-source AI platform that automatically generates professional presentation slides using large language models. The system allows users to create complete PowerPoint presentations simply by entering a topic or uploading source documents such as PDFs, Word files, or Markdown notes. Using natural language processing and structured content generation, the platform produces presentation outlines and converts them into fully formatted slide decks. The application integrates...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into structured outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    Extractous is a Rust-based unstructured data extraction library focused on fast local parsing of documents and other content-heavy files. Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Gotenberg

    Gotenberg

    A Docker-powered stateless API for PDF files

    Gotenberg provides a developer-friendly API to interact with powerful tools like Chromium and LibreOffice for converting numerous document formats (HTML, Markdown, Word, Excel, etc.) into PDF files, and more! Thanks to Docker, you don't have to install each tool in your environments; drop the Docker image in your stack, and you're good to go! The webhook feature allows you to upload the output file to the destination of your choice. There are many options to fit your requirements, from the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 17
    OpenAPI Generator

    OpenAPI Generator

    OpenAPI Generator allows generation of API client libraries

    With 50+ client generators, you can easily generate code to interact with any server which exposes an OpenAPI document. Maintainers of APIs may also automatically generate and distribute clients as part of official SDKs. Each client supports different options and features, but all templates can be replaced with your own Mustache-based templates. Getting started with server development can be tough, especially if you're evaluating technologies. We can reduce the burden when you bring your own...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    PandaWiki

    PandaWiki

    AI-powered open source platform for building intelligent wiki bases

    PandaWiki is an open source knowledge base system designed to help users build intelligent documentation platforms powered by large language models. It combines traditional wiki functionality with modern AI capabilities, allowing teams and individuals to create and manage product documentation, technical manuals, FAQs, and blog-style knowledge resources. PandaWiki provides tools for managing knowledge bases through an administrative interface while also generating public-facing wiki sites...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    poi-tl

    poi-tl

    Generate awesome word(docx) with template

    A better way to generate word(docx) with the template, based on Apache POI. FreeMarker or Velocity generates new html pages or configuration files based on text templates and data. poi-tl is a Word template engine that generates new documents based on Word templates and data. The Word template has rich styles. Poi-tl will perfectly retain the styles in the template in the generated documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MSWord2CHM is a VB.NET application for converting MS Office Word documents (doc) into the compiled help (chm) document format, using the Primary Interop Assemblies (PIA) and the 'HTML Tidy' and 'TidyForNet' libraries
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    LinWriter

    LinWriter

    small efficient text editor for Windows and Linux

    LinWriter is a small robust utility that can write quick documents and notes for your every day use. Providing a friendly and intuitive interface, it offers a distraction free environment to enhance productivity for your documents and notes. Key features include a minimalistic interface and support for common formats such as TXT, Markdown, HTML and its custom format 'rtdoc', which allows rich text editing. Including are also basic formatting options, inserting graphics, bullets, numbered...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Govel

    Govel

    Lightweight and lightning fast novel writing and organizing tool

    Govel is a lightweight novel writing and organizing software designed with the intention to leave your hands where they belong: on the keyboard. There are no fancy menus, submenus, dialogues, buttons, tabs, you name it. You don't even need to use the mouse, which makes your overall workflow faster than in any other program. You can still customize the visual appearance while beeing maximum productive. And the best of all: Govel is completly free.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    DownSmith Markdown Editor

    DownSmith Markdown Editor

    A powerful, feature-rich Markdown editor with real-time HTML preview.

    DownSmith provides an intuitive editing experience with comprehensive formatting tools, syntax highlighting, live preview, table creation, spell checking, footnotes, HTML export, and intelligent image handling. Runs without Java being installed on Windows. On macOS and Linux requires Java 11 or better installed. A Java 8 version is provided that has all the functionality of the Java 11 version except footnotes.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB