Showing 31 open source projects for "docx to xml"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 1
    carbone

    carbone

    Fast and simple report generator, from JSON to pdf, xslx, docx, odt

    Turn your JSON into PDF, DOCX, XLSX, PPTX, ODS and many more. Fast, Simple and Powerful report generator in any format PDF, DOCX, XLSX, ODT, PPTX, ODS, XML, CSV using templates and your JSON data as input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    docx2tex

    docx2tex

    Converts Microsoft Word docx to LaTeX

    Converts Microsoft Word's DOCX to LaTeX.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Savvy DOCX Recovery

    Savvy DOCX Recovery

    Open corrupt Word DOCX files and possibly recover formatting too.

    XML was designed from the beginning to be intolerant of errors. This decision adversely affects MS Word's corruption recovery. With one error in the document.xml subfile where all the DOCX file's text is stored, instead of a partial recovery, Word will stop and throw an error. Savvy DOCX Recovery attempts to do precise surgery on corrupt Word documents to reorder or excise bad XML tags.
    Leader badge
    Downloads: 150 This Week
    Last Update:
    See Project
  • 4
    ONLYOFFICE Desktop Editors

    ONLYOFFICE Desktop Editors

    Office suite that combines text, spreadsheet and presentation editors

    ONLYOFFICE Desktop Editors is a free and open-source office suite that combines text, spreadsheet, and presentation editors, allowing users to create, view, and edit documents on Windows, Linux, and macOS platforms.
    Downloads: 25 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    unioffice

    unioffice

    Pure go library for creating and processing Office Word documents

    unioffice is a library for creation of Office Open XML documents (.docx, .xlsx and .pptx). Its goal is to be the most compatible and highest-performance Go library for the creation and editing of docx/xlsx/pptx files. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ... - Support for multiple input formats. - Lightweight editing prior to saving. Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7

    DocJGenerator

    Wiki generator and Java Help System

    Allows to generate a wiki (interlinked HTML files) from a bunch of XML formatted files. It also allows to add a Help-system to a Swing or JavaFX application. Also it is also possible to generate a PDF, Word (docx), or epub document rather than a wiki. The tool also provides a visual editor to edit the wiki. The project also support both the Mediawiki and Markdown syntax.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 8
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 398 This Week
    Last Update:
    See Project
  • 9
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract.
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Quiz/Survey/Test - QST

    Quiz/Survey/Test - QST

    A Free, complete, enterprise grade, open source exam management system

    QST, the worlds unparalleled open source online/lan assessment software. From a quick quiz on your phone to very large scale, high stakes, proctored desktop testing, we make it easy/secure/economical. Our intuitive design contains features (Immediate detailed results, Create/Export/Import/Convert Questions, WYSIWYG/Math-Chemistry/Basic Editors, Question/Item Bank, Multiple Question Types, Multiple Delivery Styles, Multiple Delivery/Results Options, Adaptive/Branching Questions, Randomly...
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 12
    Delphi : VRCalc++ and more Binary Exec

    Delphi : VRCalc++ and more Binary Exec

    Delphi Java - VRCalc++ OOSL (Script) and + (Binary Exec Distro)

    Vincent Radio {Adrix.NT} Embarcadero : Delphi : Executable Binaries Delphi : VRCalc++ Object Oriented Scripting Language : Engine + Ext Libraries VRCalc++ OOSL Visual Stage Project : VCL & FMX (FireMonkey) VRCalc++ Script Executor: - VCL Console - Terminal Console - FMX Console + VRCalc++ OOSL : VR System Scripted Standard Runtime Library Delphi Applics - VR Multi Editor : Smart Text Editor - VR Lazy Code Editor : Smart RTF Multi Lang Code Text Editor - VR Astro Vision...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL & + (Paged List, TextEditor, VRAstroVision ...)

    Vincent Radio {Adrix.NT} Sources Library & Applications : Delphi C++ Java VRCalc++ C# VRCalc++ Object Oriented Scripting Language - Engine Source Pascal Code - Delphi Packages Build Prjs - VRCalc++ Scripted System Std RT Library - Guides & Docs (CHM, PDF, DOCX) - VCL & FMX (FireMonkey) Support - Script Test Code (Lang RTL VCL FMX) - Visual Stage Project : VCL & FMX Paged Lists & Iterators : Delphi C++ Java C# Multi-Dim Arrays & Direct Graph Classes : Delphi C++ Java VRCalc++...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14

    cpp_xml_parser

    Simple xml parser for C++

    ...Parser was tested on docx underlying xml file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    XL-Parser

    XL-Parser

    XL-Parser is a tool for data extraction and analysis.

    XL-Parser provides a bunch of functions for data extraction and analysis. It also provides web log analysis features like a tool for detection of suspicious activities. More details and screenshots on http://le-tools.com.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    AnyCount 3D

    AnyCount counts characters, words, lines, and pages in 37 file formats

    AnyCount—the most accurate word-count software and the industry standard, compatible with 37 file formats: Microsoft Office: DOC, DOCX, RTF, XLS, XSLX, PPT, PPS, PPTX, PPSX, PUB, VSD, VSDX Images and PDF: GIF, PNG, BMP, JPG, PDF Open Office and Text files: ODT, SXW, SDW, ODS, SXC, SDC, ODP, SXI, SDD, TXT, CSV HTML+, Help and other: HTML, XML, HLP, CHM, WPD, SLP, MIF, ZIP, RAR The tool counts words not only in saved documents but also directly from websites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MS Word documents are one of the most commonly used file types. Therefore, extorting textual data from the MS Word document (docx file) can be achieved by decompressing the file. MS-Office Forensics is a tool developed so that it allows investigators to automatically decompress (unzip) the investigated MS Word document. Additionally, it automatically parses and analyzes the contents of the resulted XML files and subfolders. The user can extract various parts such as the document's textual data by paragraphs or by textual data blocks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Savvy Repair for Microsoft Office

    Savvy Repair for Microsoft Office

    Repairs/Recovers Corrupt MS Office 2007 - 2013 Format Files

    DOCX, XLSX and PPTX files are collections of conventionally zipped mostly XML sub-files. This program first attempts repair any zip based corruption. From there it engages in four different algorithmic attempts at repair or recovery. With the first method using validation, the location of the first XML error is sought. At that point the sub-files are truncated and repaired.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Projet sumtec

    Nettoyage et préparation de corpus de transcriptions d'entretiens

    Scripts réalisés dans le cadre du projet SUMTEC pour la préparation des corpus de transcription en vue d'une exploitation sur RQDA et IRAMUTEQ. http://www.msh-lorraine.fr/index.php?id=623 Le projet contient 3 programmes PERL. L'objectif consiste à récupérer des transcriptions d'entretien non structurées afin de les structurer sous la forme d'un arbre xml. L'intérêt consiste à pouvoir, in fine, identifier les tours de parole et séparer les discours des interviewés et des intervieweurs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Corrupt DOCX Salvager

    Corrupt DOCX Salvager

    Extract text from corrupt DOCX files where Word itself fails.

    Previously known as Damaged DOCX2TXT, this GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents. DOCX files are actually zipped collections of mostly XML files. The main text in docx files is found in document.xml file in the collection. Corrupt DOCX Salvager uses 7Zip, an unzipper that sometimes unzips partially corrupt document.xml files despite reporting an error. XML as a format is unforgiving of data corruption but Corrupt DOCX Salvager uses a Perl routine used to extract the text from the document.xml file which doesn't care about well-formed XML, a recovery stumbling block for Microsoft Word. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Corrupt Extractor for Microsoft Office

    Corrupt Extractor for Microsoft Office

    Extracts text/data from corrupt MS Office 2007-13 format files.

    Corrupt Office 2007 Extractor will extract the text/data from corrupt docx, xlsx, and pptx files where the respective MS Office files error out and refuse to open. In advanced mode the program can fix the zip structure of "Office Open XML" format files, a step which I now recommend despite our dissuasive blurb which comes up when you start that function. Advanced mode also allows recovering images and includes is a basic editor for editing the corrupt XML subfiles. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22

    JDocAndDocxSearch

    Doc ve docx dosyalarında içerik araması yapar.

    Ms Doc (Word Microsoft Office Document) ve Docx (Word Microsoft Office Open XML Format Document) dosyalarında içerik araması yapar. Aranılan metnin bulunduğu dosyalar listelenir, isterseniz bunları açmanız sağlanır. Eng: Makes text search in the doc (docx) file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    j2docx

    j2docx

    JAVA API to write MS Word 2007/2010 docx documents

    J2DOCX-“Java to docx” is a java approach to write the contents programmatically over the Microsoft docx documents and to manipulate the document. This API comprises set of useful funtions to work with docx. The version 2.1 supports writing all sort of of text,images,hyperlink and custom xml structures. About the Developer: This API was created by Vinothkumar P T, A Java Developer, Software Architect with over 10 years of experience in IT industry and currently work for Cognizant technology solutions US Corporation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DOTX to DOCX Converter

    DOTX to DOCX Converter

    Converts DOTX/DOTM files to DOCX/DOCM files

    DOTX to DOCX Converter converts Office Open XML templates (DOTX/DOTM) to Office Open XML documents (DOCX/DOCM). The program is an effective supplement to the Microsoft Office Compatibility Pack, which cannot convert these files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Moodle Xml Toolkit

    Moodle Xml Toolkit

    Convert from doc to moodle xml.

    Microsoft Office 2010 addon that alowing you to convert glossary from regular doc and docx text to moodle xml. Надстройка Microsoft Office 2010, которое позволяет конвертировать глоссарий из обычного doc-документа в формат moodle xml.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB