Showing 31 open source projects for "xml to docx"

View related business solutions
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • 1
    ONLYOFFICE Desktop Editors

    ONLYOFFICE Desktop Editors

    Office suite that combines text, spreadsheet and presentation editors

    ONLYOFFICE Desktop Editors is a free and open-source office suite that combines text, spreadsheet, and presentation editors, allowing users to create, view, and edit documents on Windows, Linux, and macOS platforms.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 2
    docx2tex

    docx2tex

    Converts Microsoft Word docx to LaTeX

    Converts Microsoft Word's DOCX to LaTeX.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    carbone

    carbone

    Fast and simple report generator, from JSON to pdf, xslx, docx, odt

    Turn your JSON into PDF, DOCX, XLSX, PPTX, ODS and many more. Fast, Simple and Powerful report generator in any format PDF, DOCX, XLSX, ODT, PPTX, ODS, XML, CSV using templates and your JSON data as input.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    unioffice

    unioffice

    Pure go library for creating and processing Office Word documents

    unioffice is a library for creation of Office Open XML documents (.docx, .xlsx and .pptx). Its goal is to be the most compatible and highest-performance Go library for the creation and editing of docx/xlsx/pptx files. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team. Our SDKs libraries are flexible and developed to solve common problems. We provide high-level...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 5
    docconv

    docconv

    Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

    A Go wrapper library to convert PDF, DOC, DOCX, XML, HTML, RTF, ODT, Pages documents and images (see optional dependencies below) to plain text. See go help install for details on the installation location of the installed docd executable. Make sure that the full path to the executable is in your PATH environment variable. To add image support to the docconv library you first need to install and build gosseract. Now you can add -tags ocr to any go command when building/fetching/testing docconv...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 471 This Week
    Last Update:
    See Project
  • 7
    Quiz/Survey/Test - QST

    Quiz/Survey/Test - QST

    A Free, complete, enterprise grade, open source exam management system

    QST, the worlds unparalleled open source online/lan assessment software. From a quick quiz on your phone to very large scale, high stakes, proctored desktop testing, we make it easy/secure/economical. Our intuitive design contains features (Immediate detailed results, Create/Export/Import/Convert Questions, WYSIWYG/Math-Chemistry/Basic Editors, Question/Item Bank, Multiple Question Types, Multiple Delivery Styles, Multiple Delivery/Results Options, Adaptive/Branching Questions, Randomly...
    Leader badge
    Downloads: 67 This Week
    Last Update:
    See Project
  • 8
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ... supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 10
    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL (Script) and more

    Delphi : VRCalc++ OOSL & + (Paged List, TextEditor, VRAstroVision ...)

    Vincent Radio {Adrix.NT} Sources Library & Applications : Delphi C++ Java VRCalc++ C# VRCalc++ Object Oriented Scripting Language - Engine Source Pascal Code - Delphi Packages Build Prjs - VRCalc++ Scripted System Std RT Library - Guides & Docs (CHM, PDF, DOCX) - VCL & FMX (FireMonkey) Support - Script Test Code (Lang RTL VCL FMX) - Visual Stage Project : VCL & FMX Paged Lists & Iterators : Delphi C++ Java C# Multi-Dim Arrays & Direct Graph Classes : Delphi C++ Java VRCalc++ C...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11

    DocJGenerator

    Wiki generator and Java Help System

    Allows to generate a wiki (interlinked HTML files) from a bunch of XML formatted files. It also allows to add a Help-system to a Swing or JavaFX application. Also it is also possible to generate a PDF, Word (docx), or epub document rather than a wiki. The tool also provides a visual editor to edit the wiki. The project also support both the Mediawiki and Markdown syntax.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Delphi : VRCalc++ and more Binary Exec

    Delphi : VRCalc++ and more Binary Exec

    Delphi Java - VRCalc++ OOSL (Script) and + (Binary Exec Distro)

    Vincent Radio {Adrix.NT} Embarcadero : Delphi : Executable Binaries Delphi : VRCalc++ Object Oriented Scripting Language : Engine + Ext Libraries VRCalc++ OOSL Visual Stage Project : VCL & FMX (FireMonkey) VRCalc++ Script Executor: - VCL Console - Terminal Console - FMX Console + VRCalc++ OOSL : VR System Scripted Standard Runtime Library Delphi Applics - VR Multi Editor : Smart Text Editor - VR Lazy Code Editor : Smart RTF Multi Lang Code Text Editor - VR Astro Vision...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    cpp_xml_parser

    Simple xml parser for C++

    XML Parser is provided. It is very simple and straight forward. It allows to create and parse xml files. In fact it will parse any document which is formed in xml way. That is as far as document is formed using <tag></tag> or <tag/> structure it will parse it. Parser will handle attributes as well. Parser will preserve the order of elements when perform output functionality. Parser was tested on docx underlying xml file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XL-Parser

    XL-Parser

    XL-Parser is a tool for data extraction and analysis.

    XL-Parser provides a bunch of functions for data extraction and analysis. It also provides web log analysis features like a tool for detection of suspicious activities. More details and screenshots on http://le-tools.com.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15

    AnyCount 3D

    AnyCount counts characters, words, lines, and pages in 37 file formats

    AnyCount—the most accurate word-count software and the industry standard, compatible with 37 file formats: Microsoft Office: DOC, DOCX, RTF, XLS, XSLX, PPT, PPS, PPTX, PPSX, PUB, VSD, VSDX Images and PDF: GIF, PNG, BMP, JPG, PDF Open Office and Text files: ODT, SXW, SDW, ODS, SXC, SDC, ODP, SXI, SDD, TXT, CSV HTML+, Help and other: HTML, XML, HLP, CHM, WPD, SLP, MIF, ZIP, RAR The tool counts words not only in saved documents but also directly from websites. The 3D version allows users...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Savvy DOCX Recovery

    Savvy DOCX Recovery

    Open corrupt Word DOCX files and possibly recover formatting too.

    XML was designed from the beginning to be intolerant of errors. This decision adversely affects MS Word's corruption recovery. With one error in the document.xml subfile where all the DOCX file's text is stored, instead of a partial recovery, Word will stop and throw an error. Savvy DOCX Recovery attempts to do precise surgery on corrupt Word documents to reorder or excise bad XML tags. If this doesn't work, it uses the command line app xmllint first to attempt to repair corrupt XML...
    Leader badge
    Downloads: 188 This Week
    Last Update:
    See Project
  • 17
    MS Word documents are one of the most commonly used file types. Therefore, extorting textual data from the MS Word document (docx file) can be achieved by decompressing the file. MS-Office Forensics is a tool developed so that it allows investigators to automatically decompress (unzip) the investigated MS Word document. Additionally, it automatically parses and analyzes the contents of the resulted XML files and subfolders. The user can extract various parts such as the document's textual data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Savvy Repair for Microsoft Office

    Savvy Repair for Microsoft Office

    Repairs/Recovers Corrupt MS Office 2007 - 2013 Format Files

    DOCX, XLSX and PPTX files are collections of conventionally zipped mostly XML sub-files. This program first attempts repair any zip based corruption. From there it engages in four different algorithmic attempts at repair or recovery. With the first method using validation, the location of the first XML error is sought. At that point the sub-files are truncated and repaired. The second method works the same way except the XML validation is more lax. With the third, the strict XML validation...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    Projet sumtec

    Nettoyage et préparation de corpus de transcriptions d'entretiens

    Scripts réalisés dans le cadre du projet SUMTEC pour la préparation des corpus de transcription en vue d'une exploitation sur RQDA et IRAMUTEQ. http://www.msh-lorraine.fr/index.php?id=623 Le projet contient 3 programmes PERL. L'objectif consiste à récupérer des transcriptions d'entretien non structurées afin de les structurer sous la forme d'un arbre xml. L'intérêt consiste à pouvoir, in fine, identifier les tours de parole et séparer les discours des interviewés et des intervieweurs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Corrupt DOCX Salvager

    Corrupt DOCX Salvager

    Extract text from corrupt DOCX files where Word itself fails.

    Previously known as Damaged DOCX2TXT, this GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents. DOCX files are actually zipped collections of mostly XML files. The main text in docx files is found in document.xml file in the collection. Corrupt DOCX Salvager uses 7Zip, an unzipper that sometimes unzips partially corrupt document.xml files despite reporting an error. XML as a format is unforgiving of data corruption but Corrupt DOCX Salvager uses...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Corrupt Extractor for Microsoft Office

    Corrupt Extractor for Microsoft Office

    Extracts text/data from corrupt MS Office 2007-13 format files.

    Corrupt Office 2007 Extractor will extract the text/data from corrupt docx, xlsx, and pptx files where the respective MS Office files error out and refuse to open. In advanced mode the program can fix the zip structure of "Office Open XML" format files, a step which I now recommend despite our dissuasive blurb which comes up when you start that function. Advanced mode also allows recovering images and includes is a basic editor for editing the corrupt XML subfiles. Additionally I...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    JDocAndDocxSearch

    Doc ve docx dosyalarında içerik araması yapar.

    Ms Doc (Word Microsoft Office Document) ve Docx (Word Microsoft Office Open XML Format Document) dosyalarında içerik araması yapar. Aranılan metnin bulunduğu dosyalar listelenir, isterseniz bunları açmanız sağlanır. Eng: Makes text search in the doc (docx) file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    j2docx

    j2docx

    JAVA API to write MS Word 2007/2010 docx documents

    J2DOCX-“Java to docx” is a java approach to write the contents programmatically over the Microsoft docx documents and to manipulate the document. This API comprises set of useful funtions to work with docx. The version 2.1 supports writing all sort of of text,images,hyperlink and custom xml structures. About the Developer: This API was created by Vinothkumar P T, A Java Developer, Software Architect with over 10 years of experience in IT industry and currently work for Cognizant technology...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DOTX to DOCX Converter

    DOTX to DOCX Converter

    Converts DOTX/DOTM files to DOCX/DOCM files

    DOTX to DOCX Converter converts Office Open XML templates (DOTX/DOTM) to Office Open XML documents (DOCX/DOCM). The program is an effective supplement to the Microsoft Office Compatibility Pack, which cannot convert these files.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Moodle Xml Toolkit

    Moodle Xml Toolkit

    Convert from doc to moodle xml.

    Microsoft Office 2010 addon that alowing you to convert glossary from regular doc and docx text to moodle xml. Надстройка Microsoft Office 2010, которое позволяет конвертировать глоссарий из обычного doc-документа в формат moodle xml.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.