Open Source Python Text Processing Software

Python Text Processing Software

View 91 business solutions

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Payments you can rely on to run smarter. Icon
    Payments you can rely on to run smarter.

    Never miss a sale. Square payment processing serves customers better with tools and integrations that make work more efficient.

    Accept payments at your counter or on the go. It’s easy to get started. Try the Square POS app on your phone or pick from a range of hardworking hardware.
    Learn More
  • 1
    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.
    Leader badge
    Downloads: 22,239 This Week
    Last Update:
    See Project
  • 2
    Notepad++ Python Script

    Notepad++ Python Script

    A Python Scripting plugin for Notepad++

    A Python Scripting plugin for Notepad++. Complete easy script access to all of the editor's features (including absolutely everything in Scintilla). Configurable menus and toolbar options, assign shortcuts to scripts.
    Leader badge
    Downloads: 8,385 This Week
    Last Update:
    See Project
  • 3
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 194 This Week
    Last Update:
    See Project
  • 4
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • Lightspeed golf course management software Icon
    Lightspeed golf course management software

    Lightspeed Golf is all-in-one golf course management software to help courses simplify operations, drive revenue and deliver amazing golf experiences.

    From tee sheet management, point of sale and payment processing to marketing, automation, reporting and more—Lightspeed is built for the pro shop, restaurant, back office, beverage cart and beyond.
    Learn More
  • 5
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 47 This Week
    Last Update:
    See Project
  • 6
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 7
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • 10
    Tomoe is a handwriting character recognition engine.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    sPOP means Small Python Office Package.The sPOP suite got a editor,filemanager,timetable,..programs. The sPOP programs are small and they are program that you can learn from them.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    xml2txt is a text formatter for XMl in the same way the FO is a PDF formatter. It uses python to convert an XML document to well-formatted text, wtih borders, indents, and tables.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    PyRTF is a pure python module for the efficient creation of RTF documents.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Buzz is a fast graphical editor for XML files with special support for OPML. Using the OPML convergence tools it will edit about any outline and many forms of indented text, including Python. In fact, Buzz was written with Buzz! It is written in P
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    The converter performs automatically the full process of converting the files of a C project into the equivalent C++ files. Classes are created, var and functions becomes attributes and methods and the changes are propagated into all files.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    arCHMage

    A reader and decompiler for files in the CHM format

    arCHMage is a reader and decompiler for files in the CHM format. This is the format used by Microsoft HTML Help, and is also known as Compiled HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Boarded - on-screen keyboard for X11
    Boarded is a virtual on-screen keyboard for the X11 window system. It's intended to be used on mobile devices without a hardware keyboard. It allows you to design your own keyboard layouts, specifying key positioning/grouping and setting own colors.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Gyrfalcon is a note / thought / task management system. Take your notes and other bits of information and: put the notes in hierarchical trees, tag, search, hyperlink, etc. Gyrfalcon is also designed with a clean interface that avoids modal interactions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    Indexmeister

    automatic indexing for large LaTex documents

    Indexmeister reads a variety of formats (.tex, .docx, .epub, and others) and suggests keywords for indexing. The included program Imbrowse provides a semi-automatic interface to rapidly add index tags to multi-file latex documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    LaTeX Helper

    GUI to help create a LaTeX document

    This GUI lets the user navigate a wide selection of LaTeX structure and formatting tags that are used to write a document in LaTeX. It also has a raw text input box with a LaTeX preview for testing out LaTeX markup. Also, each markup has associated help text and is automatically selected to be ready to paste into the user's text editor. Also, this GUI lets the user choose which text editor and supporting programs are used to create their LaTeX document instead of forcing them into a particular interface of a pre-defined IDE.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Multiplayer Notepad is a collaboration tool that allows multiple users to edit a single document with real-time synchronization between them.
    Downloads: 1 This Week
    Last Update:
    See Project