Open Source Linux Text Processing Software - Page 3

Text Processing Software for Linux

View 9 business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Stay in Flow. Let Zenflow Handle the Heavy Lifting. Icon
    Stay in Flow. Let Zenflow Handle the Heavy Lifting.

    Your AI engineering control center. Zenflow turns specs into shipped features using parallel agents and multi-repo intelligence.

    Zenflow is your engineering control center, turning specs into shipped features. Parallel agents handle coding, testing, and refactoring with real repo context. Multi-agent workflows remove bottlenecks and automate routine work so developers stay focused and in flow.
    Try free now
  • 1

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    FigTeX manages images and their easy inclusion in LaTeX documents. Similar to BibTex, the image information is stored in an external file and is imported into the document as needed. It comes with a comfortable GUI for managing the image library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • 5
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    GOSH takes an Usenet-styled ASCII text as input and generates Latex-code, HTML, or Man-pages. Its transformation is easily customizable, which makes GOSH suited for creating web content, papers, letters, and technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    HTML Header Tree is a Mediawiki extension which organizes a page with a container tree according to its headers. Containers with specific headers are given specific styles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Your Live Streaming Platform Icon
    Build Your Live Streaming Platform

    Scalable, Ultra Low Latency and Adaptive WebRTC Streaming

    Ant Media Server is a streaming engine software that provides adaptive, ultra low latency streaming by using WebRTC technology with ~0.5 seconds latency.
    Learn More
  • 10
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Html Optimizer is an optimizer for optimize html files by shrink them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JFileReader allows users to view the contents of any file in a bytewise manner using a simple GUI interface. Functions include byte view changes to different primitive datatypes, endianess changes, and file content exports to new files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    JLoom is a JSP like template language for text generation - e.g. source code, HTML, XML. JLoom templates are modular encapsulated. Parameters can be any Java type, even Generics or Varargs. There is a plugin for Eclipse and a command line tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A JavaScript library for parsing Creole 1.0 wiki markup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    KefirBB

    The Java text processor.

    Project was moved to https://github.com/kefirfromperm/kefirbb
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats. The website will have complete HTML contents available as downloadible zip file and the websites source available as gzipped tar file. A 'configure' file is provided to help compile the gzipped tar file sources. Other features include spell checking, broken link checking, diction and style analysis on LaTeX sources. LaTeX Web Publisher is a wrapper for the programs latex, makeindex, latex2html, htlatex, dvips, dvi2ps, ps2pdf, pdflatex, pdf2djvu, dvi2tty, dvipdfm, epstopdf, convert, aspell, diction, style and linkchecker.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NOTE: unsupported - do you want to maintain this project? contact me! Markdownify is a HTML to Markdown converter written in PHP. See it as the successor to `html2text.php` since it has better design, better performance and less corner cases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Mediawiki-PDF is a mediawiki extension to convert wiki articles into PDF Documents. The extension uses HTMLDOC to convert the wiki pages from plain HTML into PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    NRtfTree library is a set of classes written entirely in C# which may be used to manage (read and write) RTF documents in your own applications. A java port of the library can be found in http://www.sgoliver.net/blog/?page_id=92
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    An extensible tool to validate Open Office Text documents for accessability standards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OpenNabu is a standalone personal wiki, featuring fast and simple content creation combined with highly customizable HTML export. All information is stored in plain text files and no additional server or database is needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PDML is an informal markup language written in PHP that is similar to HTML. It allows for the creation of complex PDF documents and can also be used in conjunction with PHP, to define templates which can generate dynamic PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project