TEXminer

Text Mining Classification for Texts in ASCII, Unicode and PDF Format.

Brought to you by: gearwheelsoft2

Downloads: 2 This Week

Last Update: 2025-03-25

Get an email when there's a new version of TEXminer

TEXminer uses generic Text Mining Methods to analyze Unicode Files as plain Text or PDF.
The Text Database can be saved in XML where the orginal Text, the Sentence and Word Lists and additional Parameters (e.g. Abbreviations) are stored.
TEXminer allows Language Detection by Letter Frequency Analysis, finding important Words by
Cooccurrence Analysis, Determination of Central Expressions, Thematic Text Classification (also Semantic Groups) Fingerprint Comparison and Word Frequency.
Because TEXminer is not disigned to have a Reference Corpus, Thematic Model Statistics uses Language Models (lexicons) to have Background Knowledge about certain Languages (English, German, French, Spanish, Italian, Russian), which are derived from Decaleon Project.
The Thematic Models for Standard Vocabulary have been extended (spring 2015).
The Thematic Models for Technical Terms have been extended (2015).
The Thematic Models for additional Standard Vocabularies have been extended (2015-2023).

Features

Text Mining for Unicode Files and PDF
Letter Frequency Analysis
Cooccurrence Analysis
Central Expressions
Thematic Model Statistics
Similarity Analysis
Word Frequency Ratio

Project Samples

Cooccurrence Analysis

Thematic Model Analysis

Analysis of Technical Text about electric Battery

Word Frequency Ratio

Project Activity

See All Activity >

Follow TEXminer

TEXminer Web Site

Other Useful Business Software

Go From Idea to Deployed AI App Fast Icon

Go From Idea to Deployed AI App Fast

One platform to build, fine-tune, and deploy. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free

Rate This Project

Login To Rate This Project

User Reviews

Be the first to post a review of TEXminer!

Additional Project Details

Registered

2012-11-20

Report inappropriate content

Recommended Projects

Japanese Text Analysis Tool
Generate frequency and readability reports from Japanese texts.
HanLP
Han Language Processing
WordCount
Count frequency of single, 2-word and 3-word clusters in a text
earthengine-py-notebooks
A collection of 360+ Jupyter Python notebook examples
MarkItDown
Python tool for converting files and office documents to Markdown