Enhanced version of the standard Unix strings(1) program which uses language models for automatic language identification and character-set identification, supporting over 1400 languages, dozens of character encodings, and 4800+ language/encoding pairs.
Features
- text extraction
- language identification
- character-set identification
Categories
SearchLicense
Creative Commons Attribution Non-Commercial License V2.0, GNU General Public License version 3.0 (GPLv3)Follow Language-Aware String Extractor
Other Useful Business Software
Train ML Models With SQL You Already Know
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Rate This Project
Login To Rate This Project
User Reviews
-
Thanks for software and updates.