Linguistics Software

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    HanNanum - Korean POS Tagger
    HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc. Contact: kschoi@kaist.ac.kr hjjeong@world.kaist.ac.kr
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Hebrew Deflector

    A proram to de-inflect modern Hebrew words

    Hebrew Deflector tries to guess the root, the pattern and the form of a modern Hebrew word provided by the user. It uses the existing rules of the language to do that, and displays the list of possible answers. It is not a dictionary and it doesn't know whether the word (and the listed forms of it) exists or not. It also doesn't know anything about exception to the rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HermeneutiX

    HermeneutiX

    Your graphical tool for Syntactic/Semantic Structure Analysis of texts

    HermeneutiX is a tool for diagramming syntactic and semantic structures of complex (not necessarily foreign-language) texts (e.g. bible or other historical excerpts). HermeneutiX is now part of SciToS (the scientific tool set). Starting with version 2.0.0, HermeneutiX can be found on GitHub. Please check out the release summary: https://github.com/scientific-tool-set/scitos/releases For an introduction, check out this video: https://youtu.be/uQjewyG0Ad8 PS: To run a Java application such as HermeneutiX (i.e. SciToS) you need a Java Runtime Environment (JRE). HermeneutiX is currently built to be compatible down to JRE version 6. You may download the current JRE here: http://www.java.com/en/download
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Hermes Natural Language Processing

    A repository of software, documentation and data for NLP

    Hermes is a repository of software, documentation and data for NLP. I am currently adding corpora extracted from Wikipedia (mostrly in Romance languages).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5

    ISO GrAF

    Experimental Java library for reading and writing GrAF/XML files.

    The Graph Annotation Framework (GrAF) models linguistic annotations using a data model based on Graph theory and algorithms. The GrAF standard is a work product of ISO TC37SC4 Working Group 1. This Java library is NOT part of the GrAF standard and standoff annotation files produced by the library may not be GrAF compliant.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The "ITALIAN Translations" is a project dedicated to the translation of any kind (Open Source, Creative Commons, Freeware) application, documentation, tutorials, etc, in the Italian Language. Anyone can request translations in the Italian Language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    InGen is a java-based tool that automatically extracts keywords from a given LaTeX-Document and creates an index for those keywords.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    JAVA Arabic Stemmer

    A JAVA class with a small functionality that is stemming Arabic words

    A JAVA Arabic stemmer that is based on Shereen Khoja algorithm. This java class offers a function called stemWrod which takes an arabic word and return the stem of it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    JInsect
    The JINSECT toolkit is a Java-based toolkit and library that supports and demonstrates the use of n-gram graphs within Natural Language Processing applications, ranging from summarization and summary evaluation to text classification and indexing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    KAF2Tiger2 is a KAF (KYOTO annotation format) to <tiger2/> (Tiger2 XML) converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Kurdi Nus
    A portable, platform-independent, open source tool for converting different Kurdish scripts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    With this tool you can easily design a new language, digitize an existing one or incrementally reconstruct an ancient language. It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy integration with other software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    LanguageTool

    Proofreading Software for 20+ Languages

    LanguageTool is an Open Source language/grammar checker. *** THIS REPOSITORY IS OUT OF DATE, see https://github.com/languagetool-org INSTEAD ***
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LinqYedict

    LinqYedict

    Translate Chinese to English

    Translate Chinese to English using CEDICT (cantonese dictionary). Demonstrate the speed of C# and Linq. Copy the chinese text from any browser/application to Windows clipboard and see the translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. Deployed in accessibility contexts, it aims for dependable behavior across accents, environments, and intermittent connectivity, with graceful degradation when resources are constrained.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Lyreword
    Lyreword is a flexible word generator for writers, role players, conlangers and everybody who seeks some inspiration for inventing words and names.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Maskouk : Arabic Collocations
    Maskouk: Arabic Collocations Dictionary المسكوكات اللفظية العربيو، المتلازمات المتواردات
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Maya Hieroglyphs Fonts and Input Methods
    The aim of this project is to develop the basics for the usage of Maya Hieroglyphs on computers. There is quite a lot of stuff to do: Developing encodings for Unicode, fonts and input methods.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A collection of Metasyntaxes like EBNF for .Net including a definition file parser and an expression tree.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MinGen is a Minimalist generator, the logical opposite to a parser. MinGen generates syntactically valid sentences by following the rules of Minimalism.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Nekoshka is a cross-platform open-source shell for Japanese dictionaries like edict and yarxi. It supports radical lookup, handwriting and direct keyboard input.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB