Open Source Windows Linguistics Software - Page 4

Linguistics Software for Windows

View 1216 business solutions
  • Simplify IT and security with a single endpoint management platform Icon
    Simplify IT and security with a single endpoint management platform

    Automate the hardest parts of IT

    NinjaOne automates the hardest parts of IT, delivering visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. The company seamlessly integrates with a wide range of IT and security technologies. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support.
    Learn More
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Google Translate
    This Program helps to translate words and text via Google Translate.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Al-Mintiq: Arabic eSpeak

    Al-Mintiq: Arabic eSpeak

    Arabic voice files for eSpeak system

    Arabic files and voices for eSpeak Text to speech system, المنطيق : ملفات اللغة العربية لبرنامج توليد الكلام من النص إسبيك
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3

    Automatic Compound Processing (AuCoPro)

    Automatic compound splitting and semantic analysis of compounds

    The central problem to be addressed in this project concerns a multidisciplinary (linguistics and computational linguistics) investigation into sharing of knowledge and resources between closely-related languages, specifically relating to the automatic processing of compounds. Specifically, we will explore the possibility to create new knowledge about closely-related languages, and efficiently develop additional, more advanced resources for (a) compound segmentation; and (b) the semantic analysis of compounds; as such, the project will be divided into two interrelated subprojects, to be executed simultaneously. The focus in this project will be on Afrikaans (with Dutch as the closely-related, well-sourced language), which will lay grounds for future work on other closely-related language pairs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Align parallel data at sentence level and also automatic creation of .tmx files for use with Autshumato ITE
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas | Run databases anywhere Icon
    MongoDB Atlas | Run databases anywhere

    Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

    MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.
    Learn More
  • 5
    Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Entity recognition and normalization software for biomedical text
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7

    PADIC

    A multilingual Parallel Arabic DIalectal Corpus

    PADIC (Parallel Arabic DIalectal Corpus) is a multi-dialectal corpus built in the framework of the National Research Project "TORJMAN", led by Scientific and Technical Research Center for the Development of Arabic Language and funded by the Algerian Ministry of Higher Education and Scientific Research. PADIC is composed of 6 dialects: two Algerian dialects (Algiers and Annaba cities), Palestinian, Syrian, Tunisian, Moroccan) and MSA. Mourad Abbas Computational Linguistics Department, crstdla https://sites.google.com/site/mouradabbas9 Publications ----------------- K. Meftouh, S. Harrat, S. Jamoussi, M. Abbas, K. Smaïli, Machine Translation Experiments on PADIC: A Parallel Arabic DIalect Corpus, The 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015, Shanghai, 2015. TORJMAN website: ------------------------- https://sites.google.com/site/torjmanepnr/6-corpus
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    PronunDict_French

    PronunDict_French

    a pronunciation dictionary of French

    PronunDict for French is both a reverse phonetic dictionary (searching by pronunciation) and a standard one to search by spelling. Pronunciation is transcribed with IPA symbols. It runs on Windows, and should also work with Wine on Linux and macOS. Acknowledgement: This app uses one external French dictionary (bundled with it) by Universite du Maine, France. Warning: There might be some errors in the dictionary entries. You can follow PronunDict on Twitter. https://twitter.com/PronunDict
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build apps or websites quickly on a fully managed platform Icon
    Build apps or websites quickly on a fully managed platform

    Get two million requests free per month.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure.
    Try it for free
  • 10
    A windows form application to convert Bangla accented English text files into unicode Bangla text files. •Development Tools: Visual C#,MS Dot(.) Net. FrameWork 2.0.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11

    The Magda language

    Magda language resource site

    Magda is a programming language introduced in Jarek Kusmierek's PhD (http://www.mimuw.edu.pl/~jdk/mixiny.pdf) and continued in Mauro Mulatero's thesis (http://www.tesionline.it/default/tesi.asp?idt=45612). Magda's goal is to allow a programmer to write well-modularized, reusable code. Magda is based upon the core notion of mixin as the only unit of reuse. The power of modularization of mixins is enhanced by two unique features. The first feature is the modularization of constructors. In Magda, many mixins with independent definitions of constructors can be combined without the need to copy any code and without the risk of clashes. The second distinctive feature modifies the way declarations of new methods, overriding methods, and method calls are done, in order to rule out accidental name clashes, even with respect to future modifications of the code. Samples written in Magda language, together with an IDE (by Marco Naddeo), are available in the download.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    Aelius Brazilian Portuguese POS-Tagger

    Python, NLTK-based package for shallow parsing of Brazilian Portuguese

    Aelius is an ongoing open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Dictionary Additions Management System (DAMS), a collection of open source translation dictionaries. These files are compatible with the Open Translation Engine (OTE). For more info, see http://sourceforge.net/projects/ote/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Dutch sentiment analysis engine

    Dutch sentiment analysis engine

    Een module om de sentiment van een stuk Nederlandse tekst to bepalen

    This application was developed by Incentro to satisfy requests by clients for a sentiment analyser for the Dutch language. It is currently in it's alpha stage and we expect to have a beta release by November 2012. If you would like to help with the development or testing of this product please contact us at +31[0]15 76 40 750 - of info {at} incentro.com. Deze applicatie is ontwikkeld door Incentro om te voldoen aan klantaanvragen voor een sentimentanalyse module voor de Nederlandse taal. Momenteel is de module in alpha versie beschikbaar en een beta versie wordt verwacht in november 2012. Als u ons wilt helpen bij het ontwikkelen of testen van deze module, neem dan contact op met Incentro via +31[0]15 76 40 750 - of info {at} incentro.com.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Fast Unsupervised Sentence Aligner described in "Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora", COLING 2010. NEWS : release 1.0b : bug fixed (release1.0a deprecated).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    JAVA Arabic Stemmer

    A JAVA class with a small functionality that is stemming Arabic words

    A JAVA Arabic stemmer that is based on Shereen Khoja algorithm. This java class offers a function called stemWrod which takes an arabic word and return the stem of it.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Kurdi Nus
    A portable, platform-independent, open source tool for converting different Kurdish scripts
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Mani
    Coptic - English and Coptic - Czech dictionary related to Crum's coptic dictionary, written in C++, based on MySql, with Qt GUI. Is developed as part of project Marcion, containing only coptic data without study environment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    Mansour

    Arabic text analyzer

    Mansour is a simple application for analyzing digital text written in Arabic.منصور هو تطبيق مكتبي بسيط لتحليل النصوص الرقمية المكتوبة باللغة العربية.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    A unique natural-language processing software, called Discovery, created on the CA Visual Objects/Vulcan.NET environment, which also has potential for effective "shallow approach" machine translation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    The Open Translation Engine (OTE) is a web-based translation dictionary manager. The OTE allows a community of users to create and manage one or many translation dictionaries. The OTE is written in PHP and uses a MySQL database.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    SuperAlign was fully updated as of 15 July 2013 and is now released under the name eAlign as well. A parallel corpora (bitext) aligning tool. Create TMX databases and align translations for Translation Memory databases. Use multiple files in multiple formats to align them with their translations. The full workflow is built in with a GUI interface. SuperAlign-eAlign uses the hunalign algorithm.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.