Open Source Windows Linguistics Software - Page 5

Sort By:

Linguistics Software for Windows

View 34 business solutions

Linguistics Windows Clear Filters

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

CTexT Alignment Interface Pro

Align parallel data at sentence level and also automatic creation of .tmx files for use with Autshumato ITE

Downloads: 0 This Week

Last Update: 2015-02-24
See Project
2

Chaski

Distributed phrase-based machine translation training tool based on Hadoop.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
3

CoSyne Integrated Prototype

Multilingual Content Synchronization with Wikis: CoSyne is a Research and Technological Development project co-funded by the European Union. Details: http://cosyne.eu

Downloads: 0 This Week

Last Update: 2013-04-29
See Project
4

Colloquium QDA

A free and open source qualitative ethnographic interview coding tool.

Colloquium QDA is a tool for custom coding and analyzing qualitative ethnographic interviews. To run, make sure you first have JRE 8 or later installed (http://www.oracle.com/technetwork/java/javase/downloads/). Colloquium QDA is an open source cross-platform Java Swing app utilizing an embedded Java DB with Lucene integrated search.

Downloads: 0 This Week

Last Update: 2017-01-23
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

Communication Supporting System

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
6

Communication Supporting System

Downloads: 0 This Week

Last Update: 2013-05-29
See Project
7

CompE Toolkit

Data Type Converter

CompE Toolkit allows the user to seamlessly convert between binary, decimal, hexadecimal, and 32-bit floating point representation. It uses a simple, user-friendly interface designed for maximum efficiency and minimal clutter.

Downloads: 0 This Week

Last Update: 2015-02-16
See Project
8

Comparing String

Practicing C's built-in string.h functions

This programme simpl takes two string (Like two tweets) and compare them and adds a hashtag to end of them if user desired to. Very simple used built-in C string functions.

Downloads: 0 This Week

Last Update: 2025-11-17
See Project
9

ConTextKit

ConTextKit is a Java-based implementation of Wendy Chapman's ConText algorithm for annotating the context of medical documents, specifically the negation, temporality, and experiencer.

Downloads: 0 This Week

Last Update: 2014-06-24
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

CoocViewer

Viewer for co-occurrences and positional co-occurrences

A Demo is available at: http://coocviewer.sourceforge.net/coocviewer/index.php

Downloads: 0 This Week

Last Update: 2013-11-08
See Project
11

CorpSe

CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
12

Corpus redundancy manager

Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.

Downloads: 0 This Week

Last Update: 2014-06-30
See Project
13

Cross-Language Computational Linguistics

cross-languages resources

AFEWC corpus is a multilingual comparable text articles in Arabic, French, and English languages. Each triple article is related to the same topic (aligned at article level). AFEWC corpus is collected from Wikipedia. The corpus is available for free for research purposes only. It is composed of 40K aligned articles, 91.3M English words, 57.8M French words, 22M Arabic words, 2.8M English unique words, 1.9M French unique words, and 1.5M Arabic unique words. Wikipedia text is available under Creative Commons Attribution-ShareAlike 3.0 License. https://en.wikipedia.org/wiki/Wikipedia:About To cite the corpora: M. Saad, D. Langlois, and K. Smaïli. Extracting Comparable Articles from Wikipedia and Measuring their Comparabilities. Procedia - Social and Behavioral Sciences, 95(0):40 – 47, 2013. ISSN 1877-0428.

Downloads: 0 This Week

Last Update: 2015-09-11
See Project
14

Cunei Machine Translation Platform

Cunei is a data-driven machine translation system that builds dynamic, statistical models based on instances of known translations found in a corpus.

1 Review

Downloads: 0 This Week

Last Update: 2013-06-05
See Project
15

DArtikel!

Learn the articles of German words.

Learn words in german that you know at your own pace. With this system you can add the words you knew in a day and then do exercises with them. Written by: Jovanny Pablo Cruz Gómez. Computer Engineering Student. IPN, ESIME Culhuacan, Mexico City.

Downloads: 0 This Week

Last Update: 2013-11-07
See Project
16

DCTFinder

Extract title and creation time from web page.

Web pages do not offer reliable metadata concerning their creation date and time. However, getting the document creation time is a necessary step for allowing to apply temporal normalization systems to web pages. DCTFinder is a system that parses a web page and extracts from its content the title and the creation date of this web page. DCTFinder combines heuristic title detection, supervised learning with Conditional Random Fields (CRFs) for document date extraction, and rule-based creation time recognition. DCTFinder is released under CeCILL free software license agreement. The system is described in the following paper (see 'Files' section): Xavier Tannier. "Extracting News Web Page Creation Time with DCTFinder". Proceedings of the 9th Language Resources and Evaluation Conference. Reykjavik, Iceland.

Downloads: 0 This Week

Last Update: 2016-10-21
See Project
17

DawNLITE

DawNLITE is a Natural-Language-based Image Transmoding Engine. The software transforms an image to a video as recorded by a virtual camera panning and zooming over the image, following a natural language text description of the image.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project
18

Dendrarium

System do pielęgnacji składnikowych drzew składniowych

Dendrarium służy do wybierania i weryfikacji składnikowych drzew składniowych generowanych przez parser Świgra. System jest użytkowany w Instytucie Podstaw Informatyki PAN do tworzenia banku drzew składniowych dla języka polskiego Składnica.

Downloads: 0 This Week

Last Update: 2014-02-18
See Project
19

Dex to Wiktionary

Linguistic analysis

Scopul proiectului este conversia definițiilor din DEX '98, luate de pe dexonline.ro, din text simplu în formatul structurat folosite de Wiktionary. Programul convertește definițiile în 3 pași: - descarcă și extrage definițiile de pe dexonline.ro - parsează definițiile și le împarte în părți mai mici care sunt clasificate - asamblează bucățile într-un format arborescent, pe care apoi îl parsează și generează formatul cerut de Wiktionary

Downloads: 0 This Week

Last Update: 2015-10-10
See Project
20

Dictionary Additions Management System

Dictionary Additions Management System (DAMS), a collection of open source translation dictionaries. These files are compatible with the Open Translation Engine (OTE). For more info, see http://sourceforge.net/projects/ote/

1 Review

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
21

DisMo

A POS, disfluency and multi-word unit annotator for spoken language

DisMo is a part-of-speech, disfluency and multi-word unit automatic annotator. It is designed to manage the complexities and phenomena specific to spoken language. It currently supports English and French, with support for more languages coming soon. It is developed and maintained by George Christodoulides (Centre Valibel, IL&C, University of Louvain, Louvain-la-Neuve, Belgium). Visit www.corpusannotation.org to find out more about DisMo and other annotation tools for language corpora. If you are using DisMo to annotate your corpus, please cite the following paper: Christodoulides, George; Avanzi, Mathieu; Goldman, Jean-Philippe. DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC) 2014, Reykjavik, Iceland, 26-31 May 2014, pp. 3902-3907.

Downloads: 0 This Week

Last Update: 2014-10-23
See Project
22

Discriminative Language Editor

Discriminative language editor based on ontologies

Text editor in Java that is able to detect discriminative expressions while the user is typing. When the internal ontology-based analyzer detects a potential discriminative expression the user is advised by underscoring the related words in the text. A descriptive message about the issue is also shown to the user when the cursor is placed over the potential discriminative expression.

Downloads: 0 This Week

Last Update: 2016-10-30
See Project
23

Donatus Parsing Tools for Portuguese

Donatus is an on-going project consisting of Python, NLTK-based tools and grammars for deep parsing and syntactical annotation of Brazilian Portuguese corpora. It includes a user-friendly graphical user interface for building syntactic parsers with the NLTK, providing some additional functionalities.

Downloads: 0 This Week

Last Update: 2016-08-28
See Project
24

Drug Extraction

Drug name extraction

Drug name recognition and normalisation/grounding to DrugBank ids and standard names. Package provides 2 taggers: 1. DrugTagger - CRF-based with DrugBank presence feature (see feature set for details). 2. DrugnameGazetteer - gazetteer/dictionary-based. Dictionary created from DrugBank.ca database. Both taggers include grounding/normalisation to DrugBank ids and standard names. Feature set: Word, Word-1, Word+1, Word-1_Word, Word_Word+1, DrugBankPresence, POS DrugBankPresence feature indicates the presence of the drug name in the DrugBank. Using CONLL-Evaluation: processed 32065 tokens with 3656 phrases; found: 3251 phrases; correct: 2786. accuracy: 95.25%; precision: 85.70%; recall: 76.20%; FB1: 80.67 Using GATE Corpus Benchmark: Strict: P: 0.65 R: 0.73 F1: 0.69 Lenient: P: 0.74 R: 0.84 F1: 0.78 The details of how to reproduce evaluation, see README. To use standalone version for tagging download DrugExtractionStandalone.tar.gz from Files.

Downloads: 0 This Week

Last Update: 2015-06-12
See Project
25

Dualword-PMC

PMC browser

PubMed Central browser. Source code: http://github.com/dualword/dualword-pmc/

Downloads: 0 This Week

Last Update: 2021-11-08
See Project