Showing 46 open source projects for "word frequency"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    WordCount

    WordCount

    Count frequency of single, 2-word and 3-word clusters in a text

    The program can read a text file and count the occurrences of single words and clusters of 2 and 3 words. The resulting list will be sorted in descending order (highest frequency on top).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TEXminer

    TEXminer

    Text Mining Classification for Texts in ASCII, Unicode and PDF Format.

    TEXminer uses generic Text Mining Methods to analyze Unicode Files as plain Text or PDF. The Text Database can be saved in XML where the orginal Text, the Sentence and Word Lists and additional Parameters (e.g. Abbreviations) are stored. TEXminer allows Language Detection by Letter Frequency Analysis, finding important Words by Cooccurrence Analysis, Determination of Central Expressions, Thematic Text Classification (also Semantic Groups) Fingerprint Comparison and Word Frequency. Because TEXminer is not disigned to have a Reference Corpus, Thematic Model Statistics uses Language Models (lexicons) to have Background Knowledge about certain Languages (English, German, French, Spanish, Italian, Russian), which are derived from Decaleon Project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Kindle Mate(KMate)

    Kindle Mate(KMate)

    Kindle clippings and Kindle Vocabulary Builder manager

    KMate is the ultimate reading companion for Kindle users — and the all-new, cross-platform successor to Kindle Mate, the classic Kindle notes manager trusted by readers worldwide for over a decade. It is the only Kindle assistant that unifies cross-device import, cloud sync, vocabulary & dictionary management, flexible export, reading analytics, and AI-powered definitions — all in one app. ## KMate 3 for Windows latest (Store...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 4
    Onda Sfasata

    Onda Sfasata

    An authentic Italian learning app.

    ...GitHub repository: https://github.com/Northstrix/onda-sfasata Check it out at: https://onda-sfasata.netlify.app/ This app is fully localized into English, Hebrew, and two dialects of German — Hochdeutsch and a mixture of Zurich and Basel dialects (approximately 64%–36%), labeled as “Schwiizerdütsch” I picked the words for this app not based on predefined categories, usage frequency, or the fluency level to which the word might correspond, but on which words could be cleanly cut from the audio tracks. As a result, the word set turned out to be a bit odd, yet unique. Every single sound used in the app, except for success.wav, error.wav, and completed.wav, was extracted from public domain recordings. The success and error sounds are covered by Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/), the completed sound is available under Creative Commons 0 License (http://creativecommons.org/publicdomain/zero/1.0/)
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5

    pyLogos

    Qualitative content analysis software.

    ...Documents (imported from txt and docx files) are stored in a database, and may have marked text segments associated with codes. It is possible to retrieve these segments in various ways, generate word clouds, tabulate frequency of codes and words, among other outputs. pyLogos é um programa de apoio à análise de conteúdo de textos. Documentos (importados de arquivos txt e docx) são armazenados numa base de dados, podendo ter segmentos de textos marcados a associados a códigos. É possível recuperar esses segmentos de várias formas, gerar nuvens de palavras, tabular frequência de códigos e palavras, entre outras saídas.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    spectrograph

    spectrograph

    The program analyzes sound when you talk into a Headset microphone

    ...The complete source code is included as an ASM file in 2 copies, one copy is ready for assembly with the Qeditor of free MASM32 package, and the other copy is ready for assembly with the free FASMW assembler. Using the ASM file one can try to make improvements to the sound-analyzing. I have removed from it one word which was triggering a false positive in Avast!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    OCR Manga Reader for Android

    OCR Manga Reader for Android

    Android Manga reader with Japanese OCR and dictionary capabilities

    OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 8
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9

    pipZuseZ3

    An emulator for the Zuse Z3 computer, invented in 1941

    An emulator for the Zuse Z3 computer, invented in 1941 by Konrad Zuse. It was the world's first working programmable, fully automatic digital computer. The Z3 was built with 2,600 relays, implementing a 22-bit word length that operated at a clock frequency of about 5–10 Hz. Program code was stored on punched film. Initial values were entered manually.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10

    Linguistic Analyzer

    The Linguistic Analyzer is a tool for corpus analysis and comparison

    The Linguistic Analyzer (Almuhalil Alloghawy) is a free tool designed by a team from Al-Imam Muhammad bin Saud islamic university that can be used for corpus analysis and comparison in terms of the several linguistic characteristics, such as frequency lists generation, concordances, collocation extraction, the difference between two words, and keyword identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    HSKinter

    HSKinter

    Chinese Words Study (HSK 1–5) on Desktop and Phone

    ...Flashcards, practice of hanzi meaning, pinyin and tones, stats of accuracy. Optional pronunciation via gTTS (Google). Compatible with Pydroid 3 (runs on Android). The frequency of a word showing up depends on its retaining level and time since the last answer (age).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PSWordCloud

    PSWordCloud

    Create pretty word clouds with PowerShell!

    Create pretty word clouds with PowerShell!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    autonomous-response

    # autonomous-re.sh generates a random Word Frequency per list of 10 ;

    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Word frequency and diversity (distribution) across hundreds of corpora. You'll see both the lemma and the various forms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    Meaning Explorer

    A tool for analyzing the words of the Quran

    The main purpose of this tool is to help users in extracting syntagmatic relations between words, lemmas and roots available in the Quran; these relations include identifying significant collocates and words’ co-occurrences. In addition, the tool also provides other helpful functionalities that complement the primary purpose, which include a Key Word In Context (KWIC) concordance, in addition to frequency lists of all words, lemmas and roots in the holy Quran. The main intended users of this tool are Arabic Quranic scholars and linguists. The Meaning Explorer applies a new distributional semantic model to extract words’ significant co-occurrences from the Quran. This model is based on the Refined MI association measure applied to all words within a symmetric sliding window of five words surrounding the node word. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character encoding g. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...It focuses on automating the collection of large sets of images by programmatically parsing page content and iterating through gallery entries. mzitu also includes a simple analysis script that processes downloaded folder names to generate statistics and visualizations. Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. Overall, mzitu serves as a learning-oriented implementation of Python web scraping, data processing, and visualization techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    tinfoleak

    tinfoleak

    OSINT tool for extracting and analyzing Twitter intelligence data

    tinfoleak is an open source intelligence (OSINT) and social media intelligence (SOCMINT) tool designed to automate the collection and analysis of data from Twitter. It focuses on helping analysts extract large volumes of information from Twitter timelines using identifiers such as usernames, geographic coordinates, or keywords. Once the data is gathered, tinfoleak organizes it into structured information that can support intelligence analysis and investigative research. tinfoleak is capable...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    pydictor

    pydictor

    powerful and useful hacker dictionary builder for a brute-force attack

    ...You can use pydictor to generate a general blast wordlist, a custom wordlist based on Web content, a social engineering wordlist, and so on; You can use the pydictor built-in tool to safe delete, merge, unique, merge and unique, count word frequency to filter the wordlist, besides, you also can specify your wordlist and use '-tool handler' to filter your wordlist. You can generate highly customized and complex wordlists by modifying multiple configuration files, adding your own dictionary, using leet mode, filter by length, char occur times, types of different char, regex, and even add customized encode scripts in /lib/encode/ folder, add your own plugin script in /plugins/ folder, add your own tool script in /tools/ folder.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    rubycollect4j

    Ruby Collections for Java

    The rubycollect4j implements all methods refer to Ruby Array, Hash and Enumerable. For further info, please visit http://ruby-doc.org website. For more documentation, please read JavaDoc. With Java 8: // Sort the characters by its frequency based on the word 'Mississippi' case-insensitively RubyArray<String> word = ra("Mississippi".split("(?!^)")); String result = word.map((c) -> { return c.toLowerCase(); }).sortBy((c) -> { return word.count(c); }).uniq().join(); p(result); // "mpis" With Java 6: // Sort the characters by its frequency based on the word 'Mississippi' case-insensitively final RubyArray<String> word = newRubyArray("Mississippi".split("(?!...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Word Doctor

    Word Doctor

    Nextgen word app. Word Docs made easy!

    Word Doctor is a word editor/ writers aid, designed to analyze writing "Content" and "Style". Inspire your creative process and get to work fast using dictation (Speech to Text), or the Ink-Blot test to inspire creativity. Analyze what you already have and Identify imagery, weak writing structures, and more. Content is king, and Word Doctor can certainly help with that!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Rikaisama (Legacy)

    Modification of Rikaichan with more features and customization options

    *** THIS ADD-ON IS NO LONGER SUPPORTED AND WILL NOT WORK WITH FIREFOX 57+ (however, it still works in Waterfox using a non-e10s window: "File > New Non-e10s Window") *** Rikaisama is a modification of the rikaichan Japanese-English popup dictionary that adds many features and customization options such as audio pronunciation, EPWING dictionary support, sanseido web dictionary support, word frequency, pitch accent, enhanced clipboard & save options, ability to create and add cards directly to an open Anki deck, "Super Sticky" mode, ability to remap shortcut keys, more fine-tuned startup options, and more. See http://rikaisama.sourceforge.net/ for more information. Supports Windows, Ubuntu, and newer versions of OSX.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    JGlossator

    JGlossator

    Creates glosses for Japanese text

    JGlossator can create a gloss for Japanese text complete with de-inflected expressions, readings, audio pronunciation, example sentences, pitch accent, word frequency, kanji information, and grammar analysis. See http://jglossator.sourceforge.net/ for more information and screenshots. Inspired by Translation Aggregator, but aimed primarily at people learning Japanese.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB