Search Results for "data mining research papers"

Showing 155 open source projects for "data mining research papers"

View related business solutions
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    Apache Sedona

    Apache Sedona

    Cluster computing framework for processing large-scale geospatial data

    ... query workloads. According to our benchmark and third-party research papers, Sedona has 50% less peak memory consumption than other Spark-based geospatial data systems for large-scale in-memory query processing. Sedona offers Scala, Java, Spatial SQL, Python, and R APIs and integrates them into underlying system kernels with care. You can simply create spatial analytics and data mining applications and run them in any cloud environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Qlib

    Qlib

    Qlib is an AI-oriented quantitative investment platform

    Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies. An increasing number of SOTA Quant research works/papers are released in Qlib. With Qlib, users can easily try their ideas to create better Quant investment strategies. At the module level, Qlib is a platform that consists...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    ... for scientific plotting for presentations and research papers as well as a supporting module for other mesh-dependent Python modules. Easily integrate with NumPy and create a variety of geometries and plot them. You could use any geometry to create your glyphs, or even plot the points directly. Direct access to mesh analysis and transformation routines. Intuitive plotting routines with matplotlib similar syntax.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Google DeepMind GraphCast and GenCast

    Google DeepMind GraphCast and GenCast

    Global weather forecasting model using graph neural networks and JAX

    GraphCast, developed by Google DeepMind, is a research-grade weather forecasting framework that employs graph neural networks (GNNs) to generate medium-range global weather predictions. The repository provides complete example code for running and training both GraphCast and GenCast, two models introduced in DeepMind’s research papers. GraphCast is designed to perform high-resolution atmospheric simulations using the ERA5 dataset from ECMWF, while GenCast extends the approach with diffusion...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 5
    Awesome-Quant

    Awesome-Quant

    A curated list of insanely awesome libraries, packages and resources

    awesome-quant is a curated list (“awesome list”) of libraries, packages, articles, and resources for quantitative finance (“quants”). It includes tools, frameworks, research papers, blogs, datasets, etc. It aims to help people working in algorithmic trading, quant investing, financial engineering, etc., find useful open source or educational resources. Licensed under typical “awesome” list standards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    .... It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more. ArchiveBox is an open-source, self-hosted web archiving tool for saving websites offline. It helps organizations and individuals preserve bookmarks, research papers, and social media content, among others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ThuThesis

    ThuThesis

    LaTeX Thesis Template for Tsinghua University

    ThuThesis is an abbreviation of Tsinghua University Thesis LaTeX Template. This package establishes a simple and easy-to-use LaTeX template for Tsinghua dissertations, including general undergraduate research papers, masters theses, doctoral dissertations, and postdoctoral reports. This template is subject to frequent changes. Please make sure you have read the usage documentation and example code completely and carefully before using and asking questions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    MoCo (Momentum Contrast)

    MoCo (Momentum Contrast)

    Self-supervised visual learning using momentum contrast in PyTorch

    MoCo is an open source PyTorch implementation developed by Facebook AI Research (FAIR) for the papers “Momentum Contrast for Unsupervised Visual Representation Learning” (He et al., 2019) and “Improved Baselines with Momentum Contrastive Learning” (Chen et al., 2020). It introduces Momentum Contrast (MoCo), a scalable approach to self-supervised learning that enables visual representation learning without labeled data. The core idea of MoCo is to maintain a dynamic dictionary with a momentum...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 10
    funNLP

    funNLP

    Resources, corpora, and tools for Chinese natural language processing

    ... dictionaries, stopwords). It also includes links to academic papers, open-source model implementations, and practical utilities like word segmentation or text cleaning scripts. The project is highly community-oriented, frequently updated with contributions and new resources, and it’s widely used in both academic and applied NLP research. Its value lies in providing not just tools but also curated, domain-specific data, which can be hard to find elsewhere.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Skim

    Skim

    A PDF Reader and Note-taker for OS X

    Skim is a PDF reader and note-taker for OS X. It is designed to help you read and annotate scientific papers in PDF, but is also great for viewing any PDF file. Skim requires OS X 10.10 or higher.
    Leader badge
    Downloads: 2,661 This Week
    Last Update:
    See Project
  • 12
    Auditory Modeling Toolbox
    The auditory modeling toolbox (AMT) is a Matlab/Octave toolbox for the development and application of auditory computational models. Over 50 auditory models implemented in Matlab, Octave, C, C++, and Python can be run from Matlab and Octave, on Windows and Linux. The AMT provides a well-structured in-code documentation, includes auditory data required to run the models. It integrates functionality to reproduce the model predictions. Model implementations can be evaluated in two stages...
    Leader badge
    Downloads: 70 This Week
    Last Update:
    See Project
  • 13
    QUAST

    QUAST

    Quality Assessment Tool for Genome Assemblies

    ... analysis results in summary tables (in plain text, tab-separated, and LaTeX formats) and colorful plots. The tool also produces web-based reports condensing all information in one easy-to-navigate file. QUAST and its three follow-up papers (MetaQUAST, Icarus, QUAST-LG) papers were published in Bioinformatics; the last paper (WebQUAST) is out in Nucl Acid Research.
    Leader badge
    Downloads: 55 This Week
    Last Update:
    See Project
  • 14
    UnBBayes

    UnBBayes

    Framework & GUI for Bayes Nets and other probabilistic models.

    UnBBayes is a probabilistic network framework written in Java. It has both a GUI and an API with inference, sampling, learning and evaluation. It supports Bayesian networks, influence diagrams, MSBN, OOBN, HBN, MEBN/PR-OWL, PRM, structure, parameter and incremental learning. Please, visit our wiki (https://sourceforge.net/p/unbbayes/wiki/Home/) for more information. Check out the license section (https://sourceforge.net/p/unbbayes/wiki/License/) for our licensing policy.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 15
    Euler Pole Calculator (EPC)

    Euler Pole Calculator (EPC)

    A Matlab software to estimate Euler pole parameters

    The EPC software calculates velocities for any points located on the earth’s surface given the Euler pole parameters and estimates the Euler pole parameters given the observed velocities of a set of sites located on a tectonic plate. Since version 2, the software is able to estimate or apply the Plate Translation Rates, as well. Please cite one of the following papers when you use the software: - Goudarzi, M A, Cocard, M, and Santerre, R. 2014."EPC: Matlab Software to Estimate Euler Pole...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16

    Hirtius

    A Historical Research Assistant

    Hirtius is a web-based historical research assistant created by Les Ateliers du Héron. Its goal is to replace the historian's notebook and box of cards, while providing a collaborative work environment for a whole team of researchers, that can be thought of as a purpose-built CMS cross-bred with a digital pictures management system, a genealogy application and a Wiki. Eventually, based on the information you stored in the application, and the links you created between the various data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Habfuzz

    Habfuzz

    A command-line tool for data-driven fuzzy modelling

    ... suitability in various discharge scenarios in a simulated river reach. It comes with no graphical user interface but it's a one-click tool. Just provide your input and let HABFUZZ provide you the output. The HABFUZZ manual https://github.com/chtheodoro/habfuzz/blob/master/HABFUZZ_v2.5_manual.pdf HABFUZZ in the Journal of Open Source Software http://joss.theoj.org/papers/1ad27db8f0976c28a75e20d34eba5ee2 HABFUZZ website https://chtheodoro.wixsite.com/habfuzz
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    stkpp

    stkpp

    C++ Statistical ToolKit

    ... classes in C++ and is suitable for projects ranging from small one-off projects to complete data mining application suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    General Knowledge Machine Project

    General Knowledge Machine Project

    Intellect Modeling Kit: assisting research, diagnostics, consulting

    ...; * Memorizing - new knowledge item creation; * Abstraction – building objects representing typical signs of similar objects groups, data mining. KM is not intended to replace human experts, it is built to multiply abilities. Machine should not be responsible for decisions. The IMK is designed to create ready-to-use software applications using simple text files. Any human knowledge can be uploaded to KM by expert not familiar with software coding. Demos present in kit. Good luck!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Awesome Fraud Detection Research Papers

    Awesome Fraud Detection Research Papers

    A curated list of data mining papers about fraud detection

    A curated list of data mining papers about fraud detection from several conferences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Karate Club

    Karate Club

    An API Oriented Open-source Python Framework for Unsupervised Learning

    Karate Club is an unsupervised machine learning extension library for NetworkX. Karate Club consists of state-of-the-art methods to do unsupervised learning on graph-structured data. To put it simply it is a Swiss Army knife for small-scale graph mining research. First, it provides network embedding techniques at the node and graph level. Second, it includes a variety of overlapping and non-overlapping community detection methods. Implemented methods cover a wide range of network science...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    BitMagic Library

    Compressed bit-sets, sparse bit matrices and algorithms

    BitMagic - C and C++ library implementing dynamic bitvectors and bit-set algorithms with several types of on-the-fly, adaptive compression. Designed for use in databases, search systems, data-mining algorithms, scientific projects. The core of the library is C++, but it provides C-compatibility wrappers and can be compiled without C++ runtime. Optimizations for Intel SSE2, SSE4.2 and AVX2.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Text Analysis Markup System
    Text Analysis Markup System (TAMS) is both a system of marking documents for qualitative analysis and a series of tools for mining information based on that syntax.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 24
    Brain Tokyo Workshop

    Brain Tokyo Workshop

    Experiments and code from Google Brain’s Tokyo research workshop

    .... The repository includes implementations, experimental data, and supporting research papers that accompany published studies. Notable works such as Weight Agnostic Neural Networks and Neuroevolution of Self-Interpretable Agents highlight the team’s exploration of how AI can learn more efficiently and transparently. Overall, this repository serves as an open research hub for sharing ideas and advancing the understanding of intelligent systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.