Showing 23056 open source projects for "open data assimilation"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration...
    Downloads: 93 This Week
    Last Update:
    See Project
  • 2
    Data Formulator

    Data Formulator

    Create rich visualizations with AI

    To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals. To achieve this, analysts need not only proficiency in data transformation and visualization tools but also efforts to manage the branching history consisting of many different versions of data and charts. Recent LLM-powered AI systems have greatly improved visualization authoring experiences, for example by mitigating manual data transformation...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Open Source Point of Sale

    Open Source Point of Sale

    Open Source Point of Sale is a web based point of sale application

    Open Source Point of Sale (OSPOS) is a free and open-source point-of-sale system for small and medium-sized businesses. It helps manage sales, inventory, and customer data through a user-friendly web interface. OSPOS is highly customizable and supports barcode scanning, multi-language support, and detailed reporting, making it ideal for retail stores, restaurants, and small businesses seeking an affordable POS solution.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 4
    Explorer

    Explorer

    Series (one-dimensional) and dataframes (two-dimensional)

    Explorer brings series (one-dimensional) and data frames (two-dimensional) to Elixir for fast data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • 5
    data-diff

    data-diff

    Efficiently diff rows across two different databases

    We're excited to announce the launch of a new open-source product, data-diff that makes comparing datasets across databases fast at any scale. data-diff automates data quality checks for data replication and migration. In modern data platforms, data is constantly moving between systems, and at the modern data volume and complexity, systems go out of sync all the time. Until now, there has not been any tooling to ensure that when the data is correctly copied. Replicating data at scale, across...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Azure Data Studio

    Azure Data Studio

    A data management tool that enables working with other SQL tools

    Azure Data Studio is a cross-platform database tool for data professionals who use on-premises and cloud data platforms on Windows, macOS, and Linux. Azure Data Studio offers a modern editor experience with IntelliSense, code snippets, source control integration, and an integrated terminal. It's engineered with the data platform user in mind, with the built-in charting of query result sets and customizable dashboards. Use Azure Data Studio to query, design, and manage your databases and data...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 7
    data.table

    data.table

    Extends base R’s data for high-performance data manipulation

    data.table is an R package that extends base R’s data.frame for high-performance data manipulation. It offers concise syntax, blazing speed, and memory-efficient operations. It supports fast file reading/writing, joins, grouping, reshaping, and updates by reference. It is heavily used in large data workflows, big data in R, production pipelines, etc. Extremely efficient grouping/aggregation/summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Open Interpreter

    Open Interpreter

    A natural language interface for computers

    Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have locally...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Open MCT

    Open MCT

    A web based mission control framework

    Open MCT is a next-generation mission operations data visualization framework. Web-based, for desktop and mobile. Software based on Open MCT is in use as a data visualization tool in support of multiple missions at the Jet Propulsion Laboratory, and at NASA's Ames Research Center to support the development of lunar rover mission concepts. Open MCT can be adapted for planning and operations of any system that produces telemetry. While Open MCT is developed to support space missions, its core...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Open Asset Import Library

    Open Asset Import Library

    The open-asset-importer-library repository

    The Open Asset Import Library (short name: Assimp) is a portable Open-Source library to import various well-known 3D model formats in a uniform manner. The most recent version also knows how to export 3d files and is therefore suitable as a general-purpose 3D model converter. See the feature-list. open3mod is a Windows-based model viewer. It loads all file formats that Assimp supports and is perfectly suited to quickly inspect 3d assets. Assimp aims to provide a full asset conversion pipeline...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Spring Data JPA

    Spring Data JPA

    Simplifies the development of creating a JPA-based data access layer

    Spring Data JPA, part of the larger Spring Data family, makes it easy to easily implement JPA-based repositories. This module deals with enhanced support for JPA-based data access layers. It makes it easier to build Spring-powered applications that use data access technologies. Implementing a data access layer of an application has been cumbersome for quite a while. Too much boilerplate code has to be written to execute simple queries as well as perform pagination, and auditing. Spring Data JPA...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Open Web Analytics (OWA)

    Open Web Analytics (OWA)

    Official repository for Open Web Analytics

    Open Web Analytics (OWA) is an open-source web analytics framework that tracks and analyzes visitor behavior on websites and applications. It provides insights into page views, user demographics, and engagement metrics. OWA can be self-hosted, giving users full control over their data. It is an alternative to commercial analytics platforms and supports integration with WordPress and other CMS platforms.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Open Policy Agent

    Open Policy Agent

    An open source, general-purpose policy engine

    Policy-based control for cloud-native environments. Flexible, fine-grained control for administrators across the stack. Stop using a different policy language, policy model, and policy API for every product and service you use. Use OPA for a unified toolset and framework for policy across the cloud-native stack. Whether for one service or for all your services, use OPA to decouple policy from the service's code so you can release, analyze, and review policies (which security and compliance...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    AWESOME DATA SCIENCE

    AWESOME DATA SCIENCE

    Awesome Data Science repository to learn and apply for real world

    An open source Data Science repository to learn and apply towards solving real world problems. This is a shortcut path to start studying Data Science. Just follow the steps to answer the questions, "What is Data Science and what should I study to learn Data Science?" Data Science is one of the hottest topics on the Computer and Internet farmland nowadays. People have gathered data from applications and systems until today and now is the time to analyze them. The next steps are producing...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Open X-Embodiment

    Open X-Embodiment

    Unified open dataset enabling cross-embodiment learning for robotics

    Open X-Embodiment is a large-scale collaborative initiative led by Google DeepMind to unify robotic learning datasets into a consistent and standardized format, simplifying access and usage across the robotics research community. Its primary goal is to make all available open-source robotic data interoperable by representing them using the RLDS (Reinforcement Learning Dataset Structure) episode format. This enables seamless integration for training, evaluation, and model development across...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Open-Source auth provider

    Open-Source auth provider

    Open source alternative to Auth0 / Firebase Auth, etc.

    Add secure login and session management to your apps. SDKs available for popular languages and front-end frameworks e.g. Node.js, Go, Python, React.js, React Native, Vanilla JS, etc. Supertokens architecture is optimized to add secure authentication for your users without compromising on user and developer experience. Frontend SDK: Manages session tokens and renders login UI widgets. Backend SDK: Provides APIs for sign-up, sign-in, signout, session refreshing etc. Your Frontend will talk to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Open-IM-Server

    Open-IM-Server

    Open source Instant Messaging Server

    ... and quickly, and ensure the security and privacy of business data. Open-IM relies on five open source high-performance components: ETCD, MySQL, MongoDB, Redis, and Kafka. Privatization deployment Before Open-IM-Server, please make sure that the above five components have been installed. If your server does not have the above components, you must first install Missing components. If you have the above components, it is recommended to use them directly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Open Source Routing Machine

    Open Source Routing Machine

    Open Source Routing Machine - C++ backend

    High-performance routing engine written in C++14 designed to run on OpenStreetMap data. There are several services available via HTTP API, C++ library interface and NodeJs wrapper. Nearest, snaps coordinates to the street network and returns the nearest matches. Route finds the fastest route between coordinates. Table computes the duration or distances of the fastest route between all pairs of supplied coordinates. Match snaps noisy GPS traces to the road network in the most plausible way. Trip...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Data-Juicer

    Data-Juicer

    Data processing for and with foundation models

    Data-Juicer is an open-source data processing and augmentation framework designed to enhance the quality and diversity of datasets for machine learning tasks. It includes a modular pipeline for scalable data transformation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The CUE Data Constraint Language

    The CUE Data Constraint Language

    The home of the CUE language. Validate and define text-based config

    CUE is an open source data constraint language which aims to simplify tasks involving defining and using data. CUE merges the notion of schema and data. The same CUE definition can simultaneously be used for validating data and act as a template to reduce boilerplate. Schema definition is enriched with fine-grained value definitions and default values. At the same time, data can be simplified by removing values implied by such detailed definitions. The merging of these two concepts enables many...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Rust Data Analysis

    Rust Data Analysis

    Rust for data analysis encyclopedia (WIP)

    Welcome to the Rust Data Analysis repository! This collection of Jupyter notebooks provides a comprehensive exploration of data analysis using Rust. Powered by a Rust kernel, these notebooks allow you to dive deep into the realm of data analysis, leveraging the capabilities of the Rust programming language. With the help of various Rust libraries, such as ndarray, plotters, and more, you'll be able to extract valuable insights from different datasets with ease.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Data Annotator for Machine Learning

    Data Annotator for Machine Learning

    Data annotator for machine learning

    Data annotator for machine learning allows you to centrally create, manage and administer annotation projects for machine learning. Data Annotator for Machine Learning (DAML) is an application that helps machine learning teams facilitate the creation and management of annotations. Active learning with uncertain sampling to query unlabeled data. Project tracking with real-time data aggregation and review process. User management panel with role-based access control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Azure Data SQL Samples

    Azure Data SQL Samples

    Official Microsoft repository containing code samples for SQL Server

    This GitHub repository contains code samples that demonstrate how to use Microsoft's Azure Data products including SQL Server, Azure SQL Database, Azure Synapse, and Azure SQL Edge. Each sample includes a README file that explains how to run and use the sample. Note that certain features like In-Memory OLTP are edition specific for SQL Server and would be possible to implement if the edition which supports that feature is being used to run the sample. Releases allow you to conveniently download...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Synthetic Data Kit

    Synthetic Data Kit

    Tool for generating high quality Synthetic datasets

    Synthetic Data Kit is a CLI-centric toolkit for generating high-quality synthetic datasets to fine-tune Llama models, with an emphasis on producing reasoning traces and QA pairs that line up with modern instruction-tuning formats. It ships an opinionated, modular workflow that covers ingesting heterogeneous sources (documents, transcripts), prompting models to create labeled examples, and exporting to fine-tuning schemas with minimal glue code. The kit’s design goal is to shorten the “data prep...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.