Showing 134 open source projects for "partitioning"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    pg_partman

    pg_partman

    Partition management extension for PostgreSQL

    pg_partman is a PostgreSQL extension for managing time-based and serial-based table partitioning. It automates the creation, maintenance, and retention of partitions, making it easier to manage large datasets with better performance and query efficiency. Designed for both native and trigger-based partitioning, pg_partman is ideal for applications handling time-series or log data.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    yabai

    yabai

    A tiling window manager for macOS based on binary space partitioning

    yabai is a tiling window manager for macOS that extends the native windowing system with fully scriptable command-line control. It uses a binary space partitioning algorithm to auto-tile windows, supports extensive keyboard shortcut mapping (via skhd), and enhances productivity for power users.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    Easy GPU PV

    Easy GPU PV

    A Project dedicated to making GPU Partitioning on Windows easier

    Easy-GPU-PV is an open-source project that automates the relatively complex steps required to set up GPU Paravirtualization (GPU-PV) on Windows Hyper-V, letting you partition a physical GPU and assign slices to virtual machines for low-latency remote use. It creates and installs a Windows VM, copies the necessary GPU-PV driver files into the VM, and helps install Parsec (an ultra-low-latency remote-desktop client) so you can connect to the VM at high framerates. The repo is practical and...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    Olric

    Olric

    Distributed, in-memory key/value store and cache

    A lightweight, distributed in-memory data store designed for key-value caching and ephemeral storage.
    Downloads: 3 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Infinispan

    Infinispan

    Infinispan is an open source data grid platform

    Infinispan is a distributed in-memory data grid and caching system designed for high-performance computing. It allows applications to scale dynamically by distributing data across multiple nodes, reducing latency and improving resilience.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    DLRM

    DLRM

    An implementation of a deep learning recommendation model (DLRM)

    ...The architecture combines dense (MLP) and sparse (embedding) branches, then interacts features via dot product or feature interactions before passing through further dense layers to predict click-through, ranking scores, or conversion probabilities. The implementation is optimized for performance at scale, supporting multi-GPU and multi-node execution, quantization, embedding partitioning, and pipelined I/O to feed huge embeddings efficiently. It includes data loaders for standard benchmarks (like Criteo), training scripts, evaluation tools, and capabilities like mixed precision, gradient compression, and memory fusion to maximize throughput.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Arch Installer

    Arch Installer

    Arch Linux installer - guided, templates etc.

    Just another guided/automated Arch Linux installer with a twist. The installer also doubles as a python library to install Arch Linux and manage services, packages, and other things inside the installed system (Usually from a live medium).
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    Resurrecting Project in LinUtil

    Resurrecting Project in LinUtil

    Automated Arch Linux Install

    Resurrecting Project in LinUtil is an automated Arch Linux installation project created by Chris Titus that wraps Arch’s traditionally manual install process in a guided script. It aims to make Arch more approachable by handling tasks like disk partitioning, base system installation, and desktop environment selection through a menu-driven interface. The repository includes configuration presets and scripts that not only install the system but also set up a curated software stack and sensible defaults. Over time, the project evolved and was eventually archived in favor of a newer, streamlined Arch install script, but ArchTitus remains available as a reference for its original approach. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    exo

    exo

    Run your own AI cluster at home with everyday devices

    Run your own AI cluster at home with everyday devices. Maintained by exo labs. Forget expensive NVIDIA GPUs, unify your existing devices into one powerful GPU, iPhone, iPad, Android, Mac, Linux, or pretty much any device. Now the default models, run 8B, 70B, and 405B parameter models on your own devices.
    Downloads: 5 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Redis

    Redis

    An in-memory database that persists on disk

    ...Redis has built-in replication, Lua scripting, LRU eviction, transactions, and different levels of on-disk persistence, and provides high availability via Redis Sentinel and automatic partitioning with Redis Cluster. You can run atomic operations on these types, like appending to a string; incrementing the value in a hash; pushing an element to a list; computing set intersection, union and difference; or getting the member with highest ranking in a sorted set. To achieve top performance, Redis works with an in-memory dataset. ...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 11
    rav1e

    rav1e

    The fastest and safest AV1 encoder

    ...Despite aiming for simplicity, rav1e supports a wide range of AV1 features: different bit depths, chroma subsampling formats, prediction and transform modes, and block partitioning options, which means it can produce reasonably efficient compressed video.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 12
    Liftbridge

    Liftbridge

    Lightweight, fault-tolerant message streams

    ...Use Liftbridge as a simpler and lighter alternative to systems like Kafka and Pulsar or to add streaming semantics to an existing NATS deployment. Stream replication provides high availability and durability of messages. Clustering and partitioning provide horizontal scalability for streams and their consumers. No heavy or unwieldy dependencies like ZooKeeper or the JVM. Liftbridge is a single static binary roughly 16MB in size. It has a simple gRPC-based API which makes it quick to implement client libraries. Create streams that match wildcard topics, such as stock.nyse. or stock.nasdaq. in addition to topic literals like stock.nasdaq.msft. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    pgstream

    pgstream

    PostgreSQL replication with DDL changes

    pgstream is an open source CDC command-line tool and library that offers Postgres replication support with DDL changes to any provided output. One of the main differentiators of pgstream is the fact that it tracks and replicates schema changes automatically. It relies on SQL triggers that will populate a Postgres table (pgstream.schema_log) containing a history log of all DDL changes for a given schema. Whenever a schema change occurs, this trigger creates a new row in the schema log table...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14

    Magnetic tape partitioning tool

    command line tool to partition LTO-5+ tapes

    This little shell script can create partitions on an LTO-5+ tape. LTO-5 tapes can hold 2 partitions, LTO-6+ tapes can hold up to 4 partitions. It can work as batch and interactive (menu driven) on console/terminal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Spring Batch

    Spring Batch

    Spring Batch is a framework for writing batch applications using Java

    ...Spring Batch provides reusable functions that are essential in processing large volumes of records, including logging/tracing, transaction management, job processing statistics, job restart, skip, and resource management. It also provides more advanced technical services and features that will enable extremely high-volume and high performance batch jobs through optimization and partitioning techniques. Simple as well as complex, high-volume batch jobs can leverage the framework in a highly scalable manner to process significant volumes of information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    GenStage

    GenStage

    Producer and consumer actors with back-pressure for Elixir

    ...Developers implement callbacks like handle_demand and handle_events to control how items are emitted, transformed, and consumed across asynchronous boundaries. Because stages are OTP processes, you gain fault tolerance, supervised restarts, and concurrency tuned via configurable demand and partitioning. GenStage underpins higher-level libraries like Flow and Broadway, but it can also be used directly for custom pipelines where timing and throughput matter. Its clear separation of concerns encourages testable, composable stages that can be rearranged as requirements evolve. In production, this leads to predictable, resilient dataflows for event ingestion, batching, and parallel processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    YugabyteDB

    YugabyteDB

    The high-performance distributed SQL database for global apps

    ...Achieve continuous availability during infrastructure failures and on maintenance tasks such as software upgrades and distributed backups. Use powerful replication and data geo-partitioning capabilities to achieve the latency, resilience, and compliance your applications need. Deploy YugabyteDB in public, private, and hybrid cloud environments, on VMs, containers, or bare metal. Lower TCO for dev, staging, and production deployments. Powering a wide range of cloud-native workloads. Risk-free migration from on-prem to the cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AWS SDK for pandas

    AWS SDK for pandas

    Easy integration with Athena, Glue, Redshift, Timestream, Neptune

    ...With a few lines of code, you can read from and write to Amazon S3 in Parquet/CSV/JSON/ORC, register tables in the AWS Glue Data Catalog, and query with Amazon Athena directly into pandas. The library abstracts efficient patterns like partitioning, compression, and vectorized I/O so you get performant data lake operations without hand-rolling boilerplate. It also supports Redshift, OpenSearch, and other services, enabling ETL tasks that blend SQL engines and Python transformations. Operational helpers handle IAM, sessions, and concurrency while exposing knobs for encryption, versioning, and catalog consistency. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TurboGears

    TurboGears

    Python web framework with full-stack layer

    TurboGears is a hybrid web framework able to act both as a Full Stack framework or as a Microframework. TurboGears helps you get going fast and gets out of your way when you want it! TurboGears can be used both as a full stack framework or as a microframework in single-file mode. TurboGears 2 is built on top of the experience of several next-generation web frameworks including TurboGears 1 (of course), Django, and Rails. All of these frameworks had limitations that frustrated us, and TG2 was...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Petastorm

    Petastorm

    Petastorm library enables single machine or distributed training

    Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. Petastorm is an open-source data access library developed at Uber ATG. This library enables single machine or distributed training and evaluation of deep learning models directly from datasets in Apache Parquet format. Petastorm supports popular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GPT fdisk is a disk partitioning tool loosely modeled on Linux fdisk, but used for modifying GUID Partition Table (GPT) disks. The related FixParts utility fixes some common problems on Master Boot Record (MBR) disks.
    Leader badge
    Downloads: 24,843 This Week
    Last Update:
    See Project
  • 22
    Streamline Analyst

    Streamline Analyst

    AI agent that streamlines the entire process of data analysis

    ...This Data Analysis Agent effortlessly automates all the tasks such as data cleaning, preprocessing, and even complex operations like identifying target objects, partitioning test sets, and selecting the best-fit models based on your data. With Streamline Analyst, results visualization and evaluation become seamless.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Recursive Zonal Equal Area (EQ) Sphere Partitioning Toolbox is a suite of Matlab functions for partitioning and point distribution on multidimensional spheres.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ...IBEIP is also applicable to analyzing high-mutant antigens and complex epitope structures that span adjacent monomers. Citation: Yun-Fei Ma, Ye Liu. Intelligent Batch Epitope Identification and Partitioning: an intelligent tool for the identification of B cell dominant epitopes. Brief. Bioinform. 2025, 26(4):bbaf310.
    Downloads: 839 This Week
    Last Update:
    See Project
  • 25
    CXXGraph

    CXXGraph

    Header-Only C++ Library for Graph Manipulation and Algorithms

    CXXGraph is a small library, header only, that manages the Graph and it's algorithms in C++. In other words a "Comprehensive C++ Graph Library".
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB