Compare the Top Distributed Databases for Cloud as of August 2025 - Page 2

  • 1
    GaussDB

    GaussDB

    Huawei Cloud

    GaussDB (for MySQL) is a next generation MySQL-compatible, enterprise-class distributed database service. It uses a decoupled compute and storage architecture and data functions virtualization (DFV) storage that auto-scales up to 128 TB per DB instance. There is virtually no risk of data loss. It supports millions of QPS throughputs and cross-AZ deployment, combining the performance and reliability of commercial databases with the flexibility of open source databases. By decoupling compute and storage, connecting them through RDMA, and using a "log as database" architecture, you can get seven times the performance of open-source databases. To scale read capacity and performance, you can add up to 15 read replicas for a primary node within minutes. GaussDB(for MySQL) is fully compatible with MySQL. You can easily migrate your MySQL databases to GaussDB(for MySQL) without reconstructing existing applications and without sharding.
    Starting Price: $2,586.04 per month
  • 2
    CrateDB

    CrateDB

    CrateDB

    The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.
  • 3
    GigaSpaces

    GigaSpaces

    GigaSpaces

    Smart DIH is an operational data hub that powers real-time modern applications. It unleashes the power of customers’ data by transforming data silos into assets, turning organizations into data-driven enterprises. Smart DIH consolidates data from multiple heterogeneous systems into a highly performant data layer. Low code tools empower data professionals to deliver data microservices in hours, shortening developing cycles and ensuring data consistency across all digital channels. XAP Skyline is a cloud-native, in memory data grid (IMDG) and developer framework designed for mission critical, cloud-native apps. XAP Skyline delivers maximal throughput, microsecond latency and scale, while maintaining transactional consistency. It provides extreme performance, significantly reducing data access time, which is crucial for real-time decisioning, and transactional applications. XAP Skyline is used in financial services, retail, and other industries where speed and scalability are critical.
  • 4
    HCL OneDB

    HCL OneDB

    HCL Software

    Build and run distributed, database-driven enterprise applications with the highest levels of availability, scalability, and performance completely cloud native. For enterprises just starting their cloud native journey or those already executing a multi-cloud strategy, OneDB offers the flexibility, reliability, and ease-of-use needed to meet your application needs. Capturing the value of data for insight and actionable intelligence is made easier through fully automated database administration. You can drastically reduce the need for deep technical expertise to launch new ideas and still stay ahead of the competition. OneDB is great for application development. From broad support of interfaces and APIs to extensive programming language support, developers will find everything they need with OneDB. HCL offers the most versatile cloud native database in the market.
  • 5
    PolarDB

    PolarDB

    Alibaba Cloud

    PolarDB is designed for business-critical database applications that require fast performance, high concurrency, and automatic scaling. You can scale up to millions of queries per second and 100 TB per database cluster with 15 low latency read replicas. PolarDB is six times faster than standard MySQL databases, and delivers the security, reliability, and availability of traditional commercial databases at 1/10 the cost. PolarDB embodies the proven database technology and best practices honed over the last decade that supported hyper-scale events such as the Alibaba Double 11 Global Shopping Festival. To support the developer community, we are introducing Always Free ApsaraDB for PolarDB (all three variations) when you use no more than 1 instance (2-core and 8GB of memory), and up to 50GB of storage. Register now and renew each month to continue this benefit. Regional resource availability is subject to change.
  • 6
    Citus

    Citus

    Citus Data

    Citus gives you the Postgres you love, plus the superpower of distributed tables. 100% open source. Now with schema-based and row-based sharding, plus Postgres 16 support. Scale Postgres by distributing data & queries. You can start with a single Citus node, then add nodes & rebalance shards when you need to grow. Speed up queries by 20x to 300x (or more) through parallelism, keeping more data in memory, higher I/O bandwidth, and columnar compression. Citus is an extension (not a fork) to the latest Postgres versions, so you can use your familiar SQL toolset & leverage your Postgres expertise. Reduce your infrastructure headaches by using a single database for both your transactional and analytical workloads. Download and use Citus open source for free. You can manage Citus yourself, embrace open source, and help us improve Citus via GitHub. Focus on your application & forget about your database. Run your app on Citus in the cloud with Azure Cosmos DB for PostgreSQL.
    Starting Price: $0.27 per hour
  • 7
    Tarantool

    Tarantool

    Tarantool

    Corporations need a way to ensure uninterrupted operation of their systems, high speed of data processing, and reliability of storage. The in-memory technologies have proven themselves well in solving these problems. For more than 10 years, Tarantool has been helping companies all over the world build smart caches, data marts, and golden client profiles while saving server capacity. Reduce the cost of storing credentials compared to siloed solutions and improve the service and security of client applications. Reduce data management costs of maintaining a large number of disparate systems that store customer identities. Increase sales by improving the speed and quality of customer recommendations for goods or services through the analysis of user behavior and user data. Improve mobile and web channel service by accelerating frontends to reduce user outflow. IT systems of large organizations operate in a closed loop of a local network, where data circulates unprotected.
  • 8
    Couchbase

    Couchbase

    Couchbase

    Unlike other NoSQL databases, Couchbase provides an enterprise-class, multicloud to edge database that offers the robust capabilities required for business-critical applications on a highly scalable and available platform. As a distributed cloud-native database, Couchbase runs in modern dynamic environments and on any cloud, either customer-managed or fully managed as-a-service. Couchbase is built on open standards, combining the best of NoSQL with the power and familiarity of SQL, to simplify the transition from mainframe and relational databases. Couchbase Server is a multipurpose, distributed database that fuses the strengths of relational databases such as SQL and ACID transactions with JSON’s versatility, with a foundation that is extremely fast and scalable. It’s used across industries for things like user profiles, dynamic product catalogs, GenAI apps, vector search, high-speed caching, and much more.
  • 9
    MarkLogic

    MarkLogic

    Progress Software

    Unlock data value, accelerate insightful decisions, and securely achieve data agility with the MarkLogic data platform. Combine your data with everything known about it (metadata) in a single service and reveal smarter decisions—faster. Get a faster, trusted way to securely connect data and metadata, create and interpret meaning, and consume high-quality contextualized data across the enterprise with the MarkLogic data platform. Know your customers in-the-moment and provide relevant and seamless experiences, reveal new insights to accelerate innovation, and easily enable governed access and compliance with a single data platform. MarkLogic provides a proven foundation to help you achieve your key business and technical outcomes—now and in the future.
  • 10
    VoltDB

    VoltDB

    VoltDB

    Volt Active Data is a data platform built to make your entire tech stack leaner, faster, and less expensive, so that your applications (and your company) can scale seamlessly to meet the ultra-low latency SLAs of 5G, IoT, edge computing, and whatever comes next. Designed to augment your existing big data investments, such as NoSQL, Hadoop, Kubernetes, Kafka, and traditional databases or data warehouses, Volt Active Data replaces the various layers typically required to make contextual decisions on streaming data with a single, unified layer that can handle ingest to action in less than 10 milliseconds. The world is full of data that’s generated, stored, forgotten, and then deleted. “Active Data” is data that needs to be acted on immediately to gain business value from it. There are lots of traditional and NoSQL data storage products that you can use to keep such data. There’s also data that you can make money from, if only you can act on it fast enough to ‘influence the moment’.
  • 11
    Neo4j

    Neo4j

    Neo4j

    Neo4j’s graph data platform is purpose-built to leverage not only data but also data relationships. Using Neo4j, developers build intelligent applications that traverse today's large, interconnected datasets in real time. Powered by a native graph storage and processing engine, Neo4j’s graph database delivers an intuitive, flexible and secure database for unique, actionable insights.
  • 12
    Apache HBase

    Apache HBase

    The Apache Software Foundation

    Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Automatic failover support between RegionServers. Easy to use Java API for client access. Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options. Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX.
  • 13
    Google Cloud Bigtable
    Google Cloud Bigtable is a fully managed, scalable NoSQL database service for large analytical and operational workloads. Fast and performant: Use Cloud Bigtable as the storage engine that grows with you from your first gigabyte to petabyte-scale for low-latency applications as well as high-throughput data processing and analytics. Seamless scaling and replication: Start with a single node per cluster, and seamlessly scale to hundreds of nodes dynamically supporting peak demand. Replication also adds high availability and workload isolation for live serving apps. Simple and integrated: Fully managed service that integrates easily with big data tools like Hadoop, Dataflow, and Dataproc. Plus, support for the open source HBase API standard makes it easy for development teams to get started.
  • 14
    TiDB

    TiDB

    PingCAP

    An open-source, cloud-native, distributed SQL database for elastic scale and real-time analytics. Supported by a wealth of open-source data migration tools in the ecosystem, TiDB gives you the freedom to choose your own vendor and avoid lock-in. Purposely built to deliver SQL at scale, TiDB eliminates the scaling problems of traditional relational databases without intrusion to your application. HTAP database platform that enables real-time situation awareness and decision making on live transactional data and eliminates friction between IT and business goals. TiDB is ACID-compliant and strongly consistent. You can use TiDB as a scale-out MySQL database with familiar SQL syntaxes and ecosystem tools. TiDB automatically shards your data so you don’t have to do it manually. You can simply add new nodes to scale horizontally and elastically to meet your business growth. TiDB simplifies the ETL process and automatically recovers from errors.
  • 15
    Vitess

    Vitess

    Vitess

    A database clustering system for horizontal scaling of MySQL. Vitess combines many important MySQL features with the scalability of a NoSQL database. Its built-in sharding features let you grow your database without adding sharding logic to your application. Vitess automatically rewrites queries that hurt database performance. It also uses caching mechanisms to mediate queries and prevent duplicate queries from simultaneously reaching your database. Vitess automatically handles functions like master failovers and backups. It uses a lock server to track and administer servers, letting your application be blissfully ignorant of database topology. Vitess eliminates the high-memory overhead of MySQL connections. Vitess servers easily handle thousands of connections at once. MySQL doesn’t natively support sharding, but you will likely need it as your database grows.
  • 16
    Apache Geode
    Build high-speed, data-intensive applications that elastically meet performance requirements at any scale. Take advantage of Apache Geode's unique technology that blends advanced techniques for data replication, partitioning and distributed processing. Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing. Data can easily be partitioned (sharded) or replicated between nodes allowing performance to scale as needed. Durability is ensured through redundant in-memory copies and disk-based persistence. Super fast write-ahead-logging (WAL) persistence with a shared-nothing architecture that is optimized for fast parallel recovery of nodes or an entire cluster.
  • 17
    FoundationDB

    FoundationDB

    FoundationDB

    FoundationDB is multi-model, meaning you can store many types data in a single database. All data is safely stored, distributed, and replicated in the Key-Value Store component. FoundationDB is easy to install, grow, and manage. It has a distributed architecture that gracefully scales out, and handles faults while acting like a single ACID database. FoundationDB provides amazing performance on commodity hardware, allowing you to support very heavy loads at low cost. FoundationDB has been running in production for years and been hardened with lessons learned. Backing FoundationDB up is an unmatched testing system based on a deterministic simulation engine. We encourage your participation in our open-source community! Join us in technical and user discussions on the community forums, and learn how to contribute.
  • 18
    Apache Accumulo

    Apache Accumulo

    Apache Corporation

    With Apache Accumulo, users can store and manage large data sets across a cluster. Accumulo uses Apache Hadoop's HDFS to store its data and Apache ZooKeeper for consensus. While many users interact directly with Accumulo, several open source projects use Accumulo as their underlying store. To learn more about Accumulo, take the Accumulo tour, read the user manual and run the Accumulo example code. Feel free to contact us if you have any questions. Accumulo has a programming mechanism (called Iterators) that can modify key/value pairs at various points in the data management process. Every Accumulo key/value pair has its own security label which limits query results based off user authorizations. Accumulo runs on a cluster using one or more HDFS instances. Nodes can be added or removed as the amount of data stored in Accumulo changes.
  • 19
    HerdDB

    HerdDB

    Diennea

    HerdDB is a SQL distributed database implemented in Java. It has been designed to be embeddable in any Java Virtual Machine. It is optimized for fast "writes" and primary key read/update access patterns. HerdDB is designed to manage hundreds of tables. It is simple to add and remove hosts and to reconfigure tablespaces to easly distribute the load on multiple systems. HerdDB leverages Apache Zookeeper and Apache Bookkeeper to build a fully replicated, shared-nothing architecture without any single point of failure. At the low level HerdDB is very similar to a key-value NoSQL database. On top of that an SQL abstraction layer and JDBC Driver support enables every user to leverage existing known-how and port existing applications to HerdDB. At Diennea we developed EmailSuccess, a powerfull MTA (Mail Transfer Agent), designed to deliver millions of email messages per hour to inboxes all around the world,
  • 20
    BigchainDB

    BigchainDB

    BigchainDB

    With high throughput, low latency, powerful query functionality, decentralized control, immutable data storage and built-in asset support, BigchainDB is like a database with blockchain characteristics. BigchainDB allows developers and enterprise to deploy blockchain proof-of-concepts, platforms and applications with a blockchain database, supporting a wide range of industries and use cases. Rather than trying to enhance blockchain technology, BigchainDB starts with a big data distributed database and then adds blockchain characteristics - decentralized control, immutability and the transfer of digital assets. No single point of control. No single point of failure. Decentralized control via a federation of voting nodes makes for a P2P network. Write and run any MongoDB query to search the contents of all stored transactions, assets, metadata and blocks. Powered by MongoDB itself.
  • 21
    Oceanbase

    Oceanbase

    Oceanbase

    OceanBase eliminates the complexity of traditional sharding databases, enabling you to effortlessly scale your database to meet ever-growing workloads, whether horizontally, vertically, or even at the tenant level. This facilitates on-the-fly scaling and linear performance growth without downtime or necessitating changes to applications in high-concurrency scenarios, ensuring quicker and more reliable responses to performance-intensive critical workloads. Empower mission-critical workloads and performance-intensive applications across both OLTP and OLAP scenarios, all while maintaining full compatibility with MySQL. 100% ACID Compliance, natively supports distributed transactions with multi-replica strong synchronization built upon Paxos protocols. Experience ultimate query performance that your mission-critical and time-sensitive workloads can depend on. This effectively eliminates downtime, and ensures your mission-critical workload remains always available.
  • 22
    ArangoDB

    ArangoDB

    ArangoDB

    Natively store data for graph, document and search needs. Utilize feature-rich access with one query language. Map data natively to the database and access it with the best patterns for the job – traversals, joins, search, ranking, geospatial, aggregations – you name it. Polyglot persistence without the costs. Easily design, scale and adapt your architectures to changing needs and with much less effort. Combine the flexibility of JSON with semantic search and graph technology for next generation feature extraction even for large datasets.
  • 23
    Hazelcast

    Hazelcast

    Hazelcast

    In-Memory Computing Platform. The digital world is different. Microseconds matter. That's why the world's largest organizations rely on us to power their most time-sensitive applications at scale. New data-enabled applications can deliver transformative business power – if they meet today’s requirement of immediacy. Hazelcast solutions complement virtually any database to deliver results that are significantly faster than a traditional system of record. Hazelcast’s distributed architecture provides redundancy for continuous cluster up-time and always available data to serve the most demanding applications. Capacity grows elastically with demand, without compromising performance or availability. The fastest in-memory data grid, combined with third-generation high-speed event processing, delivered through the cloud.
  • 24
    Dgraph

    Dgraph

    Hypermode

    Dgraph is an open source, low-latency, high throughput, native and distributed graph database. Designed to easily scale to meet the needs of small startups as well as large companies with massive amounts of data, DGraph can handle terabytes of structured data running on commodity hardware with low latency for real time user queries. It addresses business needs and uses cases involving diverse social and knowledge graphs, real-time recommendation engines, semantic search, pattern matching and fraud detection, serving relationship data, and serving web apps.
  • 25
    Holochain

    Holochain

    Holochain

    An end-to-end open source P2P app framework. Local circles of trust provide data integrity without centralized authorities. Holochain delivers the promises of blockchain with a mashup of proven tech that provides self-owned data, a distributed database, and peer accountability. Holochain helps by creating an alternative to the dominant centralized systems of the Internet, protecting our ability to make our own choices, and giving trustworthy information we can act on. We call this 'digital agency', and we believe it will give us the power to face today's complex problems, together. Access other Holochain apps as if they were part of your codebase. No HTTP client, just function calls with optional access control. Compute and data live at the edges, which means there’s no infrastructure for you to maintain and defend. Holochain automatically adapts to disruptions and threats.
  • 26
    Apache Ignite

    Apache Ignite

    Apache Ignite

    Use Ignite as a traditional SQL database by leveraging JDBC drivers, ODBC drivers, or the native SQL APIs that are available for Java, C#, C++, Python, and other programming languages. Seamlessly join, group, aggregate, and order your distributed in-memory and on-disk data. Accelerate your existing applications by 100x using Ignite as an in-memory cache or in-memory data grid that is deployed over one or more external databases. Think of a cache that you can query with SQL, transact, and compute on. Build modern applications that support transactional and analytical workloads by using Ignite as a database that scales beyond the available memory capacity. Ignite allocates memory for your hot data and goes to disk whenever applications query cold records. Execute kilobyte-size custom code over petabytes of data. Turn your Ignite database into a distributed supercomputer for low-latency calculations, complex analytics, and machine learning.
  • 27
    RocksDB

    RocksDB

    RocksDB

    RocksDB uses a log structured database engine, written entirely in C++, for maximum performance. Keys and values are just arbitrarily-sized byte streams. RocksDB is optimized for fast, low latency storage such as flash drives and high-speed disk drives. RocksDB exploits the full potential of high read/write rates offered by flash or RAM. RocksDB provides basic operations such as opening and closing a database, reading and writing to more advanced operations such as merging and compaction filters. RocksDB is adaptable to different workloads. From database storage engines such as MyRocks to application data caching to embedded workloads, RocksDB can be used for a variety of data needs.
  • 28
    Yugabyte

    Yugabyte

    Yugabyte

    The Leading High-Performance Distributed SQL Database. Open source, cloud native relational DB for powering global, internet-scale apps. Single-Digit Millisecond Latency Build blazing fast cloud applications by serving queries directly from the DB. Massive Scale. Achieve millions of transactions per second and store multiple TB’s of data per node. Geo-Distribution. Deploy across regions and clouds with synchronous or multi-master replication. Built for Cloud Native Architectures. Develop, deploy and operationalize modern applications faster than ever before with YugabyteDB. Gain Developer Agility. Leverage full power of PostgreSQL-compatible SQL and distributed ACID transactions. Operate Resilient Services. Ensure continuous availability even when underlying compute, storage or network fails. Scale On-Demand. Add and remove nodes at will. Say no to over-provisioned clusters forever. Lower User Latency.