Alternatives to Apache HBase
Compare Apache HBase alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Apache HBase in 2025. Compare features, ratings, user reviews, pricing, and more from Apache HBase competitors and alternatives in order to make an informed decision for your business.
-
1
Redis
Redis Labs
Redis Labs: home of Redis. Redis Enterprise is the best version of Redis. Go beyond cache; try Redis Enterprise free in the cloud using NoSQL & data caching with the world’s fastest in-memory database. Run Redis at scale, enterprise grade resiliency, massive scalability, ease of management, and operational simplicity. DevOps love Redis in the Cloud. Developers can access enhanced data structures, a variety of modules, and rapid innovation with faster time to market. CIOs love the confidence of working with 99.999% uptime best in class security and expert support from the creators of Redis. Implement relational databases, active-active, geo-distribution, built in conflict distribution for simple and complex data types, & reads/writes in multiple geo regions to the same data set. Redis Enterprise offers flexible deployment options, cloud on-prem, & hybrid. Redis Labs: home of Redis. Redis JSON, Redis Java, Python Redis, Redis on Kubernetes & Redis gui best practices.Starting Price: Free -
2
Amazon DynamoDB
Amazon
Amazon DynamoDB is a key-value and document database that delivers single-digit millisecond performance at any scale. It's a fully managed, multi-region, Multimaster, durable database with built-in security, backup and restore, and in-memory caching for internet-scale applications. DynamoDB can handle more than 10 trillion requests per day and can support peaks of more than 20 million requests per second. Many of the world's fastest-growing businesses such as Lyft, Airbnb, and Redfin as well as enterprises such as Samsung, Toyota, and Capital One depend on the scale and performance of DynamoDB to support their mission-critical workloads. Focus on driving innovation with no operational overhead. Build out your game platform with player data, session history, and leaderboards for millions of concurrent users. Use design patterns for deploying shopping carts, workflow engines, inventory tracking, and customer profiles. DynamoDB supports high-traffic, extreme-scaled events. -
3
RavenDB
RavenDB
RavenDB is the pioneer NoSQL Document Database that is fully transactional (ACID) across your database and throughout your cluster. At a fraction of the total cost of ownership (TCO), our open source distributed database offers high availability and high performance with zero administration. It is designed as an easy to use all-in-one database which minimizes the need for third party addons, tools, or support to boost developer productivity and get your project into production fast. You can setup and secure a data cluster in minutes and deploy in the cloud, on-premise or in a hybrid environment. RavenDB offers a Database as a Service solution, allowing you to pass on all your database operations to us so you can focus exclusively on your application. RavenDB has a built-in storage engine, Voron, that operates at speeds up to 1 million reads per second and 150,000 writes per second on a single node using simple commodity hardware to increase your application’s performance. -
4
Couchbase
Couchbase
Unlike other NoSQL databases, Couchbase provides an enterprise-class, multicloud to edge database that offers the robust capabilities required for business-critical applications on a highly scalable and available platform. As a distributed cloud-native database, Couchbase runs in modern dynamic environments and on any cloud, either customer-managed or fully managed as-a-service. Couchbase is built on open standards, combining the best of NoSQL with the power and familiarity of SQL, to simplify the transition from mainframe and relational databases. Couchbase Server is a multipurpose, distributed database that fuses the strengths of relational databases such as SQL and ACID transactions with JSON’s versatility, with a foundation that is extremely fast and scalable. It’s used across industries for things like user profiles, dynamic product catalogs, GenAI apps, vector search, high-speed caching, and much more. -
5
Apache Cassandra
Apache Software Foundation
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages. -
6
Apache Hive
Apache Software Foundation
The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. Apache Hive is an open source project run by volunteers at the Apache Software Foundation. Previously it was a subproject of Apache® Hadoop®, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API. -
7
Apache Kudu
The Apache Software Foundation
A Kudu cluster stores tables that look just like tables you're used to from relational (SQL) databases. A table can be as simple as a binary key and value, or as complex as a few hundred different strongly-typed attributes. Just like SQL, every table has a primary key made up of one or more columns. This might be a single column like a unique user identifier, or a compound key such as a (host, metric, timestamp) tuple for a machine time-series database. Rows can be efficiently read, updated, or deleted by their primary key. Kudu's simple data model makes it a breeze to port legacy applications or build new ones, no need to worry about how to encode your data into binary blobs or make sense of a huge database full of hard-to-interpret JSON. Tables are self-describing, so you can use standard tools like SQL engines or Spark to analyze your data. Kudu's APIs are designed to be easy to use. -
8
Google Cloud Bigtable
Google
Google Cloud Bigtable is a fully managed, scalable NoSQL database service for large analytical and operational workloads. Fast and performant: Use Cloud Bigtable as the storage engine that grows with you from your first gigabyte to petabyte-scale for low-latency applications as well as high-throughput data processing and analytics. Seamless scaling and replication: Start with a single node per cluster, and seamlessly scale to hundreds of nodes dynamically supporting peak demand. Replication also adds high availability and workload isolation for live serving apps. Simple and integrated: Fully managed service that integrates easily with big data tools like Hadoop, Dataflow, and Dataproc. Plus, support for the open source HBase API standard makes it easy for development teams to get started. -
9
Hypertable
Hypertable
Hypertable delivers scalable database capacity at maximum performance to speed up your big data application and reduce your hardware footprint. Hypertable delivers maximum efficiency and superior performance over the competition which translates into major cost savings. A proven scalable design that powers hundreds of Google services. All the benefits of open source with a strong and thriving community. C++ implementation for optimum performance. 24/7/365 support for your business-critical big data application. Unparalleled access to Hypertable brain power by the employer of all core Hypertable developers. Hypertable was designed for the express purpose of solving the scalability problem, a problem that is not handled well by a traditional RDBMS. Hypertable is based on a design developed by Google to meet their scalability requirements and solves the scale problem better than any of the other NoSQL solutions out there. -
10
DataStax
DataStax
The Open, Multi-Cloud Stack for Modern Data Apps. Built on open-source Apache Cassandra™. Global-scale and 100% uptime without vendor lock-in. Deploy on multi-cloud, on-prem, open-source, and Kubernetes. Elastic and pay-as-you-go for improved TCO. Start building faster with Stargate APIs for NoSQL, real-time, reactive, JSON, REST, and GraphQL. Skip the complexity of multiple OSS projects and APIs that don’t scale. Ideal for commerce, mobile, AI/ML, IoT, microservices, social, gaming, and richly interactive applications that must scale-up and scale-down with demand. Get building modern data applications with Astra, a database-as-a-service powered by Apache Cassandra™. Use REST, GraphQL, JSON with your favorite full-stack framework Richly interactive apps that are elastic and viral-ready from Day 1. Pay-as-you-go Apache Cassandra DBaaS that scales effortlessly and affordably. -
11
ScyllaDB
ScyllaDB
ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows. Unlike any other database, ScyllaDB is a distributed NoSQL database fully compatible with Apache Cassandra and Amazon DynamoDB, yet is built with deep architectural advancements that enable exceptional end-user experiences at radically lower costs. Over 400 game-changing companies like Disney+ Hotstar, Expedia, FireEye, Discord, Zillow, Starbucks, Comcast, and Samsung use ScyllaDB for their toughest database challenges. ScyllaDB is available as free open source software, a fully-supported enterprise product, and a fully managed database-as-a-service (DBaaS) on multiple cloud providers. -
12
Azure Table Storage
Microsoft
Use Azure Table storage to store petabytes of semi-structured data and keep costs down. Unlike many data stores—on-premises or cloud-based—Table storage lets you scale up without having to manually shard your dataset. Availability also isn’t a concern: using geo-redundant storage, stored data is replicated three times within a region—and an additional three times in another region, hundreds of miles away. Table storage is excellent for flexible datasets—web app user data, address books, device information, and other metadata—and lets you build cloud applications without locking down the data model to particular schemas. Because different rows in the same table can have a different structure—for example, order information in one row, and customer information in another—you can evolve your application and table schema without taking it offline. Table storage embraces a strong consistency model. -
13
GridGain
GridGain Systems
The enterprise-grade platform built on Apache Ignite that provides in-memory speed and massive scalability for data-intensive applications and real-time data access across datastores and applications. Upgrade from Ignite to GridGain with no code changes and deploy your clusters securely at global scale with zero downtime. Perform rolling upgrades of your production clusters with no impact on application availability. Replicate across globally distributed data centers to load balance workloads and prevent downtime from regional outages. Secure your data at rest and in motion, and ensure compliance with security and privacy standards. Easily integrate with your organization's authentication and authorization system. Enable full data and user activity auditing. Create automated schedules for full and incremental backups. Restore your cluster to the last stable state with snapshots and point-in-time recovery. -
14
ClickHouse
ClickHouse
ClickHouse is a fast open-source OLAP database management system. It is column-oriented and allows to generate analytical reports using SQL queries in real-time. ClickHouse's performance exceeds comparable column-oriented database management systems currently available on the market. It processes hundreds of millions to more than a billion rows and tens of gigabytes of data per single server per second. ClickHouse uses all available hardware to its full potential to process each query as fast as possible. Peak processing performance for a single query stands at more than 2 terabytes per second (after decompression, only used columns). In distributed setup reads are automatically balanced among healthy replicas to avoid increasing latency. ClickHouse supports multi-master asynchronous replication and can be deployed across multiple datacenters. All nodes are equal, which allows avoiding having single points of failure. -
15
Apache Accumulo
Apache Corporation
With Apache Accumulo, users can store and manage large data sets across a cluster. Accumulo uses Apache Hadoop's HDFS to store its data and Apache ZooKeeper for consensus. While many users interact directly with Accumulo, several open source projects use Accumulo as their underlying store. To learn more about Accumulo, take the Accumulo tour, read the user manual and run the Accumulo example code. Feel free to contact us if you have any questions. Accumulo has a programming mechanism (called Iterators) that can modify key/value pairs at various points in the data management process. Every Accumulo key/value pair has its own security label which limits query results based off user authorizations. Accumulo runs on a cluster using one or more HDFS instances. Nodes can be added or removed as the amount of data stored in Accumulo changes. -
16
eXtremeDB
McObject
How is platform independent eXtremeDB different? - Hybrid data storage. Unlike other IMDS, eXtremeDB can be all-in-memory, all-persistent, or have a mix of in-memory tables and persistent tables - Active Replication Fabric™ is unique to eXtremeDB, offering bidirectional replication, multi-tier replication (e.g. edge-to-gateway-to-gateway-to-cloud), compression to maximize limited bandwidth networks and more - Row & Columnar Flexibility for Time Series Data supports database designs that combine row-based and column-based layouts, in order to best leverage the CPU cache speed - Embedded and Client/Server. Fast, flexible eXtremeDB is data management wherever you need it, and can be deployed as an embedded database system, and/or as a client/server database system -A hard real-time deterministic option in eXtremeDB/rt Designed for use in resource-constrained, mission-critical embedded systems. Found in everything from routers to satellites to trains to stock markets worldwide -
17
Azure Cosmos DB
Microsoft
Azure Cosmos DB is a fully managed NoSQL database service for modern app development with guaranteed single-digit millisecond response times and 99.999-percent availability backed by SLAs, automatic and instant scalability, and open source APIs for MongoDB and Cassandra. Enjoy fast writes and reads anywhere in the world with turnkey multi-master global distribution. Reduce time to insight by running near-real time analytics and AI on the operational data within your Azure Cosmos DB NoSQL database. Azure Synapse Link for Azure Cosmos DB seamlessly integrates with Azure Synapse Analytics without data movement or diminishing the performance of your operational data store. -
18
Aerospike
Aerospike
Aerospike is the global leader in next-generation, real-time NoSQL data solutions for any scale. Aerospike enterprises overcome seemingly impossible data bottlenecks to compete and win with a fraction of the infrastructure complexity and cost of legacy NoSQL databases. Aerospike’s patented Hybrid Memory Architecture™ delivers an unbreakable competitive advantage by unlocking the full potential of modern hardware, delivering previously unimaginable value from vast amounts of data at the edge, to the core and in the cloud. Aerospike empowers customers to instantly fight fraud; dramatically increase shopping cart size; deploy global digital payment networks; and deliver instant, one-to-one personalization for millions of customers. Aerospike customers include Airtel, Banca d’Italia, Nielsen, PayPal, Snap, Verizon Media and Wayfair. The company is headquartered in Mountain View, Calif., with additional locations in London; Bengaluru, India; and Tel Aviv, Israel. -
19
InfinityDB
InfinityDB
InfinityDB Embedded is a Java NoSQL database, a hierarchical sorted key value store. It is high-performance, multi-core, flexible, and maintenance-free. InfinityDB Encrypted database and InfinityDB Client/Server database are now available as well. InfinityDB has the highest available performance, according to our customers and the provided performance tests: Multi-core overlapping operations scale almost linearly in thread count, threads use fair scheduling, with very low inter-thread interference, random I/O scales logarithmically in file size, with no size limit, caches grow only as used, and are packed efficiently, database open is immediate, even for recovery after abrupt exit. -
20
Riak KV
Riak
At Riak, we are distributed systems experts and we work with Application teams to overcome these distributed system challenges. Riak’s Riak® is a distributed NoSQL database that delivers unmatched Resiliency beyond typical “high availability” offerings. Innovative technology to ensure data accuracy and never lose a write. Massive scale on commodity hardware. Common code foundation with true multi-model support. Riak® provides all this, while still focused on ease of operations. Chose Riak® KV flexible key-value data model for web scale profile and session management, real-time big data, catalog, content management, customer 360, digital messaging, and more use cases. Chose Riak® TS for IoT and time series use cases. When seconds of latency can cost thousands of dollars and an outage millions, the call for scalable, highly available databases that are easy to operationalize is resoundingly clear. Riak performs as promised and keeps the lights on.Starting Price: $0 -
21
FoundationDB
FoundationDB
FoundationDB is multi-model, meaning you can store many types data in a single database. All data is safely stored, distributed, and replicated in the Key-Value Store component. FoundationDB is easy to install, grow, and manage. It has a distributed architecture that gracefully scales out, and handles faults while acting like a single ACID database. FoundationDB provides amazing performance on commodity hardware, allowing you to support very heavy loads at low cost. FoundationDB has been running in production for years and been hardened with lessons learned. Backing FoundationDB up is an unmatched testing system based on a deterministic simulation engine. We encourage your participation in our open-source community! Join us in technical and user discussions on the community forums, and learn how to contribute. -
22
Apache Ignite
Apache Ignite
Use Ignite as a traditional SQL database by leveraging JDBC drivers, ODBC drivers, or the native SQL APIs that are available for Java, C#, C++, Python, and other programming languages. Seamlessly join, group, aggregate, and order your distributed in-memory and on-disk data. Accelerate your existing applications by 100x using Ignite as an in-memory cache or in-memory data grid that is deployed over one or more external databases. Think of a cache that you can query with SQL, transact, and compute on. Build modern applications that support transactional and analytical workloads by using Ignite as a database that scales beyond the available memory capacity. Ignite allocates memory for your hot data and goes to disk whenever applications query cold records. Execute kilobyte-size custom code over petabytes of data. Turn your Ignite database into a distributed supercomputer for low-latency calculations, complex analytics, and machine learning. -
23
Apache Parquet
The Apache Software Foundation
We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites. -
24
Greenplum
Greenplum Database
Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Database® project is released under the Apache 2 license. We want to thank all our current community contributors and are interested in all new potential contributions. For the Greenplum Database community no contribution is too small, we encourage all types of contributions. An open-source massively parallel data platform for analytics, machine learning and AI. Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. Experience the fully featured, integrated, open source analytics platform. -
25
RocksDB
RocksDB
RocksDB uses a log structured database engine, written entirely in C++, for maximum performance. Keys and values are just arbitrarily-sized byte streams. RocksDB is optimized for fast, low latency storage such as flash drives and high-speed disk drives. RocksDB exploits the full potential of high read/write rates offered by flash or RAM. RocksDB provides basic operations such as opening and closing a database, reading and writing to more advanced operations such as merging and compaction filters. RocksDB is adaptable to different workloads. From database storage engines such as MyRocks to application data caching to embedded workloads, RocksDB can be used for a variety of data needs. -
26
LeanXcale
LeanXcale
LeanXcale is a fast and scalable database that combines the characteristics of SQL and NoSQL. It is built to ingest massive batch and real-time data pipelines and make it available through SQL or GIS for any use, such as operational applications, analytics, dashboarding, or machine learning processing. No matter what stack you use, LeanXcale provides you both SQL and NoSQL interfaces. KiVi storage engine is a relational key-value data store. Users can access the data not only through the standard SQL API but also through a direct ACID key-value interface. This key-value interface allows users to perform data ingestion at very high rates and very efficiently by avoiding SQL processing overhead. Highly-scalable, efficient and distributed storage engine distributed data along the cluster to improve the performance and increase the reliability.Starting Price: $0.127 per GB per month -
27
Dgraph
Hypermode
Dgraph is an open source, low-latency, high throughput, native and distributed graph database. Designed to easily scale to meet the needs of small startups as well as large companies with massive amounts of data, DGraph can handle terabytes of structured data running on commodity hardware with low latency for real time user queries. It addresses business needs and uses cases involving diverse social and knowledge graphs, real-time recommendation engines, semantic search, pattern matching and fraud detection, serving relationship data, and serving web apps. -
28
LedisDB
LedisDB
Ledisdb is a high-performance NoSQL database library and server written in Go. It's similar to Redis but store data in disk. It supports many data structures including kv, list, hash, zset, set. LedisDB now supports multiple different databases as backends. -
29
BangDB
BangDB
BangDB natively integrates AI, streaming, graph, analytics within the DB itself to enable users to deal with complex data of different kinds, such as text, images, videos, objects etc. for real time data processing and analysis Ingest or stream any data, process it, train models, do prediction, find patterns, take action and automate all these to enable use cases such as IOT monitoring, fraud or disruption prevention, log analysis, lead generation, 1-on-1 personalisation and many more. Today’s use cases require different kinds of data to be ingested, processed, and queried at the same time for a given problem. BangDB supports most of the useful data formats to allow user to solve the problem in a simple manner. Rise of real time data pushes for real time streaming and predictive data analytics for advanced and optimized business operations.Starting Price: $2,499 per year -
30
Alibaba Cloud Tablestore
Alibaba Cloud
Tablestore enables seamless expansion of data size and access concurrency through data sharding and server load balancer technologies, providing storage of and real-time access to massive structured data. Three copies of data with high consistency, full host, service high availability and data high reliability. Provides full/incremental data tunnels, seamlessly interconnecting with various products for big data analysis and real-time stream computing. Distributed architecture, single table auto scaling, support of 10-PB-level data and 10-million-level access concurrency. Multi-dimensional and multi-level security protection and resource access management to ensure data security. The low latency, high concurrency, elastic resources and Pay-As-You-Go billing method of this service enables your risk control system to always operate in optimal conditions, allowing you to strictly control transaction risks.Starting Price: $0.00010 per GB -
31
IBM Cloudant
IBM
IBM Cloudant® is a distributed database that is optimized for handling heavy workloads that are typical of large, fast-growing web and mobile apps. Available as an SLA-backed, fully managed IBM Cloud™ service, Cloudant elastically scales throughput and storage independently. Instantly deploy an instance, create databases and independently scale throughput capacity and data storage to meet your application requirements. Encrypt all data, with optional user-defined encryption key management through IBM Key Protect, and integrate with IBM Identity and Access Management. Get continuous availability as Cloudant distributes data across availability zones and 6 regions for app performance and disaster recovery requirements. Get continuous availability as Cloudant distributes data across availability zones and 6 regions for app performance and disaster recovery requirements. -
32
AllegroGraph
Franz Inc.
AllegroGraph is a breakthrough solution that allows infinite data integration through a patented approach unifying all data and siloed knowledge into an Entity-Event Knowledge Graph solution that can support massive big data analytics. AllegroGraph utilizes unique federated sharding capabilities that drive 360-degree insights and enable complex reasoning across a distributed Knowledge Graph. AllegroGraph provides users with an integrated version of Gruff, a unique browser-based graph visualization software tool for exploring and discovering connections within enterprise Knowledge Graphs. Franz’s Knowledge Graph Solution includes both technology and services for building industrial strength Entity-Event Knowledge Graphs based on best-of-class tools, products, knowledge, skills and experience. -
33
Hazelcast
Hazelcast
In-Memory Computing Platform. The digital world is different. Microseconds matter. That's why the world's largest organizations rely on us to power their most time-sensitive applications at scale. New data-enabled applications can deliver transformative business power – if they meet today’s requirement of immediacy. Hazelcast solutions complement virtually any database to deliver results that are significantly faster than a traditional system of record. Hazelcast’s distributed architecture provides redundancy for continuous cluster up-time and always available data to serve the most demanding applications. Capacity grows elastically with demand, without compromising performance or availability. The fastest in-memory data grid, combined with third-generation high-speed event processing, delivered through the cloud. -
34
InterSystems IRIS
InterSystems
InterSystems IRIS is a complete cloud-first data platform that includes a multi-model transactional data management engine, an application development platform, and interoperability engine, and an open analytics platform. It is the next generation of our proven data management software.It includes the capabilities of InterSystems Cache and Ensemble, plus a wealth of exciting new capabilities to make it easy to build and deploy cloud based, analytics-intensive enterprise applications with even greater performance and scalability. InterSystems IRIS provides a set of APIs to operate with transactional persistent data simultaneously: key-value, relational, object, document, multidimensional. Data can be managed by SQL, Java, node.js, .NET, C++, Python, and native server-side ObjectScript language. InterSystems IRIS includes -
35
Apache CouchDB
The Apache Software Foundation
Apache CouchDB™ lets you access your data where you need it. The Couch Replication Protocol is implemented in a variety of projects and products that span every imaginable computing environment from globally distributed server-clusters, over mobile phones to web browsers. Store your data safely, on your own servers, or with any leading cloud provider. Your web- and native applications love CouchDB, because it speaks JSON natively and supports binary data for all your data storage needs. The Couch Replication Protocol lets your data flow seamlessly between server clusters to mobile phones and web browsers, enabling a compelling offline-first user-experience while maintaining high performance and strong reliability. CouchDB comes with a developer-friendly query language, and optionally MapReduce for simple, efficient, and comprehensive data retrieval. -
36
Oracle NoSQL Database
Oracle
Oracle NoSQL Database is designed to handle high-volume, high-velocity data applications requiring low-latency responses and flexible data models. It supports JSON, table, and key-value data types, and operates both on-premise and as a cloud service. The database scales elastically to meet dynamic workloads and provides distributed data storage across multiple shards, ensuring high availability and rapid failover. It includes Python, Node.js, Java, C, C#, and REST API drivers for easy application development. Additionally, it integrates with Oracle products such as IoT, Golden Gate, and Fusion Middleware. Oracle NoSQL Database Cloud Service is a fully managed service, freeing developers from backend infrastructure management. Oracle NoSQL Database Cloud Service is a fully managed database service for developers who want to focus on application development without dealing with the hassle of managing the back-end hardware and software infrastructure. -
37
Apache Druid
Druid
Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures. -
38
MariaDB
MariaDB
MariaDB Platform is a complete enterprise open source database solution. It has the versatility to support transactional, analytical and hybrid workloads as well as relational, JSON and hybrid data models. And it has the scalability to grow from standalone databases and data warehouses to fully distributed SQL for executing millions of transactions per second and performing interactive, ad hoc analytics on billions of rows. MariaDB can be deployed on prem on commodity hardware, is available on all major public clouds and through MariaDB SkySQL as a fully managed cloud database. To learn more, visit mariadb.com. -
39
qikkDB
qikkDB
QikkDB is a GPU accelerated columnar database, delivering stellar performance for complex polygon operations and big data analytics. When you count your data in billions and want to see real-time results you need qikkDB. We support Windows and Linux operating systems. We use Google Tests as the testing framework. There are hundreds of unit tests and tens of integration tests in the project. For development on Windows, Microsoft Visual Studio 2019 is recommended, and its dependencies are CUDA version 10.2 minimal, CMake 3.15 or newer, vcpkg, boost. For development on Linux, the dependencies are CUDA version 10.2 minimal, CMake 3.15 or newer, and boost. This project is licensed under the Apache License, Version 2.0. You can use an installation script or dockerfile to install qikkDB. -
40
JanusGraph
JanusGraph
JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a project under The Linux Foundation, and includes participants from Expero, Google, GRAKN.AI, Hortonworks, IBM and Amazon. Elastic and linear scalability for a growing data and user base. Data distribution and replication for performance and fault tolerance. Multi-datacenter high availability and hot backups. All functionality is totally free. No need to buy commercial licenses. JanusGraph is fully open source under the Apache 2 license. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. Support for ACID and eventual consistency. In addition to online transactional processing (OLTP), JanusGraph supports global graph analytics (OLAP) with its Apache Spark integration. -
41
KeyDB
KeyDB
KeyDB maintains full compatibility with Redis modules, API and protocol. Seamlessly drop in KeyDB and maintain full compatibility with your existing clients, scripts and configurations. Multi-Master mode uses a single replicated dataset across many nodes to serve both read and write operations Nodes can be replicated cross-region to offer submillisecond latencies to local clients. Cluster mode allows unlimited read and write scaling by splitting the dataset across shards. This allows unlimited scaling, and also support high availability through replica nodes. KeyDB offers new community driven commands that enable you to do more with your data. Add your own commands and functionality using JavaScript with the ModJS module. ModJS lets you write functions in javascript that can in turn be called directly by KeyBD. The example to the left shows and example of a javascript function that would be loaded with the module. It can then be called directly from your client. -
42
CrateDB
CrateDB
The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data. -
43
Apache Trafodion
Apache Software Foundation
Apache Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop. Trafodion builds on the scalability, elasticity, and flexibility of Hadoop. Trafodion extends Hadoop to provide guaranteed transactional integrity, enabling new kinds of big data applications to run on Hadoop. Full-functioned ANSI SQL language support. JDBC/ODBC connectivity for Linux/Windows clients. Distributed ACID transaction protection across multiple statements, tables, and rows. Performance improvements for OLTP workloads with compile-time and run-time optimizations. Support for large data sets using a parallel-aware query optimizer. Reuse existing SQL skills and improve developer productivity. Distributed ACID transactions guarantee data consistency across multiple rows and tables. Interoperability with existing tools and applications. Hadoop and Linux distribution neutral. Easy to add to your existing Hadoop infrastructure.Starting Price: Free -
44
QuerySurge
RTTS
QuerySurge leverages AI to automate the data validation and ETL testing of Big Data, Data Warehouses, Business Intelligence Reports and Enterprise Apps/ERPs with full DevOps functionality for continuous testing. Use Cases - Data Warehouse & ETL Testing - Hadoop & NoSQL Testing - DevOps for Data / Continuous Testing - Data Migration Testing - BI Report Testing - Enterprise App/ERP Testing QuerySurge Features - Projects: Multi-project support - AI: automatically create datas validation tests based on data mappings - Smart Query Wizards: Create tests visually, without writing SQL - Data Quality at Speed: Automate the launch, execution, comparison & see results quickly - Test across 200+ platforms: Data Warehouses, Hadoop & NoSQL lakes, databases, flat files, XML, JSON, BI Reports - DevOps for Data & Continuous Testing: RESTful API with 60+ calls & integration with all mainstream solutions - Data Analytics & Data Intelligence: Analytics dashboard & reports -
45
Google Cloud Memorystore
Google
Reduce latency with scalable, secure, and highly available in-memory service for Redis and Memcached. Memorystore automates complex tasks for open source Redis and Memcached like enabling high availability, failover, patching, and monitoring so you can spend more time coding. Start with the lowest tier and smallest size and then grow your instance with minimal impact. Memorystore for Memcached can support clusters as large as 5 TB supporting millions of QPS at very low latency. Memorystore for Redis instances are replicated across two zones and provide a 99.9% availability SLA. Instances are monitored constantly and with automatic failover—applications experience minimal disruption. Choose from the two most popular open source caching engines to build your applications. Memorystore supports both Redis and Memcached and is fully protocol compatible. Choose the right engine that fits your cost and availability requirements. -
46
Luna for Apache Cassandra
DataStax
Luna is a subscription to the Apache Cassandra support and expertise at DataStax. It allows you to enjoy all the benefits of open-source Cassandra, with the peace of mind knowing you have direct access to the team that authored the majority of the code and supported some of the largest deployments in the world. Best practices, advice, and SLA-based support to keep your Cassandra deployment in top shape. Scale without compromising on performance or latency to seamlessly manage the most demanding real-time workloads. Create real-time and highly-interactive customer experiences with blisteringly fast read and writes. Luna provides assistance with resolving issues and following best practices with Cassandra clusters. Services provide help through the full application life cycle, with a deeper integration in your team working together on implementation. -
47
Amazon Neptune
Amazon
Amazon Neptune is a fast, reliable, fully managed graph database service that makes it easy to build and run applications that work with highly connected datasets. The core of Amazon Neptune is a purpose-built, high-performance graph database engine optimized for storing billions of relationships and querying the graph with milliseconds latency. Amazon Neptune supports popular graph models Property Graph and W3C's RDF, and their respective query languages Apache TinkerPop Gremlin and SPARQL, allowing you to easily build queries that efficiently navigate highly connected datasets. Neptune powers graph use cases such as recommendation engines, fraud detection, knowledge graphs, drug discovery, and network security. Proactively detect and investigate IT infrastructure using a layered security approach. Visualize all infrastructure to plan, predict and mitigate risk. Build graph queries for near-real-time identity fraud pattern detection in financial and purchase transactions. -
48
AntDB
Antdb AsiaInfo
AntDB is a cloud-native, distributed relational database developed by AsiaInfo Technologies, designed to handle high-performance online transaction processing and online analytical processing workloads. AntDB has been serving over 1 billion subscribers across 24 provinces in China, supporting massive business data related to calls, internet access, payments, and billing. AntDB's cloud-native distributed architecture supports online scalability, data consistency, and high availability across data centers. It is compatible with SQL2016 standards and integrates seamlessly with various domestic ecosystems, including mainstream CPUs and operating systems. The platform offers features such as automatic high availability, online elastic capacity expansion, and read/write splitting at the kernel level to efficiently manage traffic loads during peak periods. AntDB has been successfully commercialized in industries like telecommunications, finance, transportation, and energy.Starting Price: Free -
49
OrbitDB
OrbitDB
OrbitDB is a serverless, distributed, peer-to-peer database that utilizes IPFS for data storage and Libp2p Pubsub for automatic synchronization across peers. It employs Merkle-CRDTs to ensure conflict-free database writes and merges, making it suitable for decentralized applications, blockchain integrations, and local-first web apps. OrbitDB offers various database types tailored to different use cases: 'events' for immutable append-only logs, 'documents' for JSON document storage indexed by a specified key, 'keyvalue' for traditional key-value pairs, and 'keyvalue-indexed' for LevelDB-indexed key-value data. All these databases are built atop OpLog, an immutable, cryptographically verifiable, operation-based CRDT structure. The JavaScript implementation supports both browser and Node.js environments, with a Go version maintained by the Berty project.Starting Price: Free -
50
InterSystems Caché
InterSystems
InterSystems Caché® is a high-performance database that powers transaction processing applications around the world. It is used for everything from mapping a billion stars in the Milky Way, to processing a billion equity trades in a day, to managing smart energy grids. Caché is a multi-model (object, relational, key-value) DBMS and application server developed by InterSystems. InterSystems Caché provides several APIs to operate with same data simultaneously: key-value, relational, object, document, multi-dimensional. Data can be managed via SQL, Java, node.js, .NET, C++, Python. Caché also provides an application server which hosts web apps (CSP), REST, SOAP, web sockets and other types of TCP access for Caché data.