Alternatives to Apache Kudu
Compare Apache Kudu alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Apache Kudu in 2025. Compare features, ratings, user reviews, pricing, and more from Apache Kudu competitors and alternatives in order to make an informed decision for your business.
-
1
StarTree
StarTree
StarTree Cloud is a fully-managed real-time analytics platform designed for OLAP at massive speed and scale for user-facing applications. Powered by Apache Pinot, StarTree Cloud provides enterprise-grade reliability and advanced capabilities such as tiered storage, scalable upserts, plus additional indexes and connectors. It integrates seamlessly with transactional databases and event streaming platforms, ingesting data at millions of events per second and indexing it for lightning-fast query responses. StarTree Cloud is available on your favorite public cloud or for private SaaS deployment. • Gain critical real-time insights to run your business • Seamlessly integrate data streaming and batch data • High performance in throughput and low-latency at petabyte scale • Fully-managed cloud service • Tiered storage to optimize cloud performance & spend • Fully-secure & enterprise-ready -
2
Redis
Redis Labs
Redis Labs: home of Redis. Redis Enterprise is the best version of Redis. Go beyond cache; try Redis Enterprise free in the cloud using NoSQL & data caching with the world’s fastest in-memory database. Run Redis at scale, enterprise grade resiliency, massive scalability, ease of management, and operational simplicity. DevOps love Redis in the Cloud. Developers can access enhanced data structures, a variety of modules, and rapid innovation with faster time to market. CIOs love the confidence of working with 99.999% uptime best in class security and expert support from the creators of Redis. Implement relational databases, active-active, geo-distribution, built in conflict distribution for simple and complex data types, & reads/writes in multiple geo regions to the same data set. Redis Enterprise offers flexible deployment options, cloud on-prem, & hybrid. Redis Labs: home of Redis. Redis JSON, Redis Java, Python Redis, Redis on Kubernetes & Redis gui best practices.Starting Price: Free -
3
Amazon Redshift
Amazon
More customers pick Amazon Redshift than any other cloud data warehouse. Redshift powers analytical workloads for Fortune 500 companies, startups, and everything in between. Companies like Lyft have grown with Redshift from startups to multi-billion dollar enterprises. No other data warehouse makes it as easy to gain new insights from all your data. With Redshift you can query petabytes of structured and semi-structured data across your data warehouse, operational database, and your data lake using standard SQL. Redshift lets you easily save the results of your queries back to your S3 data lake using open formats like Apache Parquet to further analyze from other analytics services like Amazon EMR, Amazon Athena, and Amazon SageMaker. Redshift is the world’s fastest cloud data warehouse and gets faster every year. For performance intensive workloads you can use the new RA3 instances to get up to 3x the performance of any cloud data warehouse.Starting Price: $0.25 per hour -
4
RavenDB
RavenDB
RavenDB is the pioneer NoSQL Document Database that is fully transactional (ACID) across your database and throughout your cluster. At a fraction of the total cost of ownership (TCO), our open source distributed database offers high availability and high performance with zero administration. It is designed as an easy to use all-in-one database which minimizes the need for third party addons, tools, or support to boost developer productivity and get your project into production fast. You can setup and secure a data cluster in minutes and deploy in the cloud, on-premise or in a hybrid environment. RavenDB offers a Database as a Service solution, allowing you to pass on all your database operations to us so you can focus exclusively on your application. RavenDB has a built-in storage engine, Voron, that operates at speeds up to 1 million reads per second and 150,000 writes per second on a single node using simple commodity hardware to increase your application’s performance. -
5
Apache Hudi
Apache Corporation
Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records. -
6
Apache Parquet
The Apache Software Foundation
We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites. -
7
Apache HBase
The Apache Software Foundation
Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Automatic failover support between RegionServers. Easy to use Java API for client access. Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options. Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX. -
8
Google Cloud Bigtable
Google
Google Cloud Bigtable is a fully managed, scalable NoSQL database service for large analytical and operational workloads. Fast and performant: Use Cloud Bigtable as the storage engine that grows with you from your first gigabyte to petabyte-scale for low-latency applications as well as high-throughput data processing and analytics. Seamless scaling and replication: Start with a single node per cluster, and seamlessly scale to hundreds of nodes dynamically supporting peak demand. Replication also adds high availability and workload isolation for live serving apps. Simple and integrated: Fully managed service that integrates easily with big data tools like Hadoop, Dataflow, and Dataproc. Plus, support for the open source HBase API standard makes it easy for development teams to get started. -
9
ClickHouse
ClickHouse
ClickHouse is a fast open-source OLAP database management system. It is column-oriented and allows to generate analytical reports using SQL queries in real-time. ClickHouse's performance exceeds comparable column-oriented database management systems currently available on the market. It processes hundreds of millions to more than a billion rows and tens of gigabytes of data per single server per second. ClickHouse uses all available hardware to its full potential to process each query as fast as possible. Peak processing performance for a single query stands at more than 2 terabytes per second (after decompression, only used columns). In distributed setup reads are automatically balanced among healthy replicas to avoid increasing latency. ClickHouse supports multi-master asynchronous replication and can be deployed across multiple datacenters. All nodes are equal, which allows avoiding having single points of failure. -
10
Apache Druid
Druid
Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures. -
11
CrateDB
CrateDB
The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data. -
12
Greenplum
Greenplum Database
Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Database® project is released under the Apache 2 license. We want to thank all our current community contributors and are interested in all new potential contributions. For the Greenplum Database community no contribution is too small, we encourage all types of contributions. An open-source massively parallel data platform for analytics, machine learning and AI. Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. Experience the fully featured, integrated, open source analytics platform. -
13
Apache Cassandra
Apache Software Foundation
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages. -
14
HerdDB
Diennea
HerdDB is a SQL distributed database implemented in Java. It has been designed to be embeddable in any Java Virtual Machine. It is optimized for fast "writes" and primary key read/update access patterns. HerdDB is designed to manage hundreds of tables. It is simple to add and remove hosts and to reconfigure tablespaces to easly distribute the load on multiple systems. HerdDB leverages Apache Zookeeper and Apache Bookkeeper to build a fully replicated, shared-nothing architecture without any single point of failure. At the low level HerdDB is very similar to a key-value NoSQL database. On top of that an SQL abstraction layer and JDBC Driver support enables every user to leverage existing known-how and port existing applications to HerdDB. At Diennea we developed EmailSuccess, a powerfull MTA (Mail Transfer Agent), designed to deliver millions of email messages per hour to inboxes all around the world, -
15
Rockset
Rockset
Real-Time Analytics on Raw Data. Live ingest from S3, Kafka, DynamoDB & more. Explore raw data as SQL tables. Build amazing data-driven applications & live dashboards in minutes. Rockset is a serverless search and analytics engine that powers real-time apps and live dashboards. Operate directly on raw data, including JSON, XML, CSV, Parquet, XLSX or PDF. Plug data from real-time streams, data lakes, databases, and data warehouses into Rockset. Ingest real-time data without building pipelines. Rockset continuously syncs new data as it lands in your data sources without the need for a fixed schema. Use familiar SQL, including joins, filters, and aggregations. It’s blazing fast, as Rockset automatically indexes all fields in your data. Serve fast queries that power the apps, microservices, live dashboards, and data science notebooks you build. Scale without worrying about servers, shards, or pagers.Starting Price: Free -
16
eXtremeDB
McObject
How is platform independent eXtremeDB different? - Hybrid data storage. Unlike other IMDS, eXtremeDB can be all-in-memory, all-persistent, or have a mix of in-memory tables and persistent tables - Active Replication Fabric™ is unique to eXtremeDB, offering bidirectional replication, multi-tier replication (e.g. edge-to-gateway-to-gateway-to-cloud), compression to maximize limited bandwidth networks and more - Row & Columnar Flexibility for Time Series Data supports database designs that combine row-based and column-based layouts, in order to best leverage the CPU cache speed - Embedded and Client/Server. Fast, flexible eXtremeDB is data management wherever you need it, and can be deployed as an embedded database system, and/or as a client/server database system -A hard real-time deterministic option in eXtremeDB/rt Designed for use in resource-constrained, mission-critical embedded systems. Found in everything from routers to satellites to trains to stock markets worldwide -
17
Couchbase
Couchbase
Unlike other NoSQL databases, Couchbase provides an enterprise-class, multicloud to edge database that offers the robust capabilities required for business-critical applications on a highly scalable and available platform. As a distributed cloud-native database, Couchbase runs in modern dynamic environments and on any cloud, either customer-managed or fully managed as-a-service. Couchbase is built on open standards, combining the best of NoSQL with the power and familiarity of SQL, to simplify the transition from mainframe and relational databases. Couchbase Server is a multipurpose, distributed database that fuses the strengths of relational databases such as SQL and ACID transactions with JSON’s versatility, with a foundation that is extremely fast and scalable. It’s used across industries for things like user profiles, dynamic product catalogs, GenAI apps, vector search, high-speed caching, and much more. -
18
Apache Pinot
Apache Corporation
Pinot is designed to answer OLAP queries with low latency on immutable data. Pluggable indexing technologies - Sorted Index, Bitmap Index, Inverted Index. Joins are currently not supported, but this problem can be overcome by using Trino or PrestoDB for querying. SQL like language that supports selection, aggregation, filtering, group by, order by, distinct queries on data. Consist of of both offline and real-time table. Use real-time table only to cover segments for which offline data may not be available yet. Detect the right anomalies by customizing anomaly detect flow and notification flow. -
19
kdb+
KX Systems
A high-performance cross-platform historical time-series columnar database featuring: - An in-memory compute engine - A real-time streaming processor - An expressive query and programming language called q kdb+ powers kdb Insights portfolio and KDB.AI, together delivering time-oriented data insights and generative AI capabilities to the world’s leading enterprise organizations. Independently benchmarked* as the fastest in-memory, columnar analytics database available, kdb+ delivers unmatched value to businesses operating in the toughest data environments. kdb+ improves decision-making processes to help navigate rapidly changing data landscapes. -
20
CockroachDB
Cockroach Labs
CockroachDB: Cloud-native, distributed SQL. Your cloud applications deserve a cloud-native database. Cloud-based apps and services deserve a database that scales across clouds, eases operational complexity, and improves reliability. CockroachDB delivers resilient, distributed SQL with ACID transactions and data partitioned by location. Automate operations for mission-critical applications by pairing CockroachDB with orchestration tools like Kubernetes and Mesosphere DC/OS. Every node can service both reads and writes so that you can scale query throughput and database capacity by simply adding more endpoints. Just add new nodes to CockroachDB, and it automatically rebalances data, completely removing the pain of manual sharding. As demand shifts, CockroachDB detects hotspots and intelligently distributes data to maintain performance. Tune your database at the row level so that data lives close to your users and you can minimize query latency. -
21
MariaDB
MariaDB
MariaDB Platform is a complete enterprise open source database solution. It has the versatility to support transactional, analytical and hybrid workloads as well as relational, JSON and hybrid data models. And it has the scalability to grow from standalone databases and data warehouses to fully distributed SQL for executing millions of transactions per second and performing interactive, ad hoc analytics on billions of rows. MariaDB can be deployed on prem on commodity hardware, is available on all major public clouds and through MariaDB SkySQL as a fully managed cloud database. To learn more, visit mariadb.com. -
22
DataStax
DataStax
The Open, Multi-Cloud Stack for Modern Data Apps. Built on open-source Apache Cassandra™. Global-scale and 100% uptime without vendor lock-in. Deploy on multi-cloud, on-prem, open-source, and Kubernetes. Elastic and pay-as-you-go for improved TCO. Start building faster with Stargate APIs for NoSQL, real-time, reactive, JSON, REST, and GraphQL. Skip the complexity of multiple OSS projects and APIs that don’t scale. Ideal for commerce, mobile, AI/ML, IoT, microservices, social, gaming, and richly interactive applications that must scale-up and scale-down with demand. Get building modern data applications with Astra, a database-as-a-service powered by Apache Cassandra™. Use REST, GraphQL, JSON with your favorite full-stack framework Richly interactive apps that are elastic and viral-ready from Day 1. Pay-as-you-go Apache Cassandra DBaaS that scales effortlessly and affordably. -
23
OpenText Analytics Database is a high-performance, scalable analytics platform that enables organizations to analyze massive data sets quickly and cost-effectively. It supports real-time analytics and in-database machine learning to deliver actionable business insights. The platform can be deployed flexibly across hybrid, multi-cloud, and on-premises environments to optimize infrastructure and reduce total cost of ownership. Its massively parallel processing (MPP) architecture handles complex queries efficiently, regardless of data size. OpenText Analytics Database also features compatibility with data lakehouse architectures, supporting formats like Parquet and ORC. With built-in machine learning and broad language support, it empowers users from SQL experts to Python developers to derive predictive insights.
-
24
PolarDB-X
Alibaba Cloud
PolarDB-X has been tried and tested in Tmall Double 11 shopping festivals, and has helped customers in industries such as finance, logistics, energy, e-commerce, and public service to address business challenges. Linearly increases storage space to provide petabyte-scale storage, making storage bottlenecks of standalone databases a thing of the past. Provides the massively parallel processing (MPP) capabilities to significantly improve the efficiency of complex analysis and queries on vast amounts of data. Provides extensive algorithms to distribute data across multiple storage nodes, effectively reducing the volume of data stored in a single table.Starting Price: $10,254.44 per year -
25
Citus
Citus Data
Citus gives you the Postgres you love, plus the superpower of distributed tables. 100% open source. Now with schema-based and row-based sharding, plus Postgres 16 support. Scale Postgres by distributing data & queries. You can start with a single Citus node, then add nodes & rebalance shards when you need to grow. Speed up queries by 20x to 300x (or more) through parallelism, keeping more data in memory, higher I/O bandwidth, and columnar compression. Citus is an extension (not a fork) to the latest Postgres versions, so you can use your familiar SQL toolset & leverage your Postgres expertise. Reduce your infrastructure headaches by using a single database for both your transactional and analytical workloads. Download and use Citus open source for free. You can manage Citus yourself, embrace open source, and help us improve Citus via GitHub. Focus on your application & forget about your database. Run your app on Citus in the cloud with Azure Cosmos DB for PostgreSQL.Starting Price: $0.27 per hour -
26
TiDB
PingCAP
An open-source, cloud-native, distributed SQL database for elastic scale and real-time analytics. Supported by a wealth of open-source data migration tools in the ecosystem, TiDB gives you the freedom to choose your own vendor and avoid lock-in. Purposely built to deliver SQL at scale, TiDB eliminates the scaling problems of traditional relational databases without intrusion to your application. HTAP database platform that enables real-time situation awareness and decision making on live transactional data and eliminates friction between IT and business goals. TiDB is ACID-compliant and strongly consistent. You can use TiDB as a scale-out MySQL database with familiar SQL syntaxes and ecosystem tools. TiDB automatically shards your data so you don’t have to do it manually. You can simply add new nodes to scale horizontally and elastically to meet your business growth. TiDB simplifies the ETL process and automatically recovers from errors. -
27
GaussDB
Huawei Cloud
GaussDB (for MySQL) is a next generation MySQL-compatible, enterprise-class distributed database service. It uses a decoupled compute and storage architecture and data functions virtualization (DFV) storage that auto-scales up to 128 TB per DB instance. There is virtually no risk of data loss. It supports millions of QPS throughputs and cross-AZ deployment, combining the performance and reliability of commercial databases with the flexibility of open source databases. By decoupling compute and storage, connecting them through RDMA, and using a "log as database" architecture, you can get seven times the performance of open-source databases. To scale read capacity and performance, you can add up to 15 read replicas for a primary node within minutes. GaussDB(for MySQL) is fully compatible with MySQL. You can easily migrate your MySQL databases to GaussDB(for MySQL) without reconstructing existing applications and without sharding.Starting Price: $2,586.04 per month -
28
Blazegraph
Blazegraph
Blazegraph™ DB is a ultra high-performance graph database supporting Blueprints and RDF/SPARQL APIs. It supports up to 50 Billion edges on a single machine. It is in production use for Fortune 500 customers such as EMC, Autodesk, and many others. It is supporting key Precision Medicine applications and has wide-spread usage for life science applications. It is used extensively to support Cyber analytics in commercial and government applications. It powers the Wikimedia Foundation's Wikidata Query Service. You can choose an executable jar, war file, or tar.gz distribution. Blazegraph is designed to be easy to use and get started. It ships without SSL or authentication by default for this reason. For production deployments, we strongly recommend you enable SSL, authentication, and appropriate network configurations. There are some helpful links below to enable you to do this. -
29
Vitess
Vitess
A database clustering system for horizontal scaling of MySQL. Vitess combines many important MySQL features with the scalability of a NoSQL database. Its built-in sharding features let you grow your database without adding sharding logic to your application. Vitess automatically rewrites queries that hurt database performance. It also uses caching mechanisms to mediate queries and prevent duplicate queries from simultaneously reaching your database. Vitess automatically handles functions like master failovers and backups. It uses a lock server to track and administer servers, letting your application be blissfully ignorant of database topology. Vitess eliminates the high-memory overhead of MySQL connections. Vitess servers easily handle thousands of connections at once. MySQL doesn’t natively support sharding, but you will likely need it as your database grows. -
30
OrbitDB
OrbitDB
OrbitDB is a serverless, distributed, peer-to-peer database that utilizes IPFS for data storage and Libp2p Pubsub for automatic synchronization across peers. It employs Merkle-CRDTs to ensure conflict-free database writes and merges, making it suitable for decentralized applications, blockchain integrations, and local-first web apps. OrbitDB offers various database types tailored to different use cases: 'events' for immutable append-only logs, 'documents' for JSON document storage indexed by a specified key, 'keyvalue' for traditional key-value pairs, and 'keyvalue-indexed' for LevelDB-indexed key-value data. All these databases are built atop OpLog, an immutable, cryptographically verifiable, operation-based CRDT structure. The JavaScript implementation supports both browser and Node.js environments, with a Go version maintained by the Berty project.Starting Price: Free -
31
ArangoDB
ArangoDB
Natively store data for graph, document and search needs. Utilize feature-rich access with one query language. Map data natively to the database and access it with the best patterns for the job – traversals, joins, search, ranking, geospatial, aggregations – you name it. Polyglot persistence without the costs. Easily design, scale and adapt your architectures to changing needs and with much less effort. Combine the flexibility of JSON with semantic search and graph technology for next generation feature extraction even for large datasets. -
32
JanusGraph
JanusGraph
JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a project under The Linux Foundation, and includes participants from Expero, Google, GRAKN.AI, Hortonworks, IBM and Amazon. Elastic and linear scalability for a growing data and user base. Data distribution and replication for performance and fault tolerance. Multi-datacenter high availability and hot backups. All functionality is totally free. No need to buy commercial licenses. JanusGraph is fully open source under the Apache 2 license. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. Support for ACID and eventual consistency. In addition to online transactional processing (OLTP), JanusGraph supports global graph analytics (OLAP) with its Apache Spark integration. -
33
rqlite
rqlite
The lightweight, user-friendly, distributed relational database built on SQLite. Fault tolerance and high availability with zero hassle. rqlite is a distributed relational database that combines the simplicity of SQLite with the robustness of a fault-tolerant, highly available system. It's developer-friendly, its operation is straightforward, and it's designed for reliability with minimal complexity. Deploy in seconds, with no complex configurations. Seamlessly integrates with modern cloud infrastructures. Built on SQLite, the world’s most popular database. Supports full-text search, Vector Search, and JSON documents. Access controls and encryption for secure deployments. Rigorous, automated testing ensures high quality. Clustering provides high availability and fault tolerance. Automatic node discovery simplifies clustering. -
34
Amazon Aurora
Amazon
Amazon Aurora is a MySQL and PostgreSQL-compatible relational database built for the cloud, that combines the performance and availability of traditional enterprise databases with the simplicity and cost-effectiveness of open source databases. Amazon Aurora is up to five times faster than standard MySQL databases and three times faster than standard PostgreSQL databases. It provides the security, availability, and reliability of commercial databases at 1/10th the cost. Amazon Aurora is fully managed by Amazon Relational Database Service (RDS), which automates time-consuming administration tasks like hardware provisioning, database setup, patching, and backups. Amazon Aurora features a distributed, fault-tolerant, self-healing storage system that auto-scales up to 64TB per database instance. It delivers high performance and availability with up to 15 low-latency read replicas, point-in-time recovery, continuous backup to Amazon S3, and replication across three Availability Zones.Starting Price: $0.02 per month -
35
SingleStore
SingleStore
SingleStore (formerly MemSQL) is a distributed, highly-scalable SQL database that can run anywhere. We deliver maximum performance for transactional and analytical workloads with familiar relational models. SingleStore is a scalable SQL database that ingests data continuously to perform operational analytics for the front lines of your business. Ingest millions of events per second with ACID transactions while simultaneously analyzing billions of rows of data in relational SQL, JSON, geospatial, and full-text search formats. SingleStore delivers ultimate data ingestion performance at scale and supports built in batch loading and real time data pipelines. SingleStore lets you achieve ultra fast query response across both live and historical data using familiar ANSI SQL. Perform ad hoc analysis with business intelligence tools, run machine learning algorithms for real-time scoring, perform geoanalytic queries in real time.Starting Price: $0.69 per hour -
36
ScyllaDB
ScyllaDB
ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows. Unlike any other database, ScyllaDB is a distributed NoSQL database fully compatible with Apache Cassandra and Amazon DynamoDB, yet is built with deep architectural advancements that enable exceptional end-user experiences at radically lower costs. Over 400 game-changing companies like Disney+ Hotstar, Expedia, FireEye, Discord, Zillow, Starbucks, Comcast, and Samsung use ScyllaDB for their toughest database challenges. ScyllaDB is available as free open source software, a fully-supported enterprise product, and a fully managed database-as-a-service (DBaaS) on multiple cloud providers. -
37
RocksDB
RocksDB
RocksDB uses a log structured database engine, written entirely in C++, for maximum performance. Keys and values are just arbitrarily-sized byte streams. RocksDB is optimized for fast, low latency storage such as flash drives and high-speed disk drives. RocksDB exploits the full potential of high read/write rates offered by flash or RAM. RocksDB provides basic operations such as opening and closing a database, reading and writing to more advanced operations such as merging and compaction filters. RocksDB is adaptable to different workloads. From database storage engines such as MyRocks to application data caching to embedded workloads, RocksDB can be used for a variety of data needs. -
38
BigchainDB
BigchainDB
With high throughput, low latency, powerful query functionality, decentralized control, immutable data storage and built-in asset support, BigchainDB is like a database with blockchain characteristics. BigchainDB allows developers and enterprise to deploy blockchain proof-of-concepts, platforms and applications with a blockchain database, supporting a wide range of industries and use cases. Rather than trying to enhance blockchain technology, BigchainDB starts with a big data distributed database and then adds blockchain characteristics - decentralized control, immutability and the transfer of digital assets. No single point of control. No single point of failure. Decentralized control via a federation of voting nodes makes for a P2P network. Write and run any MongoDB query to search the contents of all stored transactions, assets, metadata and blocks. Powered by MongoDB itself. -
39
Nebula Graph
vesoft
The graph database built for super large-scale graphs with milliseconds of latency. We are continuing to collaborate with the community to prepare, popularize and promote the graph database. Nebula Graph only allows authenticated access via role-based access control. Nebula Graph supports multiple storage engine types and the query language can be extended to support new algorithms. Nebula Graph provides low latency read and write , while still maintaining high throughput to simplify the most complex data sets. With a shared-nothing distributed architecture , Nebula Graph offers linear scalability. Nebula Graph's SQL-like query language is easy to understand and powerful enough to meet complex business needs. With horizontal scalability and a snapshot feature, Nebula Graph guarantees high availability even in case of failures. Large Internet companies like JD, Meituan, and Xiaohongshu have deployed Nebula Graph in production environments. -
40
HCL OneDB
HCL Software
Build and run distributed, database-driven enterprise applications with the highest levels of availability, scalability, and performance completely cloud native. For enterprises just starting their cloud native journey or those already executing a multi-cloud strategy, OneDB offers the flexibility, reliability, and ease-of-use needed to meet your application needs. Capturing the value of data for insight and actionable intelligence is made easier through fully automated database administration. You can drastically reduce the need for deep technical expertise to launch new ideas and still stay ahead of the competition. OneDB is great for application development. From broad support of interfaces and APIs to extensive programming language support, developers will find everything they need with OneDB. HCL offers the most versatile cloud native database in the market. -
41
Fauna
Fauna
Fauna is a data API for modern applications that facilitates rich clients with serverless backends by providing a web-native interface with support for GraphQL and custom business logic, frictionless integration with the serverless ecosystem, a no compromise multi-cloud architecture you can trust and grow with and total freedom from database operations. Instantly create multiple databases in one account leveraging multi-tenancy for development or customer-facing use case. Create a distributed database across one geography or the globe in just three clicks and easily import existing data. Scale seamlessly without ever managing servers, clusters, data partitioning, or replication. Track usage and consumption-based billing in near real time via a dashboard.Starting Price: Free -
42
Grakn
Grakn Labs
Building intelligent systems starts at the database. Grakn is an intelligent database - a knowledge graph. An insanely intuitive & expressive data schema, with constructs to define hierarchies, hyper-entities, hyper-relations and rules, to build rich knowledge models. An intelligent language that performs logical inference of data types, relationships, attributes and complex patterns, during runtime, and over distributed & persisted data. Out-of-the-box distributed analytics (Pregel and MapReduce) algorithms, accessible through the language through simple queries. Strong abstraction over low-level patterns, enabling simpler expressions of complex constructs, while the system figures out the most optimal query execution. Scale your enterprise Knowledge Graph with Grakn KGMS and Workbase. A distributed database designed to scale over a network of computers through partitioning and replication. -
43
FoundationDB
FoundationDB
FoundationDB is multi-model, meaning you can store many types data in a single database. All data is safely stored, distributed, and replicated in the Key-Value Store component. FoundationDB is easy to install, grow, and manage. It has a distributed architecture that gracefully scales out, and handles faults while acting like a single ACID database. FoundationDB provides amazing performance on commodity hardware, allowing you to support very heavy loads at low cost. FoundationDB has been running in production for years and been hardened with lessons learned. Backing FoundationDB up is an unmatched testing system based on a deterministic simulation engine. We encourage your participation in our open-source community! Join us in technical and user discussions on the community forums, and learn how to contribute. -
44
HarperDB
HarperDB
HarperDB is a distributed systems platform that combines database, caching, application, and streaming functions into a single technology. With it, you can start delivering global-scale back-end services with less effort, higher performance, and lower cost than ever before. Deploy user-programmed applications and pre-built add-ons on top of the data they depend on for a high throughput, ultra-low latency back end. Lightning-fast distributed database delivers orders of magnitude more throughput per second than popular NoSQL alternatives while providing limitless horizontal scale. Native real-time pub/sub communication and data processing via MQTT, WebSocket, and HTTP interfaces. HarperDB delivers powerful data-in-motion capabilities without layering in additional services like Kafka. Focus on features that move your business forward, not fighting complex infrastructure. You can't change the speed of light, but you can put less light between your users and their data.Starting Price: Free -
45
AntDB
Antdb AsiaInfo
AntDB is a cloud-native, distributed relational database developed by AsiaInfo Technologies, designed to handle high-performance online transaction processing and online analytical processing workloads. AntDB has been serving over 1 billion subscribers across 24 provinces in China, supporting massive business data related to calls, internet access, payments, and billing. AntDB's cloud-native distributed architecture supports online scalability, data consistency, and high availability across data centers. It is compatible with SQL2016 standards and integrates seamlessly with various domestic ecosystems, including mainstream CPUs and operating systems. The platform offers features such as automatic high availability, online elastic capacity expansion, and read/write splitting at the kernel level to efficiently manage traffic loads during peak periods. AntDB has been successfully commercialized in industries like telecommunications, finance, transportation, and energy.Starting Price: Free -
46
Apache Geode
Apache
Build high-speed, data-intensive applications that elastically meet performance requirements at any scale. Take advantage of Apache Geode's unique technology that blends advanced techniques for data replication, partitioning and distributed processing. Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing. Data can easily be partitioned (sharded) or replicated between nodes allowing performance to scale as needed. Durability is ensured through redundant in-memory copies and disk-based persistence. Super fast write-ahead-logging (WAL) persistence with a shared-nothing architecture that is optimized for fast parallel recovery of nodes or an entire cluster. -
47
Holochain
Holochain
An end-to-end open source P2P app framework. Local circles of trust provide data integrity without centralized authorities. Holochain delivers the promises of blockchain with a mashup of proven tech that provides self-owned data, a distributed database, and peer accountability. Holochain helps by creating an alternative to the dominant centralized systems of the Internet, protecting our ability to make our own choices, and giving trustworthy information we can act on. We call this 'digital agency', and we believe it will give us the power to face today's complex problems, together. Access other Holochain apps as if they were part of your codebase. No HTTP client, just function calls with optional access control. Compute and data live at the edges, which means there’s no infrastructure for you to maintain and defend. Holochain automatically adapts to disruptions and threats. -
48
Azure Table Storage
Microsoft
Use Azure Table storage to store petabytes of semi-structured data and keep costs down. Unlike many data stores—on-premises or cloud-based—Table storage lets you scale up without having to manually shard your dataset. Availability also isn’t a concern: using geo-redundant storage, stored data is replicated three times within a region—and an additional three times in another region, hundreds of miles away. Table storage is excellent for flexible datasets—web app user data, address books, device information, and other metadata—and lets you build cloud applications without locking down the data model to particular schemas. Because different rows in the same table can have a different structure—for example, order information in one row, and customer information in another—you can evolve your application and table schema without taking it offline. Table storage embraces a strong consistency model. -
49
InfiniDB
Database of Databases
InfiniDB is a column-store DBMS optimized for OLAP workloads. It has a distributed architecture to support Massive Paralllel Processing (MPP). It uses MySQL as its front-end such that users familiar with MySQL can quickly migrate to InfiniDB. Due to this fact, users can connect to InfiniDB using any MySQL connector. InfiniDB applies MVCC to do concurrency control. It uses term System Change Number (SCN) to indicate a version of the system. In its Block Resolution Manager (BRM), it utilizes three structures, version buffer, version substitution structure, and version buffer block manager, to manage multiple versions. InfiniDB applies deadlock detection to resolve conflicts. InfiniDB uses MySQL as its front-end and supports all MySQL syntaxes, including foreign keys. InfiniDB is a columnar DBMS. For each column, InfiniDB applies range partitioning and stores the minimum and maximum value of each partition in a small structure called extent map. -
50
Sadas Engine
Sadas
Sadas Engine is the fastest Columnar Database Management System both in Cloud and On Premise. Turn Data into Information with the fastest columnar Database Management System able to perform 100 times faster than transactional DBMSs and able to carry out searches on huge quantities of data over a period even longer than 10 years. Every day we work to ensure impeccable service and appropriate solutions to enhance the activities of your specific business. SADAS srl, a company of the AS Group , is dedicated to the development of Business Intelligence solutions, data analysis applications and DWH tools, relying on cutting-edge technology. The company operates in many sectors: banking, insurance, leasing, commercial, media and telecommunications, and in the public sector. Innovative software solutions for daily management needs and decision-making processes, in any sector