Alternatives to Pravega

Compare Pravega alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Pravega in 2026. Compare features, ratings, user reviews, pricing, and more from Pravega competitors and alternatives in order to make an informed decision for your business.

  • 1
    StreamNative

    StreamNative

    StreamNative

    StreamNative redefines streaming infrastructure by seamlessly integrating Kafka, MQ, and other protocols into a single, unified platform, providing unparalleled flexibility and efficiency for modern data processing needs. StreamNative offers a unified solution that adapts to the diverse requirements of streaming and messaging in a microservices-driven environment. By providing a comprehensive and intelligent approach to messaging and streaming, StreamNative empowers organizations to navigate the complexities and scalability of the modern data ecosystem with efficiency and agility. Apache Pulsar’s unique architecture decouples the message serving layer from the message storage layer to deliver a mature cloud-native data-streaming platform. Scalable and elastic to adapt to rapidly changing event traffic and business needs. Scale-up to millions of topics with architecture that decouples computing and storage.
    Starting Price: $1,000 per month
  • 2
    Astra Streaming
    Responsive applications keep users engaged and developers inspired. Rise to meet these ever-increasing expectations with the DataStax Astra Streaming service platform. DataStax Astra Streaming is a cloud-native messaging and event streaming platform powered by Apache Pulsar. Astra Streaming allows you to build streaming applications on top of an elastically scalable, multi-cloud messaging and event streaming platform. Astra Streaming is powered by Apache Pulsar, the next-generation event streaming platform which provides a unified solution for streaming, queuing, pub/sub, and stream processing. Astra Streaming is a natural complement to Astra DB. Using Astra Streaming, existing Astra DB users can easily build real-time data pipelines into and out of their Astra DB instances. With Astra Streaming, avoid vendor lock-in and deploy on any of the major public clouds (AWS, GCP, Azure) compatible with open-source Apache Pulsar.
  • 3
    Apache Kafka

    Apache Kafka

    The Apache Software Foundation

    Apache Kafka® is an open-source, distributed streaming platform. Scale production clusters up to a thousand brokers, trillions of messages per day, petabytes of data, hundreds of thousands of partitions. Elastically expand and contract storage and processing. Stretch clusters efficiently over availability zones or connect separate clusters across geographic regions. Process streams of events with joins, aggregations, filters, transformations, and more, using event-time and exactly-once processing. Kafka’s out-of-the-box Connect interface integrates with hundreds of event sources and event sinks including Postgres, JMS, Elasticsearch, AWS S3, and more. Read, write, and process streams of events in a vast array of programming languages.
  • 4
    Amazon MSK
    Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can use native Apache Kafka APIs to populate data lakes, stream changes to and from databases, and power machine learning and analytics applications. Apache Kafka clusters are challenging to setup, scale, and manage in production. When you run Apache Kafka on your own, you need to provision servers, configure Apache Kafka manually, replace servers when they fail, orchestrate server patches and upgrades, architect the cluster for high availability, ensure data is durably stored and secured, setup monitoring and alarms, and carefully plan scaling events to support load changes.
    Starting Price: $0.0543 per hour
  • 5
    Apache Geode
    Build high-speed, data-intensive applications that elastically meet performance requirements at any scale. Take advantage of Apache Geode's unique technology that blends advanced techniques for data replication, partitioning and distributed processing. Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing. Data can easily be partitioned (sharded) or replicated between nodes allowing performance to scale as needed. Durability is ensured through redundant in-memory copies and disk-based persistence. Super fast write-ahead-logging (WAL) persistence with a shared-nothing architecture that is optimized for fast parallel recovery of nodes or an entire cluster.
  • 6
    WarpStream

    WarpStream

    WarpStream

    WarpStream is an Apache Kafka-compatible data streaming platform built directly on top of object storage, with no inter-AZ networking costs, no disks to manage, and infinitely scalable, all within your VPC. WarpStream is deployed as a stateless and auto-scaling agent binary in your VPC with no local disks to manage. Agents stream data directly to and from object storage with no buffering on local disks and no data tiering. Create new “virtual clusters” in our control plane instantly. Support different environments, teams, or projects without managing any dedicated infrastructure. WarpStream is protocol compatible with Apache Kafka, so you can keep using all your favorite tools and software. No need to rewrite your application or use a proprietary SDK. Just change the URL in your favorite Kafka client library and start streaming. Never again have to choose between reliability and your budget.
    Starting Price: $2,987 per month
  • 7
    Red Hat OpenShift Streams
    Red Hat® OpenShift® Streams for Apache Kafka is a managed cloud service that provides a streamlined developer experience for building, deploying, and scaling new cloud-native applications or modernizing existing systems. Red Hat OpenShift Streams for Apache Kafka makes it easy to create, discover, and connect to real-time data streams no matter where they are deployed. Streams are a key component for delivering event-driven and data analytics applications. The combination of seamless operations across distributed microservices, large data transfer volumes, and managed operations allows teams to focus on team strengths, speed up time to value, and lower operational costs. OpenShift Streams for Apache Kafka includes a Kafka ecosystem and is part of a family of cloud services—and the Red Hat OpenShift product family—which helps you build a wide range of data-driven solutions.
  • 8
    Speedb

    Speedb

    Speedb

    The next-generation key-value storage engine.bSpeedb is 100% RocksDB compatible enhancing stability, efficiency, and overall performance. Join the Hive, Speedb’s open-source community, to interact, improve, and share knowledge and best practices on RocksDB. Speedb is a compatible alternative for LevelDB and RocksDB users who would like to take their application to the next level. When using event streaming platforms like Kafka, Flink, Spark, Splunk, Elastic, or others, consider using Speedb to enhance its performance. The increase in metadata in modern data sets is causing significant performance issues for many applications. With Speedb you can keep costs low and ensure your applications continue to run smoothly even under heavy loads. When it comes to making a choice to upgrade or deploy a new key-value store with your platform, Speedb is up for the challenge. By seamlessly integrating Speedb's advanced key-value storage engine with your projects, you'll experience immediate relief.
    Starting Price: Free
  • 9
    Axual

    Axual

    Axual

    Axual is Kafka-as-a-Service for DevOps teams. Empower your team to unlock insights and drive decisions with our intuitive Kafka platform. Axual offers the ultimate solution for enterprises looking to seamlessly integrate data streaming into their core IT infrastructure. Our all-in-one Kafka platform is designed to eliminate the need for extensive technical knowledge or skills, and provides a ready-made solution that delivers all the benefits of event streaming without the hassle. The Axual Platform is a all-in-one solution, designed to help you simplify and enhance the deployment, management, and utilization of real-time data streaming with Apache Kafka. By providing an array of features that cater to the diverse needs of modern enterprises, the Axual Platform enables organizations to harness the full potential of data streaming while minimizing complexity and operational overhead.
  • 10
    Baidu Messaging System
    Baidu Messaging System (BMS) is a distributed and scalable hosting message queue service with high throughputs. It collects massive data from websites, devices, or applications for real-time analysis, such as user browsing, clicks, and searches. BMS is a hosted service based on Apache Kafka. Kafka is a distributed, multi-partition, and multi-replica messaging service. The producer asynchronously interacts with the consumer through the message queue without waiting for each other. Compared with the traditional messaging service. BMS encapsulates the Kafka cluster details and provides them in the form of a hosted service. You can directly use BMS to integrate with massively distributed applications without the consideration of cluster operations and pay-per-use only.
  • 11
    Azure Event Hubs
    Event Hubs is a fully managed, real-time data ingestion service that’s simple, trusted, and scalable. Stream millions of events per second from any source to build dynamic data pipelines and immediately respond to business challenges. Keep processing data during emergencies using the geo-disaster recovery and geo-replication features. Integrate seamlessly with other Azure services to unlock valuable insights. Allow existing Apache Kafka clients and applications to talk to Event Hubs without any code changes—you get a managed Kafka experience without having to manage your own clusters. Experience real-time data ingestion and microbatching on the same stream. Focus on drawing insights from your data instead of managing infrastructure. Build real-time big data pipelines and respond to business challenges right away.
    Starting Price: $0.03 per hour
  • 12
    Apache Pulsar

    Apache Pulsar

    Apache Software Foundation

    Apache Pulsar is a cloud-native, distributed messaging and streaming platform originally created at Yahoo! and now a top-level Apache Software Foundation project. Easy to deploy, lightweight compute process, developer-friendly APIs, no need to run your own stream processing engine. Run in production at Yahoo! scale for over 5 years, with millions of messages per second across millions of topics. Built from the ground up as a multi-tenant system. Supports isolation, authentication, authorization and quotas. Configurable replication between data centers across multiple geographic regions. Persistent message storage based on Apache BookKeeper. IO-level isolation between write and read operations. Rest admin API for provisioning, administration, tools and monitoring.
  • 13
    HarperDB

    HarperDB

    HarperDB

    HarperDB is a distributed systems platform that combines database, caching, application, and streaming functions into a single technology. With it, you can start delivering global-scale back-end services with less effort, higher performance, and lower cost than ever before. Deploy user-programmed applications and pre-built add-ons on top of the data they depend on for a high throughput, ultra-low latency back end. Lightning-fast distributed database delivers orders of magnitude more throughput per second than popular NoSQL alternatives while providing limitless horizontal scale. Native real-time pub/sub communication and data processing via MQTT, WebSocket, and HTTP interfaces. HarperDB delivers powerful data-in-motion capabilities without layering in additional services like Kafka. Focus on features that move your business forward, not fighting complex infrastructure. You can't change the speed of light, but you can put less light between your users and their data.
    Starting Price: Free
  • 14
    Pulsar

    Pulsar

    Pulsar-Edit

    A community-led hyper-hackable text editor. Pulsar works across operating systems. Use it on OS X, Windows, or Linux. Search and install new packages or create your own right from Pulsar. Pulsar helps you write code faster with a smart and flexible autocomplete. Easily browse and open a single file, a whole project, or multiple projects in one window. Split your Pulsar interface into multiple panes to compare and edit code across files. Find, preview, and replace text as you type in a file or across all your projects.
    Starting Price: Free
  • 15
    Samza

    Samza

    Apache Software Foundation

    Samza allows you to build stateful applications that process data in real-time from multiple sources including Apache Kafka. Battle-tested at scale, it supports flexible deployment options to run on YARN or as a standalone library. Samza provides extremely low latencies and high throughput to analyze your data instantly. Scales to several terabytes of state with features like incremental checkpoints and host-affinity. Samza is easy to operate with flexible deployment options - YARN, Kubernetes or standalone. Ability to run the same code to process both batch and streaming data. Integrates with several sources including Kafka, HDFS, AWS Kinesis, Azure Eventhubs, K-V stores and ElasticSearch.
  • 16
    IBM Event Streams
    IBM Event Streams is a fully managed event streaming platform built on Apache Kafka, designed to help enterprises process and respond to real-time data streams. With capabilities for machine learning integration, high availability, and secure cloud deployment, it enables organizations to create intelligent applications that react to events as they happen. The platform supports multi-cloud environments, disaster recovery, and geo-replication, making it ideal for mission-critical workloads. IBM Event Streams simplifies building and scaling real-time, event-driven solutions, ensuring data is processed quickly and efficiently.
  • 17
    Pulsar Music Player

    Pulsar Music Player

    Rhythm Software

    Gorgeous user interface and animation following material design. Automatically download missing album cover and artist image. User-friendly interface for managing and playing your music, easy and effective. Application size is only 2.6M, but still a full featured music player. Pulsar helps you to manage and play music from every perspective, it contains almost everything you need for a music player. Pulsar support standard music file types including mp3, aac, flac, ogg, wav and etc. If you cannot find your music in Pulsar, please click “rescan library” menu item from action bar to rescan device.
    Starting Price: $2.99 one-time payment
  • 18
    Equalum

    Equalum

    Equalum

    Equalum’s continuous data integration & streaming platform is the only solution that natively supports real-time, batch, and ETL use cases under one, unified platform with zero coding required. Make the move to real-time with a fully orchestrated, drag-and-drop, no-code UI. Experience rapid deployment, powerful transformations, and scalable streaming data pipelines in minutes. Multi-modal, robust, and scalable CDC enabling real-time streaming and data replication. Tuned for best-in-class performance no matter the source. The power of open-source big data frameworks, without the hassle. Equalum harnesses the scalability of open-source data frameworks such as Apache Spark and Kafka in the Platform engine to dramatically improve the performance of streaming and batch data processes. Organizations can increase data volumes while improving performance and minimizing system impact using this best-in-class infrastructure.
  • 19
    Pulsar

    Pulsar

    Pulsar AI

    Whenever customers reply to your listings - PulsarAI takes over the conversation. It automates all communications in email, sms and drives the conversation to the goal - an appointment! Pulsar understands what customers ask about - prices, features, terms, etc. by using integration with your existing systems. It will answer multiple questions in a two-way conversation while continually attempting to schedule an appointment. If leads are not converted - it follows up until they're re-engaged. Every follow-up strategy is tailored to each customer. When a lead comes in from any source, Pulsar reads the question, generates a response using inventory data and sends it back to the customer. By immediately answering every lead, Pulsar keeps the conversation going and converts it into an appointment. Appointments, calls, and follow-ups are scheduled automatically in CRM and notifications are sent to the dealership team.
  • 20
    Oracle Cloud Infrastructure Streaming
    Streaming service is a real-time, serverless, Apache Kafka-compatible event streaming platform for developers and data scientists. Streaming is tightly integrated with Oracle Cloud Infrastructure (OCI), Database, GoldenGate, and Integration Cloud. The service also provides out-of-the-box integrations for hundreds of third-party products across categories such as DevOps, databases, big data, and SaaS applications. Data engineers can easily set up and operate big data pipelines. Oracle handles all infrastructure and platform management for event streaming, including provisioning, scaling, and security patching. With the help of consumer groups, Streaming can provide state management for thousands of consumers. This helps developers easily build applications at scale.
  • 21
    Estuary Flow
    Estuary Flow is a new kind of DataOps platform that empowers engineering teams to build real-time, data-intensive applications at scale with minimal friction. This platform unifies a team’s databases, pub/sub systems, and SaaS around their data, without requiring new investments in infrastructure or development.
    Starting Price: $200/month
  • 22
    Aiven for Apache Kafka
    Apache Kafka as a fully managed service, with zero vendor lock-in and a full set of capabilities to build your streaming pipeline. Set up fully managed Kafka in less than 10 minutes — directly from our web console or programmatically via our API, CLI, Terraform provider or Kubernetes operator. Easily connect it to your existing tech stack with over 30 connectors, and feel confident in your setup with logs and metrics available out of the box via the service integrations. A fully managed distributed data streaming platform, deployable in the cloud of your choice. Ideal for event-driven applications, near-real-time data transfer and pipelines, stream analytics, and any other case where you need to move a lot of data between applications — and quickly. With Aiven’s hosted and managed-for-you Apache Kafka, you can set up clusters, deploy new nodes, migrate clouds, and upgrade existing versions — in a single mouse click — and monitor them through a simple dashboard.
    Starting Price: $200 per month
  • 23
    Exein

    Exein

    Exein

    The IoT edge that protects from inside. Continuously monitoring and identifying threats at every stage of development. For all your devices. Automate the analysis and identification of security vulnerabilities inside your devices. All in one place, with intelligent prioritization and proprietary rating. Get all the intel you need about the security of your device in one place: simple and convenient. Know exactly where the most dangerous vulnerabilities are, and address them in a smart way. Pulsar is a modern runtime threat detection and response engine. Designed for IoT and edge computing, Pulsar is optimized for performance, runtime cost, and edge security. Pulsar's modular architecture is written entirely in Rust, a modern and secure language. Pulsar combines its edge AI threat detection engines with deterministic security policies to achieve state-of-the-art performance.
  • 24
    Pulsar

    Pulsar

    Pulsar

    Pulsar is a cloud-connected real-time machine monitoring and Overall Equipment Effectiveness (OEE) platform designed to give manufacturers full visibility into shop-floor operations, production performance, and machine health so they can detect issues, eliminate downtime, and improve productivity without relying on manual logs or spreadsheets. It uses industrial IoT sensors and hardware that connect to machines of any type, age, or model without altering PLCs and stream availability, performance, speed, cycle counts, downtime causes, and production output data to intuitive dashboards accessible from any device. Pulsar automatically tracks key manufacturing KPIs, like OEE, utilization, throughput, and quality, provides advanced real-time analytics, historical reporting, and custom configurable alerts via SMS, email, or TV dashboards, and includes a digital Andon system so teams can spot and respond to production issues instantly.
  • 25
    Google Cloud Pub/Sub
    Google Cloud Pub/Sub. Scalable, in-order message delivery with pull and push modes. Auto-scaling and auto-provisioning with support from zero to hundreds of GB/second. Independent quota and billing for publishers and subscribers. Global message routing to simplify multi-region systems. High availability made simple. Synchronous, cross-zone message replication and per-message receipt tracking ensure reliable delivery at any scale. No planning, auto-everything. Auto-scaling and auto-provisioning with no partitions eliminate planning and ensures workloads are production-ready from day one. Advanced features, built in. Filtering, dead-letter delivery, and exponential backoff without sacrificing scale help simplify your applications. A fast, reliable way to land small records at any volume, an entry point for real-time and batch pipelines feeding BigQuery, data lakes and operational databases. Use it with ETL/ELT pipelines in Dataflow.
  • 26
    GlassFlow

    GlassFlow

    GlassFlow

    GlassFlow is a serverless, event-driven data pipeline platform designed for Python developers. It enables users to build real-time data pipelines without the need for complex infrastructure like Kafka or Flink. By writing Python functions, developers can define data transformations, and GlassFlow manages the underlying infrastructure, offering auto-scaling, low latency, and optimal data retention. The platform supports integration with various data sources and destinations, including Google Pub/Sub, AWS Kinesis, and OpenAI, through its Python SDK and managed connectors. GlassFlow provides a low-code interface for quick pipeline setup, allowing users to create and deploy pipelines within minutes. It also offers features such as serverless function execution, real-time API connections, and alerting and reprocessing capabilities. The platform is designed to simplify the creation and management of event-driven data pipelines, making it accessible for Python developers.
    Starting Price: $350 per month
  • 27
    Google Cloud Managed Service for Kafka
    ​Google Cloud's Managed Service for Apache Kafka is a fully managed and scalable service that simplifies the deployment, management, and maintenance of Apache Kafka clusters. It automates operational tasks such as provisioning, patching, and scaling, allowing users to focus on building applications without the complexities of infrastructure management. It ensures high availability and reliability by replicating data across multiple zones, safeguarding against potential failures. It also offers seamless integration with other Google Cloud services, enabling users to create robust data processing pipelines. Security is a priority, with features like encryption at rest and in transit, identity, and access management, and network isolation to protect data. Google Cloud Managed Service for Kafka supports both public and private networking configurations, providing flexibility in connectivity options.
    Starting Price: $0.09 per hour
  • 28
    Waterstream

    Waterstream

    SimpleMatter

    Waterstream turns your Kafka-compatible platform into a full-fledged MQTT broker. Connect millions of clients to your data streaming platform with no code, no integration pipelines, and no additional storage. Waterstream implements a bidirectional layer between Kafka and MQTT clients. Forget managing external MQTT clusters, integration pipelines to code, and data duplication. Waterstream scales out linearly. For most operations, its nodes don’t depend on each other. Add more instances to support an increasing number of clients. Waterstream requires only Kafka to operate. The built-in persistence benefits of using Kafka are all included: high availability, high throughput, and low latency.
  • 29
    SiteWhere

    SiteWhere

    SiteWhere

    SiteWhere infrastructure and microservices are deployed on Kubernetes, allowing for deployment on-premise or almost any cloud provider. Highly-available configurations of Apache Kafka, Zookeeper, and Hashicorp Consul provide infrastructure. Each microservice scales independently and integrates automatically. Complete multitenant IoT ecosystem including device management, event ingestion, big data event storage, REST APIs, data integration, and much more. Distributed architecture built with Java microservices running on Docker infrastructure with Apache Kafka processing pipeline. SiteWhere CE will always be open source and free for private as well as commercial use. The SiteWhere team offers free basic support and a steady stream of new features.
  • 30
    Yandex Managed Service for Apache Kafka
    Focus on developing data stream processing applications and don’t waste time maintaining the infrastructure. Managed Service for Apache Kafka is responsible for managing Zookeeper brokers and clusters, configuring clusters, and updating their versions. Distribute your cluster brokers across different availability zones and set the replication factor to ensure the desired level of fault tolerance. The service analyzes the metrics and status of the cluster and automatically replaces it if one of the nodes fails. For each topic, you can set the replication factor, log cleanup policy, compression type, and maximum number of messages to make better use of computing, network, and disk resources. You can add brokers to your cluster with just a click of a button to improve its performance, or change the class of high-availability hosts without stopping them or losing any data.
  • 31
    ksqlDB

    ksqlDB

    Confluent

    Now that your data is in motion, it’s time to make sense of it. Stream processing enables you to derive instant insights from your data streams, but setting up the infrastructure to support it can be complex. That’s why Confluent developed ksqlDB, the database purpose-built for stream processing applications. Make your data immediately actionable by continuously processing streams of data generated throughout your business. ksqlDB’s intuitive syntax lets you quickly access and augment data in Kafka, enabling development teams to seamlessly create real-time innovative customer experiences and fulfill data-driven operational needs. ksqlDB offers a single solution for collecting streams of data, enriching them, and serving queries on new derived streams and tables. That means less infrastructure to deploy, maintain, scale, and secure. With less moving parts in your data architecture, you can focus on what really matters -- innovation.
  • 32
    Baidu AI Cloud Stream Computing
    Baidu Stream Computing (BSC) provides real-time streaming data processing capacity with low delay, high throughput and high accuracy. It is fully compatible with Spark SQL; and can realize the logic data processing of complicated businesses through SQL statement, which is easy to use; provides users with full life cycle management for the streaming-oriented computing jobs. Integrate deeply with multiple storage products of Baidu AI Cloud as the upstream and downstream of stream computing, including Baidu Kafka, RDS, BOS, IOT Hub, Baidu ElasticSearch, TSDB, SCS and others. Provide a comprehensive job monitoring indicator, and the user can view the monitoring indicators of the job and set the alarm rules to protect the job.
  • 33
    Kapacitor

    Kapacitor

    InfluxData

    Kapacitor is a native data processing engine for InfluxDB 1.x and is an integrated component in the InfluxDB 2.0 platform. Kapacitor can process both stream and batch data from InfluxDB, acting on this data in real-time via its programming language TICKscript. Today’s modern applications require more than just dashboarding and operator alerts—they need the ability to trigger actions. Kapacitor’s alerting system follows a publish-subscribe design pattern. Alerts are published to topics and handlers subscribe to a topic. This pub/sub model and the ability for these to call User Defined Functions make Kapacitor very flexible to act as the control plane in your environment, performing tasks like auto-scaling, stock reordering, and IoT device control. Kapacitor provides a simple plugin architecture, or interface, that allows it to integrate with any anomaly detection engine.
    Starting Price: $0.002 per GB per hour
  • 34
    Pulsar

    Pulsar

    Venera Technologies

    Pulsar is an on-premises content quality verification solution and the world's fastest file-based automated QC system. It streamlines file-based QC workflows across content types and stages, reducing cost and increasing efficiency. Pulsar delivers high-speed performance – for HD content, it is six times faster than real-time speed and offers near real-time analysis for JPEG-2000 4K content. It supports various file formats and standards, including Dolby Vision, HDR-10, and HDR-10+ content, and has comprehensive support for IMF packages. And by providing in-built support for adaptive bitrate formats as well as fragmented MP4, it also automates the QC of Video-on-Demand assets.
  • 35
    Macrometa

    Macrometa

    Macrometa

    We deliver a geo-distributed real-time database, stream processing and compute runtime for event-driven applications across up to 175 worldwide edge data centers. App & API builders love our platform because we solve the hardest problems of sharing mutable state across 100s of global locations, with strong consistency & low latency. Macrometa enables you to surgically extend your existing infrastructure to bring part of or your entire application closer to your end users. This allows you to improve performance, user experience, and comply with global data governance laws. Macrometa is a serverless, streaming NoSQL database, with integrated pub/sub and stream data processing and compute engine. Create stateful data infrastructure, stateful functions & containers for long running workloads, and process data streams in real time. You do the code, we do all the ops and orchestration.
  • 36
    HStreamDB
    A streaming database is purpose-built to ingest, store, process, and analyze massive data streams. It is a modern data infrastructure that unifies messaging, stream processing, and storage to help get value out of your data in real-time. Ingest massive amounts of data continuously generated from various sources, such as IoT device sensors. Store millions of data streams reliably in a specially designed distributed streaming data storage cluster. Consume data streams in real-time as fast as from Kafka by subscribing to topics in HStreamDB. With the permanent data stream storage, you can playback and consume data streams anytime. Process data streams based on event-time with the same familiar SQL syntax you use to query data in a relational database. You can use SQL to filter, transform, aggregate, and even join multiple data streams.
    Starting Price: Free
  • 37
    Apache Mesos

    Apache Mesos

    Apache Software Foundation

    Mesos is built using the same principles as the Linux kernel, only at a different level of abstraction. The Mesos kernel runs on every machine and provides applications (e.g., Hadoop, Spark, Kafka, Elasticsearch) with API’s for resource management and scheduling across entire datacenter and cloud environments. Native support for launching containers with Docker and AppC images.Support for running cloud native and legacy applications in the same cluster with pluggable scheduling policies. HTTP APIs for developing new distributed applications, for operating the cluster, and for monitoring. Built-in Web UI for viewing cluster state and navigating container sandboxes.
  • 38
    Apache Druid
    Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures.
  • 39
    Eclipse Streamsheets
    Build professional applications to automate workflows, continuously monitor operations, and control processes in real-time. Your solutions run 24/7 on servers in the cloud and on the edge. Thanks to the spreadsheet user interface, you do not have to be a software developer. Instead of writing program code, you drag-and-drop data, fill cells with formulas, and design charts in a way you already know. Find all necessary protocols on board that you need to connect to sensors, and machines like MQTT, REST, and OPC UA. Streamsheets is native to stream data processing like MQTT and kafka. Pick up a topic stream, transform it and blast it back out into the endless streaming world. REST opens you the world, Streamsheets let you connect to any web service or let them connect to you. Streamsheets run in the cloud, on your servers, but also on edge devices like a Raspberry Pi.
  • 40
    DeltaStream

    DeltaStream

    DeltaStream

    DeltaStream is a unified serverless stream processing platform that integrates with streaming storage services. Think about it as the compute layer on top of your streaming storage. It provides functionalities of streaming analytics(Stream processing) and streaming databases along with additional features to provide a complete platform to manage, process, secure and share streaming data. DeltaStream provides a SQL based interface where you can easily create stream processing applications such as streaming pipelines, materialized views, microservices and many more. It has a pluggable processing engine and currently uses Apache Flink as its primary stream processing engine. DeltaStream is more than just a query processing layer on top of Kafka or Kinesis. It brings relational database concepts to the data streaming world, including namespacing and role based access control enabling you to securely access, process and share your streaming data regardless of where they are stored.
  • 41
    Airy Messenger
    From Conversational AI assistants to customer service experiences, own your own conversational platform with open-source Airy. Airy Core is an open source, fully-featured, production-ready conversational platform. With Airy you can process conversational data from a variety of sources. Since Airy's infrastructure is built around Apache Kafka, it can process a large amount of conversations and messages simultaneously and stream the relevant conversational data to wherever you need it. Connect anything from our free open-source live chat plugin to Facebook Messenger & Google's Business Messages to your Airy Core. This is all possible through an ingestion platform that heavily relies on Apache Kafka to process incoming webhook data from different sources. We make sense of the data and reshape it into source-independent contacts, conversations, and messages.
  • 42
    Azure Storage

    Azure Storage

    Microsoft

    The Azure Storage platform is Microsoft's cloud storage solution for modern data storage scenarios. Azure Storage offers highly available, massively scalable, durable, and secure storage for a variety of data objects in the cloud. Azure Storage data objects are accessible from anywhere in the world over HTTP or HTTPS via a REST API. Azure Storage also offers client libraries for developers building applications or services with .NET, Java, Python, JavaScript, C++, and Go. Developers and IT professionals can use Azure PowerShell and Azure CLI to write scripts for data management or configuration tasks. The Azure portal and Azure Storage Explorer provide user-interface tools for interacting with Azure Storage. Durable and highly available. Redundancy ensures that your data is safe in the event of transient hardware failures. You can also opt to replicate data across data centers or geographical regions for additional protection from local catastrophes or natural disasters.
  • 43
    Rackspace

    Rackspace

    Rackspace

    Enhanced full-lifecycle cloud native development capabilities to help customers build modern applications for the future. Unlock the full potential of the cloud today with applications architected for tomorrow. Traditional approaches to cloud adoption focused on infrastructure and application migration, with very little attention to the underlying code. And while the cloud has always delivered the benefits of elasticity and scale, it can’t unleash its full potential until the code in your applications has been updated. Modern applications, built with cloud native technologies and modern architectures, allow you to access the full potential of the cloud, while increasing agility and helping you to accelerate innovation. Build self-healing, auto-scaling applications, unchained from the limitation of servers. Serverless architectures offer the highest efficiency and cost benefits of the cloud while pushing nearly all infrastructure and software management to the platform.
  • 44
    PubSub+ Platform
    Solace PubSub+ Platform helps enterprises design, deploy and manage event-driven systems across hybrid and multi-cloud and IoT environments so they can be more event-driven and operate in real-time. The PubSub+ Platform includes the powerful PubSub+ Event Brokers, event management capabilities with PubSub+ Event Portal, as well as monitoring and integration capabilities all available via a single cloud console. PubSub+ allows easy creation of an event mesh, an interconnected network of event brokers, allowing for seamless and dynamic data movement across highly distributed network environments. PubSub+ Event Brokers can be deployed as fully managed cloud services, self-managed software in private cloud or on-premises environments, or as turnkey hardware appliances for unparalleled performance and low TCO. PubSub+ Event Portal is a complimentary toolset for design and governance of event-driven systems including both Solace and Kafka-based event broker environments.
  • 45
    Materialize

    Materialize

    Materialize

    Materialize is a reactive database that delivers incremental view updates. We help developers easily build with streaming data using standard SQL. Materialize can connect to many different external sources of data without pre-processing. Connect directly to streaming sources like Kafka, Postgres databases, CDC, or historical sources of data like files or S3. Materialize allows you to query, join, and transform data sources in standard SQL - and presents the results as incrementally-updated Materialized views. Queries are maintained and continually updated as new data streams in. With incrementally-updated views, developers can easily build data visualizations or real-time applications. Building with streaming data can be as simple as writing a few lines of SQL.
    Starting Price: $0.98 per hour
  • 46
    Xeotek

    Xeotek

    Xeotek

    Xeotek helps companies develop and explore their data applications and streams faster with Xeotek's powerful desktop and web application. Xeotek KaDeck was designed to be used by developers, operations, and business users alike. Because business users, developers, and operations jointly gain insight into data and processes via KaDeck, the whole team benefits: fewer misunderstandings, less rework, more transparency. Xeotek KaDeck puts you in control of your data streams. Save hours of work by gaining insights at the data and application level in projects or day-to-day operations. Export, filter, transform and manage data streams in KaDeck with ease. Run JavaScript (NodeV4) code, transform & generate test data, view & change consumer offsets, manage your streams or topics, Kafka Connect instances, schema registry, and ACLs – all from one convenient user interface.
  • 47
    PeerDB

    PeerDB

    PeerDB

    If Postgres is at the core of your business and is a major source of data, PeerDB provides a fast, simple, and cost-effective way to replicate data from Postgres to data warehouses, queues, and storage. Designed to run at any scale, and tailored for data stores. PeerDB uses replication messages from the Postgres replication slot to replay the schema messages. Alerts for slot growth and connections. Native support for Postgres toast columns and large JSONB columns for IoT. Optimized query design to reduce warehouse costs; particularly useful for Snowflake and BigQuery. Support for partitioned tables via both publish. Blazing fast and consistent initial load by transaction snapshotting and CTID scans. High-availability, in-place upgrades, autoscaling, advance logs, metrics and monitoring dashboards, burstable instance types, and suitable for dev environments.
    Starting Price: $250 per month
  • 48
    Artie

    Artie

    Artie

    Stream only the data that has changed to the destination. Eliminate data latency and reduce computational overhead. Change data capture (CDC) is a highly efficient method to sync data. Log-based replication is a non-intrusive way to replicate data in real time and does not impact source database performance. Set up the end-to-end solution in minutes, with zero pipeline maintenance. Let your data teams work on higher-value projects. Setting up Artie takes just a few simple steps. Artie will handle backfilling historical data and continuously stream new changes to the final table as they occur. Artie ensures data consistency and high reliability. In the event of an outage, Artie leverages offsets in Kafka to pick up where it left off, which helps maintain high data integrity while avoiding the burden of performing full re-syncs.
    Starting Price: $231 per month
  • 49
    Confluent

    Confluent

    Confluent

    Infinite retention for Apache Kafka® with Confluent. Be infrastructure-enabled, not infrastructure-restricted Legacy technologies require you to choose between being real-time or highly-scalable. Event streaming enables you to innovate and win - by being both real-time and highly-scalable. Ever wonder how your rideshare app analyzes massive amounts of data from multiple sources to calculate real-time ETA? Ever wonder how your credit card company analyzes millions of credit card transactions across the globe and sends fraud notifications in real-time? The answer is event streaming. Move to microservices. Enable your hybrid strategy through a persistent bridge to cloud. Break down silos to demonstrate compliance. Gain real-time, persistent event transport. The list is endless.
  • 50
    Conduktor

    Conduktor

    Conduktor

    We created Conduktor, the all-in-one friendly interface to work with the Apache Kafka ecosystem. Develop and manage Apache Kafka with confidence. With Conduktor DevTools, the all-in-one Apache Kafka desktop client. Develop and manage Apache Kafka with confidence, and save time for your entire team. Apache Kafka is hard to learn and to use. Made by Kafka lovers, Conduktor best-in-class user experience is loved by developers. Conduktor offers more than just an interface over Apache Kafka. It provides you and your teams the control of your whole data pipeline, thanks to our integration with most technologies around Apache Kafka. Provide you and your teams the most complete tool on top of Apache Kafka.