OpenObserve
OpenObserve is an open source observability platform for logs, metrics, and traces that emphasizes high performance, scalability, and dramatically lower cost. It supports petabyte-scale observability thanks to features like data compression using columnar storage and the ability to use “bring your own bucket” storage (local disk, S3, GCS, Azure Blob, etc.). It is written in Rust, uses the DataFusion query engine to directly query Parquet files, and provides a stateless, horizontally scalable architecture with caching (both result and disk) to maintain speed under heavy load. It embraces open standards (OpenTelemetry compatibility, vendor-neutral APIs), so it fits into existing monitoring/logging workflows. Key modules include logs, metrics, traces, frontend monitoring, pipelines, alerts, and dashboards/visualizations.
Learn more
IBM Cloud SQL Query
Serverless, interactive querying for analyzing data in IBM Cloud Object Storage. Query your data directly where it is stored, there's no ETL, no databases, and no infrastructure to manage. IBM Cloud SQL Query uses Apache Spark, an open-source, fast, extensible, in-memory data processing engine optimized for low latency and ad hoc analysis of data. No ETL or schema definition needed to enable SQL queries. Analyze data where it sits in IBM Cloud Object Storage using our query editor and REST API. Run as many queries as you need; with pay-per-query pricing, you pay only for the data scan. Compress or partition data to drive savings and performance. IBM Cloud SQL Query is highly available and executes queries using compute resources across multiple facilities. IBM Cloud SQL Query supports a variety of data formats such as CSV, JSON and Parquet, and allows for standard ANSI SQL.
Learn more
Tenzir
Tenzir is a data pipeline engine specifically designed for security teams, facilitating the collection, transformation, enrichment, and routing of security data throughout its lifecycle. It enables users to seamlessly gather data from various sources, parse unstructured data into structured formats, and transform it as needed. It optimizes data volume, reduces costs, and supports mapping to standardized schemas like OCSF, ASIM, and ECS. Tenzir ensures compliance through data anonymization features and enriches data by adding context from threats, assets, and vulnerabilities. It supports real-time detection and stores data efficiently in Parquet format within object storage systems. Users can rapidly search and materialize necessary data and reactivate at-rest data back into motion. Tension is built for flexibility, allowing deployment as code and integration into existing workflows, ultimately aiming to reduce SIEM costs and provide full control.
Learn more
Polars
Knowing of data wrangling habits, Polars exposes a complete Python API, including the full set of features to manipulate DataFrames using an expression language that will empower you to create readable and performant code. Polars is written in Rust, uncompromising in its choices to provide a feature-complete DataFrame API to the Rust ecosystem. Use it as a DataFrame library or as a query engine backend for your data models.
Learn more