Apache DataFusion

Apache DataFusion

Apache Software Foundation
Keen

Keen

Keen.io
+
+

Related Products

  • StarTree
    25 Ratings
    Visit Website
  • RaimaDB
    5 Ratings
    Visit Website
  • Google Cloud Platform
    56,320 Ratings
    Visit Website
  • Google Cloud BigQuery
    1,734 Ratings
    Visit Website
  • DbVisualizer
    489 Ratings
    Visit Website
  • TeamDesk
    92 Ratings
    Visit Website
  • MongoDB Atlas
    1,632 Ratings
    Visit Website
  • Quickbase
    2,607 Ratings
    Visit Website
  • Ninox
    542 Ratings
    Visit Website
  • ToucanTech
    168 Ratings
    Visit Website

About

Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.

About

Keen is the fully managed event streaming platform. Built upon trusted Apache Kafka, we make it easier than ever for you to collect massive volumes of event data with our real-time data pipeline. Use Keen’s powerful REST API and SDKs to collect event data from anything connected to the internet. Our platform allows you to store your data securely decreasing your operational and delivery risk with Keen. With storage infrastructure powered by Apache Cassandra, data is totally secure through transfer through HTTPS and TLS, then stored with multi-layer AES encryption. Once data is securely stored, utilize our Access Keys to be able to present data in arbitrary ways without having to re-architect your security or data model. Or, take advantage of Role-based Access Control (RBAC), allowing for completely customizable permission tiers, down to specific data points or queries.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Professional developers and data engineers seeking a solution for building data-centric systems

Audience

B2B SaaS and B2C SaaS companies and DevOps teams

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

$149 per month
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Software Foundation
Founded: 2019
United States
datafusion.apache.org

Company Information

Keen.io
Founded: 2011
United States
keen.io

Alternatives

AnySQL Maestro

AnySQL Maestro

SQL Maestro Group

Alternatives

HyperSQL DataBase

HyperSQL DataBase

The hsql Development Group

Categories

Categories

Embedded Analytics Features

Ad hoc Query
Application Development
Benchmarking
Dashboard
Interactive Reports
Mobile Reporting
Multi-User Collaboration
Self Service Analytics
Streaming Analytics
Visual Workflow Management

Master Data Management Features

Data Governance
Data Masking
Data Source Integrations
Hierarchy Management
Match & Merge
Metadata Management
Multi-Domain
Process Management
Relationship Mapping
Visualization

Integrations

Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Bold BI
Causal
EMnify
Google Sheets
JSON
Kixie PowerCall & SMS
Pure360
Python
RudderStack
Runscope
Rust
Segment
Stackpile
Stripe
Style Intelligence
Twilio

Integrations

Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Bold BI
Causal
EMnify
Google Sheets
JSON
Kixie PowerCall & SMS
Pure360
Python
RudderStack
Runscope
Rust
Segment
Stackpile
Stripe
Style Intelligence
Twilio
Claim Apache DataFusion and update features and information
Claim Apache DataFusion and update features and information
Claim Keen and update features and information
Claim Keen and update features and information