Apache Hudi

Apache Hudi

Apache Corporation
+
+

Related Products

  • AnalyticsCreator
    46 Ratings
    Visit Website
  • Teradata VantageCloud
    992 Ratings
    Visit Website
  • Google Cloud BigQuery
    1,939 Ratings
    Visit Website
  • Docket
    58 Ratings
    Visit Website
  • Secure Eraser
    11 Ratings
    Visit Website
  • Kamatera
    152 Ratings
    Visit Website
  • BrewPOS
    8 Ratings
    Visit Website
  • Curtain LogTrace File Activity Monitoring
    4 Ratings
    Visit Website
  • Cerberus FTP Server
    159 Ratings
    Visit Website
  • TeamDesk
    92 Ratings
    Visit Website

About

Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.

About

Powered by Apache Doris, VeloDB is a modern data warehouse for lightning-fast analytics on real-time data at scale. Push-based micro-batch and pull-based streaming data ingestion within seconds. Storage engine with real-time upsert、append and pre-aggregation. Unparalleled performance in both real-time data serving and interactive ad-hoc queries. Not just structured but also semi-structured data. Not just real-time analytics but also batch processing. Not just run queries against internal data but also work as a federate query engine to access external data lakes and databases. Distributed design to support linear scalability. Whether on-premise deployment or cloud service, separation or integration of storage and compute, resource usage can be flexibly and efficiently adjusted according to workload requirements. Built on and fully compatible with open source Apache Doris. Support MySQL protocol, functions, and SQL for easy integration with other data tools.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Data Warehouse solution that helps companies with streaming primitives over hadoop compatible storages

Audience

Organizations interested in a powerful real-time data warehouse

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Corporation
Founded: 1954
United States
hudi.apache.org

Company Information

VeloDB
Founded: 2023
Singapore
www.velodb.io

Alternatives

Apache Iceberg

Apache Iceberg

Apache Software Foundation

Alternatives

Apache Doris

Apache Doris

The Apache Software Foundation
Apache Doris

Apache Doris

The Apache Software Foundation

Categories

Categories

Integrations

Apache Doris
Apache Flink
Apache Kafka
Apache Spark
MySQL
AWS Marketplace
Alluxio
Amazon Athena
Amazon Redshift
Apache Cassandra
Apache Hive
Azure Data Lake
CelerData Cloud
DataHub
Hadoop
PostgreSQL
Presto
PuppyGraph
dbt
e6data

Integrations

Apache Doris
Apache Flink
Apache Kafka
Apache Spark
MySQL
AWS Marketplace
Alluxio
Amazon Athena
Amazon Redshift
Apache Cassandra
Apache Hive
Azure Data Lake
CelerData Cloud
DataHub
Hadoop
PostgreSQL
Presto
PuppyGraph
dbt
e6data
Claim Apache Hudi and update features and information
Claim Apache Hudi and update features and information
Claim VeloDB and update features and information
Claim VeloDB and update features and information