File

Attribute	Apache ORC	Apache Hudi
Name	Apache ORC	Apache Hudi
Description	ORC is a self-describing type-aware columnar file format designed for Hadoop workloads.	Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Utilises data stored in either parquet or orc.
License	Apache license 2.0	Apache license 2.0
Source code	https://github.com/apache/orc	https://github.com/apache/hudi
Website	https://orc.apache.org/	https://hudi.apache.org/
Year created	2013	2016
Company	Hortonworks, Facebook	Uber
Language support	java, scala, c++, python
Use cases	Write once read many, Analytics, Efficient storage, ACID transactions	Incremental data processing, Data upserts, Change Data Capture (CDC), ACID transactions
Is human readable	no	no
Orientation	row	column or row
Has type system	yes	yes
Has nested structure support	yes	yes
Has native compression	yes	yes
Has encoding support	yes	yes
Has constraint support	no	yes
Has acid support	no	yes
Has metadata	yes	yes
Has encryption support	yes	maybe
Data processing framework support	Apache Flink, Apache Gobblin, Apache Hadoop, Apache NiFi, Apache Pig, Apache Spark,	Apache Spark, Apache Flink,
Analytics query support	Apache Impala, Apache Druid, Apache Hive, Apache Pinot, AWS Athena, BigQuery, Clickhouse, Firebolt, Presto, Trino,	Apache Hive, Apache Impala, AWS Athena, BigQuery, Clickhouse, Presto, Trino,