File

Apache Iceberg Apache Parquet

Attribute	Apache Iceberg	Apache Parquet
Name	Apache Iceberg	Apache Parquet
Description	Iceberg is a high-performance format for huge analytic tables. Utilises data stored in either parquet, avro, or orc.	Apache Parquet is an open source, column-oriented data file format designed for efficient data storage and retrieval.
License	Apache license 2.0	Apache license 2.0
Source code	https://github.com/apache/iceberg	https://github.com/apache/parquet-format
Website	https://iceberg.apache.org/	https://parquet.apache.org/
Year created	2017	2013
Company	Netflix	Twitter, Cloudera
Language support		java, scala, c++, python, r, php
Use cases	Write once read many, Analytics, Efficient storage, ACID transactions	Write once read many, Analytics, Efficient storage, Column based queries
Is human readable	no	no
Orientation	column or row	column
Has type system	yes	yes
Has nested structure support	yes	yes
Has native compression	yes	yes
Has encoding support	yes	yes
Has constraint support	no	no
Has acid support	yes	no
Has metadata	yes	yes
Has encryption support	maybe	yes
Data processing framework support	Apache Drill, Apache Flink, Apache Gobblin, Apache Pig, Apache Spark,	Apache Beam, Apache Drill, Apache Flink, Apache Spark,
Analytics query support	Apache Impala, Apache Druid, Apache Hive, AWS Athena, BigQuery, Clickhouse, Dremio, DuckDB, Presto, Trino,	Apache Hive, Apache Impala, Apache Druid, Apache Pinot, AWS Athena, Azure Synapse, BigQuery, Clickhouse, Dremio, DuckDB, Firebolt,