| Name |
CSV |
Apache Iceberg |
| Description |
Comma-Separated Values (CSV) is a text file format that uses commas to separate values in plain text. |
Iceberg is a high-performance format for huge analytic tables. Utilises data stored in either parquet, avro, or orc. |
| Source code |
|
https://github.com/apache/iceberg |
| Website |
https://www.rfc-editor.org/rfc/rfc4180.html |
https://iceberg.apache.org/ |
| Language support |
java, scala, c++, python, r, php, go |
|
| License |
N/A |
Apache license 2.0 |
| Year created |
0 |
2017 |
| Company |
|
Netflix |
| Use cases |
|
Write once read many, Analytics, Efficient storage, ACID transactions |
| Is human readable |
yes
|
no
|
| Orientation |
row |
column or row |
| Has type system |
no
|
yes
|
| Has nested structure support |
no
|
yes
|
| Has native compression |
no
|
yes
|
| Has encoding support |
no
|
yes
|
| Has constraint support |
no
|
no
|
| Has acid support |
no
|
yes
|
| Has metadata |
no
|
yes
|
| Has encryption support |
no
|
maybe
|
| Data processing framework support |
Apache Beam,
Apache Drill,
Apache Flink,
Apache Gobblin,
Apache Hive,
Apache NiFi,
Apache Pig,
Apache Spark,
|
Apache Drill,
Apache Flink,
Apache Gobblin,
Apache Pig,
Apache Spark,
|
| Analytics query support |
Apache Impala,
Apache Druid,
Apache Pinot,
AWS Athena,
Azure Synapse,
BigQuery,
Clickhouse,
Dremio,
DuckDB,
Firebolt,
|
Apache Impala,
Apache Druid,
Apache Hive,
AWS Athena,
BigQuery,
Clickhouse,
Dremio,
DuckDB,
Presto,
Trino,
|