Name |
Delta Lake |
Apache Avro |
Description |
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture. |
Apache Avro is the leading serialization format for record data, and first choice for streaming data pipelines. |
License |
Apache license 2.0 |
Apache license 2.0 |
Source code |
https://github.com/delta-io/delta |
https://github.com/apache/avro |
Website |
https://delta.io/ |
https://avro.apache.org/ |
Year created |
2019 |
2009 |
Company |
Databricks |
Apache |
Language support |
scala, java, python, rust |
java, c++, c#, c, python, javascript, perl, ruby, php, rust |
Use cases |
Write once read many, Analytics, Efficient storage, ACID transactions |
Stream processing, Analytics, Efficient data exchange |
Is human readable |
no
|
no
|
Orientation |
column |
row |
Has type system |
yes
|
yes
|
Has nested structure support |
yes
|
yes
|
Has native compression |
yes
|
yes
|
Has encoding support |
yes
|
yes
|
Has constraint support |
yes
|
no
|
Has acid support |
yes
|
no
|
Has metadata |
yes
|
yes
|
Has encryption support |
maybe
|
no
|
Data processing framework support |
Apache Drill,
Apache Flink,
Apache Spark,
|
Apache Flink,
Apache Gobblin,
Apache NiFi,
Apache Pig,
Apache Spark,
|
Analytics query support |
Apache Hive,
AWS Athena,
Azure Synapse,
BigQuery,
Clickhouse,
Dremio,
Presto,
Trino,
|
Apache Impala,
Apache Druid,
Apache Hive,
Apache Pinot,
AWS Athena,
BigQuery,
Clickhouse,
Firebolt,
|