the.com/lakehouse
a data warehouse and a data lake had a kid, and it does chores now.
means a data architecture that stores raw files like a data lake but adds the structure, transactions, and speed of a data warehouse on top.
from coined around 2020 by databricks engineers frustrated with copying data between cheap sprawling lakes and expensive tidy warehouses, so they built one system that could be both.
key techtable formats like delta lake, iceberg, hudi
selling pointone copy of data, not two pipelines
tradeoffstill catching up on warehouse-grade query speed