the.com/databricks
the company that sold shovels to everyone mining the big-data gold rush, then bought the mine.
means a data and AI platform built around apache spark that lets companies store, process, and analyze massive datasets, then build machine learning models on top of them.
from founded in 2013 by the creators of apache spark, the uc berkeley research project that outran hadoop mapreduce; they built a company to commercialize it and named it after the bricks of data infrastructure it stacks together.
valuation 2024valued around 43 billion dollars privately
lakehouse termcoined lakehouse to merge data lakes and warehouses
spark origingrew directly out of the open source spark project
snowflake rivalfierce rivalry with snowflake over data platform dominance
for instance
unity catalog — its governance layer for tracking data and ai assets across clouds
mosaic ml acquisition — bought mosaicml for 1.3 billion dollars in 2023 for llm training
dbrx model — released its own open source large language model in 2024