Week 1: Lakehouse Architecture & Platform

Overview

Understand the evolution of data architectures from data warehouses to data lakes to the data lakehouse. Explore the Databricks platform: workspace navigation, Unity Catalog hierarchy, and compute resources.

Topics

#TypeTitleDuration
1.1.1VideoData Architecture Evolution8 min
1.1.2VideoLakehouse Architecture10 min
1.1.3VideoDatabricks and the Lakehouse8 min
1.2.1VideoDatabricks Overview10 min
1.2.2VideoWorkspace, Catalog & Data12 min
1.3.1VideoCompute Resources8 min
LabLakehouse Concepts30 min
LabWorkspace & Catalog30 min
QuizLakehouse Architecture15 min

Key Concepts

Data Architecture Evolution

EraArchitectureStrengthsWeaknesses
1980s–2000sData WarehouseACID, schema, BIExpensive, rigid, no unstructured
2010sData LakeCheap, flexible, any formatNo ACID, quality issues, "data swamp"
2020s+Data LakehouseBest of bothRequires modern platform

Lakehouse Properties

A data lakehouse provides:

  • ACID transactions on data lake storage (via Delta Lake)
  • Schema enforcement and evolution for data quality
  • Direct BI access to source data (no ETL to warehouse)
  • Unified batch and streaming in one architecture
  • Open formats (Parquet + Delta) — no vendor lock-in
  • Governance via Unity Catalog

Databricks Platform Architecture

  • Control Plane: Managed by Databricks — workspace UI, job scheduling, notebooks
  • Data Plane: Runs in your cloud account — compute clusters, data storage, processing
  • Unity Catalog: Three-level namespace (Metastore > Catalog > Schema > Table)
  • Compute Options: All-purpose clusters, job clusters, SQL warehouses, serverless

Certification Topics

Key accreditation concepts from this week:

  1. A data lakehouse combines warehouse reliability with lake flexibility
  2. Delta Lake provides ACID transactions on data lake storage
  3. Unity Catalog provides unified governance across all data assets
  4. The control plane is managed by Databricks; the data plane runs in your cloud
  5. Photon accelerates SQL queries without requiring code changes
  6. Open formats prevent vendor lock-in

Demo Code