A real-time data engineering project that ingests live Wikipedia edit events and processes them using the Databricks Lakehouse Medallion Architecture
apache-spark data-engineering databricks real-time-data data-pipeline real-time-analytics spark-structured-streaming etl-pipeline delta-lake streaming-data-pipelines databricks-sql medallion-architecture lakehouse-architectures event-stream-processing json-stream-processing
-
Updated
Mar 7, 2026 - Jupyter Notebook