Handling Real Time insights of Delta Live Tables with Change Data Capture in Databricks
<p>Last week I was able to take a deep dive of <a href="https://matthewsalminen.medium.com/real-time-data-processing-with-delta-live-tables-use-cases-and-best-practices-for-databricks-2009a9a6fc16" rel="noopener">Delta Live Tables </a>in Databricks on how DLTs can handle your real-time data ingestion. Without going too far in depth, I wanted to explain a useful way of monitoring data changes as your DLTs consistently run with new updates to your records. The way we can monitor these data changes is by using <em>Change Data Capture</em>.</p>
<blockquote>
<p><strong>What is Change Data Capture? </strong>Change Data Capture or CDC is a process of identifying any data changes made to your data sources and moving those changes to the target. It enables tracking and recording of specific actions of your DLTs, primarily inserts, updates, and deletes to your data.</p>
</blockquote>
<p>Using CDC is useful as it monitors any changes to your data between different sources/DLTs and keeps history of these changes over time. CDC works alongside Delta lake and your metadata to ensure all changes are tracked accordingly.</p>
<p><a href="https://matthewsalminen.medium.com/handling-real-time-insights-of-delta-live-tables-with-change-data-capture-in-databricks-42f4394611a"><strong>Learn More</strong></a></p>