Tag: Airflow

Simplify Airflow DAG Creation and Maintenance with Hamilton in 8 minutes

This post walks you through the benefits of having two open source projects, Hamilton and Airflow, and their directed acyclic graphs (DAGs) work in tandem. At a high level Airflow is responsible for orchestration (think macro) and Hamilton helps author clean and maintainable...

Simplify Airflow DAG Creation and Maintenance with Hamilton in 8 minutes

This post walks you through the benefits of having two open source projects, Hamilton and Airflow, and their directed acyclic graphs (DAGs) work in tandem. At a high level Airflow is responsible for orchestration (think macro) and Hamilton helps author clean and maintainable...

Data Engineering Project: Twitter Airflow Data Pipeline

Well, nowadays social media is abuzz with the legendary fight between Meta’s CEO Mark Zuckerberg and X’s owner Elon Musk. It has even escalated to the point of a cage fight between the two tech giants. Well we all know how that could turn out.   Musk Vs Zuckerberg Cage Fight ...

Data Engineering Project — IMDB Movie Analysis

In this article, I will create a data pipeline for transferring and analyzing movie data from IMDb. The data pipeline will be created using the following tools: Data ingestion: Web scraping from IMDB using Python Data storage: Google BigQuery Data analysis: DBT Data visualization: Pow...

Simplify Airflow DAG Creation and Maintenance with Hamilton in 8 minutes

This post walks you through the benefits of having two open source projects, Hamilton and Airflow, and their directed acyclic graphs (DAGs) work in tandem. At a high level Airflow is responsible for orchestration (think macro) and Hamilton helps author clean and maintainable...

Unlocking MLOps using Airflow: A Comprehensive Guide to ML System Orchestration

This tutorial represents lesson 4 out of a 7-lesson course that will walk you step-by-step through how to design, implement, and deploy an ML system using MLOps good practices. During the course, you will build a production-ready model to forecast energy consumption levels f...

Hello World! Airflow + Docker

Introduction What is Airflow? Airflow is a platform to programmatically author, schedule and monitor workflows. Airflow is invaluable for data professionals seeking to automate ETL workflows, streamline machine learning processes, or efficiently handle any task that requires automation with...

Hello World! Airflow + Docker

Airflow is a platform to programmatically author, schedule and monitor workflows. Airflow is invaluable for data professionals seeking to automate ETL workflows, streamline machine learning processes, or efficiently handle any task that requires automation within the realm of data management. ...

Setting up Apache-Airflow in Windows using WSL 2

In the previous story, you learned to set up Ubuntu 20.04 on Windows 10 as Linux Subsystem Distribution. In this article, I will walk you through the installation process of Apache Airflow in WSL 2 using a virtual environment. Installation of pip on WSL 2 To set up a virt...

Install Airflow on Windows without Docker or Virtual Box in 5 mins

Apache Airflow is an open-source platform used for orchestrating complex data workflows. It offers a powerful toolset for managing, scheduling, and monitoring workflows in a distributed environment. If you are a data engineer, scientist, or analyst, chances are you have heard of Airflow and its...

Simple airflow DAG using windows part 1

Airflow was developed by AirBNB software team in 2014, and it was made available for everyone. Apache Airflow is an open-source tool designed for orchestrating, scheduling, and monitoring data workflows. Airflow is a framework which has a UI, and can be used using python. The key factors that made a...