Azure Data Engineering Project: Insightful Analytics with Sakila Database

Welcome to the GitHub repository for my Azure Data Engineering project! This repository contains all the code and resources used to transform the Sakila MySQL database into a powerhouse of business intelligence using Azure's cloud computing capabilities.

Project Overview

In this project, I've tackled the challenge of converting raw CSV data from the Sakila database into meaningful insights. The journey involves data ingestion, storage, transformation, and visualization, all within Azure's ecosystem.

What's Inside:

Data Ingestion Scripts: Scripts used with Azure Data Factory to ingest data from Git raw URLs.
Data Transformation Notebooks: Azure Databricks notebooks containing Spark code for data transformations.
Visualization Dashboards: Samples or links to PowerBI dashboards created from the processed data.
Documentation: Detailed explanations of the processes and code.

Tech Stack

Azure Data Factory: For data ingestion.
Azure Data Lake Gen 2: Used as our primary data storage.
Azure Databricks: For data processing and transformation.
PowerBI: For creating insightful visualizations.

Key Questions Answered

Who are our top 5 most valuable customers?
Which employees have processed the most orders?
How do sales trends vary across offices over the years?
What's the total sales figure for each year?
Which products are selling the least?

Snapshot of ADF pipeline

Snapshot of PowerBI dashboard

Contribute

Feel free to fork this repository, experiment with the code, and suggest improvements! If you have any questions or feedback, don't hesitate to open an issue or submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Data		Data
images		images
README.md		README.md
TransformationNotebook (1).ipynb		TransformationNotebook (1).ipynb
dashboard.pbix		dashboard.pbix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure Data Engineering Project: Insightful Analytics with Sakila Database

Project Overview

What's Inside:

Tech Stack

Key Questions Answered

Snapshot of ADF pipeline

Snapshot of PowerBI dashboard

Contribute

Feel free to fork this repository, experiment with the code, and suggest improvements! If you have any questions or feedback, don't hesitate to open an issue or submit a pull request.

About

Releases

Packages

Languages

sarmadafzalj/AzureDataEngineering

Folders and files

Latest commit

History

Repository files navigation

Azure Data Engineering Project: Insightful Analytics with Sakila Database

Project Overview

What's Inside:

Tech Stack

Key Questions Answered

Snapshot of ADF pipeline

Snapshot of PowerBI dashboard

Contribute

Feel free to fork this repository, experiment with the code, and suggest improvements! If you have any questions or feedback, don't hesitate to open an issue or submit a pull request.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages