Data Engineer @AstraZeneca | Founder @DataMasteryLab | AI & Big Data Architect | YouTuber @CodeWithYu | Teaching 50K+ students worldwide
I build AI-powered, production-grade data systems and architect big data solutions for the future:
- π€ AI & Machine Learning (MLOps, LLMs, Generative AI, Vector Databases)
- π§ Big Data Engineering (Spark, Kafka, Airflow, Flink, dbt)
- βοΈ Cloud & Distributed Systems
- π Real-time Streaming & Intelligent Analytics
- π§ AI-Native Data Platforms & Data Mesh
Building the future of data with:
- Generative AI integration into data pipelines
- Real-time ML systems and intelligent data workflows
- Scalable big data architectures for AI workloads
- LLM fine-tuning and RAG (Retrieval-Augmented Generation)
- Next-gen data platforms and AI-powered analytics
- π Data Mastery Lab - My AI & Data educational platform
- π₯ YouTube - Code With Yu - End-to-end data engineering tutorials
- βοΈ Medium - 3K+ followers | Writing about AI, Big Data & Future Tech
- πΌ LinkedIn - Let's connect!
- π Udemy - Teaching AI-powered data engineering & emerging technologies
MSc in Computational Intelligence and Data Analytics | Cranfield University
Empowering the next generation of data & AI professionals to build intelligent, scalable, future-ready solutions.
π« Let's collaborate on AI and big data projects!
- Real-Time Stock Market Anomaly Detection Using Machine Learning: An End-to-End Data Engineeringβ¦
- Building Realtime Data Warehouse with Apache Airflow, Redpanda, Pinot and Superset
- Decodable vs. AWS Managed Service for Apache Flink (MSF): An End-to-End Data Engineering Showdown
- Apache Spark vs Apache Flink: Choosing the Right Tools and Technologies
- End to End Data Engineering for Data Lakehouse with Airflow, Minio, Kafka, Apache Spark, Apacheβ¦




