This is a curated list of publicly accessible machine learning courses from top universities such as Berkeley, Harvard, Stanford, and MIT. It also includes machine learning project case studies from large and experienced companies. The list is broken down by topics and areas of specialization. Python is the preferred language of choice as it covers end-to-end machine learning engineering.
Special thanks to the schools for making their course videos and assignments publicly available.
This awesome list uses the following conventions:
- β indicates a highly recommended course that is foundational or excellent for that topic
- πΊ indicates a link to video lectures for that course
- Course Name - Brief description of the course and what you will learn.
- Shortest Path to LLM / Agents
- TL;DR
- Computer Science
- Math and Statistics
- Artificial Intelligence
- Machine Learning
- Machine Learning Engineering
- Deep Learning Overview
- Specializations
- Case Studies
- License
- Contributing
Bare minimum list of courses to go through for basic background knowledge in LLM and AI Agents.
-
Berkeley CS188: Artificial Intelligence - Foundational course covering search, planning, and reasoning essential for understanding AI agents. β
-
Stanford CS231n: Convolutional Neural Networks for Visual Recognition - Deep learning for computer vision with practical assignments. [Assignment 2 Solution, Assignment 3 Solution] β
-
Stanford CS224n: Natural Language Processing with Deep Learning - Covers neural networks for NLP, language models, and transformers. [Reference Solutions] β
-
Berkeley CS285: Deep Reinforcement Learning - Comprehensive course on deep RL algorithms and policy gradient methods. β
-
Stanford CS336: Language Modeling from Scratch - Modern approach to building language models from first principles. β
Bare minimum list of courses to go through for basic knowledge in machine learning engineering.
-
MIT: The Missing Semester of Your CS Education - Essential practical skills for computer science and software development.
-
edX Harvard: CS50x: Introduction to Computer Science - Comprehensive introduction to computer science fundamentals.
-
MIT 18.05: Introduction to Probability and Statistics - Foundation for statistical understanding in machine learning.
-
Columbia COMS W4995: Applied Machine Learning - Applied ML with practical projects and real-world examples. πΊ
-
MIT 18.06: Linear Algebra - Essential mathematical foundation for machine learning.
-
Berkeley CS182: Designing, Visualizing, and Understanding Deep Neural Networks - Deep learning fundamentals with visual explanations and code. πΊ [Reference Solutions]
-
Berkeley: Full Stack Deep Learning - End-to-end ML engineering covering infrastructure and deployment.
Foundational computer science, Python, and SQL skills for machine learning engineering.
-
Grokking Algorithms - Visual introduction to algorithms and data structures with clear illustrations.
-
Google Python Style Guide - Industry standard Python code style and conventions.
-
Python Design Patterns - Common Python patterns and idioms for writing clean code.
-
Python3 Patterns - Python 3 specific patterns and best practices.
-
Design Patterns: Elements of Reusable Object-Oriented Software 1st Edition - Foundational book on software design patterns.
-
MIT: The Missing Semester of Your CS Education - Essential practical tools and skills for software development. β
-
edX MITX: Introduction to Computer Science and Programming Using Python - Learn Python fundamentals through problem-solving and applications. β
-
edX Harvard: CS50x: Introduction to Computer Science - Comprehensive CS fundamentals from Harvard University.
-
SQL for Data Analysis - Learn SQL for querying and analyzing data.
-
PostgreSQL Exercises - Hands-on SQL practice with real-world scenarios.
-
U Waterloo: CS794: Optimization for Data Science - Optimization techniques essential for machine learning.
-
Berkeley CS 170: Efficient Algorithms and Intractable Problems - Study of algorithms, computational complexity, and NP-completeness.
-
Berkeley CS 294-165: Sketching Algorithms - Algorithms for processing massive datasets efficiently.
-
MIT 6.824: Distributed Systems - Foundations of distributed systems and fault tolerance. πΊ
Linear algebra, statistics, and mathematical foundations for machine learning.
- NIST Engineering Statistics Handbook - Comprehensive reference on statistical methods and applications.
-
MIT 18.05: Introduction to Probability and Statistics - Essential probability and statistics for understanding machine learning. β
-
MIT 18.06: Linear Algebra - Comprehensive linear algebra covering vectors, matrices, and eigenvalues. β
-
Stanford Stats216: Statistical Learning - Statistical methods for learning from data with R labs. β
-
CalTech: Learning From Data - Theoretical foundations of machine learning and generalization.
-
A Students Guide to Bayesian Statistics - Introduction to Bayesian methods and probabilistic thinking.
-
Introduction to Linear Algebra for Applied Machine Learning with Python - Practical linear algebra with Python applications.
Artificial Intelligence is the superset of Machine Learning. These courses provide a high-level understanding of the field of AI, including searching, planning, logic, constraint optimization, and machine learning.
- Artificial Intelligence: A Modern Approach - Comprehensive textbook on AI algorithms and techniques.
-
Berkeley CS188: Artificial Intelligence - Foundational AI course covering search, planning, reasoning, and learning. β
-
edX ColumbiaX: Artificial Intelligence - Core AI concepts and algorithms with programming projects. [Reference Solutions]
Core machine learning theory and applied methods.
-
Mathematics for Machine Learning - Essential mathematics for understanding machine learning algorithms.
-
Concise Machine Learning - Concise overview of key machine learning concepts.
-
The Elements of Statistical Learning - Statistical foundations of supervised learning.
-
Mining of Massive Datasets - Algorithms for processing and analyzing large-scale data.
-
Pattern Recognition and Machine Learning - Comprehensive coverage of pattern recognition techniques. [Codes]
-
Cross-Industry Process for Data Mining methodology - Standard process for data mining and analytics projects.
-
Columbia COMS W4995: Applied Machine Learning - Applied ML with hands-on projects and real-world problem solving. πΊ β
-
Stanford CS229: Machine Learning - Comprehensive ML course covering supervised, unsupervised, and reinforcement learning. πΊ
-
Harvard CS 109A Data Science - Data science fundamentals and machine learning methods.
-
edX ColumbiaX: Machine Learning - Core machine learning algorithms and applications.
-
Berkeley CS294: Fairness in Machine Learning - Ethical considerations and fairness in machine learning systems.
-
Google: Machine Learning Crash Course - Practical introduction to machine learning from Google.
-
Google: AI Education - Comprehensive AI and ML educational resources from Google.
-
Google: Applied Machine Learning Intensive - Hands-on applied ML training from Google.
-
Cornell Tech CS5785: Applied Machine Learning - Applied ML techniques with programming assignments. πΊ
-
Probabilistic Machine Learning (Summer 2020) - Probabilistic approaches to machine learning. πΊ
-
AutoML - Automated Machine Learning - Techniques for automating machine learning workflows.
-
MIT: Data Centric AI - Focus on data quality and engineering for AI systems.
These courses help you bridge the gap from training machine learning models to deploying AI systems in the real world.
-
Machine Learning Engineering - Practical guide to building production ML systems.
-
Machine Learning System Design - Design principles for ML systems at scale.
-
Microsoft Commercial Software Engineering ML Fundamentals - ML engineering best practices from Microsoft.
-
Google Rules of ML - Practical rules for effective ML projects.
-
The Twelve Factors App - Principles for building scalable software applications.
-
Feature Engineering and Selection: A Practical Approach for Predictive Models - Feature engineering techniques and best practices.
-
Continuous Delivery for Machine Learning - CI/CD practices for machine learning systems.
-
Berkeley: Full Stack Deep Learning - End-to-end ML engineering from research to production. β
-
Stanford: CS 329S: Machine Learning Systems Design - Design and architecture of production ML systems. β
-
CMU: Machine Learning in Production - ML system design with focus on quality and reliability. github
-
Andrew Ng: Bridging AI's Proof-of-Concept to Production Gap - Practical insights on moving ML from research to production.
-
Facebook Field Guide to Machine Learning - Machine learning practices from Facebook.
-
Udemy: Deployment of Machine Learning Models - Practical guide to deploying ML models in production. β
-
Spark - Large-scale data processing with Apache Spark.
-
Udemy: The Complete Hands On Course To Master Apache Airflow - Workflow orchestration and scheduling for data pipelines.
Basic overview and foundations of deep learning.
-
Deep Learning - Comprehensive textbook on deep learning methods and theory.
-
Dive into Deep Learning - Interactive deep learning book with code examples.
-
The Matrix Calculus You Need For Deep Learning - Essential calculus concepts for understanding deep learning.
-
Berkeley CS182: Designing, Visualizing and Understanding Deep Neural Networks - Deep learning fundamentals with focus on understanding neural networks. πΊ [Reference Solutions]
-
Stanford CS 25: Transformers - Comprehensive course on transformer architectures and their applications. πΊ
-
Deeplearning.ai Deep Learning Specialization - Comprehensive specialization covering neural networks and deep learning. [Reference Solutions] β
-
NYU: Deep Learning - Deep learning course using PyTorch with code examples.
Recommendation systems are used when users do not know what they want and cannot use keywords to describe their needs. These systems learn user preferences and predict items of interest.
-
Mining of Massive Datasets - Foundational algorithms for recommendation systems at scale.
-
Speech and Language Processing - Natural language understanding for recommendation and search.
-
Dive into Deep Learning: Chapter 16 Recommender Systems - Deep learning approaches to recommendation systems.
- Stanford CS246: Mining Massive Data Sets - Algorithms for mining and analyzing massive datasets including recommendation systems.
Search and ranking systems are used when users have specific information needs and can use keywords to describe their queries.
- Introduction to Information Retrieval - Comprehensive introduction to information retrieval and search technology.
-
Stanford CS224U: Natural Language Understanding - NLU and Information Retrieval - NLU methods for information retrieval and question answering.
-
TU Wein: Crash Course IR - Fundamentals - Fundamentals of information retrieval systems.
-
UIUC: Text Retrieval and Search Engines - Search engine technology and text retrieval methods.
-
Stanford CS276: Information Retrieval and Web Search - Web-scale search and ranking algorithms.
-
University of Freiburg: Information Retrieval - Information retrieval concepts and implementation. πΊ
Modern NLP leverages deep learning and language models to understand and generate human language. Large language models have dramatically improved language understanding and generation capabilities.
-
Deep Learning - Deep learning methods for NLP and sequence models.
-
Introduction to Natural Language Processing - Foundations of natural language processing.
-
Speech and Language Processing - Comprehensive NLP textbook covering linguistic and statistical methods.
-
Stanford CS224n: Natural Language Processing with Deep Learning - Neural networks and deep learning for NLP covering word embeddings, RNNs, and transformers. [Reference Solutions] β
-
Berkeley CS182: Designing, Visualizing, and Understanding Deep Neural Networks - Deep neural network fundamentals applicable to NLP. πΊ [Reference Solutions]
-
Stanford: CS336: Language Modeling from Scratch - Modern language models built from first principles. β
-
NYU: DS-GA 1011 Natural Language Processing with Representation Learning - NLP with representation learning and neural methods.
-
Deeplearning.ai Natural Language Processing Specialization - Comprehensive NLP specialization from DeepLearning.AI. [Reference Solutions]
Computer vision systems extract meaning from images and video. Modern vision-language models combine visual and textual understanding for comprehensive scene interpretation.
- Deep Learning - Deep learning methods for computer vision tasks.
-
Stanford CS231n: Convolutional Neural Networks for Visual Recognition - Comprehensive course on CNNs for image recognition and understanding. [Assignment 2 Solution, Assignment 3 Solution] β
-
Berkeley CS182: Designing, Visualizing, and Understanding Deep Neural Networks - Deep neural network fundamentals with applications to vision. πΊ [Reference Solutions]
Unsupervised learning discovers patterns in data without labeled examples. Generative models learn to create new data samples with similar properties to the training data.
-
Stanford CS236: Deep Generative Models - Generative models including VAEs, GANs, and flow-based models.
-
Berkeley CS294-158: Deep Unsupervised Learning - Deep learning methods for unsupervised learning tasks.
Foundation models are large models trained on broad data that can be adapted to many downstream tasks. These courses cover language models, multi-modal models, and model adaptation.
-
Stanford CS324: Large Language Models (Winter 2022) - Comprehensive course on large language models covering architecture, training, and applications.
-
Stanford CS324: Advances in Foundation Models (Winter 2023) - Advanced topics in foundation models including multi-modal and domain-specific models.
Reinforcement learning enables agents to learn optimal behaviors through interaction with environments. These courses cover policy gradient methods, value-based learning, and deep reinforcement learning.
-
Reinforcement Learning - Foundational textbook on RL by Richard Sutton and Andrew Barto.
-
Deep Learning - Deep learning methods for reinforcement learning.
-
Coursera: Reinforcement Learning Specialization - Comprehensive RL specialization recommended by Richard Sutton, the author of the foundational RL textbook. β
-
Berkeley CS182: Designing, Visualizing, and Understanding Deep Neural Networks - Deep neural network fundamentals with RL applications. πΊ [Reference Solutions]
-
Stanford CS234: Reinforcement Learning - Reinforcement learning algorithms and applications.
-
Berkeley CS285: Deep Reinforcement Learning - Deep RL covering policy gradients, Q-learning, and actor-critic methods. β
-
CS 330: Deep Multi-Task and Meta Learning - Meta-learning and multi-task learning for fast adaptation. Videos
-
Berkeley: Deep Reinforcement Learning Bootcamp - Intensive bootcamp on deep reinforcement learning fundamentals.
-
OpenAI Spinning Up - Practical introduction to deep RL from OpenAI.
-
IDS at Stanford RL Forum - Video 1 of RL forum discussion. Video 2 Slides
Robotics applies machine learning and control theory to physical systems. These courses cover kinematics, dynamics, planning, and learning for robotic control.
π€
-
ColumbiaX: CSMM.103x Robotics - Robotics fundamentals covering kinematics, dynamics, and control.
-
UC Berkeley CS 287: Advanced Robotics - Advanced robotics covering planning, learning, and control for manipulation.
Technical case studies from companies applying and scaling machine learning systems.
-
Google: Best Practices for Machine Learning - Rules and best practices for machine learning projects from Google.
-
ByteDance: How TikTok Wins The Social Media Recommendation System War - Technical breakdown of TikTok's recommendation algorithm. (transcription)
-
NerdWallet: How NerdWallet Dialed Machine Learning Up to 11 - Case study on scaling ML infrastructure at fintech company.
-
AI Dungeon: How we scaled AI Dungeon 2 to support over 1,000,000 users - Infrastructure lessons from scaling a generative AI application.
-
Spotify: The Winding Road to Better Machine Learning Infrastructure Through Tensorflow Extended and Kubeflow - Building ML infrastructure for music recommendation at Spotify.
-
Trigo: How Trigo built a scalable AI development & deployment pipeline for Frictionless Retail - AI/ML pipeline development for computer vision retail applications.
All books, blogs, and courses are owned by their respective authors.
This compilation and reference solutions are released under the CC0 1.0 Universal license, which means:
- You are free to use this compilation for any purpose (personal, educational, commercial)
- No permission or attribution is required
- All copyrighted content referenced remains owned by its original authors
When using reference solutions or citing this work, you may optionally use:
@misc{leehanchung,
author = {Lee, Hanchung},
title = {Awesome Full Stack Machine Learning Engineering Courses},
year = {2020},
howpublished = {GitHub Repository},
url = {https://github.com/leehanchung/awesome-full-stack-machine-learning-courses}
}See LICENSE for full CC0 1.0 Universal license details.
Contributions are welcome! If you have course recommendations, case studies, or improvements to share, please follow the contribution guidelines.
To contribute:
- Fork the repository
- Create your feature branch
- Add your changes following the format:
- [Name](URL) - Brief description. - Submit a pull request
Thank you for helping improve this resource!











