Skip to content

Latest commit

 

History

History
297 lines (260 loc) · 106 KB

rl.md

File metadata and controls

297 lines (260 loc) · 106 KB
description
强化学习 让我以另一种方式玩游戏吧

RL

深度强化学习训练智能体:超级玛丽

DQN in Pytorch Stream 3 of N | Atari Breakout + Logging and Monitoring

DeepMind Made A Superhuman AI For 57 Atari Games! 🕹 Two Minute Papers

Python Flappy Bird AI Tutorial (with NEAT) - Creating the Bird

A.I. Flappy Bird without Libraries from SCRATCH (Python/PyCharm) Max Teaches Tech

How to Solve a Basic Reinforcement Learning Example | RL Hello World

An introduction to Reinforcement Learning

平台

腾讯开悟(s, ) Reinforcement Learning(quora, )
TutsNode.com

OpenAI

OpenAI(site, git, Baselines, GPT-3) openai Research index s
OpenAI CEO, CTO on risks and how AI will reshape society ABC News
Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI | Lex Fridman Podcast #367
Breakthrough potential of AI | Sam Altman | MIT 2023 Imagination in Action
OpenAI CEO Sam Altman testifies at Senate artificial intelligence hearing | full video CBS News
LIVE: OpenAI CEO Sam Altman testifies during Senate hearing on AI oversight — 05/16/23 CNBC Television
Open AI CEO第一次国会听证会内容介绍 Jeff科技视角
【OpenAI】萨姆奥特曼 Sam Altman出席国会听证会 | 积极拥抱政府监管 | AI企业要上牌照 | 建议成立国际组织 | AI将创造更多就业 | 不为赚钱只因热爱 最佳拍档

UP

Yuxi Li(u, ) Code Bullet(u, )
ClarityCoders u Greer Viau(u, )
DeepMind u Machine Learning with Phil u
Lex Fridman u 蓝仔的十八般武艺 抖音号: lanzai8888
Shusen Wang u en AI探长 抖音号: AITanzhang
Yifei Hu u Emergent Garden u
学渣程序员 抖音号:67129424878 Edan Meyer u
AI Prism u Saasha Nair u
Pourquoi (布瓜的世界) u

课程

强化学习基础(本科课程)-北京邮电大学 刘先生
秒懂强化学习 Reinforcement Learning 莫烦Python
强化学习 Reinforcement Learning Python 教学 教程 莫烦Python
什么是深度强化学习(DRL)?【知多少】 KnowingAI知智
什么是强化学习(Reinforcement Learning)?【知多少】KnowingAI知智
在Unity環境中訓練強化學習AI! AI葵
Tim & Heinrich — Democraticizing Reinforcement Learning Research Weights & Biases
Train AI to Play Snake – Reinforcement Learning Course (Python, PyTorch, Pygame) freeCodeCamp
Reinforcement learning with Snake-RL - Made with TensorFlow.js TensorFlow
Algorithmic SNAKES! (AI compilation) AlphaPhoenix
How does electricity find the "Path of Least Resistance"? AlphaPhoenix
贪吃蛇游戏数学算法人工智能AI创造世界纪录 Oziter茅 哈密尔顿回路
代码编程 Oziter茅 华容道
分步详解C语言贪吃蛇游戏 大雄的公开课
【Python】60行搞定贪吃蛇小游戏 Bennett Poitier
【python游戏编程教程】【小白友好版】贪吃蛇 Stephanie_程序媛 五子棋 三子棋 联机
我用30天写了一个完美的贪吃蛇AI 林亦LYi
MIT 6.S191: Reinforcement Learning Alexander Amini list
AI Learns to Play Normalized Nerd
Deep Maths - machine learning and mathematics Oxford Mathematics

Using AI to accelerate scientific discovery - Demis Hassabis (Crick Insight Lecture Series)

DeepMind

DeepMind x UCL | Reinforcement Learning Course 2018 DeepMind
Alpha Go weibin zhuang
CS885 Reinforcement Learning - Spring 2020 Pascal Poupart
CS885 Reinforcement Learning - Spring 2018 - University of Waterloo Pascal Poupart

CS234: Reinforcement Learning

深度强化学习完整版-2020秋-UC Berkeley CS285 by Sergey Levine Math4AI
Reinforcement Learning with Python(Nicholas Renotte)
A.I. Learns to play Flappy Bird(Code Bullet)
AI Learns to play... Code Bullet
AI is programmed to play... Code Bullet
AI Plays Flappy Bird - NEAT Python Tech With Tim
Python Pong AI Tutorial - Using NEAT Tech With Tim
Neural Network Learns to Play Snake Greer Viau
Reinforcement Learning Yannic Kilcher RE•WORK NPTEL-NOC IITM
Reinforcement Learning - Goal Oriented Intelligence deeplizard
Reinforcement Learning - Developing Intelligent Agents deeplizard
MarI/O - Machine Learning for Video Games SethBling
MarIQ -- Q-Learning Neural Network for Mario Kart -- 2M Sub Special SethBling
Reinforcement Learning - David Silver
Reinforcement Learning by David Silver 道法自然
Reinforcement Learning - Emma Brunskill | Stanford - OnlineHub Rahul Madhavan
reinforcement learning Matlab Raony Maia Fontes
秒懂强化学习 Reinforcement Learning 莫烦Python
强化学习基础(张志华)-北京大学 刘先生
深度强化学习基础 Shusen Wang
决胜AI-强化学习实战系列视频课程 唐宇迪 网易云课堂
讓人工智慧玩捉迷藏,最後居然發展出連人類都想不到的策略!? | 一探啾竟 第80集 | 啾啾鞋
OpenAI Plays Hide and Seek…and Breaks The Game! 🤖 Two Minute Papers
這是我看過最廢的人工智慧了... 啾啾鞋
CS 294-112 Deep Reinforcement Learning UC Berkeley coursehero eecs CAL ESG - EECS reddit
Python Bots Playing Games and More!! ClarityCoders
A.I. Battles ClarityCoders
Python Reinforcement Learning using Gymnasium – Full Course freeCodeCamp
Reinforcement Learning Course: Intro to Advanced Actor Critic Methods freeCodeCamp
Reinforcement Learning Course - Full Machine Learning Tutorial freeCodeCamp
Python AI Learns to Play the Chrome Dinosaur Game | Made with Pygame and NEAT enigma git
Build a Chrome Dino Game AI Model with Python | AI Learns to Play Dino Game Nicholas Renotte

Python A.I. (N.E.A.T.) Max teaches Tech

Chrome Dinosaur in Pygame Max teaches Tech

Pygame Tutorials Max teaches Tech

Python AI Learns To Play Flappy Bird! | Python NEAT and Pygame enigma git
Flappy Bird Tutorial Max teaches Tech
Intro to Reinforcement Learning 强化学习纲要 Bolei Zhou git
Reinforcement Learning sentdex
Reinforcement Learning with Stable Baselines 3 sentdex
Physics Simulator w/ Robot Dog sentdex
Starcraft 2 AI sentdex
永不坠落的小鸟—游戏中的人工智能 开发者学堂
An introduction to Reinforcement Learning Arxiv Insights
Reinforcement Learning with sparse rewards Arxiv Insights
An introduction to Policy Gradient methods - Deep Reinforcement Learning Arxiv Insights
Learning to Walk via Deep Reinforcement Learning Jie Tan
Equivariant RL Simons Institute

What is the Statistical Complexity of Reinforcement Learning?

强化学习的统计复杂性

Simons Institute

AI Learns To Draw New Pokemon Jabrils
Making My First Machine Learning Game Jabrils
Advanced Topics in Reinforcement Learning DeepPavlov

This AI Learned Boxing…With Serious Knockout Power! 🥊 Two Minute Papers

Control Strategies for Physically Simulated Characters Performing Two-player Competitive Sports Meta Research

Deep Reinforcement Learning in Python Tutorial - A Course on How to Implement Deep Learning Papers freeCodeCamp
Q Learning In Reinforcement Learning | Q Learning Example | Machine Learning Tutorial | Simplilearn
Artificial Intelligence Lessons Dr. Daniel Soper
Reinforcement Learning Steve Brunton
Reinforcement learning with TensorFlow Agents TensorFlow
TensorFlow and deep reinforcement learning, without a PhD (Google I/O '18) TensorFlow
The fastest matrix multiplication algorithm Dr. Trefor Bazett
Deep Reinforcement Learning: CS 285 Fall 2021 (UC Berkeley) RAIL
Deep Reinforcement Learning: CS 285 Fall 2020 RAIL
David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86 Lex Fridman
AI's Game Playing Challenge - Computerphile
Google's Deep Mind Explained! - Self Learning A.I. ColdFusion
Teach AI To Play Snake! Reinforcement Learning With PyTorch and Pygame Python Engineer
Download Practical AI with Python and Reinforcement Learning tut4dev
NVIDIA’s New AI Trained For 10 Years! But How? 🤺 Two Minute Papers
NVIDIA’s AI Plays Minecraft After 33 Years of Training! 🤖 Two Minute Papers
DeepMind Makes Prototyping Papers Easy with ACME Machine Learning with Phil
Deep Reinforcement Learning Tutorials - All Videos Machine Learning with Phil
Advanced Actor Critic and Policy Gradient Methods Machine Learning with Phil
Learning RL Algorithms via ML Edan Meyer
Research Talk: Dueling network architectures for deep reinforcement learning Stanford Scholar
Tutorial - Search Solutions 2020 - IRSG BCS Member Groups
혁펜하임의 “트이는” 강화 학습 (Reinforcement learning) 혁펜하임
Code Frozen Game Using Reinforcement Learning | OpenAI Gym | Python Project AI Sciences
Creating binance trading bot GUI | Python | Live trading AI Sciences
Fundamentals of Reinforcement Learning AI Sciences
深度強化學習簡介 (Deep Reinforcement Learning) Kuan-Ting Lai
Taipei Tech Deep Reinforcement Learning Kuan-Ting Lai
Ubisoft’s New AI: Breathing Life Into Games! Two Minute Papers
Superintelligence: Science or Fiction? | Elon Musk & Other Great Minds Future of Life Institute

Reinforcement Learning in 3 Hours

Reinforcement Learning Fundamentals Mutual Information
Reinforcement Learning AI Insights - Rituraj Kaushik
[Tutorialsplanet.NET] Udemy - Advanced AI Deep Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

Reinforcement Learning Krish Naik
Data-driven Optimization Workshop: Deep Reinforcement Learning in Supply Chain Optimizations Microsoft Research
【强化学习的数学原理】课程视频合集(从零开始透彻理解强化学习)Aerial robotics @ Westlake University
Talk | 悉尼科技大学在读博士生胡思逸:MARLlib,全新的多智能体强化学习框架 将门-TechBeat技术社区
Reinforcement Learning for Simple UAV Navigation Huy Pham

Reinforcement Learning: An Introduction pdf stanford Second edition, in progress

强化学习是一种机器学习的类型,涉及代理通过反复试验来学习如何在环境中做出决策。代理的目标是最大化由环境给出的奖励信号。代理学习采取导致最大可能奖励的行动,同时避免导致负面结果的行动。

Richard S. Sutton和Andrew G. Barto的《强化学习导论》一书全面介绍了强化学习领域。该书涵盖价值函数、蒙特卡罗方法、时序差分学习和策略梯度等主题。

该书的第一版于1998年出版,第二版目前正在编写中。第二版根据领域内最新进展更新了材料,并增加了有关深度强化学习和多智能体强化学习的新章节。

该书被广泛认为是关于强化学习的最权威的文本之一,并被该领域的研究人员和实践者用作参考。它适合本科和研究生学生,并为任何对学习或从事强化学习感兴趣的人提供了坚实的基础。

Learning From Passive Data Explained Edan Meyer

算法

DQN
Playing Atari with Deep Reinforcement Learning arxiv pdf pdf 2013.12
Reinforcement Learning - Ep. 30 (Deep Learning SIMPLIFIED) DeepLearning.TV
CURL: Contrastive Unsupervised Representations for Reinforcement Learning Machine Learning Street Talk
RLHF
Reinforcement Learning from Human Feedback: From Zero to chatGPT HuggingFace
John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges Berkeley EECS
How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF) John Tan Chong Min
RLHF+CHATGPT: What you must know Machine Learning Street Talk
AI Safety, RLHF, and Self-Supervision - Jared Kaplan | Stanford MLSys #79 Stanford MLSys Seminars
【分享】State of GPT(GPT的现状)中文字幕精校版 | Andrej Karpathy 微软Build大会精彩演讲 | GPT状态和原理 | 解密OpenAI模型训练 最佳拍档
【機器學習 2023】(生成式 AI) Hung-yi Lee
PPO
RL — Proximal Policy Optimization (PPO) Explained medium
强化学习与ChatGPT:PPO 算法介绍和实际应用(中文介绍) Pourquoi (布瓜的世界)
Python Reinforcement Learning using Stable baselines. Mario PPO ClarityCoders
Evolution
Google AI Simulates Evolution On A Computer! 🦖 Two Minute Papers

具身智能 Embodied AI

【人工智能】具身智能:下一个AI浪潮 | 稚晖君 | Embodied AI | 什么是具身智能 | 目前发展阶段 | 挑战与困难 | 智元远征A1机器人 最佳拍档

游戏AI

Voyager
【人工智能】全新AI智能体Voyager | 自己学会玩minecraft | 全场景终身学习 | 性能完胜AutoGPT | 英伟达Nvidia最新发布 | NPC取代人类玩家 最佳拍档 无梯度架构 终身学习
arxiv MineDojo/Voyager
NVIDIA’s New AI Mastered Minecraft 15X Faster! Two Minute Papers

其他

DQN_HollowKnight(git, v, )
快手斗地主 DouZero(arxiv, git, s, reddit, paperswithcode, dczha, ) v
俄罗斯方块Tetris AI Learns to Play Tetris [Cocos Creator/TypeScript] Archi Tsai
Coding Adventure: Chess AI Sebastian Lague
How To Hack The Google Chrome Dinosaur Game [PYTHON] | Only 10 Lines Of Coding | Pyautogui | Numpy Know-How
Deep Reinforcement Learning in Python Tutorial freeCodeCamp
AI's Game Playing Challenge - Computerphile
AlphaStar: The inside story DeepMind
David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86 Lex Fridman
AlphaZero from Scratch – Machine Learning Tutorial freeCodeCamp
The story of AlphaGo DeepMind
AlphaGo full movie HD Zucci
阿尔法狗用什么算法击败李世石?《阿尔法围棋》 | 看电影了没
DeepMind AlphaStar Analysis and Impressions (StarCraft II) brownbear
StarCraft 2: Google DeepMind AlphaStar (A.I.) vs Pro Gamer! LowkoTV
Reinforcement Learning for Stock Prediction Siraj Raval
DeepMind's New AI: As Smart As An Engineer... Kind Of! 🤯 Two Minute Papers
Artificial Intelligence & Machine Learning ForrestKnight
Coding Challenge #71: Minesweeper The Coding Train
How to play Minesweeper Eric Buffington
Python Game Development Project Using OOP – Minesweeper Tutorial (w/ Tkinter) freeCodeCamp
蘑菇书 Easy RL 强化学习教程 s epubit datawhalechina/easy-rl errata db
alibaba/EasyReinforcementLearning
Reinforcement Learning in 3 Hours | Full Course using Python Nicholas Renotte
Discovering novel algorithms with AlphaTensor deepmind v
Deepmind AlphaTensor Algorithmic Discovery with AI | Paper + Code Simon Lermen AI
【線性代數 2022 (課程補充)】AlphaTensor: 用增強式學習 (Reinforcement Learning) 找出更有效率的矩陣相乘演算法 Hung-yi Lee

DRL, Deep Reinforcement Learning, 2018 Hung-yi Lee

ML Lecture 23-1: Deep Reinforcement Learning Hung-yi Lee

Machine Learning (Hung-yi Lee, NTU) Hung-yi Lee

This is a game changer! (AlphaTensor by DeepMind explained) Yannic Kilcher
AlphaFold 2 论文精读【论文精读】 Mu Li
Deep Reinforcement Learning with OpenAI Gym in Python NeuralNine
格斗之王!AI写出来的AI竟然这么强! 林亦LYi
DeepMind’s AI Athletes Play In The Real World! Two Minute Papers colab
AirSim

AirSim是由微软开发的一个开源的模拟器,用于模拟无人机、汽车和机器人等各种类型的机器人的行为和环境。它提供了高度可定制的环境,允许用户在虚拟场景中测试各种机器人算法,包括视觉SLAM、路径规划、控制等等。

AirSim的最大特点是其高度逼真的图形渲染引擎和物理模拟引擎。它使用了虚幻引擎作为渲染引擎,并使用了现代计算机图形学技术来模拟各种物理现象,例如惯性、空气阻力、摩擦力等等,以使得机器人在仿真环境中的行为和现实世界中的行为尽量相似。

AirSim还提供了一套API,使得用户可以轻松地控制和监测机器人的状态。这些API可以用C++、Python和ROS等语言和框架进行访问。

总之,AirSim为机器人研究和开发人员提供了一个快速、高效、低成本的测试平台,可以加速机器人技术的发展。

rl + 无人机

{% file src="../.gitbook/assets/Bilgin E. Mastering Reinforcement Learning with Python 2021.pdf" %}