-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
studyStudy research papers, etc.Study research papers, etc.
Description
Tao Bian 의 value iteration (VI) 기반 CT ADP 를 리뷰한다 [1, 2].
#10 은 해당 논문의 구현을 다룸.
Refs
[1] T. Bian and Z.-P. Jiang, “Value Iteration, Adaptive Dynamic Programming, and Optimal Control of Nonlinear Systems,” in 2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, Dec. 2016, pp. 3375–3380. doi: 10.1109/CDC.2016.7798777.
[2] T. Bian and Z.-P. Jiang, “Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach,” IEEE Trans. Neural Netw. Learning Syst., pp. 1–10, 2021, doi: 10.1109/TNNLS.2020.3045087.
Metadata
Metadata
Labels
studyStudy research papers, etc.Study research papers, etc.