Skip to content

Initial stabilising gain 이 필요없는 Tao Bian's CT VI ADP 리뷰 #14

@JinraeKim

Description

@JinraeKim

Tao Bian 의 value iteration (VI) 기반 CT ADP 를 리뷰한다 [1, 2].

#10 은 해당 논문의 구현을 다룸.

Refs

[1] T. Bian and Z.-P. Jiang, “Value Iteration, Adaptive Dynamic Programming, and Optimal Control of Nonlinear Systems,” in 2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, Dec. 2016, pp. 3375–3380. doi: 10.1109/CDC.2016.7798777.
[2] T. Bian and Z.-P. Jiang, “Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach,” IEEE Trans. Neural Netw. Learning Syst., pp. 1–10, 2021, doi: 10.1109/TNNLS.2020.3045087.

Metadata

Metadata

Labels

studyStudy research papers, etc.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions