学术活动

学术活动 学院首页 > 学术活动 > 正文
学术报告_Reinforcement Learning Structures for Real-Time Optimal Control and Differential Games

作者:   时间:2019-05-07   点击数:

Reinforcement Learning Structures for

Real-Time Optimal Control and Differential Games

F. L. Lewis, National Academy of Inventors. Fellow IEEE, InstMC, and IFAC

Moncrief-O’Donnell Endowed Chair andHead,Advanced Controls & Sensors Group

UTA Research Institute (UTARI), The University of Texas at Arlington, USA

文本框: Abstract:This talk will discuss some new adaptive control structures for learning online the solutions to optimal control problems and multi-player differential games. Techniques from reinforcement learning are used to design a new family of adaptive controllers based on actor-critic mechanisms that converge in real time to optimal control and game theoretic solutions. Continuous-time systems are considered. Application of reinforcement learning to continuous-time (CT) systems has been hampered because the system Hamiltonian contains the full system dynamics. Using our technique known as Integral Reinforcement Learning (IRL), we will develop reinforcement learning methods that do not require knowledge of the system drift dynamics. In the linear quadratic (LQ) case, the new RL adaptive control algorithms learn the solution to the Riccati equation by adaptation along the system motion trajectories. In the case of nonlinear systems with general performance measures, the algorithms learn the (approximate smooth local) solutions of HJ or HJI equations. New algorithms will be presented for solving online the non zero-sum and zero-sum multi-player games. Each player maintains two adaptive learning structures, a critic network and an actor network. The result is an adaptive control system that learns based on the interplay of agents in a game, to deliver true online gaming behavior. A new class of Graphical Games is introduced that takes into account limited information flow in networked systems. A new Experience Replay technique is given that uses past data for present learning and significantly speeds up convergence. New methods of Off-policy Learning allow learning of optimal solutions without knowing any dynamic information. New RL methods in Optimal Tracking allow solution of the Output Regulator Equations for heterogeneous multi-agent systems.

BiosketchF.L. Lewis:Member, National Academy of Inventors. Fellow IEEE, Fellow IFAC, Fellow AAAS, Fellow U.K. Institute of Measurement & Control, PE Texas, U.K. Chartered Engineer. UTA Distinguished Scholar Professor, UTA Distinguished Teaching Professor, and Moncrief-O’Donnell Chair at The University of Texas at Arlington Research Institute. Founding Member Mediterranean Control Association. Qian Ren Thousand Talents Consulting Professor, Northeastern University, Shenyang, China.Ranked at position 81 worldwide, 59 in the USA, and 3 in Texas of all scientists in Computer Science and Electronics, by Guide2Research. Bachelor's Degree in Physics/EE and MSEE at Rice University, MS in Aeronautical Engineering at Univ. W. Florida, Ph.D. at Ga. Tech. He works in feedback control, reinforcement learning, intelligent systems, and distributed control systems. Author of 7 U.S. patents, 410 journal papers, 426 conference papers, 20 books, 48 chapters, and 12 journal special issues. He received the Fulbright Research Award, NSF Research Initiation Grant, ASEETerman Award,Int. Neural Network Soc.Gabor Award2009, U.K. Inst. Measurement & ControlHoneywell Field Engineering Medal2009. Received AACCRagazziniEducation Award 2018, IEEE Computational Intelligence SocietyNeural Networks Pioneer Award2012 and AIAAIntelligent Systems Award2016.IEEE Control Systems SocietyDistinguished Lecturer.Project 111 Professor at Northeastern University, China. Distinguished Foreign Scholar at Chongqing Univ. China. Received Outstanding Service Award from Dallas IEEE Section, selected as Engineer of the Year by Ft. Worth IEEE Section. Listed in Ft. Worth Business Press Top 200 Leaders in Manufacturing. Received the 2010IEEE Region 5 Outstanding Engineering Educator Award and the 2010 UTA Graduate Dean’s Excellence in Doctoral Mentoring Award. Elected to UTA Academy of Distinguished Teachers 2012. Texas Regents Outstanding Teaching Award 2013.He served on the NAE Committee on Space Station in 1995.

©2019控制科学与工程 学院 山东大学千佛山校区
山东省济南市经十路17923号 邮编250061