Control system and technique employing reinforcement learning having stability and learning phases,申请号US20020197731-传众专利搜索

发明名称	Control system and technique employing reinforcement learning having stability and learning phases
摘要	A feedback control system for automatic on-line training of a controller for a plant, the system having a reinforcement learning agent connected in parallel with the controller. The learning agent comprises an actor network and a critic network operatively arranged to carry out at least one sequence of a stability phase followed by a learning phase. During the stability phase, a multi-dimensional boundary of values is determined. During the learning phase, a plurality of updated weight values is generated in connection with the on-line training, if and until one of the updated weight values reaches the boundary, at which time a next sequence is carried out to determine a next multi-dimensional boundary of values followed by a next learning phase. Also, a method for automatic on-line training of a feedback controller within a system comprising the controller and a plant by employing a reinforcement learning agent comprising a neural network to carry out at least one sequence comprising a stability phase followed by a learning phase. Further included, a computer executable program code on a computer readable storage medium, for on-line training of a feedback controller within a system comprising the controller and a plant.
申请公布号	US2003074338(A1)	申请公布日期	2003.04.17
申请号	US20020197731	申请日期	2002.07.18
申请人	YOUNG PETER M.;ANDERSON CHARLES;HITTLE DOUGLAS C.;KRETCHMAR MATTHEW	发明人	YOUNG PETER M.;ANDERSON CHARLES;HITTLE DOUGLAS C.;KRETCHMAR MATTHEW
分类号	G05B13/02;(IPC1-7):G06E1/00;G06E3/00;G06F15/18;G06G7/00;G06N3/02	主分类号	G05B13/02
代理机构		代理人
主权项
地址