发明名称 AGENT LEARNING MACHINE
摘要 <p>The invention provides a novel highly-adaptive agent learning machine comprising a plurality of learning modules (3), each having a set of a reinforcement learning system (1) which works on an environment (4), and determines an action output for maximizing a reward provided as a result thereof, and an environment predicting system (2), which predicts a change in the environment, wherein a responsibility signal is calculated such that the smaller a prediction error of the environment predicting system (2) of each of the learning modules (3), the larger the value thereof, and the action output by the reinforcement learning system (1) is weighted in proportion to the responsibility signal, thereby providing an action with regard to the environment. The machine switches and combines actions optimum to various states or operational modes of an environment without using any specific teacher signal, and performs behaviour learning flexibly, without using any prior knowledge. &lt;IMAGE&gt;</p>
申请公布号 EP1016981(A1) 申请公布日期 2000.07.05
申请号 EP19990929751 申请日期 1999.07.08
申请人 JAPAN SCIENCE AND TECHNOLOGY CORPORATION 发明人 DOYA, KENJI;KAWATO, MITSUO
分类号 G05B13/02;G05B13/04;G06F15/18;G06N3/00;(IPC1-7):G06F15/18 主分类号 G05B13/02
代理机构 代理人
主权项
地址