发明名称 Agent learning machine
摘要 The invention provides a novel highly-adaptive agent learning machine comprising a plurality of learning modules each having a set of reinforcement learning system which works on an environment and determines an action output for maximizing a reward provided as a result thereof and an environment predicting system which predicts a change in the environment, wherein a responsibility signal is calculated such that the smaller a prediction error of the environment predicting system of each of the learning modules, the larger the value thereof, and the action output by the reinforcement learning system is weighted in proportion to the responsibility signal, thereby providing an action with regard to the environment. The machine switches and combines actions optimum to various states or operational modes of an environment without using any specific teacher signal and performs behavior learning flexibly without using any prior knowledge.
申请公布号 US6529887(B1) 申请公布日期 2003.03.04
申请号 US20000508850 申请日期 2000.05.18
申请人 AGENCY OF INDUSTRIAL SCIENCE AND TECHNOLOGY;ADVANCED TELECOMMUNICATION RESEARCH INSTITUTE INTERNATIONAL 发明人 DOYA KENJI;KAWATO MITSUO
分类号 G05B13/02;G05B13/04;G06F15/18;G06N3/00;(IPC1-7):G06F15/18 主分类号 G05B13/02
代理机构 代理人
主权项
地址