发明名称 AGENT LEARNING MACHINE
摘要 A novel highly-adaptable agent learning machine comprises a plurality of learning modules (3) each including a set of an intensive learning system (1) which works on an environment (4) and determines a behavior output for maximizing the reward given as a result of this and an environment predicting system (2) for predicting change of the environment. The smaller the prediction error of the environment predicting system (2) of each learning module (3) is, the larger the responsibility signal is required to have. In proportion to the responsibility signal, the behavior output from the intensive learning system (1) is weighted, and a behavior affecting the environment is given. In an environment having a nonlinearity/unsteadiness, such as a control object or a system, no specific teacher signal is given. The states of various environments and behaviors optimal to the operating modes are switched and combined. Without using foresight knowledge, behavior can be learned flexibly.
申请公布号 CA2303874(A1) 申请公布日期 2000.01.27
申请号 CA19992303874 申请日期 1999.07.08
申请人 JAPAN SCIENCE AND TECHNOLOGY CORPORATION 发明人 KAWATO, MITSUO;DOYA, KENJI
分类号 G05B13/02;G05B13/04;G06F15/18;G06N3/00;(IPC1-7):G06F15/18 主分类号 G05B13/02
代理机构 代理人
主权项
地址
您可能感兴趣的专利