发明名称 An automated action-selection system and method and application thereof for training prediction machines and for driving the development of self-developing devices
摘要 <p>In order to promote efficient learning of relationships inherent in a system or setup S described by system-state and context parameters, the next action to take, affecting the setup, is determined based on the knowledge gain expected to result from this action. Knowledge-gain is assessed "locally" by comparing the value of a knowledge-indicator parameter after the action with the value of this indicator on one or more previous occasions when the system-state/context parameter(s) and action variable(s) had similar values to the current ones. Preferably the "level of knowledge" is assessed based on the accuracy of predictions made by a prediction module. This technique can be applied to train a prediction machine by causing it to participate in the selection of a sequence of actions. This technique can also be applied for managing development of a self-developing device or system, the self-developing device or system performing a sequence of actions selected according to the action-selection technique.</p>
申请公布号 EP1622072(A1) 申请公布日期 2006.02.01
申请号 EP20040291912 申请日期 2004.07.27
申请人 SONY FRANCE S.A. 发明人 KAPLAN, FREDERIC;OUDEYER, PIERRE-YVES
分类号 G06N3/00 主分类号 G06N3/00
代理机构 代理人
主权项
地址