发明名称 INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM
摘要 An information processing device includes: a calculating unit configured to calculate a current-state series candidate that is a state series for an agent capable of actions reaching the current state, based on a state transition probability model obtained by performing learning of the state transition probability model stipulated by a state transition probability that a state will be transitioned according to each of actions performed by an agent capable of actions, and an observation probability that a predetermined observation value will be observed from the state, using an action performed by the agent, and an observation value observed at the agent when the agent performs an action; and a determining unit configured to determine an action to be performed next by the agent using the current-state series candidate in accordance with a predetermined strategy.
申请公布号 US2010318478(A1) 申请公布日期 2010.12.16
申请号 US20100791240 申请日期 2010.06.01
申请人 YOSHIIKE YUKIKO;KAWAMOTO KENTA;NODA KUNIAKI;SABE KOHTARO 发明人 YOSHIIKE YUKIKO;KAWAMOTO KENTA;NODA KUNIAKI;SABE KOHTARO
分类号 G06N5/02;G06F15/18 主分类号 G06N5/02
代理机构 代理人
主权项
地址