发明名称 |
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM |
摘要 |
An information processing device includes: a calculating unit configured to calculate a current-state series candidate that is a state series for an agent capable of actions reaching the current state, based on a state transition probability model obtained by performing learning of the state transition probability model stipulated by a state transition probability that a state will be transitioned according to each of actions performed by an agent capable of actions, and an observation probability that a predetermined observation value will be observed from the state, using an action performed by the agent, and an observation value observed at the agent when the agent performs an action; and a determining unit configured to determine an action to be performed next by the agent using the current-state series candidate in accordance with a predetermined strategy.
|
申请公布号 |
US2010318478(A1) |
申请公布日期 |
2010.12.16 |
申请号 |
US20100791240 |
申请日期 |
2010.06.01 |
申请人 |
YOSHIIKE YUKIKO;KAWAMOTO KENTA;NODA KUNIAKI;SABE KOHTARO |
发明人 |
YOSHIIKE YUKIKO;KAWAMOTO KENTA;NODA KUNIAKI;SABE KOHTARO |
分类号 |
G06N5/02;G06F15/18 |
主分类号 |
G06N5/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|