发明名称 INFORMATION PROCESSOR, ITS METHOD AND PROVIDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To generate an action plan capable of maximizing reward by less action experience. SOLUTION: In a step S1, prediction processing capable of obtaining maximum reward in a reccurent type neural network is executed by forward dynamics. In a step S2, plan generation processing is executed by reverse dynamics. Consequently a series of action difference values for obtaining the maximum reward are generated as an action plan. Processing mentioned above is repeatedly executed until it is judged that acquisition of a required action plan is done.
申请公布号 JP2000122992(A) 申请公布日期 2000.04.28
申请号 JP19990021791 申请日期 1999.01.29
申请人 SONY CORP 发明人 TANI ATSUSHI
分类号 G06F15/18;B25J13/00;G05B13/02;G05D1/02;G06N3/00;(IPC1-7):G06F15/18 主分类号 G06F15/18
代理机构 代理人
主权项
地址