情報処理装置、情報処理方法、及びプログラム,申请号JP20110224639-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	情報処理装置、情報処理方法、及びプログラム
摘要	Provided is an information processing apparatus including: a reward estimator generating unit using action history data, which includes state data expressing a state of an agent, action data expressing an agent's action, and a reward value expressing a reward of the action, as learning data to generate, through machine learning, a reward estimator estimating the reward value from inputted state data and action data; an action selecting unit preferentially selecting an action not included in the action history data but with a high estimated reward value; and an action history adding unit causing the agent to perform the selected action and adding to the action history data the state data and action data for the action and the action's reward value in association with each other. The reward estimator is regenerated when a set of state data, action data, and the reward value is added to the action history data.
申请公布号	JP5879899(B2)	申请公布日期	2016.03.08
申请号	JP20110224639	申请日期	2011.10.12
申请人	ソニー株式会社	发明人	小林由幸
分类号	A63F13/56;A63F13/67;G06N3/08	主分类号	A63F13/56
代理机构		代理人
主权项
地址

您可能感兴趣的专利

DEVICE FOR SUPPORTING PLATTER OF WEIGHT SIZER

TRITIUM SEPARATING APPARATUS FROM WATER CONTAINING TRITIUM

UNBORNNCHILD MONITOR DEVICE

ELASTIC PLASTIC TABLEWARE

CAPACITORRSTART MOTOR

DEVICE FOR CONNECTING SHEATH OF PLASTIC CABLE

METHOD OF AND DEVICE FOR MANUFACTURING PLATE CONNECTOR WITH TEETH AND CONNECTOR

ACCOPPIATORE PER SCI AVENTE USO DI BARELLA D'EMERGENZA.

MACCHINA PER IL TAGLIO E LA SALDATURA DI FIBRE OTTICHE PER TELECOMUNI CAZIONI

SUOLA PER DOPO SCI IN MATERIALE PLASTICO STAMPATO A ZAMPA DI ELEFANTE.

SCIABOLA ELETTRONICA

MATERASSO ORTOPEDICO.

PLASTICO, PARTICOLARMENTE DI CORPI PROCEDIMENTO PER LA FABBRICAZIONE TUBOLARI A SEZIONE CIRCOLARE ODI CORPI CAVI CONTINUI IN MATERIALE POLIGONALE; APPARECCHIATURA PERL'ATTUAZIONE DEL DETTO PROCEDIMENTO E CORPI CAVI OTTENUTI MEDIANTE IL PROCEDIMENTO STESSO.