MONTE-CARLO PLANNING USING CONTEXTUAL INFORMATION,申请号US201213348993-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	MONTE-CARLO PLANNING USING CONTEXTUAL INFORMATION
摘要	A method, system and computer program product for choosing actions in a state of a planning problem. The system simulates one or more sequences of actions, state transitions and rewards starting from the current state of the planning problem. During the simulation of performing a given action in a given state, a data record is maintained of observed contextual state information, and observed cumulative reward resulting from the action. The system performs a regression fit on the data records, enabling estimation of expected reward as a function of contextual state. The estimations of expected rewards are used to guide the choice of actions during the simulations. Upon completion of all simulations, the top-level action which obtained highest mean reward during the simulations is recommended to be executed in the current state of the planning problem.
申请公布号	US2013185039(A1)	申请公布日期	2013.07.18
申请号	US201213348993	申请日期	2012.01.12
申请人	TESAURO GERALD J.;BEYGELZIMER ALINA;SEGAL RICHARD B.;WEGMAN MARK N.;INTERNATIONAL BUSINESS MACHINES CORPORATION	发明人	TESAURO GERALD J.;BEYGELZIMER ALINA;SEGAL RICHARD B.;WEGMAN MARK N.
分类号	G06G7/48	主分类号	G06G7/48
代理机构		代理人
主权项
地址

您可能感兴趣的专利

전동 회전식 리어 스포일러 장치

차량 충돌시 선루프 자동 개도장치

자동차 도어 아웃사이드 핸들의 보강구조

Modular pet furniture

Catheter assembly

Early instruction-length pre-decode of variable-length instructions in a superscalar processor

Prefetching of committed instructions from a memory to an instruction cache

Process for realization of an optically variable image

Vehicle information and control system

Projection television lens system

Deformable intraocular lens insertion system

Method for dispensing resinated reinforcement fibers

Medical-tube retaining garment

Method for compensation of periodic shaking forces in an electrical rotating field machine

Modular assembly including two electronic circuits to be electrically interconnected to convey a microwave signal

Treatment of glaucoma

Circuit and method for scheduling instructions by predicting future availability of resources required for execution

Method and apparatus for executing fixed-point instructions within idle execution units of a superscalar processor

Adaptor for mounting on a circuit board

Method and apparatus for monitoring the RF drive circuit of a linear laser transmitter