发明名称 Method and apparatus for reward-based learning of improved systems management policies
摘要 In one embodiment, the present invention is a method for reward-based learning of improved systems management policies. One embodiment of the inventive method involves supplying a first policy and a reward mechanism. The first policy maps states of at least one component of a data processing system to selected management actions, while the reward mechanism generates numerical measures of value responsive to particular actions (e.g., management actions) performed in particular states of the component(s). The first policy and the reward mechanism are applied to the component(s), and results achieved through this application (e.g., observations of corresponding states, actions and rewards) are processed in accordance with reward-based learning to derive a second policy having improved performance relative to the first policy in at least one state of the component(s).
申请公布号 US2007203871(A1) 申请公布日期 2007.08.30
申请号 US20060337311 申请日期 2006.01.23
申请人 TESAURO GERALD J;DAS RAJARSHI;JONG NICHOLAS K;KEPHART JEFFRREY O 发明人 TESAURO GERALD J.;DAS RAJARSHI;JONG NICHOLAS K.;KEPHART JEFFRREY O.
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址