发明名称 情報処理装置、情報処理方法、及び、プログラム
摘要 An information processing apparatus optimizes an action in a transition model in which a number of objects in each state transits according to the action. A cost constraint acquisition unit acquires multiple cost constraints including one that constrains a total cost of the action over at multiple timings and/or multiple states. A processing unit assumes action distribution in each state at each timing as a decision variable in an optimization problem and maximizes an objective function subtracting a term based on an error between an actual number of objects with the action in each state at each timing and an estimated number of objects in each state at each timing based on state transition by the transition model, from a total reward in a whole period, satisfying the multiple cost constraints. An output unit outputs the action distribution in each state at each timing that maximizes the objective function.
申请公布号 JP5963320(B2) 申请公布日期 2016.08.03
申请号 JP20140067159 申请日期 2014.03.27
申请人 インターナショナル・ビジネス・マシーンズ・コーポレーションINTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 高橋 力矢;吉住 貴幸;水田 秀行
分类号 G06N7/00;G06Q30/02 主分类号 G06N7/00
代理机构 代理人
主权项
地址