发明名称 Risk quantification for policy deployment
摘要 Risk quantification, policy search, and automated safe policy deployment techniques are described. In one or more implementations, techniques are utilized to determine safety of a policy, such as to express a level of confidence that a new policy will exhibit an increased measure of performance (e.g., interactions or conversions) over a currently deployed policy. In order to make this determination, reinforcement learning and concentration inequalities are utilized, which generate and bound confidence values regarding the measurement of performance of the policy and thus provide a statistical guarantee of this performance. These techniques are usable to quantify risk in deployment of a policy, select a policy for deployment based on estimated performance and a confidence level in this estimate (e.g., which may include use of a policy space to reduce an amount of data processed), used to create a new policy through iteration in which parameters of a policy are iteratively adjusted and an effect of those adjustments are evaluated, and so forth.
申请公布号 GB201513640(D0) 申请公布日期 2015.09.16
申请号 GB20150013640 申请日期 2015.08.03
申请人 ADOBE SYSTEMS INCORPORATED 发明人
分类号 主分类号
代理机构 代理人
主权项
地址