发明名称 METHOD FOR THE COMPUTERIZED CONTROL AND/OR REGULATION OF A TECHNICAL SYSTEM
摘要 The invention concerns a method for the computerized control and/or regulation of a technical system. Within the context of the method according to the invention, an action-selection rule (PO′) is determined which has a low level of complexity and yet is well suited to the regulating and/or control of the technical system, there being used for determination of the action-selection rule (PO′) an evaluation measure (EM) which is determined on the basis of a distance measure and/or a reward measure and/or an action-selection rule evaluation method. The action-selection rule is then used to control and/or regulate the technical system. The method according to the invention has the advantage of the action-selection rule being comprehensible to a human expert. Preferably, the method according to the invention is used for regulating and/or controlling a gas turbine and/or a wind turbine.
申请公布号 US2016040603(A1) 申请公布日期 2016.02.11
申请号 US201414780538 申请日期 2014.01.22
申请人 SIEMENS AKTIENGESELLSCHAFT 发明人 Düll Siegmund;Hentschel Alexander;Udluft Steffen
分类号 F02C9/00;G05B13/02 主分类号 F02C9/00
代理机构 代理人
主权项 1. A method for computerized control, regulation, or control and regulation of a technical system, the method comprising: characterizing a dynamic behavior of the technical system for multiple points in time in each case by a state of the technical system and an action executed on the technical system, wherein a respective action at a respective point in time results in a new state of the technical system at the next point in time; providing, generating, or providing and generating action selection policies, wherein a respective action selection policy specifies an action to be executed at a corresponding point in time on the technical system, in dependence on at least the state of the technical system at the corresponding point in time, and wherein each action selection policy is associated with a complexity measure that describes a complexity of the respective action selection policy that is less than or equal to a predetermined complexity threshold; ascertaining the action selection policy having the highest evaluation measure of the provided, generated, or provided and generated action selection policies from the provided, generated, or provided and generated action selection policies by the calculation of evaluation measures, each of the evaluation measures describing the suitability of an action selection policy for the regulation, control, or regulation and control of the technical system, wherein a higher evaluation measure describes a better suitability of the action selection policy for the regulation, control, or regulation and control of the technical system, and wherein the evaluation measure of a respective action selection policy is dependent on: a distance measure between the respective action selection policy and a predefined optimum action selection policy, wherein decreasing distance measures represent higher evaluation measures;a reward measure that results upon the execution of the respective action selection policy in a simulation of the technical system, wherein higher reward measures result in higher evaluation measures;a quality measure for the respective action selection policy, which is determined by an action selection policy evaluation method, wherein higher quality measures result in higher evaluation measures; orany combination thereof; regulating, controlling, or regulating and controlling the technical system based on the ascertained action selection policy.
地址 München DE