摘要 |
PROBLEM TO BE SOLVED: To provide a control technique for learning a method of generating operation signal with which a controlled object can be safely operated even at an early stage of learning. SOLUTION: A controller is provided with functions to generate an operation signal to be applied to the controlled object 100 and to the model 400 for imitating the characteristics of the controlled object, receive an evaluation value signal calculated on the basis of a measurement signal obtained as a result of applying the operation signal to the controlled object and the model, and to learn a method of generating the operation signal so that an expected value of a total sum of the evaluation value signals obtained from the current state to a future state becomes minimum or maximum. In the controller, a first evaluation value 206 obtained on the basis of a deviation between a measurement signal 205 from the model and a target value and a second evaluation value 207 obtained on the basis of a difference in characteristics between the model and the controlled object are added, and an evaluation value signal 208 calculated on the basis of the measurement signal from the model is calculated. COPYRIGHT: (C)2007,JPO&INPIT
|