摘要 |
FIELD: physics, acoustics.SUBSTANCE: invention relates to means for recognition of human emotions from voice. Intensity of the voice and tempo, defined by the rate at which the voice appears, are detected, respectively, and intonation which reflects the picture of intensity variation in each word pronounced by the voice is detected based on the input voice signal in form of a time value. A first variation value, indicating intensity variation of the detected voice in the direction of the time axis, a second variation value, indicating tempo variation of the voice in the direction of the time axis, and a third variation value indicating intonation variation of the voice in the direction of the time axis are obtained. The voice signal of a Russian-speaking subscriber is input and intensity of the voice and tempo is then detected. Once the third variation value is obtained, the base frequency of the voice signal is detected and a fourth variation value which indicates base frequency variation in the direction of the time axis is obtained; signals expressing the emotional state of anger, fear, grief and pleasure are generated, respectively, based on said first, second, third and fourth variation values.EFFECT: high accuracy of determining the emotional state of a Russian-speaking subscriber.3 dwg |