发明名称 Real—time emotion tracking system
摘要 Devices, systems, methods, media, and programs for detecting an emotional state change in an audio signal are provided. A plurality of segments of the audio signal is received, with the plurality of segments being sequential. Each segment of the plurality of segments is analyzed, and, for each segment, an emotional state and a confidence score of the emotional state are determined. The emotional state and the confidence score of each segment are sequentially analyzed, and a current emotional state of the audio signal is tracked throughout each of the plurality of segments. For each segment, it is determined whether the current emotional state of the audio signal changes to another emotional state based on the emotional state and the confidence score of the segment.
申请公布号 US9047871(B2) 申请公布日期 2015.06.02
申请号 US201213712288 申请日期 2012.12.12
申请人 AT&T INTELLECTUAL PROPERTY I, L.P. 发明人 Dimitriadis Dimitrios;Gilbert Mazin E.;Mishra Taniya;Schroeter Horst J.
分类号 G10L21/00;G10L25/00;G10L15/00;G10L17/26 主分类号 G10L21/00
代理机构 Greenblum & Bernstein, P.L.C. 代理人 Greenblum & Bernstein, P.L.C.
主权项 1. A device for detecting an emotional state change in an audio signal, the device comprising: a processor; and a memory storing instructions that, when executed by the processor, cause the processor to perform operations including: receiving a plurality of segments of the audio signal, the plurality of segments being sequential;sequentially analyzing each segment of the plurality of segments and determining, for each segment, an emotional state from among a plurality of emotional states and a confidence score of the emotional state;sequentially analyzing the emotional state and the confidence score of each segment and tracking a current emotional state of the audio signal throughout each of the plurality of segments; anddetermining, for each segment, whether the current emotional state of the audio signal changes to an other emotional state of the plurality of emotional states based on the emotional state and the confidence score of the segment, wherein the processor determines that the current emotional state of the audio signal changes to the other emotional state of the plurality of emotional states when the confidence score of the emotional state for each of a predetermined number of consecutive ones of the plurality of segments is less than a predetermined threshold.
地址 Atlanta GA US