发明名称 Real-time emotion tracking system
摘要 Devices, systems, methods, media, and programs for detecting an emotional state change in an audio signal are provided. A plurality of segments of the audio signal is received, with the plurality of segments being sequential. Each segment of the plurality of segments is analyzed, and, for each segment, an emotional state and a confidence score of the emotional state are determined. The emotional state and the confidence score of each segment are sequentially analyzed, and a current emotional state of the audio signal is tracked throughout each of the plurality of segments. For each segment, it is determined whether the current emotional state of the audio signal changes to another emotional state based on the emotional state and the confidence score of the segment.
申请公布号 US9355650(B2) 申请公布日期 2016.05.31
申请号 US201514703107 申请日期 2015.05.04
申请人 AT&T INTELLECTUAL PROPERTY I, L.P. 发明人 Dimitriadis Dimitrios;Gilbert Mazin E.;Mishra Taniya;Schroeter Horst J.
分类号 G10L21/00;G10L25/00;G10L15/00;G10L25/48 主分类号 G10L21/00
代理机构 Greenblum & Bernstein, P.L.C. 代理人 Greenblum & Bernstein, P.L.C.
主权项 1. A device for detecting an emotional state change in an audio signal, the device comprising: a processor; and a memory storing instructions that, when executed by the processor, cause the processor to perform operations including: receiving a plurality of segments of the audio signal, the plurality of segments being sequential;sequentially analyzing each segment of the plurality of segments and determining, for each segment, an emotional state from among a plurality of emotional states and a confidence score of the emotional state;sequentially analyzing the emotional state and the confidence score of each segment and tracking a current emotional state of the audio signal throughout each of the plurality of segments; anddetermining, for each segment, whether the current emotional state of the audio signal changes to an other emotional state of the plurality of emotional states based on the emotional state and the confidence score of the segment,wherein the processor determines that the current emotional state of the audio signal changes to the other emotional state of the plurality of emotional states when the emotional state of a predetermined number of the plurality of segments is the other emotional state with the confidence score of the emotional state of each of the predetermined number of the plurality of segments being below a predetermined threshold.
地址 Atlanta GA US