发明名称 Method of recognizing gender or age of a speaker according to speech emotion or arousal
摘要 A method of recognizing gender or age of a speaker according to speech emotion or arousal includes the following steps of A) segmentalizing speech signals into a plurality of speech segments; B) fetching the first speech segment from the plural speech segments to further acquire at least one of emotional features or arousal degree in the speech segment; C) determining whether at least one of the emotional feature and the arousal degree conforms to some condition; if yes, proceed to the step D); if no, return to the step B) and then fetch the next speech segment; D) fetching the feature indicative of gender or age from the speech segment to further acquire at least one feature parameter; and E) recognizing the at least one feature parameter to further determine the gender or age of the speaker at the currently-processed speech segment.
申请公布号 US9123342(B2) 申请公布日期 2015.09.01
申请号 US201213560596 申请日期 2012.07.27
申请人 NATIONAL CHUNG CHENG UNIVERSITY 发明人 Chen Oscal Tzyh-Chiang;Lu Ping-Tsung;Ke Jia-You
分类号 G10L21/00;G10L25/00;G10L17/26;G10L25/63 主分类号 G10L21/00
代理机构 Muncy, Geissler, Olds & Lowe, P.C. 代理人 Muncy, Geissler, Olds & Lowe, P.C.
主权项 1. A method of recognizing gender or age of a speaker according to speech emotion or arousal, comprising steps of: A) segmentalizing speech signals into a plurality of speech segments; B) fetching the first speech segment from the speech segments to further acquire an arousal degree of the speech segment; B-1) after the first speech segment is fetched from the speech segments, applying a first classification to the arousal degree of the speech segment to enable the arousal to be classified as a high degree or a low degree of arousal; C) if a determination condition is set at a greater-than-threshold condition, proceeding the step D) when the arousal degree of the speech segment is determined greater than the specific threshold, or returning to the step B) when the arousal degree of the speech segment is determined less than or equal to the specific threshold; and if the determination condition is set at a less-than-threshold condition, proceeding to step D) when the arousal degree of the speech segment is determined less than the specific threshold, or returning to the step B) when the arousal degree of the speech segment is determined greater than or equal to the specific threshold; D) fetching a feature indicative of gender or age from the speech segment to further acquire at least one feature parameter corresponding to gender or age; and E) applying recognition to the at least one feature parameter according to a gender or age recognition measure to further determine the gender or age of the speaker in the currently-processed speech segment; next, apply the step B) to the next speech segment, wherein the steps A)-E) are executed by a computer.
地址 Chia-Yi TW
您可能感兴趣的专利