发明名称 SPEECH SYLLABLE/VOWEL/PHONE BOUNDARY DETECTION USING AUDITORY ATTENTION CUES
摘要 <p>In syllable or vowel or phone boundary detection during speech, an auditory spectrum may be determined for an input window of sound and one or more multi-scale features may be extracted from the auditory spectrum. Each multi-scale feature can be extracted using a separate two-dimensional spectro-temporal receptive filter. One or more feature maps corresponding to the one or more multi-scale features can be generated and an auditory gist vector can be extracted from each of the one or more feature maps. A cumulative gist vector may be obtained through augmentation of each auditory gist vector extracted from the one or more feature maps. One or more syllable or vowel or phone boundaries in the input window of sound can be detected by mapping the cumulative gist vector to one or more syllable or vowel or phone boundary characteristics using a machine learning algorithm.</p>
申请公布号 EP2695160(A1) 申请公布日期 2014.02.12
申请号 EP20110862334 申请日期 2011.11.02
申请人 SONY COMPUTER ENTERTAINMENT INC. 发明人 KALINLI, OZLEM;CHEN, RUXIN
分类号 G10L15/04;G10L15/16;G10L15/24;G10L15/34;G10L25/03 主分类号 G10L15/04
代理机构 代理人
主权项
地址