发明名称 LEARNING SPEECH MODELS FOR MOBILE DEVICE USERS
摘要 <p>Techniques are provided to recognize a speaker's voice. In one embodiment, received audio data may be separated into a plurality of signals. For each signal, the signal may be associated with value/s for one or more features (e.g., Mel-Frequency Cepstral coefficients). The received data may be clustered (e.g., by clustering features associated with the signals). A predominate voice cluster may be identified and associated with a user. A speech model (e.g., a Gaussian Mixture Model or Hidden Markov Model) may be trained based on data associated with the predominate cluster. A received audio signal may then be processed using the speech model to, e.g.,: determine who was speaking; determine whether the user was speaking; determining whether anyone was speaking; and/or determine what words were said. A context of the device or the user may then be inferred based at least partly on the processed signal.</p>
申请公布号 WO2013006489(A1) 申请公布日期 2013.01.10
申请号 WO2012US45101 申请日期 2012.06.29
申请人 QUALCOMM INCORPORATED;GROKOP, LEONARD, HENRY;NARAYANAN, VIDYA 发明人 GROKOP, LEONARD, HENRY;NARAYANAN, VIDYA
分类号 G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项
地址