发明名称 |
LEARNING SPEECH MODELS FOR MOBILE DEVICE USERS |
摘要 |
<p>Techniques are provided to recognize a speaker's voice. In one embodiment, received audio data may be separated into a plurality of signals. For each signal, the signal may be associated with value/s for one or more features (e.g., Mel-Frequency Cepstral coefficients). The received data may be clustered (e.g., by clustering features associated with the signals). A predominate voice cluster may be identified and associated with a user. A speech model (e.g., a Gaussian Mixture Model or Hidden Markov Model) may be trained based on data associated with the predominate cluster. A received audio signal may then be processed using the speech model to, e.g.,: determine who was speaking; determine whether the user was speaking; determining whether anyone was speaking; and/or determine what words were said. A context of the device or the user may then be inferred based at least partly on the processed signal.</p> |
申请公布号 |
WO2013006489(A1) |
申请公布日期 |
2013.01.10 |
申请号 |
WO2012US45101 |
申请日期 |
2012.06.29 |
申请人 |
QUALCOMM INCORPORATED;GROKOP, LEONARD, HENRY;NARAYANAN, VIDYA |
发明人 |
GROKOP, LEONARD, HENRY;NARAYANAN, VIDYA |
分类号 |
G10L15/06 |
主分类号 |
G10L15/06 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|