发明名称 PREDICTING PRONUNCIATION IN SPEECH RECOGNITION
摘要 An automatic speech recognition (ASR) device may be configured to predict pronunciations of textual identifiers (for example, song names, etc.) based on predicting one or more languages of origin of the textual identifier. The one or more languages of origin may be determined based on the textual identifier. The pronunciations may include a hybrid pronunciation including a pronunciation in one language, a pronunciation in a second language and a hybrid pronunciation that combines multiple languages. The pronunciations may be added to a lexicon and matched to the content item (e.g., song) and/or textual identifier. The ASR device may receive a spoken utterance from a user requesting the ASR device to access the content item. The ASR device determines whether the spoken utterance matches one of the pronunciations of the content item in the lexicon. The ASR device then accesses the content when the spoken utterance matches one of the potential textual identifier pronunciations.
申请公布号 WO2015134309(A1) 申请公布日期 2015.09.11
申请号 WO2015US17927 申请日期 2015.02.27
申请人 AMAZON TECHNOLOGIES, INC.;ADAMS, JEFFREY PENROD;PARLIKAR, ALOK ULHAS;LILLY, JEFFREY PAUL;RASTROW, ARIYA 发明人 ADAMS, JEFFREY PENROD;PARLIKAR, ALOK ULHAS;LILLY, JEFFREY PAUL;RASTROW, ARIYA
分类号 G10L13/02 主分类号 G10L13/02
代理机构 代理人
主权项
地址