发明名称 |
DYNAMICALLY BIASING LANGUAGE MODELS |
摘要 |
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition. In one aspect, a method comprises receiving audio data encoding one or more utterances; performing a first speech recognition on the audio data; identifying a context based on the first speech recognition; performing a second speech recognition on the audio data that is biased towards the context; and providing an output of the second speech recognition. |
申请公布号 |
US2016104482(A1) |
申请公布日期 |
2016.04.14 |
申请号 |
US201414525826 |
申请日期 |
2014.10.28 |
申请人 |
Google Inc. |
发明人 |
Aleksic Petar;Mengibar Pedro J. Moreno |
分类号 |
G10L15/22;G10L15/18;G10L19/00;G10L15/26 |
主分类号 |
G10L15/22 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method performed by one or more computers, the method comprising:
receiving audio data encoding one or more utterances; generating a recognition lattice of the one or more utterances by performing speech recognition on the audio data using a first pass speech recognizer; determining a specific context for the one or more utterances based on the recognition lattice; in response to determining that the recognition lattice defines the specific context, generating a transcription of the one or more utterances by performing speech recognition on the audio data using a second pass speech recognizer biased towards the specific context defined by the recognition lattice; and providing an output of the transcription of the one or more utterances. |
地址 |
Mountain View CA US |