发明名称 DYNAMICALLY BIASING LANGUAGE MODELS
摘要 Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition. In one aspect, a method comprises receiving audio data encoding one or more utterances; performing a first speech recognition on the audio data; identifying a context based on the first speech recognition; performing a second speech recognition on the audio data that is biased towards the context; and providing an output of the second speech recognition.
申请公布号 US2016104482(A1) 申请公布日期 2016.04.14
申请号 US201414525826 申请日期 2014.10.28
申请人 Google Inc. 发明人 Aleksic Petar;Mengibar Pedro J. Moreno
分类号 G10L15/22;G10L15/18;G10L19/00;G10L15/26 主分类号 G10L15/22
代理机构 代理人
主权项 1. A method performed by one or more computers, the method comprising: receiving audio data encoding one or more utterances; generating a recognition lattice of the one or more utterances by performing speech recognition on the audio data using a first pass speech recognizer; determining a specific context for the one or more utterances based on the recognition lattice; in response to determining that the recognition lattice defines the specific context, generating a transcription of the one or more utterances by performing speech recognition on the audio data using a second pass speech recognizer biased towards the specific context defined by the recognition lattice; and providing an output of the transcription of the one or more utterances.
地址 Mountain View CA US