发明名称 Adjusting language models using context information
摘要 Methods, systems, and apparatuses, including computer programs encoded on a computer storage medium, for adjusting language models. In one aspect, a method includes accessing audio data. Information that indicates a first context is accessed, the first context being associated with the audio data. At least one term is accessed. Information that indicates a second context is accessed, the second context being associated with the term. A similarity score is determined that indicates a degree of similarity between the second context and the first context. A language model is adjusted based on the accessed term and the determined similarity score to generate an adjusted language model. Speech recognition is performed on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data.
申请公布号 US9076445(B1) 申请公布日期 2015.07.07
申请号 US201213705228 申请日期 2012.12.05
申请人 Google Inc. 发明人 Lloyd Matthew I.
分类号 G10L15/18 主分类号 G10L15/18
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A method comprising: obtaining audio data; accessing first context information associated with the audio data, wherein the first context information indicates (i) a first geographical location, and (ii) a first time; accessing second context information associated with one or more previously typed or previously transcribed terms, wherein the second context information indicates (i) a second geographical location and (ii) a second time; determining a similarity score for the first context information and the second context information based on (i) a degree of a similarity of the second geographical location to the first geographical location and (ii) a degree of a similarity of the second time to the first time; adjusting a language model based on the similarity score to adjust a likelihood that the language model indicates the one or more previously typed or previously transcribed terms as a candidate transcription of the audio data; determining a transcription of the audio data using the adjusted language model; and outputting the transcription that was determined using the adjusted language model.
地址 Mountain View CA US