发明名称 ADAPTATION OF SPEECH RECOGNITION
摘要 A method, computer program product, and system for adapting speech recognition of a user's speech is provided. The method includes receiving a first utterance from a user having a duration below a predetermined threshold, identifying at least one further utterance from the user that provides additional information, generating a concatenated utterance by concatenating the first utterance with the at least one further utterance, transmitting the concatenated utterance to a speech recognition server, receiving a transcription of the concatenated utterance from the speech recognition server that includes a transcription of the first utterance, and extracting the transcription of the first utterance from the transcription of the concatenated utterance. The transcription of the first utterance is based on the additional information provided by the at least one further utterance.
申请公布号 US2017053643(A1) 申请公布日期 2017.02.23
申请号 US201514829785 申请日期 2015.08.19
申请人 International Business Machines Corporation 发明人 BEN-DAVID SHAY
分类号 G10L15/06;G10L15/34;G10L15/10;G10L15/187;G10L15/30;G10L15/19 主分类号 G10L15/06
代理机构 代理人
主权项 1. A method for adapting a speech recognition system, the method comprising: receiving a first utterance from a user, wherein an amount of time associated with the first utterance from the user is below a predetermined duration threshold; identifying, based on the amount of time associated with the first utterance being below the predetermined duration threshold, at least one further utterance from the user, wherein the at least one further utterance provides additional information; generating a concatenated utterance by concatenating the first utterance with the at least one further utterance; transmitting the concatenated utterance to a speech recognition server; receiving a transcription of the concatenated utterance from the speech recognition server, wherein the transcription of the concatenated utterance includes a transcription of the first utterance, and wherein the transcription of the first utterance is based on the additional information provided by the at least one further utterance; and extracting the transcription of the first utterance from the transcription of the concatenated utterance.
地址 Armonk NY US