发明名称 Use of metadata to post process speech recognition output
摘要 A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system.
申请公布号 US8676577(B2) 申请公布日期 2014.03.18
申请号 US20090415874 申请日期 2009.03.31
申请人 JABLOKOV IGOR RODITIS;STROHOFER, III CLIFFORD J.;WHITE MARC;JABLOKOV VICTOR RODITIS;CANYON IP HOLDINGS, LLC 发明人 JABLOKOV IGOR RODITIS;STROHOFER, III CLIFFORD J.;WHITE MARC;JABLOKOV VICTOR RODITIS
分类号 G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址