Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data,申请号US201514685364-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data
摘要	A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words. The replacement words in the personal vocabulary are obtained from personal data associated with a user. A transcription is received of an audio recording. The transcription is generated by an ASR engine using an ASR vocabulary and includes a transcribed word that represents a spoken word in the audio recording. Data is received that is associated with the transcribed word. A replacement word from the personal vocabulary is identified, which is used to re-score the transcription and replace the transcribed word.
申请公布号	US9626969(B2)	申请公布日期	2017.04.18
申请号	US201514685364	申请日期	2015.04.13
申请人	NUANCE COMMUNICATIONS, INC.	发明人	Zavaliagkos George;Ganong, III William F.;Jost Uwe H.;Madhavapeddi Shreedhar;Clayton Gary B.
分类号	G10L15/00;G10L15/26;G10L15/24;G10L15/065;G10L15/22;G10L15/08;G10L15/30	主分类号	G10L15/00
代理机构	Perkins Coie LLP	代理人	Perkins Coie LLP
主权项	1. A method of generating a personalized transcription from an audio recording, wherein the method is performed by a mobile device in communication with a server, wherein computational resources of the server are greater than computational resources of the mobile device, the method comprising: maintaining a personal vocabulary of words on the mobile device associated with a user of the mobile device, wherein the personal vocabulary is based on personal data associated with the user; receiving, from the server, a first transcription of an audio recording, wherein the first transcription is generated by a server automatic speech recognition (ASR) engine at the server and using an ASR vocabulary associated with a population of users,wherein the first transcription includes a first word list and confidence scores associated with a plurality of words in the first word list, andwherein the first transcription includes both words that the server ASR engine identified as most likely spoken as well as alternatives to those words; receiving, from the server, audio data corresponding to at least the portion of the audio recording; generating a second transcription, wherein the second transcription is of the received audio data,wherein the second transcription comprises a second word list and confidence scores associated with a plurality of words in the second word list, andwherein the second transcription is generated by a mobile device ASR engine located on the mobile device using the maintained personal vocabulary and an acoustic model associated with the user of the mobile device; re-scoring the first transcription, the re-scoring comprising: comparing the first transcription with the second transcription, and modifying a confidence score associated with an alternative word in the first word list when the mobile device ASR engine indicates a higher confidence score for the alternative word than the confidence score attributed by the server ASR engine to the alternative word; and generating a final transcription based on the re-scored first transcription, the final transcription including a combination of most likely spoken words identified by the UASR engine as well as the re-scored alternative words identified by the mobile device ASR engine.
地址	Burlington MA US

您可能感兴趣的专利

Detection and removal of clipping in multicarrier receivers

DECT transceiver module

FLEXIBLE NON-PNEUMATIC TIRE

Method and system for realtime communication in a network with Ethernet physical layer

A low toxicity fischer-tropsch derived fuel and process for making same

Prosthetic devices

ELECTRONIC-MAIL RECEIPT PROCESSING METHOD AND PORTABLE COMMUNICATION APPARATUS FOR PRACTISING THE SAME

Secure communication between mobile terminals using private public key pairs stored on contactless smartcards

BOTTOM LOADING CLEAN ROOM AIR FILTER SUPPORT SYSTEM

Process for producing chlorinated tertiary carbon-containing hydrocarbone

A BUMPER AIRBAG WITH MULTIPLE CHAMBERS

An intelligent serial battery charger and charging block

METHOD FOR WORKFLOW PROCESSING THROUGH COMPUTER NETWORK

REMOTELY MANAGING A DATA PROCESSING SYSTEM VIA A COMMUNICATIONS NETWORK

Communication protocol for nodes connected in a daisy chain

Virtually centralized uplink scheduling