发明名称 PRIVACY-SENSITIVE SPEECH MODEL CREATION VIA AGGREGATION OF MULTIPLE USER MODELS
摘要 Techniques disclosed herein include systems and methods for privacy-sensitive training data collection for updating acoustic models of speech recognition systems. In one embodiment, the system locally creates adaptation data from raw audio data. Such adaptation can include derived statistics and/or acoustic model update parameters. The derived statistics and/or updated acoustic model data can then be sent to a speech recognition server or third-party entity. Since the audio data and transcriptions are already processed, the statistics or acoustic model data is devoid of any information that could be human-readable or machine readable such as to enable reconstruction of audio data. Thus, such converted data sent to a server does not include personal or confidential information. Third-party servers can then continually update speech models without storing personal and confidential utterances of users.
申请公布号 US2015287401(A1) 申请公布日期 2015.10.08
申请号 US201514745630 申请日期 2015.06.22
申请人 Nuance Communications, Inc. 发明人 Lee Antonio R.;Novak Petr;Olsen Peder Andreas;Goel Vaibhava
分类号 G10L15/065;G06F21/78;G10L15/04 主分类号 G10L15/065
代理机构 代理人
主权项 1. A computer-implemented method of speech recognition processing, the computer-implemented method comprising: receiving a spoken utterance; storing audio data from the spoken utterance at a first-party device; creating adaptation data from the audio data via processing at the first-party device, the adaptation data being in a format that prevents reconstruction of the audio data; and transmitting the adaptation data to a third-party server.
地址 Burlington MA US
您可能感兴趣的专利