发明名称 Privacy-sensitive speech model creation via aggregation of multiple user models
摘要 Techniques disclosed herein include systems and methods for privacy-sensitive training data collection for updating acoustic models of speech recognition systems. In one embodiment, the system locally creates adaptation data from raw audio data. Such adaptation can include derived statistics and/or acoustic model update parameters. The derived statistics and/or updated acoustic model data can then be sent to a speech recognition server or third-party entity. Since the audio data and transcriptions are already processed, the statistics or acoustic model data is devoid of any information that could be human-readable or machine readable such as to enable reconstruction of audio data. Thus, such converted data sent to a server does not include personal or confidential information. Third-party servers can then continually update speech models without storing personal and confidential utterances of users.
申请公布号 US9424836(B2) 申请公布日期 2016.08.23
申请号 US201514745630 申请日期 2015.06.22
申请人 Nuance Communications, Inc. 发明人 Lee Antonio R.;Novak Petr;Olsen Peder Andreas;Goel Vaibhava
分类号 G10L15/06;G10L15/00;G10L15/065;G10L15/04;G06F21/78;G06F21/62;H04L29/06 主分类号 G10L15/06
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A computer-implemented method comprising acts of: receiving, via at least one network, adaptation data generated at least in part by performing statistical processing on audio data comprising at least one user utterance; and using the adaptation data to update at least one acoustic model for use in speech recognition processing, wherein the adaptation data is in a format that prevents reconstruction of the audio data.
地址 Burlington MA US
您可能感兴趣的专利