发明名称 SYSTEM AND METHOD OF RECORDING UTTERANCES USING UNMANAGED CROWDS FOR NATURAL LANGUAGE PROCESSING
摘要 A system and method of recording utterances for building Named Entity Recognition (“NER”) models, which are used to build dialog systems in which a computer listens and responds to human voice dialog. Utterances to be uttered may be provided to users through their mobile devices, which may record the user uttering (e.g., verbalizing, speaking, etc.) the utterances and upload the recording to a computer for processing. The use of the user's mobile device, which is programmed with an utterance collection application (e.g., configured as a mobile app), facilitates the use of crowd-sourcing human intelligence tasking for widespread collection of utterances from a population of users. As such, obtaining large datasets for building NER models may be facilitated by the system and method disclosed herein.
申请公布号 US2017068656(A1) 申请公布日期 2017.03.09
申请号 US201615215114 申请日期 2016.07.20
申请人 VOICEBOX TECHNOLOGIES CORPORATION 发明人 BRAGA Daniela;ROTHWELL Spencer John;ROMANI Faraz;ELSHENAWY Ahmad Khamis;CARTER Stephen Steele;KENNEWICK Michael
分类号 G06F17/27;G10L17/24 主分类号 G06F17/27
代理机构 代理人
主权项 1. A computer implemented method of recording utterances from unmanaged crowds for natural language processing, the method being implemented in an end user device having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the end user device to perform the method, the method comprising: receiving, by the end user device, a token from a user; obtaining, by the end user device, one or more campaign configuration parameters based on the token; configuring, by the end user device, the computer program instructions based on the one or more campaign configuration parameters; obtaining, by the end user device, one or more utterances to be uttered by the user based on the token; displaying, by the end user device, the one or more utterances to be uttered by the user; generating, by the end user device, an audio recording of the one or more utterances; and causing, by the end user device, the audio recording to be provided to a remote device via a network.
地址 Bellevue WA US