摘要 |
A system and method of recording utterances for building Named Entity Recognition (“NER”) models, which are used to build dialog systems in which a computer listens and responds to human voice dialog. Utterances to be uttered may be provided to users through their mobile devices, which may record the user uttering (e.g., verbalizing, speaking, etc.) the utterances and upload the recording to a computer for processing. The use of the user's mobile device, which is programmed with an utterance collection application (e.g., configured as a mobile app), facilitates the use of crowd-sourcing human intelligence tasking for widespread collection of utterances from a population of users. As such, obtaining large datasets for building NER models may be facilitated by the system and method disclosed herein. |
主权项 |
1. A computer implemented method of recording utterances from unmanaged crowds for natural language processing, the method being implemented in an end user device having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the end user device to perform the method, the method comprising:
receiving, by the end user device, a token from a user; obtaining, by the end user device, one or more campaign configuration parameters based on the token; configuring, by the end user device, the computer program instructions based on the one or more campaign configuration parameters; obtaining, by the end user device, one or more utterances to be uttered by the user based on the token; displaying, by the end user device, the one or more utterances to be uttered by the user; generating, by the end user device, an audio recording of the one or more utterances; and causing, by the end user device, the audio recording to be provided to a remote device via a network. |