发明名称 System and method for providing words or phrases to be uttered by members of a crowd and processing the utterances in crowd-sourced campaigns to facilitate speech analysis
摘要 Systems and methods of providing text related to utterances, and gathering voice data in response to the text are provide herein. In various implementations, an identification token that identifies a first file for a voice data collection campaign, and a second file for a session script may be received from a natural language processing training device. The first file and the second file may be used to configure the mobile application to display a sequence of screens, each of the sequence of screens containing text of at least one utterance specified in the voice data collection campaign. Voice data may be received from the natural language processing training device in response to user interaction with the text of the at least one utterance. The voice data and the text may be stored in a transcription library.
申请公布号 US9361887(B1) 申请公布日期 2016.06.07
申请号 US201514846925 申请日期 2015.09.07
申请人 VoiceBox Technologies Corporation 发明人 Braga Daniela;Romani Faraz;Elshenawy Ahmad Khamis;Kennewick Michael
分类号 G10L15/26;G10L15/06 主分类号 G10L15/26
代理机构 Sheppard Mullin Richter & Hampton LLP 代理人 Sheppard Mullin Richter & Hampton LLP
主权项 1. A computer-implemented method, the method being implemented in a computer system having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the computer system to perform the method, the method comprising: receiving from a natural language processing training device an identification token containing a first portion and a second portion, the first portion identifying a first file for a voice data collection campaign, and the second portion identifying a second file for a session script, the session script supporting a mobile application on the natural language processing training device; using the first file and the second file to configure the mobile application to display a sequence of screens, each of the sequence of screens containing text of at least one utterance specified in the voice data collection campaign; receiving voice data from the natural language processing training device in response to user interaction with the text of the at least one utterance; and storing the voice data and the text of the at least one utterance in a transcription library.
地址 Bellevue WA US