发明名称 Crowd Sourcing Audio Transcription Via Re-Speaking
摘要 Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.
申请公布号 US2015199966(A1) 申请公布日期 2015.07.16
申请号 US201414156032 申请日期 2014.01.15
申请人 Cisco Technology, Inc. 发明人 Paulik Matthias;Halder Vivek;Sankar Ananth
分类号 G10L15/26 主分类号 G10L15/26
代理机构 代理人
主权项 1. A method comprising: receiving a speech audio intended for transcription to textual form; dividing the received speech audio into first speech segments; identifying speakers for sending each first speech segment; sending each first speech segment to the speakers determined for the particular first speech segment; receiving second speech segments from the speakers; and processing the second speech segments to generate a complete transcript that is a textual representation corresponding to the received speech audio.
地址 San Jose CA US