发明名称 Method and apparatus to model and transfer the prosody of tags across languages
摘要 A method of transferring the prosody of tag questions across languages includes extracting prosodic parameters of speech in a first language having a tag question and mapping the prosodic parameters to speech segments in a second language corresponding to the tag question. Accordingly, semantic and pragmatic intent of the tag question in the first language may be correctly conveyed in the second language.
申请公布号 US9418655(B2) 申请公布日期 2016.08.16
申请号 US201313744391 申请日期 2013.01.17
申请人 SPEECH MORPHING SYSTEMS, INC. 发明人 Yassa Fathy;Henton Caroline
分类号 G10L15/18;G10L13/00;G06F17/28;G10L25/90;G10L13/10 主分类号 G10L15/18
代理机构 Sughrue Mion, PLLC 代理人 Sughrue Mion, PLLC
主权项 1. A method to model and transfer the prosody of tag questions across languages, the method comprising: receiving speech of a first person speaking in a first language; analyzing the speech in the first language using automatic speech recognition; extracting prosodic parameters of the speech in the first language and outputting text in the first language corresponding to the speech in the first language based on the analyzing; searching the speech in the first language for a tag question in the first language; translating the text in the first language to text in a second language; outputting translated speech in the second language that is translated from the speech in the first language based on the translated text in the second language; analyzing the speech in the first language to find speech segments that correspond to the tag question in the first language; extracting a fundamental frequency from the speech segments that correspond to the tag question in the first language based on the extracted prosodic parameters of the speech in the first language; fitting a stylized smooth contour to the fundamental frequency; mapping the stylized smooth contour into a corresponding part of pitch range of the speech in the second language; stretching or contracting the stylized smooth contour over time; aligning the stylized smooth contour with corresponding speech segments in the second language that correspond to the tag question; and applying the smooth contour to the speech in the second language.
地址 Campbell CA US