发明名称 |
Method and apparatus to model and transfer the prosody of tags across languages |
摘要 |
A method of transferring the prosody of tag questions across languages includes extracting prosodic parameters of speech in a first language having a tag question and mapping the prosodic parameters to speech segments in a second language corresponding to the tag question. Accordingly, semantic and pragmatic intent of the tag question in the first language may be correctly conveyed in the second language. |
申请公布号 |
US9418655(B2) |
申请公布日期 |
2016.08.16 |
申请号 |
US201313744391 |
申请日期 |
2013.01.17 |
申请人 |
SPEECH MORPHING SYSTEMS, INC. |
发明人 |
Yassa Fathy;Henton Caroline |
分类号 |
G10L15/18;G10L13/00;G06F17/28;G10L25/90;G10L13/10 |
主分类号 |
G10L15/18 |
代理机构 |
Sughrue Mion, PLLC |
代理人 |
Sughrue Mion, PLLC |
主权项 |
1. A method to model and transfer the prosody of tag questions across languages, the method comprising:
receiving speech of a first person speaking in a first language; analyzing the speech in the first language using automatic speech recognition; extracting prosodic parameters of the speech in the first language and outputting text in the first language corresponding to the speech in the first language based on the analyzing; searching the speech in the first language for a tag question in the first language; translating the text in the first language to text in a second language; outputting translated speech in the second language that is translated from the speech in the first language based on the translated text in the second language; analyzing the speech in the first language to find speech segments that correspond to the tag question in the first language; extracting a fundamental frequency from the speech segments that correspond to the tag question in the first language based on the extracted prosodic parameters of the speech in the first language; fitting a stylized smooth contour to the fundamental frequency; mapping the stylized smooth contour into a corresponding part of pitch range of the speech in the second language; stretching or contracting the stylized smooth contour over time; aligning the stylized smooth contour with corresponding speech segments in the second language that correspond to the tag question; and applying the smooth contour to the speech in the second language. |
地址 |
Campbell CA US |