发明名称 |
Method and Apparatus to Model and Transfer the Prosody of Tags across Languages |
摘要 |
Identify, Capture, Retain and Synthesize Non-Linguistic and Discourse Components of Speech across Languages |
申请公布号 |
US2014200892(A1) |
申请公布日期 |
2014.07.17 |
申请号 |
US201313744391 |
申请日期 |
2013.01.17 |
申请人 |
Yassa Fathy |
发明人 |
Yassa Fathy;Henton Caroline |
分类号 |
G06F17/28;G10L13/00;G10L15/18 |
主分类号 |
G06F17/28 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method and Apparatus to Model and Transfer the Prosody of Tags across Languages comprising the steps of a first person in speaking in language one (L1); where the L1 speech is recognized by the ASR; searching the speech for a known tag; searching the pieces of text that have common, cpnsistent, or idiomatic intonation patterns, translating the text to language number two (L2); examine the speech signal of L! to find the segments that correspond to the tag;
extract the fundamental frequency from those sigments and fit a smooth contour such as a cubic spline; map the stylized smooth contour into the corresponding part of the pitch range of the intended L2 synthesized speech; stretch or contract stylized smooth contour over time because the duration of the translation will be different; align the contour with the corresponding L2 segments and impose it on the synthesized L2 speech. |
地址 |
Campbell CA US |