发明名称 Method and Apparatus to Model and Transfer the Prosody of Tags across Languages
摘要 Identify, Capture, Retain and Synthesize Non-Linguistic and Discourse Components of Speech across Languages
申请公布号 US2014200892(A1) 申请公布日期 2014.07.17
申请号 US201313744391 申请日期 2013.01.17
申请人 Yassa Fathy 发明人 Yassa Fathy;Henton Caroline
分类号 G06F17/28;G10L13/00;G10L15/18 主分类号 G06F17/28
代理机构 代理人
主权项 1. A method and Apparatus to Model and Transfer the Prosody of Tags across Languages comprising the steps of a first person in speaking in language one (L1); where the L1 speech is recognized by the ASR; searching the speech for a known tag; searching the pieces of text that have common, cpnsistent, or idiomatic intonation patterns, translating the text to language number two (L2); examine the speech signal of L! to find the segments that correspond to the tag; extract the fundamental frequency from those sigments and fit a smooth contour such as a cubic spline; map the stylized smooth contour into the corresponding part of the pitch range of the intended L2 synthesized speech; stretch or contract stylized smooth contour over time because the duration of the translation will be different; align the contour with the corresponding L2 segments and impose it on the synthesized L2 speech.
地址 Campbell CA US