发明名称 |
Method and system for the automatic segmentation of an audio stream into semantic or syntactic units |
摘要 |
A digitized speech signal ( 600 ) is input to an F0 (fundamental frequency) processor that computes ( 610 ) a continuous F0 data from the speech signal. By the criterion voicing state transition (voiced/unvoiced transitions) the speech signal is presegmented ( 620 ) into segments. For each segment ( 630 ) it is evaluated ( 640 ) whether F0 is defined or not defined i.e. whether F0 is ON or OFF. In case of F0=OFF a candidate segment boundary is assumed as described above and, starting from that boundary, prosodic features are computed ( 650 ). The feature values are input into a classification tree and each candidate segment is classified thereby revealing, as a result, the existence or non-existence of a semantic or syntactic speech unit.
|
申请公布号 |
US7120575(B2) |
申请公布日期 |
2006.10.10 |
申请号 |
US20010920983 |
申请日期 |
2001.08.02 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
HAASE MARTIN;KRIECHBAUM WERNER;STENZEL GERHARD |
分类号 |
G10L11/04;G10L11/02;G10L15/18 |
主分类号 |
G10L11/04 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|