发明名称 Synchronising audio and video.
摘要 <p>A method for eliminating synchronisation errors using speech recognition. Using separate audio and visual speech recognition techniques, the method identifies 110 visemes, or visual cues which are indicative of articulatory type, in the video content, and identifies 120 phones and their articulatory types in the audio content. Once the two recognition techniques have been applied, the outputs are compared 130 to determine the relative alignment and, if not aligned, a synchronisation algorithm is applied to time-adjust one or both of the audio and the visual streams in order to achieve synchronisation. Facial features, such as mouth movements, are used to provide visual cues in the video content.</p>
申请公布号 GB2366110(A) 申请公布日期 2002.02.27
申请号 GB20010014988 申请日期 2001.06.20
申请人 * INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 PAUL S * COHEN;JOHN R * DILDINE;EDWARD J * GLEASON
分类号 G10L15/24;H04N21/2368;H04N21/43;H04N21/434;(IPC1-7):H04N7/52 主分类号 G10L15/24
代理机构 代理人
主权项
地址