发明名称 Method and apparatus for presenting images representative of an utterance with corresponding decoded speech
摘要 Apparatus for presenting images representative of one or more words in an utterance with corresponding decoded speech includes, in one aspect, a visual detector for capturing images of body movements (e.g., lip and/or mouth movements) corresponding to the one or more words in the utterance coupled to a visual feature extractor. The visual feature extractor receives time information from an automatic speech recognition (ASR) system and operatively processes the captured images from the visual detector to generate one or more image segments based on the time information relating to one or more decoded words in the utterance, each image segment corresponding to a decoded word in the utterance. An image player coupled to the visual feature extractor presents an image segment with a corresponding decoded word. The image segment may be presented as an animation of successive images in time, whereby a user is provided multiple sources of information for comprehending the utterance and can more easily ascertain the relationship between the body movements and the corresponding decoded speech.
申请公布号 US2002161582(A1) 申请公布日期 2002.10.31
申请号 US20010844120 申请日期 2001.04.27
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BASSON SARA H.;KANEVSKY DIMITRI;SORENSEN JEFFREY SCOTT
分类号 G10L21/06;(IPC1-7):G10L13/00 主分类号 G10L21/06
代理机构 代理人
主权项
地址