发明名称 Multimedia data management by speech recognizer annotation
摘要 A method and an apparatus for multimedia data management are disclosed. The method provides an indexing and retrieval scheme for digital photos with speech annotations based on image-like patterns transformed from the recognized syllable candidates. For annotated spoken content, the recognized n-best syllable candidates are transformed into a sequence of syllable-transformed patterns. Eigen-image analysis is further adopted to extract the significant information to reduce the dimensionality. Vector quantization is applied to quantize the syllable-transformed patterns into feature vectors for indexing. The invention of indexing scheme reduces the dimensionality and noise of data, and achieves better performance of 16.26% for speech annotated photo retrieval.
申请公布号 US7739110(B2) 申请公布日期 2010.06.15
申请号 US20060565628 申请日期 2006.12.01
申请人 INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE 发明人 WU CHUNG-HSIEN;LAI YU-SHENG;HUANG CHIEN-LIN;KANG CHIA-HAO
分类号 G10L15/00;G06F17/30;G10L15/08 主分类号 G10L15/00
代理机构 代理人
主权项
地址