发明名称 |
Multimedia data management by speech recognizer annotation |
摘要 |
A method and an apparatus for multimedia data management are disclosed. The method provides an indexing and retrieval scheme for digital photos with speech annotations based on image-like patterns transformed from the recognized syllable candidates. For annotated spoken content, the recognized n-best syllable candidates are transformed into a sequence of syllable-transformed patterns. Eigen-image analysis is further adopted to extract the significant information to reduce the dimensionality. Vector quantization is applied to quantize the syllable-transformed patterns into feature vectors for indexing. The invention of indexing scheme reduces the dimensionality and noise of data, and achieves better performance of 16.26% for speech annotated photo retrieval.
|
申请公布号 |
US7739110(B2) |
申请公布日期 |
2010.06.15 |
申请号 |
US20060565628 |
申请日期 |
2006.12.01 |
申请人 |
INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE |
发明人 |
WU CHUNG-HSIEN;LAI YU-SHENG;HUANG CHIEN-LIN;KANG CHIA-HAO |
分类号 |
G10L15/00;G06F17/30;G10L15/08 |
主分类号 |
G10L15/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|