发明名称 |
System and method for hierarchical segmentation with latent semantic indexing in scale space |
摘要 |
A system and method for automatically generating a hierarchical table of contents or outline for indexing a document and identifying clusters of related information in the document. The document may comprise text, audio, video, or a multimedia presentation. The invention employs a unique and novel combination of latent semantic indexing techniques to identify related blocks and major topic changes within the document with scale space segmentation techniques to respectively identify self-similar blocks within the document and to thus find topic changes of various sizes at block edges. The invention then produces a visual presentation of the semantic structure of the document.
|
申请公布号 |
US2004205461(A1) |
申请公布日期 |
2004.10.14 |
申请号 |
US20010034523 |
申请日期 |
2001.12.28 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
KAUFMAN JAMES H.;PONCELEON DULCE BEATRIZ;SLANEY MALCOLM |
分类号 |
G06F17/27;(IPC1-7):G09G5/12;G06F17/24 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|