发明名称 |
METHODS AND SYSTEMS FOR CATEGORIZING AND INDEXING HUMAN-READABLE DATA |
摘要 |
Systems and methods (20, 200) for processing content packages such as human-readable documents identify and analyze content type. Structural (300) and logical (500) evaluation of a content package is performed, followed by analysis and indexing of concepts within the package. Analysis and identification of concepts and sub-concepts may be an iterative process. Concepts are indexed (800) in accordance with different rule sets representing different consumer needs and perspectives. Customers can then use the indices to navigate large groups of content packages based on the concepts contained within those packages and also on keywords associated with concepts. |
申请公布号 |
WO2004015905(A2) |
申请公布日期 |
2004.02.19 |
申请号 |
WO2003US24097 |
申请日期 |
2003.08.01 |
申请人 |
REUTERS RESEARCH INC. |
发明人 |
MAHONEY, JOHN;BOROVIKOV, DMITRY;CURTIS, KEVIN;KOLFMAN, MICHAEL |
分类号 |
G06F9/45;G06F17/30;H04L |
主分类号 |
G06F9/45 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|