发明名称 DOCUMENT ANALYSIS SYSTEM
摘要 An information processing apparatus (5) is provided comprising: a lexicon generation module (22) operable to process a set of documents (1) to identify key words (2) present in the documents; a link generation module (24) operable to generate network data (3) linking documents which share the same or semantically related key words identified by the lexicon generation module; and a network analysis module (26) operable to associate documents with metric values based upon the patterns of connectivity of the network data generated by the link generation module. The metric values associated with documents in the set can be utilized to select documents or groups of associated documents for further processing or indexing.
申请公布号 US2011191345(A1) 申请公布日期 2011.08.04
申请号 US201113015832 申请日期 2011.01.28
申请人 E-THERAPEUTICS PLC 发明人 YOUNG MALCOLM P.
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址