发明名称 Identifying conceptual gaps in a knowledge base
摘要 A method and system for augmenting a corpus with documents on concepts not sufficiently covered within the corpus is provided. The augmentation system generates a corpus concept graph from the documents of a corpus. A corpus concept graph represents concepts of the documents as nodes and related concepts as links between nodes. To generate a corpus concept graph, the augmentation system identifies the concepts that are related within each document of the corpus and adds nodes and links to the corpus concept graph for related concepts. The augmentation system analyzes the corpus concept graph to determine whether the relatedness of concepts of the documents of the corpus is sufficient. If the relatedness of a pair of concepts is not sufficient, then the augmentation system attempts to identify documents not already in the corpus that are related to the concepts that are not sufficiently related.
申请公布号 US2007094210(A1) 申请公布日期 2007.04.26
申请号 US20050218667 申请日期 2005.09.02
申请人 THE BOARD OF TRUSTEES OF THE UNIVERSITY OF ILLINOIS 发明人 CRAIG ALAN;LEETARU KALEV
分类号 G06N5/02 主分类号 G06N5/02
代理机构 代理人
主权项
地址