发明名称 Generating an Academic Topic Graph from Digital Documents
摘要 Documents of a content management system are classified into a hierarchical taxonomy comprising a hierarchy of nodes, such that each document is associated with a node in the hierarchical taxonomy. For each of a plurality of topics extracted from the documents, a topic extraction system determines an affinity of the topic to respective nodes of the hierarchical taxonomy. Based on the determined affinities, a topic graph is generated for display to a user. The topic graph identifies one or more nodes of the hierarchical taxonomy and a plurality of topics associated with each of the one or more nodes, and each topic is linked to a corresponding node in the topic graph. Responsive to receiving a selection of a topic in the topic graph, identifiers of documents from which the selected topic was extracted are displayed.
申请公布号 US2016034757(A1) 申请公布日期 2016.02.04
申请号 US201414448983 申请日期 2014.07.31
申请人 Chegg, Inc. 发明人 Chhichhia Charmy;Le Chevalier Vincent
分类号 G06K9/00 主分类号 G06K9/00
代理机构 代理人
主权项 1. A method for generating a topic graph from digital documents in a content management system, the method comprising: accessing a plurality of documents classified into a hierarchical taxonomy, the hierarchical taxonomy comprising a hierarchy of nodes, each document associated with a plurality of nodes of the hierarchical taxonomy; extracting a plurality of topics from the documents; determining for each extracted topic, an affinity of the topic to respective nodes of the hierarchical taxonomy that are associated with documents from which the topic was extracted; generating based on the determined affinities, a topic graph for display to a user, the topic graph identifying one or more nodes of the hierarchical taxonomy and a plurality of topics associated with each of the one or more nodes, each topic linked to a corresponding node in the topic graph; displaying the topic graph to the user; and responsive to receiving a selection from the user of a topic in the topic graph, displaying identifiers of documents from which the selected topic was extracted.
地址 Santa Clara CA US