发明名称 |
Generating an Academic Topic Graph from Digital Documents |
摘要 |
Documents of a content management system are classified into a hierarchical taxonomy comprising a hierarchy of nodes, such that each document is associated with a node in the hierarchical taxonomy. For each of a plurality of topics extracted from the documents, a topic extraction system determines an affinity of the topic to respective nodes of the hierarchical taxonomy. Based on the determined affinities, a topic graph is generated for display to a user. The topic graph identifies one or more nodes of the hierarchical taxonomy and a plurality of topics associated with each of the one or more nodes, and each topic is linked to a corresponding node in the topic graph. Responsive to receiving a selection of a topic in the topic graph, identifiers of documents from which the selected topic was extracted are displayed. |
申请公布号 |
US2016034757(A1) |
申请公布日期 |
2016.02.04 |
申请号 |
US201414448983 |
申请日期 |
2014.07.31 |
申请人 |
Chegg, Inc. |
发明人 |
Chhichhia Charmy;Le Chevalier Vincent |
分类号 |
G06K9/00 |
主分类号 |
G06K9/00 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for generating a topic graph from digital documents in a content management system, the method comprising:
accessing a plurality of documents classified into a hierarchical taxonomy, the hierarchical taxonomy comprising a hierarchy of nodes, each document associated with a plurality of nodes of the hierarchical taxonomy; extracting a plurality of topics from the documents; determining for each extracted topic, an affinity of the topic to respective nodes of the hierarchical taxonomy that are associated with documents from which the topic was extracted; generating based on the determined affinities, a topic graph for display to a user, the topic graph identifying one or more nodes of the hierarchical taxonomy and a plurality of topics associated with each of the one or more nodes, each topic linked to a corresponding node in the topic graph; displaying the topic graph to the user; and responsive to receiving a selection from the user of a topic in the topic graph, displaying identifiers of documents from which the selected topic was extracted. |
地址 |
Santa Clara CA US |