发明名称 |
METHOD AND SYSTEM FOR IDENTIFYING SIGNIFICANT TOPICS OF A DOCUMENT |
摘要 |
A "domain-general" method for representing the "sense" of a document includes the steps of extracting a list of simplex noun phrases (16) representing candidate significant topics in the document, clustering the simplex noun phrases by head (18), and rankiing the simplex noun phrases (20) according to a significance measure to indicate the relative importance of the simplex noun phrases as significant topic of the document. Furthermore, the output can be filtered in a variety of ways, both for automatic processing and for presentation to users.
|
申请公布号 |
WO0010100(A1) |
申请公布日期 |
2000.02.24 |
申请号 |
WO1999US18449 |
申请日期 |
1999.08.13 |
申请人 |
THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK;WACHOLDER, FAYE, PENINA |
发明人 |
WACHOLDER, FAYE, PENINA |
分类号 |
G06F17/27;G06F17/30;(IPC1-7):G06F17/27;G06F17/28 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|