发明名称 METHOD AND SYSTEM FOR IDENTIFYING SIGNIFICANT TOPICS OF A DOCUMENT
摘要 A "domain-general" method for representing the "sense" of a document includes the steps of extracting a list of simplex noun phrases (16) representing candidate significant topics in the document, clustering the simplex noun phrases by head (18), and rankiing the simplex noun phrases (20) according to a significance measure to indicate the relative importance of the simplex noun phrases as significant topic of the document. Furthermore, the output can be filtered in a variety of ways, both for automatic processing and for presentation to users.
申请公布号 WO0010100(A1) 申请公布日期 2000.02.24
申请号 WO1999US18449 申请日期 1999.08.13
申请人 THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK;WACHOLDER, FAYE, PENINA 发明人 WACHOLDER, FAYE, PENINA
分类号 G06F17/27;G06F17/30;(IPC1-7):G06F17/27;G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址