发明名称 CUT AND PASTE DOCUMENT SUMMARIZATION SYSTEM AND METHOD
摘要 A summary of an input document is generated by extracting at least one sentence from the document and parsing the extracted sentences into components, such as in a parse tree (110). Sentence reduction processing is performed to mark components which can be removed from the parse trees (135) . Sentence reduction can include context importance processing, probabilistic processing, and linguistic knowledge based processing, probabilistic processing includes identifying sentence combination operations and establishing rules for applying the sentence combination operations to mark the parse trees to merge at least two sentences (140). Sentence combination processing also provides a paste operation to operate on the marked componen ts to effect the indicated removal and combination of sentence components, thereby generating summary sentences for the input document.
申请公布号 CA2363834(A1) 申请公布日期 2001.01.25
申请号 CA20002363834 申请日期 2000.02.22
申请人 THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 发明人 MCKEOWN, KATHLEEN, R.;JING, HONGYAN
分类号 G06F17/27;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址