发明名称 Method and apparatus for automatic document summarization
摘要 Regions of a document such as sentences and blocks of sentences are scored and classified based upon their scores. An abstract of the document can be formed from the classified sentences. Sentences are classified by the use of words classified as stop words and vanish words. Sentences are scored based on the number of stop words and the number of strings of connected stop words, called stop-word runs, contained in the sentence. Passionate sentences, which usually contain information which the writer has strong feelings about, such as joy, admiration, or sadness, are identified. This method can also select sentences that are contrapassionate, which the writer may either have to strengthen or have inserted to complete the record and provide continuity or information.
申请公布号 US5638543(A) 申请公布日期 1997.06.10
申请号 US19930071114 申请日期 1993.06.03
申请人 XEROX CORPORATION 发明人 PEDERSEN, JAN O.;TUKEY, JOHN W.
分类号 G06F17/21;G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址