发明名称 DOCUMENT SUMMARIZING DEVICE
摘要 PURPOSE:To extract an important part of good quality from a document by simple processing without assuming a specific document format or context. CONSTITUTION:A document analysis part 5 decomposes the document which is inputted from a document input part 1 into paragraph, sentences, and words. A context vector generation part 6 generates context vectors of the sentences, paragraphs, and document by using a word dictionary 8. A context vector comparison part 7 compares the comments with sentences by paragraphs, the document with the respective paragraphs, the paragraphs with respective sentences in the paragraphs, and the document with context vectors of the respective sentences to calculate distances between the context vectors. A document processing part 4 generates two kind of summaries of the paragraph which is the closest to the document and plural sentences which are the closest to the document and two kind of summaries of sentences by paragraphs which are the closest to the document and sentences which are the closest to the respective paragraphs by referring to the distances between the respective context vectors. Thus, the input document is analyzed by using the context vectors to extract the important part of good quality by the simple processing without assuming the specific document format or context.
申请公布号 JPH06215049(A) 申请公布日期 1994.08.05
申请号 JP19930007427 申请日期 1993.01.20
申请人 SHARP CORP 发明人 INUI TAKAO;KARASHI IKUO;ISHIKURA KENICHIROU
分类号 G06F17/21;G06F17/27;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址