发明名称 |
DOCUMENT SUMMARIZATION USING NOUN AND SENTENCE RANKING |
摘要 |
Systems and methods are provided for summarization of electronic text documents. Nouns and sentences are identified in a text document, and the most-prevalent nouns are further identified based on frequency. The sentences in the document are scored based on assigning points based on cumulative presence or absence of each of the most-prevalent nouns. A tag cloud consisting of the most-prevalent nouns is displayed together with the highest-scoring sentences, thereby providing context for the nouns in the tag cloud. |
申请公布号 |
US2014229159(A1) |
申请公布日期 |
2014.08.14 |
申请号 |
US201313763864 |
申请日期 |
2013.02.11 |
申请人 |
APPSENSE LIMITED |
发明人 |
BRANTON Paul Keith |
分类号 |
G06F17/21 |
主分类号 |
G06F17/21 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computerized method for providing a summary of a text document, comprising:
identifying at least some nouns and sentences in the text document; for at least some of the identified nouns, counting the number of times the identified nouns appear in the text document; identifying a predetermined number of most-prevalent nouns based on the number of times the identified nouns appear in the text document; scoring each of the identified sentences as a function of the number of times the predetermined number of most-prevalent nouns occurs therein; displaying at least some of the predetermined number of most-prevalent nouns, wherein the size of each displayed noun is a function of the number of times the displayed noun appears in the text document; and displaying a predetermined number of the scored sentences that have the highest scores, the displayed scored sentences being displayed in proximity to the displayed nouns. |
地址 |
Warrington GB |