发明名称 |
Apparatus for and method of summarising text |
摘要 |
Apparatus for identifying topics of document data has: a word ranker (171) for ranking words that are present in or representative of the content of the document data; a co-occurrence ranker (172) for ranking co-occurrences of words that are present in or representative of the content of the document data; a phrase ranker (170) for ranking phrases in the document data; a word selector (174) for selecting the highest ranking words; a co-occurrence identifier (176) for identifying which of the highest ranking co-occurrences contain at least one of the highest ranking words; a phrase identifier (177) for identifying the phrases containing at least one word from the identified co-occurrences; a phrase selector (178) for selecting the highest ranking one or ones of the identified phrases as the topic or topics of the document data; and an output device (40) for outputting data relating to the selected topics.
|
申请公布号 |
US2004225667(A1) |
申请公布日期 |
2004.11.11 |
申请号 |
US20040797107 |
申请日期 |
2004.03.11 |
申请人 |
CANON KABUSHIKI KAISHA |
发明人 |
HU JIAWEI;IMLAH WILLIAM GEORGE |
分类号 |
G06F17/00;G06F17/28;(IPC1-7):G06F17/00 |
主分类号 |
G06F17/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|