摘要 |
A time-series document summarization (201) device outputs a summary sentence of a document-of-interest collection that is a document collection to be an object. A time-series document summarization (201) comprises: a background topic word extraction part (20) configured to acquire a set of the document-of-interest collection and a document-of-interest topic word that is a feature word of the document-of-interest collection, and a reference-use document collection that is a document collection different from the document-of-interest collection, and extract a background topic word representing a topic to be a background of a topic described in the document-of-interest collection from the reference-use document collection; and a representative character string extraction part (30) configured to extract a representative character string including the document-of-interest topic word and the background topic word as a summary sentence of the document-of-interest collection from among character strings included in the document-of-interest collection. |