摘要 |
<p>In the present invention, documents relevant to a specific topic are suitably extracted from documents such as a plurality of tweets. A relevant document extraction device (10) is provided with the following: a default topic tag storage unit (141) that stores a default topic tag indicating a topic; a document storage unit (100) that stores a plurality of documents; a morpheme analysis unit (110) that divides documents into morphemes; a topic tag estimation unit (130) that extracts a document that includes the default topic tag from a plurality of documents, and calculates the frequency of appearance of terms in the extracted document; and a topic ID assigning unit (150) that extract a document relevant to the topic from information based on the calculated frequency of appearance.</p> |