摘要 |
A topic extraction device and method are disclosed. The topic extraction device extracts initial topics from a document by using latent Dirichlet allocation (LDA) and extracts final topics of the document by correcting, through a similarity comparison between words included in the extracted initial topics, the repeated extraction of topics or the mixing of topics. |