摘要 |
An approach is provided to analyze posts included threads of an online forum. The analyzing identifies a main topic related to a parent post of the thread. Child posts of the thread are selected with the parent post being a parent to each of the child posts. Child topics are identified for each of the child posts. A relevance of each of the child posts is determined by comparing the identified main topic to each of the identified child topics. Child posts are selected based on the relevance of the child posts. Parent post data is ingested into a corpus utilized by a question answering (QA) system. Data from the selected child posts is also ingested into the corpus. |