发明名称 APPARATUS AND METHOD FOR SEARCHING INFORMATION BASED ON WIKIPEDIA'S CONTENTS
摘要 The present invention is to provide an apparatus for searching information based on Wikipedia's contents comprising: a document converting part extracting fulltext documents, section title documents, info-box documents, category documents and definition statement documents from Wikipedia original documents and generating at least one of Wikipedia documents for questions and answers; a document indexing part analyzing the Wikipedia document for questions and answers, extracting POS-based index terms from the Wikipedia document for questions and answers, and generating a Wikipedia document index for questions and answers; a question analyzing part receiving a natural language question, analyzing a question pattern, an answer pattern and a question focus from the natural language question, and extracting document search keywords; a document searching part performing document search by using the document search keywords from the Wikipedia document index for questions and answers and generating document search result from each Wikipedia document index for questions and answers; an answer extracting part extracting first answers by using information about the question pattern, the answer pattern and the question focus from the document search result; and an answer integrating part integrating and prioritizing the first answer and generating a second answer.
申请公布号 US2015193505(A1) 申请公布日期 2015.07.09
申请号 US201414260828 申请日期 2014.04.24
申请人 Electronics and Telecommunications Research Institute 发明人 RYU Pum-Mo;KIM Hyun-Ki;PARK Sang-Kyu;BAE Yong-Jin;HEO Jeong;OH Hyo-Jung;LEE Chung-Hee;LIM Soo-Jong;JANG Myung-Gil;CHOI Mi-Ran;CHOI Yoon-Jae;YOON Yeo-Chan;JO Yo-Han
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. An apparatus for searching information based on Wikipedia's contents comprising: a document converting part extracting fulltext documents, section title documents, info-box documents, category documents and definition statement documents from Wikipedia original documents and generating at least one of Wikipedia documents for questions and answers; a document indexing part analyzing Wikipedia document for questions and answers, extracting POS-based index terms from the Wikipedia document for questions and answers, and generating a Wikipedia document index for questions and answers; a question analyzing part receiving a natural language question, analyzing a question pattern, an answer pattern and a question focus from the natural language question, and extracting document search keywords; a document searching part performing document search by using the document search keywords from the Wikipedia document index for questions and answers and generating document search result from each Wikipedia document index for questions and answers; an answer extracting part extracting first answers by using information about the question pattern, the answer pattern and the question focus from the document search result; and an answer integrating part integrating and prioritizing the first answer and generating a second answer.
地址 Daejeon KR