发明名称 SEMANTIC ENRICHMENT BY EXPLOITING TOP-K PROCESSING
摘要 <p>Proper representation of the meaning of texts is crucial to enhancing many data mining and information retrieval tasks, including clustering, computing semantic relatedness between texts, and searching. Representing of texts in the concept-space derived from Wikipedia has received growing attention recently, due to its comprehensiveness and expertise. This concept-based representation is capable of extracting semantic relatedness between texts that cannot be deduced with the bag of words model. A key obstacle, however, for using Wikipedia as a semantic interpreter is that the sheer size of the concepts derived from Wikipedia makes it hard to efficiently map texts into concept-space. An efficient algorithm is proved which is able to represent the meaning of a text by using the concepts that best match it. In particular, this approach first computes the approximate top- concepts that are most relevant to the given text. These concepts are then leverage to represent the meaning of the given text.</p>
申请公布号 EP2691845(A2) 申请公布日期 2014.02.05
申请号 EP20110790440 申请日期 2011.06.03
申请人 THOMSON LICENSING 发明人 KIM, JONG, WOOK;KASHYAP, ASHWIN, S.;LI, DEKAI;BHAMIDIPATI, SANDILYA;PATEL, BANKIM, A.;SRIDHAR, AVINASH;MATHUR, SAURABH
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址