发明名称 TEXT REPRESENTATION METHOD AND APPARATUS
摘要 The present invention relates to text analysis, and discloses a text representation method. Aspects include identifying concepts in the text by using a knowledge base and determining relationship between the concepts and generating a concept graph by using the relationship between the concepts. Aspects also include determining connected components of the concept graph; calculating weight of the connected components and determining the concepts representing the text according to the weight of the connected components. By using correlation between concepts in a knowledge base and according to connected component theory of a graph, finds out a set of concepts which best represents subject of the text, and removes concepts irrelevant to the subject, thus improving accuracy of text representation and reducing noise.
申请公布号 US2016154803(A1) 申请公布日期 2016.06.02
申请号 US201514967315 申请日期 2015.12.13
申请人 International Business Machines Corporation 发明人 Cao Feng;Ni Yuan;Xu Qiongkai;Zhu Hui Jia
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A text representation method, comprising: identifying concepts in a text by using a knowledge base and determining relationship between the concepts; generating a concept graph by using the relationship between the concepts; determining connected components of the concept graph; calculating weight of the connected components; determining the concepts representing the text according to the weight of the connected components.
地址 Armonk NY US