发明名称 |
TEXT REPRESENTATION METHOD AND APPARATUS |
摘要 |
The present invention relates to text analysis, and discloses a text representation method. Aspects include identifying concepts in the text by using a knowledge base and determining relationship between the concepts and generating a concept graph by using the relationship between the concepts. Aspects also include determining connected components of the concept graph; calculating weight of the connected components and determining the concepts representing the text according to the weight of the connected components. By using correlation between concepts in a knowledge base and according to connected component theory of a graph, finds out a set of concepts which best represents subject of the text, and removes concepts irrelevant to the subject, thus improving accuracy of text representation and reducing noise. |
申请公布号 |
US2016154803(A1) |
申请公布日期 |
2016.06.02 |
申请号 |
US201514967315 |
申请日期 |
2015.12.13 |
申请人 |
International Business Machines Corporation |
发明人 |
Cao Feng;Ni Yuan;Xu Qiongkai;Zhu Hui Jia |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A text representation method, comprising:
identifying concepts in a text by using a knowledge base and determining relationship between the concepts; generating a concept graph by using the relationship between the concepts; determining connected components of the concept graph; calculating weight of the connected components; determining the concepts representing the text according to the weight of the connected components. |
地址 |
Armonk NY US |