发明名称 |
Method and apparatus for extracting and structuring domain terms |
摘要 |
A method of automatically categorizing terms extracted from a text corpus is comprised of identifying lexical atoms in a text corpus as terms. The identified terms are extracted based on a relation that exists between the terms. A weight is assigned to each relation. A graphical representation of the relationships among terms is constructed by using terms as vertices and relations as weighted links between the vertices. A vertex score is calculated for each of the vertices of the graph. Each term is categorized based on its vertex score. The graphical representation may be revised based on its structure and/or the calculated vertex scores. Because of the rules governing abstracts, this abstract should not be used to construe the claims.
|
申请公布号 |
US2007016863(A1) |
申请公布日期 |
2007.01.18 |
申请号 |
US20060482344 |
申请日期 |
2006.07.07 |
申请人 |
QU YAN;ABDULJALEEL NASREEN |
发明人 |
QU YAN;ABDULJALEEL NASREEN |
分类号 |
G06F17/00 |
主分类号 |
G06F17/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|