发明名称 |
FRAMEWORK FOR DOCUMENT KNOWLEDGE EXTRACTION |
摘要 |
A knowledge extraction framework may iteratively enrich an ontology that is used to classify structured knowledge obtained from web pages based on structured knowledge previously acquired from other web pages. The framework may enable a user to define the ontology for extracting structured knowledge from a plurality of web pages. The framework applies the ontology using a supervised extraction algorithm to extract seed information from a set of web pages. The framework further applies an unsupervised extraction algorithm to extract the structured knowledge from an additional set of web pages. The framework subsequently maps the structured knowledge to the ontology based on the seed information to enrich the ontology.
|
申请公布号 |
US2013246435(A1) |
申请公布日期 |
2013.09.19 |
申请号 |
US201213419690 |
申请日期 |
2012.03.14 |
申请人 |
YAN JUN;JI LEI;WILD EDWARD W.;LI YI;LIU NING;CHEN ZHENG;MICROSOFT CORPORATION |
发明人 |
YAN JUN;JI LEI;WILD EDWARD W.;LI YI;LIU NING;CHEN ZHENG |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|