发明名称 Reference resolution for text enrichment and normalization in mining mixed data
摘要 A method for enrichment of text which enables mixed data mining includes generating a model for structured data found in tables of a database. In the model, semantically-linked terms are associated with referents, such as field names or cell content of the fields, of the structured data. The referents may be a business object or refer to a business object. A plurality of candidate referring entities in textual data in the database, such as chunks of free text, is identified. For each candidate referring entity, a similarity measure between the candidate referring entity in the textual data and the model is computed to identify referring entities of the candidate referring entities and corresponding business objects/referents to which the referring entities refer. The textual data is enriched with information derived from the business objects.
申请公布号 US2008027893(A1) 申请公布日期 2008.01.31
申请号 US20060493085 申请日期 2006.07.26
申请人 XEROX CORPORATION 发明人 CAVESTRO BRUNO;RENDERS JEAN-MICHEL
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址