发明名称 EXTRACTION OF ATTRIBUTES AND VALUES FROM NATURAL LANGUAGE DOCUMENTS
摘要 One or more classification algorithms are applied to at least one natural language document in order to extract both attributes and values of a given product. Supervised classification algorithms, semi-supervised classification algorithms, unsupervised classification algorithms or combinations of such classification algorithms may be employed for this purpose. The at least one natural language document may be obtained via a public communication network. Two or more attributes (or two or more values) thus identified may be merged to form one or more attribute phrases or value phrases. Once attributes and values have been extracted in this manner, association or linking operations may be performed to establish attribute-value pairs that are descriptive of the product. In a presently preferred embodiment, an (unsupervised) algorithm is used to generate seed attributes and values which can then support a supervised or semi-supervised classification algorithm.
申请公布号 US2012036100(A1) 申请公布日期 2012.02.09
申请号 US201113197906 申请日期 2011.08.04
申请人 PROBST KATHARINA;GHANI RAYID;FANO ANDREW E.;KREMA MARKO;LIU YAN;ACCENTURE GLOBAL SERVICES LIMITED 发明人 PROBST KATHARINA;GHANI RAYID;FANO ANDREW E.;KREMA MARKO;LIU YAN
分类号 G06N5/02 主分类号 G06N5/02
代理机构 代理人
主权项
地址
您可能感兴趣的专利