发明名称 A method for the extraction of relation patterns from articles
摘要 <p>A method for building a knowledge base containing entailment relations, therefore comprises the steps of: a) providing at least one input pattern ( p ) with N pattern slots (N&gt;1), said input pattern ( p ) expressing a specific semantic relation between N entities that fill the N pattern slots of the input pattern ( p ) as slot fillers, b) providing at least one cluster ( c ) of articles, said articles of said cluster ( c ) relating to a common main topic; c) processing said articles with respect to the input pattern ( p ) and identifying the identities which match the semantic type of the N pattern slots; d) if said at least one input pattern matches a portion of an article ( a ) of said at least one cluster ( c ): i) storing the N slot fillers (s 1 , s 2 , ... , s N ), which match the slots of the pattern (p), and a cluster identifier lc of the cluster ( c ) into a first table S , wherein the N-tuple (s 1 , s 2 , ... , s N ) and the cluster identifier l c of the associated cluster ( c ) form one element of said table S ; ii) for each element of table S, identifying appearances of the slot fillers (s 1 , s 2 , ... , s N ) in a plurality of articles of cluster ( c ) and for each appearance so identified, storing the slot fillers (s 1 , s 2 , ... , s N ) together with the sentence in which they occur into a second table C 0 ; iii) from the sentences stored in table C 0 , extracting patterns which span over the corresponding N slot fillers (s 1 , s 2 , ... , s N ), said extracted pattern expressing a semantic relation between said N slot fillers; and iv) storing said extracted pattern together with said input pattern as entailment relation into said knowledge base.</p>
申请公布号 EP1983444(A1) 申请公布日期 2008.10.22
申请号 EP20070106252 申请日期 2007.04.16
申请人 THE EUROPEAN COMMUNITY, REPRESENTED BY THE EUROPEAN COMMISSION 发明人 TANEV, HRISTO TANEV
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项
地址