发明名称 Generation of a semantic model from textual listings
摘要 A corpus of textual listings is received and main concept words and attribute words therein are identified via an iterative process of parsing listings and expanding a semantic model. During the parsing phase, the corpus of textual listings is parsed to tag one or more head noun words and/or one or more identifier words in each listing based on previously identified main concept words or using a head noun identification rule. Once substantially each listing in the corpus has been parsed in this manner, the expansion phase assigns head noun words as main concept words and modifier words as attribute words, where possible. During the next iteration, the newly identified main concept words and/or attribute words are used to further parse the listings. These iterations are repeated until a termination condition is reached. Remaining words in the corpus are clustered based on the main concept words and attribute words.
申请公布号 US9594747(B2) 申请公布日期 2017.03.14
申请号 US201615003344 申请日期 2016.01.21
申请人 Accenture Global Services Limited 发明人 Kim Doo Soon;Yeh Peter Z.;Verma Kunal
分类号 G06F17/27;G06Q30/02 主分类号 G06F17/27
代理机构 Harrity & Harrity, LLP 代理人 Harrity & Harrity, LLP
主权项 1. A system comprising: a processing device to: identify main concept words and attribute words in textual listings;cluster words, in the textual listings, based on at least one of the main concept words or the attribute words according to at least one clustering rule, the at least one clustering rule including at least one of: a first rule preventing clustering of words based on a frequency of appearance of words in a same textual listing,a second rule preventing clustering of a quantitative attribute word with a qualitative attribute word, ora third rule indicating clustering of two words when characters of a first word, of the two words, are included in a second word of the two words; andprovide, after clustering the words, the main concept words and the attribute words as at least a portion of a semantic model, the semantic model being used for subsequent clustering.
地址 Dublin IE