发明名称 ITEM LISTING CATEGORIZATION SYSTEM
摘要 Techniques for categorizing item listings based on parsing item listing titles are described. According to various embodiments, listing titles of one or more item listings on a marketplace website are accessed, the item listings being associated with a particular product category in a product category structure of the marketplace website. Words in each of the listing titles may then be converted to semantic tokens in a token symbol space, based on a tokenization process. Thereafter, n-gram modeling may be performed on the tokens corresponding to each of the listing titles of the item listings in the particular product category. One or more dominant n-gram models associated with the listing titles of the item listings in the particular product category may then be identified.
申请公布号 US2015052143(A1) 申请公布日期 2015.02.19
申请号 US201313966160 申请日期 2013.08.13
申请人 Liu Ming;Raman Suresh;Li Rui 发明人 Liu Ming;Raman Suresh;Li Rui
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method comprising: accessing listing titles of one or more item listings on a marketplace website, the item listings being associated with a particular product category in a product category structure of the marketplace website; converting words in each of the listing titles to semantic tokens in a token symbol space, based on a tokenization process; performing n-gram modeling on the tokens corresponding to each of the listing titles of the item listings in the particular product category; and identifying, by a machine having a memory and at least one processor, one or more dominant n-gram models associated with the listing titles of the item listings in the particular product category.
地址 Palo Alto CA US