发明名称 |
ITEM LISTING CATEGORIZATION SYSTEM |
摘要 |
Techniques for categorizing item listings based on parsing item listing titles are described. According to various embodiments, listing titles of one or more item listings on a marketplace website are accessed, the item listings being associated with a particular product category in a product category structure of the marketplace website. Words in each of the listing titles may then be converted to semantic tokens in a token symbol space, based on a tokenization process. Thereafter, n-gram modeling may be performed on the tokens corresponding to each of the listing titles of the item listings in the particular product category. One or more dominant n-gram models associated with the listing titles of the item listings in the particular product category may then be identified. |
申请公布号 |
US2015052143(A1) |
申请公布日期 |
2015.02.19 |
申请号 |
US201313966160 |
申请日期 |
2013.08.13 |
申请人 |
Liu Ming;Raman Suresh;Li Rui |
发明人 |
Liu Ming;Raman Suresh;Li Rui |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer-implemented method comprising:
accessing listing titles of one or more item listings on a marketplace website, the item listings being associated with a particular product category in a product category structure of the marketplace website; converting words in each of the listing titles to semantic tokens in a token symbol space, based on a tokenization process; performing n-gram modeling on the tokens corresponding to each of the listing titles of the item listings in the particular product category; and identifying, by a machine having a memory and at least one processor, one or more dominant n-gram models associated with the listing titles of the item listings in the particular product category. |
地址 |
Palo Alto CA US |