发明名称 NEAR-DUPLICATE FILTERING IN SEARCH ENGINE RESULT PAGE OF AN ONLINE SHOPPING SYSTEM
摘要 Reducing near-duplicate entries in online shopping system search results. For each pair of entries in a set of entries, each entry characterizing a product in a data store of an online shopping system and each entry characterized by a set of attributes, determining a distance between the entries in the pair based on the attributes. Determining entry clusters from a graph formed with each determined distance as an edge between nodes representing the entries used to determine the distance, each entry cluster identified by cluster identifier. Returning an ordered list of results responsive to the query from the data store of an online shopping system, filtered as a function of at least one of the distance and the cluster identifier.
申请公布号 US2016232591(A1) 申请公布日期 2016.08.11
申请号 US201615134240 申请日期 2016.04.20
申请人 GOOGLE INC. 发明人 Hu Liang;Chen Lijie;Zhang Hao
分类号 G06Q30/06;G06F17/30 主分类号 G06Q30/06
代理机构 代理人
主权项 1. A computer-implemented method to reduce same merchant near-duplicate entries in online shopping system search results, comprising: for each pair of entries in a set of entries from the same merchant, each entry characterizing a product in a data store of an online shopping system and each entry characterized by a set of quantified attributes, determining, by one or more computing devices, a distance between the entries in the pair in a vector space of the quantified attributes; determining, by the one or more computing devices, clusters of entries as a function of the determined distance between each pair of entries; receiving, by the one or more computing devices, a query directed to the data store; and returning, by the one or more computing devices, an ordered list of results responsive to the query from the data store of an online shopping system, filtered as a function of the determined distance to reduce the number of near duplicate entries in the search results.
地址 Mountain View CA US