主权项 |
1. A computer-implemented method to reduce same merchant near-duplicate entries in online shopping system search results, comprising:
for each pair of entries in a set of entries from the same merchant, each entry characterizing a product in a data store of an online shopping system and each entry characterized by a set of quantified attributes, determining, by one or more computing devices, a distance between the entries in the pair in a vector space of the quantified attributes; determining, by the one or more computing devices, clusters of entries as a function of the determined distance between each pair of entries; receiving, by the one or more computing devices, a query directed to the data store; and returning, by the one or more computing devices, an ordered list of results responsive to the query from the data store of an online shopping system, filtered as a function of the determined distance to reduce the number of near duplicate entries in the search results. |