发明名称 Duplicate item detection system and method
摘要 A method of detecting contextual duplicate items can include identifying a plurality of representations of items in a data repository, each item representation including one or more textual attributes. A degree of fit between an item representation's attributes and other items can be calculated. The degree of fit can reflect the relevance of the attributes of one item to the other item. A degree of association between the two item representations can be calculated based at least in part on the calculated degree of fit. The degree of association between the two item representations can reflect the similarity of the two items. The degree of association between the two item representations can be assessed to determine whether the items are contextual duplicates.
申请公布号 US7827186(B2) 申请公布日期 2010.11.02
申请号 US20070863987 申请日期 2007.09.28
申请人 AMAZON TECHNOLOGIES, INC. 发明人 HICKS CORY
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址