发明名称 Reliability of duplicate document detection algorithms
摘要 In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold.
申请公布号 US7392262(B1) 申请公布日期 2008.06.24
申请号 US20040016959 申请日期 2004.12.21
申请人 AOL LLC 发明人 ALSPECTOR JOSHUA;KOLCZ ALEKSANDER;CHOWDHURY ABDUR R.
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址