发明名称 Locating potentially identical objects across multiple computers based on stochastic partitioning of workload
摘要 Potentially identical objects (e.g., files) are located across multiple computers based on stochastic partitioning of workload. For each of a plurality of objects stored on a plurality of computers in a network, a portion of object information corresponding to the object is selected. The object information can be generated in a variety of manners (e.g., based on hashing the object, based on characteristics of the object, and so forth). Any of a variety of portions of the object information can be used (e.g., the least significant bits of the object information). A stochastic partitioning process is then used to identify which of the plurality of computers to communicate the object information to for identification of potentially identical objects on the plurality of computers.
申请公布号 US7519623(B2) 申请公布日期 2009.04.14
申请号 US20040991656 申请日期 2004.11.18
申请人 MICROSOFT CORPORATION 发明人 DOUCEUR JOHN R.;THEIMER MARVIN M.;ADYA ATUL;BOLOSKY WILLIAM J.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址