发明名称 |
Duplicate filtering in a data processing environment |
摘要 |
A data processing method is provided. The method comprises collecting a stream of data records from one or more devices in a network; loading one or more persistent indexes associated with the stream of data records into memory; identifying duplicate data records in the stream of data records using the in-memory indexes; and updating a repository such that the duplicate data records are not stored in the repository or managed differently than non-duplicate data records. |
申请公布号 |
US8180739(B2) |
申请公布日期 |
2012.05.15 |
申请号 |
US20090509507 |
申请日期 |
2009.07.27 |
申请人 |
ARDITI JOEL;BERK DAVID HAROLD;GILAT DAGAN;KRUTYOLKIN SERGEY;LANDAU ARIEL;SHANI URI;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
ARDITI JOEL;BERK DAVID HAROLD;GILAT DAGAN;KRUTYOLKIN SERGEY;LANDAU ARIEL;SHANI URI |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|