发明名称 |
Efficient indexing of error tolerant set containment |
摘要 |
The claimed subject matter provides a method and a system for the efficient indexing of error tolerant set containment. An exemplary method comprises obtaining a frequency threshold and a query set. All tokens or token sets within the query set are determined, and then all minimal infrequent tokens or all minimal infrequent tokens sets of data records are found and used to build an index. The minimal infrequent tokens or minimal infrequent tokensets are processed in a fixed order, and then a collection of signatures for each minimal infrequent token or token set is determined. |
申请公布号 |
US8606771(B2) |
申请公布日期 |
2013.12.10 |
申请号 |
US20100973909 |
申请日期 |
2010.12.21 |
申请人 |
ARASU ARVIND;AGRAWAL PARAG;SHRIRAGHAV KAUSHIK;MICROSOFT CORPORATION |
发明人 |
ARASU ARVIND;AGRAWAL PARAG;SHRIRAGHAV KAUSHIK |
分类号 |
G06F7/00;G06F17/30 |
主分类号 |
G06F7/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|