发明名称 |
Systems and methods for using provenance information for data retention in stream-processing |
摘要 |
A system and method for determining data usage based on provenance information, in a stream-processing system, includes progressively setting usage information for output stream data objects (SDOs), determining input SDOs that an output SDO depends on, based on a provenance dependency function; recursively feeding back the usage information for a subset of SDOs that can be discarded; and discarding the subset of SDOs. A system and method for data retention based on usage information, in a stream-processing system, includes managing retention of SDOs by deleting SDOs that are determined to be of null usage; and enhancing retention characteristics of SDOs that are deemed to have usage. |
申请公布号 |
US8856313(B2) |
申请公布日期 |
2014.10.07 |
申请号 |
US200711939176 |
申请日期 |
2007.11.13 |
申请人 |
International Business Machines Corporation |
发明人 |
Amini Lisa;Venkatramani Chitra |
分类号 |
G06F15/173;G06F17/30;G06F7/06;G06F9/46;H04L29/06;H04L9/32;H04L12/24 |
主分类号 |
G06F15/173 |
代理机构 |
Tutunjian & Bitetto, P.C. |
代理人 |
Tutunjian & Bitetto, P.C. ;Stock William |
主权项 |
1. A method for determining data usage based on provenance information, in a stream-processing system, the method comprising:
progressively setting usage information, comprising a usage count corresponding to a number of downstream recipients, for output stream data objects (SDOs) comprising information that is transmitted in the stream-processing system; determining with a processor input SDOs that an output SDO depends on, based on a provenance dependency function; recursively feeding back the usage information for a subset of SDOs that can be discarded; and discarding the subset of SDOs. |
地址 |
Armonk NY US |