发明名称 Data processing apparatus and method of processing data
摘要 Data processing apparatus comprising: a chunk store having a plurality of chunk sections, each operable to store specimen data chunks, the apparatus being operable to: process an input data set into one or more input data chunks; identify a specimen data chunk in one of said chunk sections which corresponds to a first input data chunk; identify a second input data chunk not corresponding to a specimen data chunk in the chunk store; and store the second input data chunk as a specimen data chunk in proximity to the identified specimen data chunk corresponding to the first input data chunk.
申请公布号 US8838541(B2) 申请公布日期 2014.09.16
申请号 US200712671334 申请日期 2007.10.25
申请人 Hewlett-Packard Development Company, L.P. 发明人 Camble Peter Thomas;Trezise Gregory Keith
分类号 G06F7/00;G06F17/00;G06F11/14 主分类号 G06F7/00
代理机构 代理人
主权项 1. An apparatus comprising: a chunk store having a plurality of chunk sections each storing specimen data chunks; a manifest store for containing a manifest representing at least part of a data set and having references to said chunk sections; at least one processor configured to: process an input data set into input data chunks; identify, using the manifest, a specimen data chunk in a given one of said chunk sections which corresponds to a first of the input data chunks; identify a second of the input data chunks not corresponding to a specimen data chunk in the chunk store; store the second input data chunk as a specimen data chunk in deliberate proximity to the identified specimen data chunk, wherein the storing in deliberate proximity results in selecting the given chunk section rather than another of said chunk sections to store the second input data chunk as a specimen data chunk; associate a specimen data chunk in at least one chunk section with a back-reference to a manifest referencing that specimen data chunk; determine when a given specimen data chunk is not associated with a back-reference to a manifest; delete the given specimen data chunk from a particular chunk section after a predetermined time period or number of iterations in response to determining that the given specimen data chunk is not associated with a back-reference to a manifest; and after the deleting, reduce fragmentation of the particular chunk section by rearranging chunks remaining in the particular chunk section.
地址 Houston TX US