发明名称 Iterating in parallel for deduplication
摘要 A method is used in iterating in parallel for deduplication. Based on an iteration scheme, a collection of sections is selected from a set of storage extents. Each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents. Based on the iteration scheme, each section of the collection of sections is arranged in an ordered arrangement. Based on the ordered arrangement, a deduplicating technique is applied in parallel to each section of the collection of sections.
申请公布号 US8977812(B1) 申请公布日期 2015.03.10
申请号 US201113075487 申请日期 2011.03.30
申请人 EMC Corporation 发明人 Chen Xiangping;de Forest Miles A.;Zhou Siyu;Mullis Samuel L.;Spadafora Brian M.;Wan Li
分类号 G06F13/00;G06F13/28 主分类号 G06F13/00
代理机构 代理人 Bhayana Deepika;Reyes Jason A.;Gupta Krishnendu
主权项 1. A method for use in iterating in parallel for deduplication, the method comprising: selecting a collection of sections from a set of storage extents based on an iteration scheme, wherein a deduplication domain includes the set of storage extents, wherein each storage extent of the set of storage extents is apportioned into a set of sections, wherein each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents, wherein the contents of a storage extent includes a set of data blocks; arranging each section of the collection of sections in an ordered arrangement based on the iteration scheme selected from a list of iteration schemes for applying a deduplicating technique, wherein the list of iteration schemes include a parallel iteration scheme, wherein each iteration scheme of the list of iteration schemes indicates a manner in which the set of storage extents is iterated for applying the deduplicating technique, wherein the ordered arrangement indicates an order in which each section of the collection of sections is processed for applying the deduplicating technique; and based on the ordered arrangement, applying the deduplicating technique in parallel to each section of the collection of sections, wherein data blocks in each section of the collection of sections are deduplicated in parallel based on the ordered arrangement, wherein a number of sections of the collection of sections selected for applying the deduplicating technique in parallel are based on a set of processes used for deduplication, wherein each section of the number of sections is iterated by a process of the set of processes.
地址 Hopkinton MA US