发明名称 |
Performing data storage operations with a cloud environment, including containerized deduplication, data pruning, and data transfer |
摘要 |
Various systems and methods may be used for performing data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods for content indexing data stored within a cloud environment may facilitate later searching, including collaborative searching. Methods for performing containerized deduplication may reduce the strain on a system namespace, effectuate cost savings, etc. Methods may identify suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy. Further, the systems and methods may be used for providing a cloud gateway and a scalable data object store within a cloud environment. |
申请公布号 |
US9171008(B2) |
申请公布日期 |
2015.10.27 |
申请号 |
US201313850903 |
申请日期 |
2013.03.26 |
申请人 |
Commvault Systems, Inc. |
发明人 |
Prahlad Anand;Muller Marcus S.;Kottomtharayil Rajiv;Kavuri Srinivas;Gokhale Parag;Vijayan Manoj Kumar |
分类号 |
G06F7/00;G06F17/30;G06Q30/02;G06Q50/18;G06F3/06;H04L29/08;G06F11/34;H04L29/06 |
主分类号 |
G06F7/00 |
代理机构 |
Perkins Coie LLP |
代理人 |
Perkins Coie LLP |
主权项 |
1. A computer-implemented method for indexing and searching multiple content items, the method comprising:
selecting or accessing, with a secondary copy component of a computing system, at least one secondary copy of the multiple content items,
wherein the secondary copy of the multiple content items is a copy of the multiple content items and is not a primary copy of the multiple content items,wherein the primary copy is available by the computer system over a local area network, andwherein the at least one secondary copy is stored at a cloud storage site located geographically remote from the computer system; for at least some of the multiple content items included in the secondary copy, with a content indexing component of the computing system;
analyzing content of a content item, including analyzing a summary of the content item;based upon the analysis, generating metadata corresponding to the content item, wherein the metadata includes at least a logical address to the cloud storage site for accessing the content item; andstoring, in a content index, the generated metadata of the content, wherein the content index is not stored at the cloud storage site, but is locally accessible by the computer system; and identifying, with an index searching component of the computing system, one or more indexed content items based on a search query and the metadata stored within the content index. |
地址 |
Tinton Falls NJ US |