发明名称 INDEXING OF FILE IN A HADOOP CLUSTER
摘要 A file indexing system for indexing a file to be stored onto a distributed file system includes a segmentation module to segment the file into a plurality of segments. The file indexing system further includes an index generation module to initiate indexing of the file through a plurality of nodes of a Hadoop cluster, where each of the plurality of nodes indexes one or more segments from amongst the plurality of segments to generate at least one index corresponding to the one or more segments. The file indexing system further includes an index transfer module to store the at least one index onto the distributed file system.
申请公布号 US2015120695(A1) 申请公布日期 2015.04.30
申请号 US201414498598 申请日期 2014.09.26
申请人 TATA CONSULTANCY SERVICES LIMITED 发明人 Vasu Arun;Kurunthala Jishnu
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A file indexing system for indexing a file to be stored onto a distributed file system, the file indexing system comprising: a processor; a segmentation module coupled to the processor to segment the file into a plurality of segments; an index generation module coupled to the processor to initiate indexing of the file through a plurality of nodes of a Hadoop cluster, wherein each of the plurality of nodes indexes one or more segments from amongst the plurality of segments to generate at least one index corresponding to the one or more segments; and an index transfer module coupled to the processor to store the at least one index onto the distributed file system.
地址 Mumbai IN