摘要 |
Described herein is technology for, among other things, distributed indexing of file content. Content-based indexing the file involves determining whether content-based index information for the file is available from an external source. This avoids repeating already-performed content analysis, which is time consuming and computationally intensive especially for non-text files. The content-based index information, if it is available, is received from the external source and may be stored. If the content-based index information is not available or is not complete, content-based index information for the file is generated and stored. Moreover, the generated content-based index information is shared with the external source. Once content analysis of the file is performed to generate content-based index information for the file, the content-based index information is available and sharable as needed. There is no need to repeat the same content analysis on the file. |