摘要 |
PROBLEM TO BE SOLVED: To provide an information collection system for collecting wide information related in meaning by a desired amount, based on a given retrieval key. SOLUTION: This information collection system determines a level of relevancy between retrieval key correspondence retrieval result information retrieved by a focused crawler, based on the retrieval key, and linked-target correspondence retrieval result information retrieved by the focused crawler, based on linked-target information included in the retrieval key correspondence retrieval result information, and extracts a related word having high relevancy with a representative keyword of the retrieval key correspondence retrieval result information, out of the retrieval key correspondence retrieval result information. The information collection system specifies, as new retrieval keys, a URL used for the linked-target correspondence retrieval result information having high relevancy with the retrieval key correspondence retrieval result information, and the extracted related word of high relevancy. The information collection system limits a retrieval frequency using the new retrieval keys of the focused crawler, and collects information by the focused crawler. COPYRIGHT: (C)2011,JPO&INPIT
|