摘要 |
PROBLEM TO BE SOLVED: To provide an apparatus, a method and a program for creating of link collection capable of creating automatically a directory-type link collection page. SOLUTION: Document files including URLs are acquired from e-mails, news groups, bulletin boards, etc., the URLs are extracted from the files and also documents existed in front and in the rear of the URLs are extracted as many as a determined number as candidates for introducing documents for the URLs, whole of documents (main bodies of documents) published on sites specified by the URLs are acquired and document vectors of the appropriate documents are set to be compared with document vectors of the extracted documents as the candidates to specify the candidates for introducing documents having the highest degrees of similarity to the main bodies as the introducing documents, next, the specified introducing documents are classified by category to output as the collection page in HTML-type files together the URLs.
|