发明名称 |
DEVICE FOR FINDING OUT MIRROR SITE GROUP ON WWW, METHOD FOR FINDING OUT MIRROR SITE, PROGRAM FOR THE METHOD, AND STORAGE MEDIUM RECORDING THE PROGRAM |
摘要 |
PROBLEM TO BE SOLVED: To find out a proper mirror site group and to enhance the processing efficiency. SOLUTION: A page which forms a top page of a Web site is estimated from a large Web page group (S1), a site group is determined from the top page estimated for the Web page group and a page linked thereto (S2), and sites having a size of a fixed value or more of the site group are selected as a processing object (S3). A file of site characteristic elements (information for link character strings, anchor character strings and internal/external links possessed by the sites) is formed (S4). Site pairs having the same characteristic element are selected as mirror site candidates (S5), and a mirror site pair is detected from the similarity of the mirror site candidate pairs (S6). COPYRIGHT: (C)2004,JPO&NCIPI
|
申请公布号 |
JP2004264926(A) |
申请公布日期 |
2004.09.24 |
申请号 |
JP20030052314 |
申请日期 |
2003.02.28 |
申请人 |
NIPPON TELEGR & TELEPH CORP <NTT> |
发明人 |
MORI KENICHI |
分类号 |
G06F17/30;G06F12/00;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|