发明名称 DEVICE FOR FINDING OUT MIRROR SITE GROUP ON WWW, METHOD FOR FINDING OUT MIRROR SITE, PROGRAM FOR THE METHOD, AND STORAGE MEDIUM RECORDING THE PROGRAM
摘要 PROBLEM TO BE SOLVED: To find out a proper mirror site group and to enhance the processing efficiency. SOLUTION: A page which forms a top page of a Web site is estimated from a large Web page group (S1), a site group is determined from the top page estimated for the Web page group and a page linked thereto (S2), and sites having a size of a fixed value or more of the site group are selected as a processing object (S3). A file of site characteristic elements (information for link character strings, anchor character strings and internal/external links possessed by the sites) is formed (S4). Site pairs having the same characteristic element are selected as mirror site candidates (S5), and a mirror site pair is detected from the similarity of the mirror site candidate pairs (S6). COPYRIGHT: (C)2004,JPO&NCIPI
申请公布号 JP2004264926(A) 申请公布日期 2004.09.24
申请号 JP20030052314 申请日期 2003.02.28
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 MORI KENICHI
分类号 G06F17/30;G06F12/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址