发明名称 INFORMATION COLLECTION APPARATUS, SEARCH ENGINE, INFORMATION COLLECTION METHOD, AND PROGRAM
摘要 The present invention provides an information collection apparatus, an information collection method, and a program capable of collecting information from information resources on a network effectively as well as a search engine that searches the information resources collected. An information collection apparatus of the present invention that collects information from information resources on a network includes an extraction unit that acquires data from an information resource via the network to extract a link-destination address included in the data, a calculation unit that calculates, by comparing each link-destination address with a collection rule describing a set of addresses qualified for a collection target, a score for each link-destination address that reflects a distance from the set to a link-destination information resource indicated by the link-destination address, and a judgment unit that judges whether the link-destination information resource is to be included in the collection target or not in accordance with the score calculated for the link-destination information resource.
申请公布号 US2011119263(A1) 申请公布日期 2011.05.19
申请号 US200913003875 申请日期 2009.08.14
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 HAMADA SEIJI;YAMAMOTO MAKOTO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址