发明名称 DEVICE FOR IDENTIFYING INVALID PARAMETERS IN URL, AND DEVICE AND METHOD FOR IDENTIFYING INVALID PARAMETERS
摘要 <p>The present invention relates to the technical field of search engines. Disclosed are a device for identifying invalid parameters in a URL, and a device and method for identifying invalid parameters. The device comprises a URL obtaining unit, configured to obtain URLs separately linked to a plurality of web-pages; a URL fragment combination extracting unit, configured to extract a URL fragment combination from each obtained URL separately linked to a web-page; a statistics unit, configured to collect statistics on an occurrence frequency of each URL fragment combination and determine a URL fragment combination, an occurrence frequency of which meets a preset condition, to be a target URL fragment combination; and a validity determining unit, configured to determine validity of each URL parameter in the target URL fragment combination for each target URL fragment combination according to the URL containing the target URL fragment combination. The efficiency for identifying duplicate links is enhanced, and then the efficiency for grabbing information by a search engine is improved.</p>
申请公布号 WO2015043308(A1) 申请公布日期 2015.04.02
申请号 WO2014CN83216 申请日期 2014.07.29
申请人 BEIJING QIHOO TECHNOLOGY COMPANY LIMITED;QIZHI SOFTWARE (BEIJING) COMPANY LIMITED 发明人 WEI, SHAOJUN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址