摘要 |
<p>The present invention relates to the technical field of search engines. Disclosed are a device for identifying invalid parameters in a URL, and a device and method for identifying invalid parameters. The device comprises a URL obtaining unit, configured to obtain URLs separately linked to a plurality of web-pages; a URL fragment combination extracting unit, configured to extract a URL fragment combination from each obtained URL separately linked to a web-page; a statistics unit, configured to collect statistics on an occurrence frequency of each URL fragment combination and determine a URL fragment combination, an occurrence frequency of which meets a preset condition, to be a target URL fragment combination; and a validity determining unit, configured to determine validity of each URL parameter in the target URL fragment combination for each target URL fragment combination according to the URL containing the target URL fragment combination. The efficiency for identifying duplicate links is enhanced, and then the efficiency for grabbing information by a search engine is improved.</p> |