PURPOSE: A webpage crawling method and an apparatus thereof are provided to reduce crawling working time by increasing probability to search for content of a web page having specific information through dynamic control of crawling depth. CONSTITUTION: A web address obtaining unit(210) obtains a web address list which is linked on a route web address list. A webpage evaluating unit(220) obtains content about a web address page by visiting web addresses based on the web address list. A crawling depth controlling unit(230) controls crawling depth according to an evaluation result.
申请公布号
KR20120042529(A)
申请公布日期
2012.05.03
申请号
KR20100104246
申请日期
2010.10.25
申请人
SAMSUNG ELECTRONICS CO., LTD.;KOREA ADVANCED INSTITUTE OF SCIENCE AND TECHNOLOGY
发明人
YOON, SEUNG HYUN;MAENG, SEUNG RYOUL;HUH, JAE HYUK;SEO, SANG WON;KIM, JAE HONG;PARK, JONG SE