发明名称 SYSTEM AND METHOD FOR DOWNLOADING HYPERTEXT MARKUP LANGUAGE FORMATTED WEB PAGES
摘要 A method for downloading HTML formatted Web pages is provided. The method includes the steps of writing a URL of a Web page to be downloaded to an XQuery script; analyzing the XQuery script to obtain the URL of the HTML Web page and saving the downloaded Web page in a database as the local Web page; analyzing the contents of the local Web page to obtain target contents; converting the relative URLs of all image files to the absolute URLs; downloading all the image files according to the absolute URLs; replacing the absolute URLs of the image files with an local image file path; converting the relative URLs of the embedded links to the absolute URLs of the embedded links; saving all the converted absolute URLs in the database, creating identifiers; replacing the converted absolute URLs of the embedded links with an embedded link local path. A related system is also disclosed.
申请公布号 US2008046449(A1) 申请公布日期 2008.02.21
申请号 US20070756593 申请日期 2007.05.31
申请人 HON HAI PRECISION INDUSTRY CO., LTD. 发明人 LEE CHUNG-I;YEH CHIEN-FA;LU CHIU-HUA;JIANG ZHI-QIANG
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址