发明名称 |
SYSTEM AND METHOD FOR DOWNLOADING HYPERTEXT MARKUP LANGUAGE FORMATTED WEB PAGES |
摘要 |
A method for downloading HTML formatted Web pages is provided. The method includes the steps of writing a URL of a Web page to be downloaded to an XQuery script; analyzing the XQuery script to obtain the URL of the HTML Web page and saving the downloaded Web page in a database as the local Web page; analyzing the contents of the local Web page to obtain target contents; converting the relative URLs of all image files to the absolute URLs; downloading all the image files according to the absolute URLs; replacing the absolute URLs of the image files with an local image file path; converting the relative URLs of the embedded links to the absolute URLs of the embedded links; saving all the converted absolute URLs in the database, creating identifiers; replacing the converted absolute URLs of the embedded links with an embedded link local path. A related system is also disclosed.
|
申请公布号 |
US2008046449(A1) |
申请公布日期 |
2008.02.21 |
申请号 |
US20070756593 |
申请日期 |
2007.05.31 |
申请人 |
HON HAI PRECISION INDUSTRY CO., LTD. |
发明人 |
LEE CHUNG-I;YEH CHIEN-FA;LU CHIU-HUA;JIANG ZHI-QIANG |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|