发明名称 |
Methods and apparatus to automatically crawl the internet using image analysis |
摘要 |
Methods and apparatus to automatically crawl the Internet using image analysis are disclosed. An example method to visually identify components of a web page includes rendering a web page in a web browser to generate an image, and visually analyzing at least a portion of the image with a machine to detect a region containing a possible web page component. The example method further includes automatically determining a type of the detected web page component and storing the web page component type and a location of the portion of the web page. |
申请公布号 |
EP2169566(A1) |
申请公布日期 |
2010.03.31 |
申请号 |
EP20090012337 |
申请日期 |
2009.09.29 |
申请人 |
THE NIELSEN COMPANY (US), LLC |
发明人 |
DELIYANNIS, ALEXANDROS |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|