发明名称 Detecting separator lines in a web page
摘要 A system and method of detecting separator lines in a web page may include determining coordinates of visible web elements on a web page, generating an edge image of the web page based on the coordinates of the web elements, filtering edges belonging to non-separator line elements within the edge image, detecting horizontal lines within the edge image, detecting vertical lines within the edge image, and filtering short lines within the edge image. A system for detecting separator lines in a web page may include a memory device, and a processor communicatively coupled to the memory, in which the processor determines coordinates of visible web elements on a web page, generates an edge image of the web page based on the coordinates of the web elements, filters edges belonging to non-separator line elements within the edge image, detects horizontal lines within the edge image, detects vertical lines within the edge image, and filters short lines within the edge image.
申请公布号 US8867837(B2) 申请公布日期 2014.10.21
申请号 US201013812421 申请日期 2010.07.30
申请人 Hewlett-Packard Development Company, L.P. 发明人 Hou Hui-Man;Zheng Li-Wei;Jin Jian-Ming;Fan Jian;Lim Suk Hwan
分类号 G06K9/34;C07D309/28;G06K9/00 主分类号 G06K9/34
代理机构 代理人
主权项 1. A method performed by a physical computing device comprising at least one processor for detecting separator lines in a web page comprising: determining, with the computing system, coordinates of visible web elements on a web page; generating, with the computing system, an edge image of the web page based on the coordinates of the web elements; filtering, with the computing system, edges belonging to non-separator line elements within the edge image; detecting, with the computing system, horizontal lines within the edge image; detecting, with the computing system, vertical lines within the edge image; and filtering, with the computing system, short lines within the edge image to provide an indication of the separator lines.
地址 Houston TX US