发明名称 DOCUMENT OBJECT MODEL (DOM) BASED PAGE UNIQUENESS DETECTION
摘要 DOM based unique ID generation, including receiving a hypertext markup language (HTML) page at a computer, and identifying HTML page elements in response to the receiving, the HTML page elements comprising parent nodes, the parent nodes comprising child nodes. The method further comprising processing each of the HTML page elements, the processing comprising: grouping the child nodes by parent node into a group of child nodes, detecting patterns in the group of child nodes in response to the grouping, reducing the group of child nodes to text strings in response to the detecting, storing the text strings as text values in the parent nodes, and generating a unique identifier (ID) of the HTML page in response to the processing.
申请公布号 US2012005211(A1) 申请公布日期 2012.01.05
申请号 US201113167170 申请日期 2011.06.23
申请人 AYOUB KHALIL;ALY HOSAM;WALSH JASON;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 AYOUB KHALIL;ALY HOSAM;WALSH JASON
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址