发明名称 Methods and apparatus for computing graph similarity via sequence similarity
摘要 This disclosure describes systems and methods for identifying and correcting anomalies in web graphs. A web graph is transformed into a sequence of tokens via a walk algorithm. The sequence is fingerprinted to form a set of shingles. The singles are compared to shingles for other web graphs in order to determine similarity between web graphs. Actions are then carried out to remove anomalous web graphs and modify parameters governing web mapping in order to decrease the likelihood of future anomalous web graphs being built.
申请公布号 US8417657(B2) 申请公布日期 2013.04.09
申请号 US201113099305 申请日期 2011.05.02
申请人 DASDAN ALI;PAPADIMITRIOU PANAGIOTIS;YAHOO! INC. 发明人 DASDAN ALI;PAPADIMITRIOU PANAGIOTIS
分类号 G06F17/00;G06N5/02 主分类号 G06F17/00
代理机构 代理人
主权项
地址