发明名称 Method and apparatus for organizing data sources
摘要 A method and apparatus for organizing deep Web services are provided. In one aspect, the method and apparatus obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method and apparatus uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.
申请公布号 US2008040326(A1) 申请公布日期 2008.02.14
申请号 US20060503713 申请日期 2006.08.14
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHANG YUAN-CHI;LIM LIPYEOW;WANG MIN;ZHANG ZHEN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址