发明名称 Method for enhancing search and browsing in collaborative tagging systems through learned tag hierarchies
摘要 A number of Web 2.0 sites support collaborative tagging systems, which allow users to tag resources with keywords. The tags enable search and retrieval of resources both for the user and for other users, using interfaces like a conventional search form or a tag cloud. A tag hierarchy-based search and retrieval system is provided that enhances the existing interfaces by improving search recall and allowing the discovery of even poorly annotated resources. The system uses tag co-occurrence information to automatically learn tag hierarchies. The learned hierarchies are used for automatically inferring additional tags to resources. These inferences are used to improve the recall of queries issued from a search form or via a tag cloud. The learned hierarchies can be viewed as an emergent ontology that is built up through the collaborative wisdom of a large number of users.
申请公布号 US8799294(B2) 申请公布日期 2014.08.05
申请号 US200812120663 申请日期 2008.05.15
申请人 International Business Machines Corporation 发明人 Bouillet Eric;Liu Zhen;Ranganathan Anand;Riabov Anton
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 August Law, LLC 代理人 Willinghan George;August Law, LLC
主权项 1. A method for improving search and retrieval of resources in a collaborative tagging system, the method comprising: identifying, using a computing system in communication with a plurality of computer accessible resources, relationships among a plurality of tags in the collaborative tagging system, the plurality of tags associated by a plurality of users with the plurality of resources, and the relationships among the plurality of tags comprising sub-tag relationships between given pairs of tags based on co-occurrence of those pairs of tags at a given resource and associative semantics, the associative semantics comprising common association between tags in each given pair of tags by the plurality of users; using the identified tag relationships based on tag co-occurrence statistics to create a hierarchy of tags, the hierarchy comprising a directed acyclic graph where nodes in the directed acyclic graph comprise tags and edges in the directed acyclic graph comprise the identified tag relationships; using the created hierarchy to infer additional tags for resources automatically, to increase a number of tags associated with each resource in order to increase a total number of resources uncovered by a tag-based search of the plurality of resources and to maximize the recall of a tag cloud comprising a plurality of tags; and using the tag hierarchies to include inferred tags in the description of each one of the plurality of resources, increasing the weight of top level tags in the hierarchy and removing lower level tags from the tag cloud.
地址 Armonk NY US