发明名称 Person disambiguation using name entity extraction-based clustering
摘要 Described is a technology for disambiguating data corresponding to persons that are located from search results, so that different persons having the same name can be clearly distinguished. Name entity extraction locates words (terms) that are within a certain distance of persons' names in the search results. The terms are used in disambiguating search results that correspond to different persons having the same name, such as location information, organization information, career information, and/or partner information. In one example, each person is represented as a vector, and similarity among vectors is calculated based on weighting that corresponds to nearness of the terms to a person, and/or the types of terms. Based on the similarity data, the person vectors that represent the same person are then merged into one cluster, so that each cluster represents (to a high probability) only one distinct person.
申请公布号 US2008065623(A1) 申请公布日期 2008.03.13
申请号 US20070796818 申请日期 2007.04.30
申请人 MICROSOFT CORPORATION 发明人 ZENG HUA-JUN;HUANG SHEN;CHEN ZHENG;WANG JIAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址