发明名称 Location estimation of social network users
摘要 Various embodiments relate to estimating the location of social network users. In one embodiment, a plurality of social media messages generated by a given user is received. A plurality of location features is extracted from the social media messages. Each of the location features is processed with at least one classifier from an ensemble of classifiers. A location classification is generated by each of the classifiers for each of the social media messages. Each classification comprises a location and a weight associated with that location. One of the locations is selected from the location classifications as the location of the given user based on a combination of the weights of the location classifications.
申请公布号 US9002960(B2) 申请公布日期 2015.04.07
申请号 US201213593604 申请日期 2012.08.24
申请人 International Business Machines Corporation 发明人 Drews Clemens;Mahmud Jalal U.;Nichols Jeffrey W.
分类号 G06F15/16;G06F17/30 主分类号 G06F15/16
代理机构 Fleit Gibbons Gutman Bongini & Bianco PL 代理人 Fleit Gibbons Gutman Bongini & Bianco PL ;Grzesik Thomas
主权项 1. A method comprising: receiving a plurality of social media messages generated by a given user; extracting a plurality of location features from the social media messages; computing, for each of the plurality of location features, a frequency of the location feature for at least one location; determining, for each of the plurality of location features, a number of people in the at least one location who have used the location feature in their social networking messages; determining, for each of the plurality of location features and based on the computed frequency and the determined number of people, if the location feature was included within social networking messages of a threshold percentage of people in the at least one location; and based on the location feature having been included within social networking messages of the threshold percentage of people, adding the feature to the subset of features; identifying at least one subset of location features from the plurality of location features that are discriminative of at least one location at a location granularity level of interest; processing each of the subset of location features with at least one classifier from an ensemble of classifiers; generating, by each of the classifiers, a location classification for each of the social media messages, each location classification comprising a location and a weight associated with that location; and selecting one of the locations from the location classifications as the location of the given user based on a combination of the weights of the location classifications.
地址 Armonk NY US