发明名称 |
LANGUAGE IDENTIFICATION ON SOCIAL MEDIA |
摘要 |
A method for language prediction of a social network post includes generating a social network graph which includes nodes connected by edges. Some of the nodes are user nodes representing users of a social network and some of the nodes are social network post nodes representing social network posts. At least some of the users are authors of social network posts represented by respective social network post nodes. Edges of the graph are associated with respective weights. At least one of the social network post nodes is unlabeled. Language labels are predicted for the at least one unlabeled social network post node which includes propagating language labels through the graph. A language of the social network post is predicted based on the predicted language labels for the social network post node representing that social network post and optionally also based on content-based features. |
申请公布号 |
EP3073433(A1) |
申请公布日期 |
2016.09.28 |
申请号 |
EP20160159284 |
申请日期 |
2016.03.08 |
申请人 |
XEROX CORPORATION |
发明人 |
GALLÉ, MATTHIAS;RADFORD, WILLIAM |
分类号 |
G06Q50/00 |
主分类号 |
G06Q50/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|