发明名称 曖昧性を解消する教師データの生成方法、生成システム、及び生成プログラム
摘要 A method for generating training data for disambiguation of an entity comprising a word or word string related to a topic to be analyzed includes acquiring sent messages by a user, each including at least one entity in a set of entities; organizing the messages and acquiring sets, each containing messages sent by each user; identifying a set of messages including different entities, greater than or equal to a first threshold value, and identifying a user corresponding to the identified set as a hot user; receiving an instruction indicating an object entity to be disambiguated; determining a likelihood of co-occurrence of each keyword and the object entity in sets of messages sent by hot users; and determining training data for the object entity on the basis of the likelihood of co-occurrence of each keyword and the object entity in the sets of messages sent by the hot users.
申请公布号 JP5957048(B2) 申请公布日期 2016.07.27
申请号 JP20140166695 申请日期 2014.08.19
申请人 インターナショナル・ビジネス・マシーンズ・コーポレーションINTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 伊川 洋平;鈴木 明子
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址