发明名称 Spammer group extraction apparatus and method
摘要 The present invention relates to a spammer group extraction apparatus and method, which extract spammer groups that interfere with fair trade and unbiased decision making by sending messages aimed at intentionally slandering other companies (other persons, other products, etc.) on social network services. The spammer group extraction apparatus includes a data collection unit for collecting pieces of data corresponding to social network services. A natural language processing unit preprocesses the pieces of data using a natural language processing algorithm based on big data. An abnormal behavior detection unit detects abnormal behavior based on user identifications (IDs) respectively corresponding to pieces of data, preprocessing of which has been completed. A spammer extraction unit extracts a spammer group using a user ID causing the abnormal behavior and an ID of a user group including the user ID.
申请公布号 US9563770(B2) 申请公布日期 2017.02.07
申请号 US201414324374 申请日期 2014.07.07
申请人 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE 发明人 Kim Min Sik;Kim Ki Heon;Cho Min Kyung;Park In Sung;Moon Jong Cheoll;Park Sang Woo
分类号 H04L29/06;G06F21/55;H04L12/58 主分类号 H04L29/06
代理机构 LRK Patent Law Firm 代理人 LRK Patent Law Firm
主权项 1. A spammer group extraction method, comprising: collecting pieces of data corresponding to social network services; preprocessing the pieces of data; detecting abnormal behavior based on user identifications (IDs) respectively corresponding to pieces of data, preprocessing of which has been completed; analyzing characteristics of individual IDs and connection characteristics between IDs using user IDs and IDs of a user group including the user IDs; and extracting and displaying, based on the analysis, a spammer group using the user IDs causing the abnormal behavior and the IDs of a user group including the user IDs, wherein preprocessing the pieces of data comprises performing one or more procedures of a set of procedures consisting of: a procedure for extracting keywords from the pieces of data, a procedure for sorting the pieces of data using extracted keywords, a procedure for extracting keywords associated with respective pieces of data of the pieces of data, and a procedure for identifying characteristics of messages corresponding to the pieces of data.
地址 Daejeon KR