主权项 |
1. A method for preparing a dataset of uncommon features, comprising:
retrieving a dataset comprising a plurality of social media messages stored in a memory, wherein the plurality of social media messages are authored by a plurality of users of one or more social media services; extracting, using a processor, a plurality of features from the plurality of social media messages, wherein each of the extracted features is associated with a user that authored a social media message comprising the extracted feature; and determining that the extracted features are uncommon features when a count for each of the extracted features exceeds a first threshold and is less than a second threshold. |