发明名称 METHOD AND SYSTEM FOR MONITORING SOCIAL MEDIA AND ANALYZING TEXT TO AUTOMATE CLASSIFICATION OF USER POSTS USING A FACET BASED RELEVANCE ASSESSMENT MODEL
摘要 A social media monitoring and text analysis method for automated classification of user posts on the web, using a facet based relevance assessment model, comprise a semantic indexing server, which builds a faceted classification index of text objects, and a query server, which receives and analyzes the user's query. A query thus processed is then sent from the query server to the semantic indexing server through an interface in order to perform a search in the faceted classification index. The search system and method further comprise a result handler, which provides the user with a search result set comprising a list of unexpected links and a list of result elements. The list of unexpected links corresponds to filters which allow the user to narrow down or refine the original query. The quality of unexpected links depends on identification of the most likely topical area of focus related to the query concepts and corresponding concepts in user posts, and this achieved by ensuring that we measure statistical co-occurrence of concepts in user posts, assign weighted scores based on information gain and semantic density, thus establishing a relevant conceptual tag cloud that is used validate topical focus against a set of industry specific taxonomies or ontologies.
申请公布号 US2015254230(A1) 申请公布日期 2015.09.10
申请号 US201314432397 申请日期 2013.09.30
申请人 PAPADOPOULLOS Alkis;PLANTE Patrick 发明人 Papadopoullos Alkis;Plante Patrick
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项 1. A method for automated classification of documents and to automate classification of users posts provided on a network, the method comprising: a) detecting the one or more languages of the documents to be classified; b) discovering one or more sentence within the one or more documents; c) classifying text objects contained in the documents using a faceted classification and by discovering the polarity and objectivity of the documents; d) categorizing the documents by extracting the categories from the documents.
地址 Outremont, Québec CA