发明名称 COMMENT-COMMENT AND COMMENT-DOCUMENT ANALYSIS OF DOCUMENTS
摘要 A system and method for analyzing documents, such as posts, on-line reviews and comments from people based on topics of the documents, to determine general sentiment of users is disclosed. Topics from the documents and their corresponding sentiment polarities are extracted. The documents are regarded to be constituted by a series of topics. The sentiment for a topic is represented by a quadruple (k, so, h, i), where k is the topic, so is the sentiment opinion, h is the comment or post holder, and i is the document. A quintuple (k, sup, p, n, ne) is used to illustrate the topics and corresponding sentiments and is stored in S, where sup indicates the frequency of the topic, and p (positive), n (negative) and ne (neutral) are different types of opinions of the users. From the quintuple set S, every topic is related to three kinds of sentiment opinions (positive, negative, and neutral), enabling determination of popular topics in documents as well as the users' sentiment polarities.
申请公布号 US2017109633(A1) 申请公布日期 2017.04.20
申请号 US201514884732 申请日期 2015.10.15
申请人 SAP SE 发明人 BAI Meilin;SHI Xingtian;LI Wen-Syan
分类号 G06N5/04;G06F17/30 主分类号 G06N5/04
代理机构 代理人
主权项 1. A computer-implemented method performed by a computer system to determine sentiments of users for topics comprising: providing a plurality of on-line documents to form a corpus, wherein a document comprise one or more comments, wherein comments of the document comprise one or more holders; pre-processing the documents of the corpus to form a structure data representation of the documents; and processing the pre-processed documents, wherein the processing comprises extracting topics from the documents of the corpus,mapping the documents to the topics to form comment-topic pairs,analyzing the comment-topic pairs to determine the sentiments of the topics, andidentifying relationships of a comment-comment structure within the topics of each document in the corpus.
地址 Walldorf DE