发明名称 System, method and computer program product for identifying words within collection of text applicable to specific sentiment
摘要 A content intelligence module may implement a sentiment analysis method to identify words or phrases from user-generated content that are associated with a particular sentiment. The method may comprise grouping or splitting text into different sentiment segments, tokenizing words or phrases and/or removing stopwords across the sentiment segments, performing a frequency analysis to count the words or phrases in each sentiment segment, scaling the frequency results across the sentiment segments where necessary, and removing commonly used words from the sentiment segments. The words or phrases that are left in a specific sentiment segment are the most-used words for that sentiment segment. The word cloud module therefore allows for very quick generation of a summary around sentiment segments. A sentiment overview containing the summary can be presented to a user in connection with a selected product or service with which the user-generated content is associated.
申请公布号 US8818788(B1) 申请公布日期 2014.08.26
申请号 US201213363978 申请日期 2012.02.01
申请人 Bazaarvoice, Inc. 发明人 Mihalik Dustin;Friesenhahn Dustin;Wadhwani Luveen Rupchand
分类号 G06F17/27;G06Q30/00 主分类号 G06F17/27
代理机构 代理人
主权项 1. A method for analyzing sentiment, comprising: at a first computer: dividing a collection of text into a plurality of sentiment segments; tokenizing words or phrases in the plurality of sentiment segments; performing a frequency analysis on tokenized words or phrases in each sentiment segment of the plurality of sentiment segments; performing a scaling operation to size individual sentiment segments based on results from the frequency analysis; for each tokenized word or phrase in each sentiment segment of the plurality of sentiment segments, subtracting a first number of the tokenized word or phrase in the sentiment segment from a second number of the tokenized word or phrase in at least one other sentiment segment of the plurality of sentiment segments, thereby producing, for each sentiment segment of the plurality of sentiment segments, a list of words or phrases that apply specifically to the sentiment segment; and providing the list of words or phrases that apply specifically to the sentiment segment to a second computer over a network connection.
地址 Austin TX US