发明名称 Intent discovery in audio or text-based conversation
摘要 Methods, systems, and computer program products for identifying one or more utterances that are likely to carry the intent of a speaker are provided herein. A method includes providing a transcript of utterances to a word weight scoring module to perform inverse document frequency based scoring on each word in the transcript, thereby generating a weight for each word; calculating a weight for each utterance in the transcript to generate weighted utterances by summing the weights or each constituent word in each utterance; comparing at least one weighted utterance to pre-existing example utterances carrying the intent of a speaker to determine a relevancy score for the at least one weighted utterance; and generating a ranked order of the at least one weighted utterance from highest to lowest intent relevancy score, wherein the highest intent relevancy score corresponds to the utterance which is most likely to carry intent of the speaker.
申请公布号 US9620147(B2) 申请公布日期 2017.04.11
申请号 US201514612989 申请日期 2015.02.03
申请人 International Business Machines Corporation 发明人 Deshmukh Om D.;Joshi Sachindra;Saurabh Saket;Verma Ashish
分类号 G10L21/10;G10L25/48;G10L15/18;G10L15/02;G10L15/26 主分类号 G10L21/10
代理机构 Ryan, Mason & Lewis, LLP 代理人 Ryan, Mason & Lewis, LLP
主权项 1. A method comprising: providing at least one transcript of utterances from a conversation between two or more parties to a word weight scoring module to perform inverse document frequency based scoring on each word in the at least one transcript, thereby generating a weight for each word, wherein the inverse document frequency based scoring measures the frequency of each word throughout the at least one transcript; calculating a weight for each utterance in the transcript to generate weighted utterances by assigning to each utterance the weight of the word with a maximum weight that occurs in each utterance; comparing at least one weighted utterance to pre-existing example utterances carrying an intent of a speaker to determine a relevancy score for the at least one weighted utterance based on similarity to the example utterances; and generating a ranked order of the at least one weighted utterance from highest to lowest intent relevancy score, wherein the highest intent relevancy score corresponds to the utterance which is most likely to carry intent of the speaker, and wherein said generating is carried out by a relevant propagation module executing on a hardware processor of a computing device.
地址 Armonk NY US