发明名称 Detecting Actionable Items in a Conversation among Participants
摘要 A computer-implemented technique is described herein for detecting actionable items in speech. In one manner of operation, the technique entails: receiving utterance information that expresses at least one utterance made by one participant of a conversation to at least one other participant of the conversation; converting the utterance information into recognized speech information; using a machine-trained model to recognize at least one actionable item associated with the recognized speech information; and performing at least one computer-implemented action associated the actionable item(s). The machine-trained model may correspond to a deep-structured convolutional neural network. In some implementations, the technique produces the machine-trained model using a source environment corpus that is not optimally suited for a target environment in which the model is intended to be applied. The technique further provides various adaptation techniques for adapting a source-environment model so that it better suits the target environment.
申请公布号 US2017092264(A1) 申请公布日期 2017.03.30
申请号 US201514864674 申请日期 2015.09.24
申请人 Microsoft Technology Licensing, LLC 发明人 Hakkani-Tur Dilek Zeynep;He Xiaodong;Chen Yun-Nung
分类号 G10L15/16;G10L15/20;G06N99/00;G10L15/22 主分类号 G10L15/16
代理机构 代理人
主权项 1. A method for identifying actionable items, implemented by at least one hardware processor provided by at least one computing device, comprising: receiving utterance information that expresses at least one utterance made by one participant of a conversation to at least one other participant of the conversation; converting the utterance information to recognized speech information, to provide one or more detected utterances; using a machine-trained model to recognize at least one actionable item associated with the recognized speech information; and performing at least one computer-implemented action associated with said at least one actionable item, said receiving, converting, using, and performing being executed by said at least one hardware processor without disrupting a flow of communication among participants to the conversation.
地址 Redmond WA US