发明名称 Transcription of Spoken Communications
摘要 A portion of speech is captured when spoken by a near-end user. A near-end user terminal conducts a communication session, over a network, between the near-end user and one or more far-end users, the session including a message sent to the one or more far-end users. A vetting mechanism is provided via a touchscreen user interface of the near-end user terminal, to allow the near-end user to vet an estimated transcription of the portion of speech prior to being sent to the one or more far-end users in the message. According to the vetting mechanism: (i) a first gesture performed by the near-end user through the touchscreen user interface accepts the estimated transcription to be included in a predetermined role in the sent message, whilst (ii) one or more second gestures performed by the near-end user through the touchscreen user interface each reject the estimated transcription to be sent in the message.
申请公布号 US2017085696(A1) 申请公布日期 2017.03.23
申请号 US201514858648 申请日期 2015.09.18
申请人 Microsoft Technology Licensing, LLC 发明人 Abkairov Nikolay
分类号 H04M1/725;G06F17/28 主分类号 H04M1/725
代理机构 代理人
主权项 1. A user terminal comprising: a microphone for capturing a portion of speech spoken by a near-end user of said user terminal; a network interface for connecting to a communication network; a communication client application operable to conduct a communication session, over said network, between the near-end user and one or more far-end users of one or more far-end terminals, including being operable to cause an estimated transcription of said portion of speech to be sent in a message to the one or more far-end users as part of said communication session; and a touchscreen user interface; wherein the client application is configured to implement a vetting mechanism to allow the near-end user to vet the estimated transcription via the touchscreen user interface prior to being sent in said message, and wherein according to said vetting mechanism: (i) a first gesture performed by the near-end user through the touchscreen user interface accepts the estimated transcription to be included in a predetermined role in the sent message, whilst (ii) one or more second gestures performed by the near-end user through the touchscreen user interface each reject the estimated transcription to be sent in said message, the communication client being further configured so as in response to the estimated transcription being rejected, to present one or more alternative transcriptions of said portion of speech, and to provide an option via the touchscreen user interface to select one of the one or more alternative transcriptions to be sent in said message.
地址 Redmond WA US