发明名称 Method and apparatus for modifying digital messages containing at least audio
摘要 A voice and/or video message for a user in the form of a voicemail or a video mail is edited in accordance with editing of text corresponding to audio of the message. Speech contained in the audio of the voice or video message is automatically converted to corresponding text and presented to the user via a graphical user interface. In response to various user inputs in relation to the presentation of the corresponding text, the corresponding text is modified and the digital representation of the voice or video message is modified in a manner corresponding to the modification of the corresponding text. New versions of the modified voice or video message including the modified digital representation of the voice or video message can be created and saved.
申请公布号 US9185225(B1) 申请公布日期 2015.11.10
申请号 US201113156095 申请日期 2011.06.08
申请人 Cellco Partnership 发明人 Vance Charles Terry
分类号 H04M11/00;H04M3/53;G10L19/00 主分类号 H04M11/00
代理机构 代理人
主权项 1. A computer-implemented method comprising steps of: obtaining a digital representation of a message having at least audio from a mailbox of a user in a server; automatically converting, by a processor, speech contained in the audio of the message to corresponding text; automatically converting, by the processor, the audio of the message to a corresponding spectral diagram, the spectral diagram being a frequency domain representation of the audio of the message; presenting a unified display on a computing device screen containing both the text and the spectral diagram to the user; automatically editing the text in response to one or more user inputs in relation to the presentation of the text; using the spectral diagram to remove sound in the audio that does not correspond to the text; automatically editing, by the processor, the digital representation of the message, including the audio of the message, in a manner corresponding to the editing of the text; annotating the text in response to one or more user inputs in relation to the presentation of the text; annotating the digital representation of the message, including the audio of the message, in a manner corresponding to the annotating of the text; and storing, in the server, the edited and annotated digital representation of the message.
地址 Basking Ridge NJ US