发明名称 Systems and methods for speech processing via a GUI for adjusting attack and release times
摘要 Systems and methods described herein modify audio content on an electronic device. Embodiments can be configured to detect a mode of the electronic device to determine whether the device is in a telephone mode; receive a speech signal from a speech source while the device is in the telephone mode; and process the speech signal to improve the perceived quality of the speech at a recipient when the electronic device is in a telephone mode; wherein processing the speech signal to improve the perceived quality of the speech comprises, decreasing the signal level of audio content outside of a determined frequency band relative to the signal level of the audio content within the determined frequency band; and wherein the determined frequency band is a frequency band associated a vocal range of the anticipated speech content. The electronic device further includes a graphical user interface which allows a user to adjust any or all audio parameters including very high frequency attack or release times.
申请公布号 US9449612(B2) 申请公布日期 2016.09.20
申请号 US201213603767 申请日期 2012.09.05
申请人 Yobe, Inc. 发明人 Fairey James Christopher
分类号 G10L21/00;G10L21/02;G10L21/0364;G10L21/0208;H03G5/00 主分类号 G10L21/00
代理机构 Sheppard Mullin Richter & Hampton LLP 代理人 Sheppard Mullin Richter & Hampton LLP
主权项 1. A method for modifying audio content on an electronic device with a graphical user interface (GUI), the method comprising: the electronic device detecting an audio mode in which the electronic device is operating to determine a type of audio content processing to apply to audio content based on the detected audio mode, wherein the type of audio content processing comprises speech processing when the electronic device is detected to be in a speech-related audio mode, and music processing when the electronic device is detected to be in a playback audio mode; if the detected audio mode is a speech-related audio mode, receiving a speech signal from a speech source; and processing the speech signal to improve perceived quality of the speech at a recipient; wherein processing the speech signal to improve the perceived qualify of the speech comprises, decreasing a signal level of audio content outside of a determined frequency band relative to a signal level of audio content within the determined frequency band; and wherein the determined frequency band is a frequency band associated with a vocal range of an anticipated speech content; wherein using the GUI, a user is allowed to adjust audio parameters including attack and release times allowable adjustment ranges, wherein the attack is associated with very high frequency sounds which cannot be phase shifted.
地址 St. James NY US