发明名称 |
Speech processing |
摘要 |
A technique for enhancing speech signal captured in a noisy environment is provided. According an example embodiment, the technique comprises obtaining a current time frame of a noise-suppressed voice signal, derived on basis of a current time frame of a source audio signal comprising a source voice signal, detecting input voice characteristics for the current time frame of noise-suppressed voice signal, obtaining reference voice characteristics for said current time frame, said reference voice characteristics being descriptive of the source voice signal in noise-free or low-noise environment, and creating a current time frame of a modified voice signal by modifying said current time frame of the noise-suppressed voice signal in response to a difference between the detected input voice characteristic and the reference voice characteristics exceeding a predetermined threshold. |
申请公布号 |
US9530427(B2) |
申请公布日期 |
2016.12.27 |
申请号 |
US201414507290 |
申请日期 |
2014.10.06 |
申请人 |
Nokia Technologies Oy |
发明人 |
Järvinen Kari Juhani |
分类号 |
G10L15/00;G10L15/20;G10L21/0208;G10L21/0364 |
主分类号 |
G10L15/00 |
代理机构 |
Alston & Bird LLP |
代理人 |
Alston & Bird LLP |
主权项 |
1. An apparatus comprising at least one processor and at least one non-transitory computer-readable memory including computer program code for one or more programs, the at least one non-transitory computer-readable memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to:
obtain a current time frame of a noise-suppressed voice signal, derived on basis of a current time frame of a source audio signal comprising a source voice signal; detect input voice characteristics for the current time frame of noise-suppressed voice signal; obtain reference voice characteristics for said current time frame, said reference voice characteristics being descriptive of the source voice signal in noise-free or low-noise environment; and create a current time frame of a modified voice signal by modifying said current time frame of the noise-suppressed voice signal in response to a difference between the detected input voice characteristics and the reference voice characteristics exceeding a predetermined threshold. |
地址 |
Espoo FI |