发明名称 Speech processing
摘要 A technique for enhancing speech signal captured in a noisy environment is provided. According an example embodiment, the technique comprises obtaining a current time frame of a noise-suppressed voice signal, derived on basis of a current time frame of a source audio signal comprising a source voice signal, detecting input voice characteristics for the current time frame of noise-suppressed voice signal, obtaining reference voice characteristics for said current time frame, said reference voice characteristics being descriptive of the source voice signal in noise-free or low-noise environment, and creating a current time frame of a modified voice signal by modifying said current time frame of the noise-suppressed voice signal in response to a difference between the detected input voice characteristic and the reference voice characteristics exceeding a predetermined threshold.
申请公布号 US9530427(B2) 申请公布日期 2016.12.27
申请号 US201414507290 申请日期 2014.10.06
申请人 Nokia Technologies Oy 发明人 Järvinen Kari Juhani
分类号 G10L15/00;G10L15/20;G10L21/0208;G10L21/0364 主分类号 G10L15/00
代理机构 Alston & Bird LLP 代理人 Alston & Bird LLP
主权项 1. An apparatus comprising at least one processor and at least one non-transitory computer-readable memory including computer program code for one or more programs, the at least one non-transitory computer-readable memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to: obtain a current time frame of a noise-suppressed voice signal, derived on basis of a current time frame of a source audio signal comprising a source voice signal; detect input voice characteristics for the current time frame of noise-suppressed voice signal; obtain reference voice characteristics for said current time frame, said reference voice characteristics being descriptive of the source voice signal in noise-free or low-noise environment; and create a current time frame of a modified voice signal by modifying said current time frame of the noise-suppressed voice signal in response to a difference between the detected input voice characteristics and the reference voice characteristics exceeding a predetermined threshold.
地址 Espoo FI