发明名称 |
Noise reduction method, program product, and apparatus |
摘要 |
A probability model represented as the product of the probability distribution of a mismatch vector g (or clean speech x) with an observed value y as a factor and the probability distribution of a mismatch vector g (or clean speech x) with a confidence index β for each band as a factor, executes MMSE estimation on the probability model, and estimates a clean speech estimated value x^. As a result, each band influences the result of MMSE estimation, with a degree of contribution in accordance with the level of its confidence. Further, the higher the S/N ratio of observation speech, the more the output value becomes shifted to the observed value. As a result, the output of a front-end is optimized. |
申请公布号 |
US9087513(B2) |
申请公布日期 |
2015.07.21 |
申请号 |
US201313792310 |
申请日期 |
2013.03.11 |
申请人 |
International Business Machines Corporation |
发明人 |
Ichikawa Osamu;Rennie Steven |
分类号 |
G10L15/20 |
主分类号 |
G10L15/20 |
代理机构 |
|
代理人 |
Davis Jennifer R.;Dougherty Anne Vachon |
主权项 |
1. A noise reduction method comprising:
a step of generating a confidence index for each band on the basis of a spectrum of observation speech; a step of generating a probability model represented as a mixture multi-dimensional normal distribution having a dimension for each band, each normal distribution being represented as a product of a first normal distribution and a second normal distribution; and a step of estimating a mismatch vector estimated value by executing MMSE estimation on the probability model, and deriving a clean speech estimated value on the basis of the mismatch vector estimated value, wherein the first normal distribution is a probability distribution of a mismatch vector generated based on the observation speech, and wherein the second normal distribution has a zero mean, and a variance defined as a function that outputs a smaller value as the confidence index becomes greater. |
地址 |
Armonk NY US |