发明名称 |
Speech processing apparatus and speech processing method |
摘要 |
A signal portion is extracted per frame having a specific duration from an input signal, thus generating a per-frame input signal. The per-frame input signal in the time domain is converted into a per-frame input signal in the frequency domain, thereby generating a spectral pattern of spectra. Peak spectra having peaks are detected in the spectral pattern. A harmonic spectrum is determined, in the peak spectra, having a harmonic structure showing a relationship between a fundamental pitch and a harmonic overtone. |
申请公布号 |
US8818806(B2) |
申请公布日期 |
2014.08.26 |
申请号 |
US201113305322 |
申请日期 |
2011.11.28 |
申请人 |
JVC KENWOOD Corporation |
发明人 |
Yamabe Takaaki |
分类号 |
G10L15/00 |
主分类号 |
G10L15/00 |
代理机构 |
Renner, Kenner, Greive, Bobak, Taylor & Weber |
代理人 |
Renner, Kenner, Greive, Bobak, Taylor & Weber |
主权项 |
1. A speech processing apparatus comprising:
a frame extraction unit configured to extract a signal portion per frame having a specific duration from an input signal that includes periodic non-speech segments, thus generating a per-frame input signal; a spectrum generation unit configured to convert the per-frame input signal in a time domain into a per-frame input signal in a frequency domain, thereby generating a spectral pattern of spectra; a peak detection unit configured to detect peak spectra having peaks in the spectral pattern by determining at least one spectrum of a first spectrum group of a predetermined number of spectra as the peak spectrum based on a predetermined criterion if an energy ratio of total energy of the first spectrum group to total energy of a second group of the predetermined number of spectra, next to the first spectrum group in the spectral pattern, is equal to or higher than a predetermined threshold level; and a harmonic-overtone determination unit configured to determine a harmonic spectrum, in the peak spectra, having a harmonic structure showing a relationship between a fundamental pitch and a harmonic overtone based on a barycentric frequency weighted by energy of each of the peak spectra; and a noise attenuation unit configured to attenuate energy corresponding to spectra obtained by removing the harmonic spectrum from the peak spectra in the spectral pattern. |
地址 |
Kanagawa-Ku, Yokohama-Shi, Kanagawa-Ken JP |