发明名称 |
Speech processing apparatus and method |
摘要 |
A speech processing apparatus and method. The speech processing apparatus includes a microphone to receive a speech signal, an analog/digital converter to convert the speech signal generated by the microphone into a digital speech signal, and an automatic gain controller to calculate an average value of the magnitude of the digital speech signal generated by the analog/digital converter in a plurality of frames, to determine in which region of a speech signal band the average value is located, the speech signal band being divided into a plurality of regions according to the strength of speech, and to adjust gain according to a location of the average value on the speech signal band so that the strength of speech has a level of an optimal region capable of processing the speech signal. Accordingly, speech recognition may be maximized without being constrained by the distance of a speech source. |
申请公布号 |
US9214163(B2) |
申请公布日期 |
2015.12.15 |
申请号 |
US201113306180 |
申请日期 |
2011.11.29 |
申请人 |
Samsung Electronics Co., Ltd. |
发明人 |
Kim Ki Beom |
分类号 |
G10L21/0316;G10L21/0364;H03G3/30 |
主分类号 |
G10L21/0316 |
代理机构 |
Harness, Dickey & Pierce, P.L.C. |
代理人 |
Harness, Dickey & Pierce, P.L.C. |
主权项 |
1. A speech processing apparatus, comprising:
a microphone configured to receive a speech signal; an analog/digital converter configured to convert the speech signal generated by the microphone into a digital speech signal; and an automatic gain controller configured to calculate an average value of the magnitude of the digital speech signal generated by the analog/digital converter in a plurality of frames, where the number of frames is predetermined, determine in which region of a speech signal band the average value is located, the speech signal band being divided into a plurality of regions according to a strength of speech based on a dynamic range of the microphone, and adjust gain according to a corresponding one of the plurality of regions to which a location of the average value belongs so that the strength of speech has a level of an optimal region where the speech signal is capable of being processed, wherein the speech signal band is divided into a plurality of regions based on a minimum limit capable of being detected by the microphone, a maximum limit level capable of being detected by the microphone, low and high levels of the optimal region, and a median point of the speech signal band. |
地址 |
Gyeonggi-Do KR |