发明名称 Unvoiced/Voiced Decision for Speech Processing
摘要 A method for speech processing includes determining an unvoicing parameter for a first frame of a speech signal and determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame. The unvoicing parameter reflects a speech characteristic of the first frame. The smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame. The method further includes computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame, and determining a classification of the first frame according to the computed difference. The classification includes unvoiced speech or voiced speech. The first frame is processed in accordance with the classification of the first frame.
申请公布号 US2017110145(A1) 申请公布日期 2017.04.20
申请号 US201615391247 申请日期 2016.12.27
申请人 Huawei Technologies Co., Ltd. 发明人 Gao Yang
分类号 G10L25/78;G10L19/22;G10L25/93 主分类号 G10L25/78
代理机构 代理人
主权项 1. A method for speech processing, the method comprising: determining an unvoicing parameter for a first frame of a speech signal, wherein the unvoicing parameter reflects a speech characteristic of the first frame; determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame, wherein the smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame; computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame; determining a classification of the first frame according to the computed difference, wherein the classification comprises unvoiced speech or voiced speech; and processing the first frame by the processor in accordance with the classification of the first frame.
地址 Shenzhen CN