发明名称 SPEECH ENHANCEMENT METHOD, SPEECH RECOGNITION METHOD, CLUSTERING METHOD AND DEVICE
摘要 The present invention discloses a speech enhancement method, a speech recognition method, a clustering method and a device. The method includes: selecting a feature vector clustering center best matched with the feature vector of a first frame speech part of a test speech; performing direct to the feature vectors of other frame speech parts contained in the test speech: selecting a feature vector clustering center best matched with the feature vector of the speech part from a feature vector clustering center best matched with the feature vector of a previous frame speech part to the speech part and a feature vector clustering center adjacent to the feature vector clustering center best matched with the feature vector of the previous frame speech part; and reconstructing the feature vector of the test speech according to the feature vectors of each frame speech part contained in the test speech and the selected feature vector clustering center. Because a feature capable of representing speech continuity is utilized during speech enhancement, the present invention can achieve a better speech enhancement effect relative to a traditional speech enhancement model in the prior art.
申请公布号 US2016358599(A1) 申请公布日期 2016.12.08
申请号 US201615173579 申请日期 2016.06.03
申请人 LE SHI ZHI XIN ELECTRONIC TECHNOLOGY (TIANJIN) LIMITED 发明人 WANG Yujun
分类号 G10L15/06;G10L15/02;G10L15/10 主分类号 G10L15/06
代理机构 代理人
主权项 1. A speech enhancement method, comprising: selecting a feature vector clustering center best matched with the feature vector of a first frame speech part contained in a test speech from feature vector clustering centers obtained by training by a selection unit; performing direct to the feature vectors of other frame speech parts contained in the test speech: selecting a feature vector clustering center best matched with the feature vector of the speech part from a feature vector clustering center best matched with the feature vector of a previous frame speech part to the speech part and obtained by training and a feature vector clustering center adjacent to the feature vector clustering center best matched with the feature vector of the previous frame speech part, wherein a set formed by each of the feature vector clustering centers obtained by training and at least one adjacent feature vector clustering center thereof has an ability to describe speech continuity; and reconstructing the feature vector of the test speech according to the feature vectors of each frame speech part contained in the test speech and the selected feature vector clustering center by a reconstruction unit; and performing speech recognition on a the reconstructed feature vector of the test speech by a speech recognition.
地址 Beijing CN