发明名称 A METHOD FOR RECOVERING TARGET SPEECH BASED ON AMPLITUDE DISTRIBUTIONS OF SEPARATED SIGNALS
摘要 The present invention provides a method for recovering target speech based on shapes of amplitude distributions of split spectra obtained by use of blind signal separation. This method includes: a first step of receiving target speech emitted from a sound source and a noise emitted from another sound source and forming mixed signals of the target speech and the noise at a first microphone and at a second microphone; a second step of performing the Fourier transform of the mixed signals from the time domain to the frequency domain, decomposing the mixed signals into two separated signals U1 and U2 by use of the Independent Component Analysis, and, based on transmission path characteristics of the four different paths from the two sound sources to the first and second microphones, generating the split spectra v11, V12, v21 and< >v22 from the separated signals U1 and U2; and a third step of extracting estimated spectra Z* corresponding to the target speech to generate a recovered spectrum group of the target speech, wherein the split spectra v11, v12, V21, and v22 are analyzed by applying criteria based on the shape of the amplitude distribution of each of the split spectra v 11, V12, v21, and v22, and performing the inverse Fourier transform of the recovered spectrum group from the frequency domain to the time domain to recover the target speech.
申请公布号 WO2005029467(A1) 申请公布日期 2005.03.31
申请号 WO2004JP12898 申请日期 2004.08.31
申请人 KITAKYUSHU FOUNDATION FOR THE ADVANCEMENT OF INDUSTRY, SCIENCE AND TECHNOLOGY;KINKI UNIVERSITY;GOTANDA, HIROMU;KANEDA, KEIICHI;KOYA, TAKESHI 发明人 GOTANDA, HIROMU;KANEDA, KEIICHI;KOYA, TAKESHI
分类号 G10L13/00;G10L19/14;G10L21/02 主分类号 G10L13/00
代理机构 代理人
主权项
地址