摘要 |
PROBLEM TO BE SOLVED: To solve permutation of a mask by correcting a mask created by calculating an arrival direction of an observation sound from TDOA by using voice likeness.SOLUTION: A sound source separation device includes: an arrival direction calculation unit 1 for calculating an arrival direction of a voice output from each sound source; a mask creation unit 2 for creating a mask for masking an observation signal spectrum of an observation sound other than a target voice output from a corresponding sound source from among a plurality of sound sources from the observation signal spectrum in a time frequency plane of each arrival direction calculated by the arrival direction calculation unit 1; a mask re-estimation unit 3 for verifying separation performance of the target voice and the observation sound other than the target voice for each mask based on the characteristic of the voice, and re-estimating the mask based on the verification result; and a mask unit 4 for masking the observation signal spectrum of the observation sound other than the target voice from the observation signal spectrum by using each of the masks re-estimated by the mask re-estimation unit 3, and acquiring the observation signal spectrum of the target voice. |