发明名称 |
Sound source-separating device and sound source-separating method |
摘要 |
A sound source-separating device includes a sound-collecting part, an imaging part, a sound signal-evaluating part, an image signal-evaluating part, a selection part that selects whether to estimate a sound source direction based on the first sound signal or the first image signal, a person position-estimating part that estimates a sound source direction using the first image signal, a sound source direction-estimating part that estimates a sound source direction, a sound source-separating part that extracts a second sound signal corresponding to the sound source direction from the first sound signal, an image-extracting part that extracts a second image signal of an area corresponding to the estimated sound source direction from the first image signal, and an image-combining part that changes a third image signal of an area other than the area for the second image signal and combines the third image signal with the second image signal. |
申请公布号 |
US9595259(B2) |
申请公布日期 |
2017.03.14 |
申请号 |
US201514833615 |
申请日期 |
2015.08.24 |
申请人 |
HONDA MOTOR CO., LTD. |
发明人 |
Mizumoto Takeshi;Nakadai Kazuhiro |
分类号 |
G10L21/0272;G10L17/02;G06T7/00;G06K9/46;G06K9/00;H04R1/40;G10L15/26;G10L21/0208;G10L21/0216;G10L21/028 |
主分类号 |
G10L21/0272 |
代理机构 |
Rankin, Hill & Clark LLP |
代理人 |
Rankin, Hill & Clark LLP |
主权项 |
1. A sound source-separating device comprising:
a sound-collecting part configured to collect sound and generate a first sound signal; an imaging part configured to capture an image and generate a first image signal; a sound signal-evaluating part configured to evaluate the first sound signal; an image signal-evaluating part configured to evaluate the first image signal; a selection part configured to select whether to estimate a sound source direction based on the first sound signal or whether to estimate a sound source direction based on the first image signal, based on an evaluation result of the first sound signal by the sound signal-evaluating part and an evaluation result of the first image signal by the image signal-evaluating part; a person position-estimating part configured to estimate a sound source direction indicating a direction of a speaker by using the first image signal when the selection part has selected that the sound source direction is estimated based on the first image signal; a sound source direction-estimating part configured to estimate a sound source direction by using the first sound signal when the selection part has selected that the sound source direction is estimated based on the first sound signal; a sound source-separating part configured to extract a second sound signal corresponding to the sound source direction from the first sound signal based on the estimated sound source direction estimated by the sound source direction-estimating part when the selection part has selected that sound source direction is estimated based on the first sound signal; an image-extracting part configured to extract a second image signal of an area corresponding to the estimated sound source direction from the first image signal, the estimated sound source direction being estimated by the person position-estimating part when the selection part has selected that sound source direction is estimated based on the first image signal; and an image-combining part configured to change a third image signal of an area other than the area for the second image signal and to combine the third image signal with the second image signal. |
地址 |
Tokyo JP |