发明名称 SOUND SEGMENT CLASSIFICATION DEVICE, SOUND SEGMENT CLASSIFICATION METHOD, AND SOUND SEGMENT CLASSIFICATION PROGRAM
摘要 A sound segment classification device that appropriately classifies sound segments of an observation signal by sound source, when the volume from a sound source fluctuates, when the number of sound sources is unknown, and even when a mixture of microphones of different types is used. The sound segment classification device (100) comprises: a vector calculation means (101) that calculates, from a time series of the power spectrum for sound signals collected by a plurality of microphones, a multidimensional vector series which is a vector series of the power spectrum having the same number of dimensions as there are microphones; a difference calculation means (104) that calculates, for each point in time in the multidimensional vector series that is divided into lengths of any time period, the difference vector between a point in time and the immediately preceding point in time; a sound source direction estimation means (105) that estimates as the sound source direction the main component of the difference vector found in a state where both non-orthogonality and exceeding spatial dimensions are permitted; and a sound segment determination means (106) that determines whether a sound source direction is a sound segment or a silence segment, for each sound source direction found using the sound source direction estimation means, using a prescribed sound characteristics index indicating the sound segment characteristics of sound signals input for each point in time.
申请公布号 WO2012105385(A1) 申请公布日期 2012.08.09
申请号 WO2012JP51553 申请日期 2012.01.25
申请人 NEC CORPORATION;ONISHI, YOSHIFUMI 发明人 ONISHI, YOSHIFUMI
分类号 G10L21/0216;G10L25/21 主分类号 G10L21/0216
代理机构 代理人
主权项
地址