发明名称 Erroneous detection determination device, erroneous detection determination method, and storage medium storing erroneous detection determination program
摘要 An erroneous detection determination device includes: a signal acquisition unit configured to acquire, from each of microphones, a plurality of audio signals relating to ambient sound including sound from a sound source in a certain direction; a result acquisition unit configured to acquire a recognition result including voice activity information indicating the inclusion of a voice activity relating to at least one of the audio signals; a calculation unit configured to calculate, for each of audio signals on the basis of the signals in respective unit times and the certain direction, a speech arrival rate representing the proportion of the sound from the certain direction to the ambient sound in each of the unit times; and an error detection unit configured to determine, on the basis of the recognition result and the speech arrival rate, whether or not the voice activity information is the result of erroneous detection.
申请公布号 US8775173(B2) 申请公布日期 2014.07.08
申请号 US201213406935 申请日期 2012.02.28
申请人 Fujitsu Limited 发明人 Matsumoto Chikako
分类号 G10L15/00;G10L25/84 主分类号 G10L15/00
代理机构 Staas & Halsey LLP 代理人 Staas & Halsey LLP
主权项 1. An erroneous detection determination device comprising: a signal acquisition unit configured to acquire, from each of a plurality of microphones, a plurality of audio signals relating to ambient sound including sound from a sound source in a certain direction; a result acquisition unit configured to acquire a recognition result including voice activity information indicating a voice activity relating to at least one of the plurality of audio signals; a calculation unit configured to calculate, on the basis of the signal of respective unit time of the plurality of audio signals and the certain direction, a speech arrival rate representing the proportion of the sound from the certain direction to the ambient sound in each of the unit times; and an error detection unit configured to determine, on the basis of the recognition result and the speech arrival rate, whether or not the voice activity information is the result of erroneous detection.
地址 Kawasaki JP