发明名称 Localizing the position of a source of a voice signal
摘要 The invention relates to localizing the position of a person speaking by using pictures of a pattern (21) on an object (20) worn by the person. The object (20) carries a complex pattern (21) that is optimized for determining the orientation of the object (20), the distance from the object to a microphone device (14) and/or to a camera (11). Moreover, the pattern (21) may be arranged for identifying the person carrying the object (20). The determination of the position of the person carrying the object (20) may be used to enhance speech recognition (SR) and/or to provide hands-free voice control of devices (DC), e.g. in hospitals or in industrial settings.
申请公布号 US8831954(B2) 申请公布日期 2014.09.09
申请号 US200912990792 申请日期 2009.05.05
申请人 Nuance Communications, Inc. 发明人 Bruekers Alphons Antonius Maria Lambertus;Sarroukh Bahaa Eddine;Kevenaar Thomas Andreas Maria
分类号 G10L21/00;G10L25/00;H04R3/00;G06K9/00;G06K9/46;G06K9/32;G10L15/20 主分类号 G10L21/00
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A system for localizing a position of a source of a voice signal, comprising: an object arranged to be worn by a person having speech organs arranged for being a source of a voice signal, said object comprising a visually detectable pattern, said pattern arranged to be placed a distance from the source of the voice signal; a camera device arranged for recording at least one picture of at least part of said pattern; at least one processor to determine a position and an orientation of said pattern in relation to the camera device on the basis of said at least one picture; a compensator configured to compensate for a difference in position between said pattern of said object and the speech organs of said person wearing the object; and a microphone device arranged for adapting to the position of said source of a voice signal based on the position and the orientation of said pattern and the difference in position between said pattern of said object and the speech organs of said person wearing the object; wherein said at least one processor is configured to determine the position of said source of the voice signal based on image processing, where the image processing is performed on said at least one picture of the at least part of said pattern.
地址 Burlington MA US