发明名称 Method of audio signal processing and hearing aid system for implementing the same
摘要 In a method of audio signal processing, a hearing aid system is configured to: collect sound information of a surrounding environment of the hearing aid system; capture an image of the surrounding environment of the hearing aid system; perform a directional signal processing operation on the sound information so as to generate an output audio signal; the output audio signal containing an extracted voice signal that comes from an area corresponding to a location of a target object in the image; and output the output audio signal.
申请公布号 US9491553(B2) 申请公布日期 2016.11.08
申请号 US201414464501 申请日期 2014.08.20
申请人 Liu Ching-Feng;Chen Hsiao-Han 发明人 Liu Ching-Feng;Chen Hsiao-Han
分类号 H04R25/00;H04N7/18;A61B5/117;G02C11/06;G10L15/25;G06K9/00;G10L21/0216 主分类号 H04R25/00
代理机构 Muncy, Geissler, Olds & Lowe, P.C. 代理人 Muncy, Geissler, Olds & Lowe, P.C.
主权项 1. A method of audio signal processing to be implemented by a hearing aid system, the hearing aid system including an image capturing module, a sound pickup module, and a processor, said method comprising: (a) collecting, using the sound pickup module, sound information of a surrounding environment of the hearing aid system; (b) capturing, using the image capturing module, an image of the surrounding environment of the hearing aid system; (c) using the processor to perform a directional signal processing operation on the sound information collected by the sound pickup module so as to generate an output audio signal, the output audio signal containing an extracted voice signal that comes from an area corresponding to a location of a target object in the image captured by the image capturing module; and (d) outputting the output audio signal; wherein step (c) includes the following sub-steps of: (c1) identifying, using the processor, presence of human face objects in the image captured by the image capturing module;(c2) determining, using the processor, object information corresponding respectively to the identified human face objects;(c3) determining, using the processor, a likelihood classification for each of the identified human face objects based on the object information corresponding thereto, wherein, for each of the identified human face objects, the likelihood classification is related to a likelihood of a person corresponding to the identified human face object being a target speaker; and (c4) selecting, using the processor, one of the identified human face objects as the target object, the selected one of the identified human face objects having the likelihood classification that indicates a relatively greater likelihood of the person corresponding thereto is a target speaker compared to other ones of the identified human face objects in the image.
地址 Kaohsiung TW