发明名称 ACOUSTIC ENVIRONMENT RECOGNIZER FOR OPTIMAL SPEECH PROCESSING
摘要 A system for providing an acoustic environment recognizer for optimal speech processing is disclosed. In particular, the system may utilize metadata obtained from various acoustic environments to assist in suppressing ambient noise interfering with a desired audio signal. In order to do so, the system may receive an audio stream including an audio signal associated with a user and including ambient noise obtained from an acoustic environment of the user. The system may obtain first metadata associated with the ambient noise, and may determine if the first metadata corresponds to second metadata in a profile for the acoustic environment. If the first metadata corresponds to the second metadata, the system may select a processing scheme for suppressing the ambient noise from the audio stream, and process the audio stream using the processing scheme. Once the audio stream is processed, the system may provide the audio stream to a destination.
申请公布号 US2017076736(A1) 申请公布日期 2017.03.16
申请号 US201615362372 申请日期 2016.11.28
申请人 AT&T Intellectual Property I, L.P. 发明人 Schroeter Horst J.;Bowen Donald J.;Dimitriadis Dimitrios B.;Ji Lusheng
分类号 G10L21/028;G10L25/72;G10L21/0216 主分类号 G10L21/028
代理机构 代理人
主权项 1. A system, comprising: a first memory that stores a first set of instructions; a first hardware processor of an acoustic environment recognizer that executes the first set of instructions to perform a first set of operations, the first set of operations comprising: obtaining, from visual content captured by a camera of a device of a user and from orientation data obtained from a sensor of the device of the user, first metadata associated with ambient noise occurring in an acoustic environment in which the user is located, wherein the orientation data corresponds with an orientation of the device of the user;selecting, based on the first metadata, the visual content, and the orientation data, a processing scheme for suppressing the ambient noise from an audio stream including an audio signal associated with the user; a second memory that stores a second set of instructions; and a second hardware processor of a speech signal enhancer that executes the second set of instructions to perform a second set of operations, the second set of operations comprising: processing the audio stream using the processing scheme in order to suppress the ambient noise in the audio stream; andproviding, after processing the audio stream using the processing scheme, the audio stream to a destination.
地址 Atlanta GA US