摘要 |
A monoscopic camera comprising one or more image sensors and a depth sensor may generate video based on two-dimensional image data captured via the one or more image sensors and corresponding depth information captured via the depth sensor. The camera may process corresponding audio for the generated video based on the captured depth information. The audio processing may comprise mitigating noise in the corresponding audio, enhancing voice quality in the corresponding audio, and/or enhancing audio quality of the corresponding audio. The camera may be operable to determine, based on the captured depth information, one or more sound paths between a source of the corresponding audio and a microphone utilized to capture the corresponding audio emanating from the source. The processing of the audio may comprise removing portions of the captured audio arriving at the microphone via one or more reflection paths. |