One embodiment of the present invention sets forth a technique for providing audio enhancement to a user of a listening device. The technique includes reproducing a first audio stream, such as an audio stream associated with a media player. The technique further includes detecting a voice trigger. The voice trigger may be associated with a name of a user of the listening device. The technique further includes pausing or attenuating the first audio stream and reproducing a second audio stream associated with ambient sound in response to detecting the voice trigger.