摘要 |
In some embodiments, a method and system for generating image data, processing the image data to generate listener data indicative of at least one listener characteristic (e.g., position and/or size of each listener), and rendering at least one audio object (e.g., rendering an object based audio program) in response to the listener data (and optionally also listener identification data). For rendering a program indicative of audio objects, at least one speaker feed may be generated for driving at least one speaker to emit sound indicative of one of the objects and additional sound indicative of another one of the objects, where the sound is intended to be perceived by a listener at a first position with balance and delay appropriate to the first position, and the additional sound is intended to be perceived by a listener at a second position with balance and delay appropriate to the second position. |