主权项 |
1. A decoder comprising:
a receiver for receiving an encoded data signal representing a plurality of audio signals, the encoded data signal comprising encoded time-frequency tiles for the plurality of audio signals, the encoded time-frequency tiles comprising non-downmix time-frequency tiles and downmix time-frequency tiles, each downmix time-frequency tile being a downmix of at least two time-frequency tiles of the plurality of audio signals and each non-downmix time-frequency tile representing only one time-frequency tile of the plurality of audio signals, and the allocation of the encoded time frequency tiles as downmix-time frequency tiles or non-time frequency tiles reflecting spatial characteristics of the time frequency tiles, the encoded data signal further comprising a downmix indication for time-frequency tiles of the plurality of audio signals, the downmix indication indicating whether time-frequency tiles of the plurality of audio signals are encoded as downmix time-frequency tiles or non-downmix time-frequency tiles; a generator for generating a set of output signals from the encoded time-frequency tiles, the generation of the output signals comprising an upmixing for encoded time-frequency tiles that are indicated by the downmix indication to be downmix time-frequency tiles; wherein at least one audio signal of the plurality of audio signals is represented by two downmix time-frequency tiles being downmixes of different sets of audio signals of the plurality of audio signals; and at least one downmix time-frequency tile is a downmix of an audio object not being associated with a nominal sound source position of a sound source rendering configuration and an audio channel being associated with a nominal sound source position of a sound source rendering configuration. |