发明名称 ENCODING AND DECODING OF AUDIO SIGNALS
摘要 An encoder (1201) for encoding a plurality of audio signals comprises a selector (1303) which selects a subset of time-frequency tiles to be downmixed and a subset of tiles to be non-downmix. A downmix indication is generated which indicates whether tiles are encoded as downmixed encoded tiles or as non-downmix tiles. An encoded signal comprising the encoded tiles and the downmix indication is fed to a decoder (1203) which includes a receiver (1401) for receiving the signal. A generator (1403) generates output signals from the encoded time-frequency tiles where the generation of the output signals includes an upmixing for tiles that are indicated by the downmix indication to be encoded downmixed tiles. The invention may provide more flexible and/or improved encoding/decoding and may specifically provide improved scalability, especially at higher data rates.
申请公布号 US2015142453(A1) 申请公布日期 2015.05.21
申请号 US201314413234 申请日期 2013.07.09
申请人 KONINKLIJKE PHILIPS N.V. 发明人 Oomen Arnoldus Werner Johannes;Koppens Jeroen Gerardus Henricus;Schuijers Erik Gosuinus Petrus
分类号 G10L19/26 主分类号 G10L19/26
代理机构 代理人
主权项 1. A decoder comprising: a receiver for receiving an encoded data signal representing a plurality of audio signals, the encoded data signal comprising encoded time-frequency tiles for the plurality of audio signals, the encoded time-frequency tiles comprising non-downmix time-frequency tiles and downmix time-frequency tiles, each downmix time-frequency tile being a downmix of at least two time-frequency tiles of the plurality of audio signals and each non-downmix time-frequency tile representing only one time-frequency tile of the plurality of audio signals, and the allocation of the encoded time frequency tiles as downmix-time frequency tiles or non-time frequency tiles reflecting spatial characteristics of the time frequency tiles, the encoded data signal further comprising a downmix indication for time-frequency tiles of the plurality of audio signals, the downmix indication indicating whether time-frequency tiles of the plurality of audio signals are encoded as downmix time-frequency tiles or non-downmix time-frequency tiles; a generator for generating a set of output signals from the encoded time-frequency tiles, the generation of the output signals comprising an upmixing for encoded time-frequency tiles that are indicated by the downmix indication to be downmix time-frequency tiles; wherein at least one audio signal of the plurality of audio signals is represented by two downmix time-frequency tiles being downmixes of different sets of audio signals of the plurality of audio signals; and at least one downmix time-frequency tile is a downmix of an audio object not being associated with a nominal sound source position of a sound source rendering configuration and an audio channel being associated with a nominal sound source position of a sound source rendering configuration.
地址 EINDHOVEN NL