发明名称 |
Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction |
摘要 |
A method and apparatus to encode audio with spatial information in a manner that does not depend on the exhibition setup, and to decode and play out optimally for any given exhibition setup, maximizing the sweet-spot area, and including setups with loudspeakers at different heights, and headphones. The part of the audio that requires very precise localization is encoded into a set of mono tracks with associated directional parameters, whereas the remaining audio is encoded into a set of Ambisonics tracks of a chosen order and mixture. Upon specification of a given exhibition system, the exhibition-independent format is decoded adapting to the specified system, by using different decoding methods for each assigned group. |
申请公布号 |
US9299353(B2) |
申请公布日期 |
2016.03.29 |
申请号 |
US200913142822 |
申请日期 |
2009.12.29 |
申请人 |
Dolby International AB |
发明人 |
Sole Antonio Mateos;Albo Pau Arumi |
分类号 |
G10L19/008 |
主分类号 |
G10L19/008 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for encoding initial audio signals and related spatial information into a reproduction layout-independent format, the initial audio signals arising from any source of a plurality of sources, the method comprising:
defining a threshold directionality value to assign to one of a first group and a second group of one or more sources of the plurality of sources requiring localization; assigning a directionality coefficient to each source of the one or more sources; grouping sources with a directionality coefficient above the threshold value to the first group, wherein the first group of sources generate a first set of tracks of audio signals that require narrow localization and encoding the first group only as a set of mono audio tracks with associated metadata describing the direction of origin of the signal of each track with respect to a recording position, and its initial playback time; encoding individual audio tracks of the first group with the associated metadata to facilitate playback through a minimal number of loudspeakers about an intended location of each respective source of the first group; grouping sources with a directionality coefficient equal to or below the threshold value to the second group, wherein the second group sources generate a second set of tracks of audio signals that do not require narrow localization and encoding the second group as at least one set of Ambisonics tracks of a given order and mixture of orders; and encoding in the metadata, spread parameters associated to each source of the first group, wherein a value between 0 and 1 describes an angular width of a recorded sound image of the first group. |
地址 |
Amsterdam Zuidoost NL |