摘要 |
The present technology pertains to a speech processing device and method, an encoding device, and a program with which it is possible to reproduce audio with a higher degree of freedom. An extraction unit acquires object metadata that includes information pertaining to the position of an object and diffuseness information. A determination unit compares the diffuseness information included in the object metadata and a diffuseness threshold value, causes object audio data to be supplied to a rendering unit when the diffuseness information is less than or equal to the diffuseness threshold value, and causes the object audio data to be supplied to a gain control unit when the diffuseness information is greater than the diffuseness threshold value. The present technology can be applied, for example, to a speech processing device. |