发明名称 Audio signal loudness determination and modification in the frequency domain
摘要 Methods of, apparatuses for, and computer readable media having instructions thereon that when executed cause carrying out methods of determining and modifying the perceived loudness of a frequency domain audio signal where the frequency resolution, and corresponding temporal coverage of the frequency domain information is not constant. The frequency (and thus temporal) resolution of the perceived loudness processing is maintained constant at the longest block size. One method includes a block combiner and a loudness modification interpolator.
申请公布号 US8892426(B2) 申请公布日期 2014.11.18
申请号 US201113167593 申请日期 2011.06.23
申请人 Dolby Laboratories Licensing Corporation 发明人 Smithers Michael J.
分类号 G10L25/00;G10L19/00;G10L21/00;H03G9/02;H03G9/00 主分类号 G10L25/00
代理机构 Inventek 代理人 Rosenfeld Dov;Inventek
主权项 1. A method of perceived loudness processing of frequency domain audio data, the method comprising: accepting blocks of frequency domain audio data comprising transform coefficients that result from applying a lapped transform on overlapping blocks of time samples of audio data, wherein one or more of the accepted blocks have a longest block size, wherein the accepted blocks not having the longest block size have at least one respective short block size shorter than the longest block size, and wherein the longest block size is a respective multiple of each of the one or more short block sizes, such that each of the accepted blocks has the longest block size or one of the one or more short block sizes; for a plurality of the accepted blocks having a particular short block size of the one or more short block sizes, combining the plurality of the accepted blocks having the particular short block size to form a block of frequency domain audio data having the longest block size; and carrying out perceived loudness processing of the accepted blocks, the perceived loudness processing being at the longest block size, including: determining or accepting one or more sets of perceived loudness parameters of the accepted blocks of frequency domain audio data, or of delayed versions of the accepted blocks of frequency domain audio data, wherein each set of perceived loudness parameters comprises a respective set of perceived loudness parameter values determined at a set of critical bands, wherein each of the one or more sets of perceived loudness parameters of the accepted blocks having the particular short block size are determined at the longest block size, and wherein the one or more determined sets of perceived loudness parameters include at least one of: a set of determined values of the critical band power spectrum of the accepted blocks or of the delayed versions of the accepted blocks of frequency domain audio data determined at the set of critical bands, anda set of determined values of the specific loudness of the accepted blocks or of the delayed versions of the accepted blocks of frequency domain audio data determined at the set of critical bands.
地址 San Francisco CA US