发明名称 |
Apparatus and method for transforming audio characteristics of an audio recording |
摘要 |
A method of audio processing comprises composing one or more transformation profiles for transforming audio characteristics of an audio recording and then generating for the or each transformation profile, a metadata set comprising transformation profile data and location data indicative of where in the recording the transformation profile data is to be applied; the or each metadata set is then stored in association with the corresponding recording. A corresponding method of audio reproduction comprises reading a recording and a meta-data set associated with that recording from storage, applying transformations to the recording data in accordance with the metadata set transformation profile; and then outputting the transformed recording. |
申请公布号 |
US8825483(B2) |
申请公布日期 |
2014.09.02 |
申请号 |
US200712375792 |
申请日期 |
2007.10.17 |
申请人 |
Sony Computer Entertainment Europe Limited |
发明人 |
Bardino Daniele Giuseppe;Griffiths Richard James |
分类号 |
G10L13/08;G10L13/00;G10L13/10 |
主分类号 |
G10L13/08 |
代理机构 |
Lerner, David, Littenberg, Krumholz & Mentlik, LLP |
代理人 |
Lerner, David, Littenberg, Krumholz & Mentlik, LLP |
主权项 |
1. A method of audio processing comprising the steps of:
composing one or more transformation profiles for transforming audio characteristics of an audio recording; generating, for each of the one or more transformation profiles, a metadata set comprising respective transformation profile data and location data indicative of where in the recording the transformation profile data is to be applied; storing each metadata set in association with the corresponding recording; selecting which associated metadata set is to be read from storage according to a degree of correspondence between emotion data in the metadata sets and one or more current emotion parameters in an application; applying random variations to the emotion data in the selected metadata sets; prior to composing the one or more transformation profiles, identifying locations of speech syllables in the recording; and adjusting one or more of intensity, pitch and duration transformation profiles extracted from the selected metadata. |
地址 |
GB |