摘要 |
A method for the prosodic labelling of speech including performing a first analysis step using data from an audio file, wherein the audio file is analysed as a plurality of frames positioned at fixed time intervals in said audio file; and performing a second analysis step on said data from said audio file using results of said first analysis step, wherein analysis is performed using a plurality of analysis windows and wherein the position of the analysis windows are determined by segmental information.
|