主权项 |
1. A method for modifying an audio signal, the method comprising:
receiving an audio signal, the received audio signal having an original temporal duration; identifying a word portion of the audio signal, the word portion corresponding to a spoken word; identifying a plurality of phonemes in the word portion, a first phoneme of the plurality of phonemes occupying a temporal position in the word portion, the first phoneme having a first temporal duration in the audio signal; generating a set of alternates, each alternate in the set corresponding to an alternate spoken word satisfying phonetic similarity criteria when compared to the spoken word, the set containing a total number of alternates; identifying a subset of alternates from the set of alternates, the first phoneme occupying the temporal position in each alternate in the subset, the subset containing a subset number of alternates; calculating a first significance factor for the first phoneme, the first significance factor based on a proportion of the subset number of alternates to the total number of alternates; modifying the first temporal duration of the first phoneme based on the first significance factor; and outputting the audio signal, the output audio signal including the word portion, the word portion including the first phoneme with the modified first temporal duration, the output audio signal having a modified temporal duration different from the original temporal duration. |