摘要 |
<p>Timed Language Presentation System A system for generating a timed language presentation including receiving an audio/video input and a script input, the script corresponding to an audio track of the audio/video input; separating the audio/video input into the audio track and a separate video stream, only the audio track being used for processing; conducting a track reading process using a text-to-speech process on the script to produce a description of words and phoneme length timings, analysing the audio track to produce a time location for words in the audio track; and processing the description of words, phoneme length timings and time location to produce a timed word/phoneme list.</p> |