摘要 |
One embodiment of the present invention relates to a voice synthesizing method and a voice synthesizing system using facial video recognition, and an external input device, which form a voice source based on a user′s voice, and determine syllables and words based on facial videos of the vocalizing user, to synthesize and output the determined texts via the voice source which is based on the stored user′s voice. A voice synthesizing system using facial video recognition and an external input device according to one embodiment of the present invention, may comprise: a voice source storage part for storing a consonant set and a vowel set that are formed based on a user′s voice; a pronunciation video information storage part where facial videos including the shape of mouth for each vowel during user′s vocalizing are stored; a facial video acquisition device which acquires facial videos including the shape of mouth during user′s vocalizing; and a voice output device, which compares the inputted video acquired from the facial video acquisition device with the facial video stored in the pronunciation video information storage part, to determine the type of vowel that is vocalized, and detects the determined vowel and the consonants inputted by the user from the consonant set and the vowel set of the voice source storage, respectively, to synthesize the detected consonant and vowel and output them as voice. |