摘要 |
Provided is an IPTV system using voice interface which includes a voice input device, a voice processing device, a query processing and content search device, and a content providing device. The voice processing device performs voice recognition to convert voice into a text. The voice processing device includes a voice preprocessing unit, a sound model database, a language model database, and a decoder. The voice preprocessing unit performs preprocessing which includes improving the quality of sound or removing noise for the received voice, and extracts a feature vector. The decoder converts the feature vector into a text by using a sound model and a language model. Moreover, the voice processing device stores the profile and preference of a user to provide personalized service. The result of voice recognition is updated in a sound model database and a user profile database each time service for a user is provided, the performance of voice recognition and the performance of personalized service can continuously be improved.
|