发明名称 Nutzung von Sprachidentifizierung von Mediendateidaten in Sprachdialogsystemen
摘要 The present invention relates to a method for outputting a synthesized speech signal corresponding to an orthographic string stored in a media file comprising audio data, comprising the steps of analyzing the audio data to determine at least one candidate for a language of the orthographic string, estimating a phonetic representation of the orthographic string based on the determined at least one candidate for a language and synthesizing a speech signal based on the estimated phonetic representation of the orthographic string. The invention also relates to a media player incorporating such a method for a estimating phonetic representation for song and album titles as well as artists' names for speech recognition. Furthermore, the invention relates to the choice of an appropriate speech recognizer for automatically transcribing the lyrics of songs by using audio-based language estimates.
申请公布号 DE602006005055(D1) 申请公布日期 2009.03.19
申请号 DE20066005055T 申请日期 2006.10.02
申请人 HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH 发明人 WILLETT, DANIEL;SCHWENNINGER, JOCHEN;HENNECKE, MARCUS;BRUECKNER, RAYMOND
分类号 G10L15/22;G06F17/30;G10L15/26 主分类号 G10L15/22
代理机构 代理人
主权项
地址