发明名称 Voice-controlled selection of media files utilizing phonetic data
摘要 A voice-controlled data system is providing that has a storage medium for storing media files, the media files having associated file identification data for allowing the identification of the media files, the file identification data including phonetic data having phonetic information corresponding to the file identification data. The phonetic data is supplied to a speech recognition unit that compares the phonetic data to a speech command input into the speech recognition unit. The data system further includes a file selecting unit that selects one of the media files based on the comparison result.
申请公布号 US9153233(B2) 申请公布日期 2015.10.06
申请号 US200611360034 申请日期 2006.02.21
申请人 HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH 发明人 Hennecke Marcus;Nüβle Gerhard
分类号 G10L15/187;G10L15/22;G10L15/26 主分类号 G10L15/187
代理机构 Artegis Law Group, LLP 代理人 Artegis Law Group, LLP
主权项 1. A voice-controlled data system, comprising: a storage medium for storing media files, the media files comprising audio files and including associated file identification data for allowing the identification of the media files, where the file identification data associated with each audio file includes first phonetic information that corresponds to first phonetic rules for pronouncing an artist and a song title associated with the audio file and second phonetic information that corresponds to second phonetic rules for pronouncing the artist and the song title associated with the audio file, where the first phonetic information and the second phonetic information included in the file identification data are part of the audio file; a static vocabulary list including phonetic transcriptions of corresponding user commands; a phonetic data extraction unit for extracting the first phonetic information and the second phonetic information from the file identification data; a speech recognition unit for receiving voice data from a user, the voice data including a static vocabulary and a variable vocabulary, the static vocabulary including a user command and the variable vocabulary including the artist and the song title associated with a desired media file, where the speech recognition unit is configured to generate a control command based on comparing the received voice data to the phonetic transcriptions of the static vocabulary list and the first phonetic information and the second phonetic information extracted by the phonetic data extraction unit; and a media player for playing the media files, the media player configured to select a media file based on the control command received from the speech recognition unit.
地址 Karlsbad DE