发明名称 Voice recognition device and method, and semiconductor integrated circuit device
摘要 A semiconductor integrated circuit device for voice recognition includes: a signal processing unit which generates a feature pattern representing a state of distribution of frequency components of an input voice signal; a voice recognition database storage unit which stores a voice recognition database including a standard pattern representing a state of distribution of frequency components of plural phonemes; a conversion list storage unit which stores a conversion list including plural words or sentences to be conversion candidates; a standard pattern extraction unit which extracts a standard pattern corresponding to character data representing the first syllable of each word or sentence included in the conversion list, from the voice recognition database; and a matching detection unit which compares the feature pattern generated from the first syllable of the voice signal with the extracted standard pattern and thus detects the matching of the syllable.
申请公布号 US9390709(B2) 申请公布日期 2016.07.12
申请号 US201314032906 申请日期 2013.09.20
申请人 SEIKO EPSON CORPORATION 发明人 Nonaka Tsutomu
分类号 G10L15/187;G10L13/08;G10L15/02;G10L15/10 主分类号 G10L15/187
代理机构 Oliff PLC 代理人 Oliff PLC
主权项 1. A semiconductor integrated circuit device comprising: a signal processing unit extracts frequency components of an inputted voice signal, and generates a feature pattern representing a state of distribution of the frequency components of the voice signal; a voice recognition database storage unit which stores a plurality of voice recognition databases each including a standard pattern representing a state of distribution of frequency components of plural phonemes used in a predetermined language, each of the plurality of voice recognition databases having been generated based on voice signals of a different group of speakers, each different group of speakers having a different age and/or gender than other ones of the groups of speakers; a conversion list storage unit which stores a conversion list expressed by character data and including plural words or sentences to be conversion candidates, the plural words or candidates being expected responses to a question or message; a standard pattern extraction unit which extracts the standard pattern corresponding to the character data representing the first syllable of each word or sentence included in the conversion list, from the voice recognition database; and a matching detection unit which receives an input of age and/or gender of a user, and selects a corresponding one of the plurality of voice recognition databases based on the input, and which compares the feature pattern generated from the first syllable of the voice signal with the standard pattern extracted by the standard pattern extraction unit, thus detects the matching of the syllable, and outputs information specifying a word or sentence that has the matching-detected syllable as the first syllable thereof.
地址 Tokyo JP