发明名称 Speech data retrieval apparatus, speech data retrieval method, speech data retrieval program and computer usable medium having computer readable speech data retrieval program embodied therein
摘要 A speech data retrieval apparatus (10) includes a speech database (1), a speech recognition unit (2), a confusion network creation unit (3), an inverted index table creation unit (4), a query input unit (6), a query conversion unit (7) and a label string check unit (8). The speech recognition unit (2) reads speech data from the speech database (1), carries out a speech recognition process with respect to the read speech data, and outputs a result of speech recognition process as a lattice in which a phoneme, a syllable, or a word is a base unit. The confusion network creation unit (3) creates a confusion network based on the output lattice and outputs the result of speech recognition process as the confusion network. The inverted index table creation unit (4) creates an inverted index table based on the output confusion network. The query input unit (6) receives a query input by a user, carries out a speech recognition process with respect to the received query, and outputs a result of speech recognition process as a character string. The query conversion unit (7) converts the output character string into a label string in which a phoneme, a syllable, or a word is a base unit. The label string check unit (8) checks the label string against the inverted index table and retrieves speech data which is included in both of the label string and the speech database (1).
申请公布号 US8386264(B2) 申请公布日期 2013.02.26
申请号 US20080593636 申请日期 2008.04.11
申请人 NIPPON TELEGRAPH AND TELEPHONE CORPORATION;MASSACHUSETTS INSTITUTE OF TECHNOLOGY;HORI TAKAAKI;HETHERINGTON I. LEE;HAZEN TIMOTHY J.;GLASS JAMES R. 发明人 HORI TAKAAKI;HETHERINGTON I. LEE;HAZEN TIMOTHY J.;GLASS JAMES R.
分类号 G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址