发明名称 SPEECH SYNTHESIS DICTIONARY CREATION DEVICE, SPEECH SYNTHESIZER, SPEECH SYNTHESIS DICTIONARY CREATION METHOD, AND COMPUTER PROGRAM PRODUCT
摘要 According to an embodiment, a device includes a table creator, an estimator, and a dictionary creator. The table creator is configured to create a table based on similarity between distributions of nodes of speech synthesis dictionaries of a specific speaker in respective first and second languages. The estimator is configured to estimate a matrix to transform the speech synthesis dictionary of the specific speaker in the first language to a speech synthesis dictionary of a target speaker in the first language, based on speech and a recorded text of the target speaker in the first language and the speech synthesis dictionary of the specific speaker in the first language. The dictionary creator is configured to create a speech synthesis dictionary of the target speaker in the second language, based on the table, the matrix, and the speech synthesis dictionary of the specific speaker in the second language.
申请公布号 US2016012035(A1) 申请公布日期 2016.01.14
申请号 US201514795080 申请日期 2015.07.09
申请人 KABUSHIKI KAISHA TOSHIBA 发明人 Tachibana Kentaro;Tamura Masatsune;Ohtani Yamato
分类号 G06F17/27;G10L13/00 主分类号 G06F17/27
代理机构 代理人
主权项 1. A speech synthesis dictionary creation device comprising: a mapping table creator configured to create, based on similarity between distribution of nodes of a speech synthesis dictionary of a specific speaker in a first language and distribution of nodes of a speech synthesis dictionary of the specific speaker in a second language, a mapping table in which the distribution of nodes of the speech synthesis dictionary of the specific speaker in the first language is associated with the distribution of nodes of the speech synthesis dictionary of the specific speaker in the second language; an estimator configured to estimate a transformation matrix to transform the speech synthesis dictionary of the specific speaker in the first language to a speech synthesis dictionary of a target speaker in the first language, based on speech and a recorded text of the target speaker in the first language and the speech synthesis dictionary of the specific speaker in the first language; and a dictionary creator configured to create a speech synthesis dictionary of the target speaker in the second language, based on the mapping table, the transformation matrix, and the speech synthesis dictionary of the specific speaker in the second language.
地址 Tokyo JP