发明名称 System and method for user-specified pronunciation of words for speech synthesis and recognition
摘要 The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
申请公布号 US9620104(B2) 申请公布日期 2017.04.11
申请号 US201414298690 申请日期 2014.06.06
申请人 Apple Inc. 发明人 Naik Devang K.;Gruber Thomas R.;Weiner Liam;Binder Justin G.;Srisuwananukorn Charles;Evermann Gunnar;Williams Shaun Eric;Chen Hong;Napolitano Lia T.
分类号 G10L15/00;G10L13/027;G10L13/08;G10L15/06;G10L15/26;G10L13/04;G10L15/22 主分类号 G10L15/00
代理机构 Morrison & Foerster LLP 代理人 Morrison & Foerster LLP
主权项 1. A method for learning word pronunciations, comprising: at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors: receiving a first speech input including at least one word; determining a first phonetic representation of the at least one word, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet; mapping the first set of phonemes to a second set of phonemes to generate a second phonetic representation, the second set of phonemes selected from a speech synthesis phonetic alphabet that is different from the speech recognition phonetic alphabet, wherein the speech recognition phonetic alphabet and the speech synthesis phonetic alphabet are phonetic alphabets of a same language; and storing the second phonetic representation in association with a text string corresponding to the at least one word.
地址 Cupertino CA US