发明名称 WORD AND COMPOUND WORD CLASSIFYING PROCESSING METHOD, COMPOUND WORD EXTRACTING METHOD, WORD AND COMPOUND WORD CLASSIFYING PROCESSOR, SPEECH RECOGNITION SYSTEM, MACHINE TRANSLATING DEVICE, COMPOUND WORD EXTRACTING DEVICE, AND WORD AND COMPOUND WORD STORAGE MEDIUM
摘要 PROBLEM TO BE SOLVED: To make speech recognition and machine translation accurate by classifying words and compound words included in text together and generating a class wherein the words and compound word are mixed. SOLUTION: The word and compound word classifying processor consists of a word classifying means 1, a word class string generating means 2, a word class string extracting means 3, a token giving means 4, a word and token string generating means 5, a word and token classifying means 6, and a compound word substituting means 7. Word classes obtained by classifying words are mapped in a linear array of words of the text data to generate a linear array of word classes. In the linear array of the word classes of the text data, word class arrays which all have adherence above a specific value between adjacent word classes are extracted and tokens are given to the word class arrays. The words and tokens are classified together and then a word class array corresponding to a token is substituted by a coupla belonging to the word string. Namely, a classifying process can be performed automatically without discriminating between words and compound words.
申请公布号 JPH1097286(A) 申请公布日期 1998.04.14
申请号 JP19970167243 申请日期 1997.06.24
申请人 FUJITSU LTD 发明人 SHIODA AKIRA
分类号 G10L15/06;G06F17/28;G10L15/18;G10L15/28 主分类号 G10L15/06
代理机构 代理人
主权项
地址