发明名称 METHOD AND APPARATUS FOR CREATING A LANGUAGE MODEL AND KANA-KANJI CONVERSION
摘要 Method for creating a language model capable of preventing deterioration of quality caused by the conventional back-off to unigram. Parts-of-speech with the same display and reading are obtained from a storage device (206). A cluster (204) is created by combining the obtained parts-of-speech. The created cluster (204) is stored in the storage device (206). In addition, when an instruction (214) for dividing the cluster is inputted, the cluster stored in the storage device (206) is divided (210) in accordance with to the inputted instruction (212). Two of the clusters stored in the storage device are combined (218), and a probability of occurrence of the combined clusters in the text corpus is calculated (222). The combined cluster is associated with the bigram indicating the calculated probability and stored into the storage device.
申请公布号 KR101279676(B1) 申请公布日期 2013.06.27
申请号 KR20077030209 申请日期 2006.06.23
申请人 发明人
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项
地址