发明名称 複合語分割
摘要 <p>Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for decompounding compound words are disclosed. In one aspect, a method includes obtaining a token that includes a sequence of characters, identifying two or more candidate sub-words that are constituents of the token, and one or more morphological operations that are required to transform the sub-words into the token, where at least one of the morphological operations involves a use of a non-dictionary word, and determining a cost associated with each sub-word and a cost associated with each morphological operation.</p>
申请公布号 JP5819860(B2) 申请公布日期 2015.11.24
申请号 JP20120553041 申请日期 2011.02.11
申请人 发明人
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址
您可能感兴趣的专利