发明名称 Search-based word segmentation method and device for language without word boundary tag
摘要 The present invention discloses a search-based segmentation method and device for a language without a word boundary tag. The inventive method includes the steps of: a. providing at least one search engine with a segment of a text including at least one segment; b. searching for the segment through the at least one search engine, and returning search results; and c. selecting a word segmentation approach for the segment in accordance with at least part of the returned search results. The invention solves the problems of word segmentation for a language without a word boundary tag, and thus combat the limitations of the prior art in terms of flexibility, dependence upon coverage of dictionaries, available training data corpuses, processing of a new word, etc.
申请公布号 US8131539(B2) 申请公布日期 2012.03.06
申请号 US20080044258 申请日期 2008.03.07
申请人 LIU WEN;QIN YONG;WANG XIN JING;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 LIU WEN;QIN YONG;WANG XIN JING
分类号 G06F17/27;G06F17/20 主分类号 G06F17/27
代理机构 代理人
主权项
地址