发明名称 WORD SEGMENTATION METHOD AND DEVICE
摘要 A word segmentation method and device. The method comprises the steps of: extracting text information about a webpage in a search resource (101); conducting word segmentation processing on the text information using a feature entry in a word segmentation dictionary, to obtain one or more candidate segmented words (102); when ambiguity occurs during the word segmentation processing, counting the word frequency number of the candidate segmented words characterizing context in the webpage (103); adjusting the weight of the feature entry in the word segmentation dictionary according to the word frequency number (104); and conducting word segmentation processing on the text information according to the feature entry with adjusted weight in the word segmentation dictionary, to determine a target candidate segmented word (105). The present invention assists word segmentation processing based on context, and fully takes account of the characteristics of natural language, thereby effectively reducing the influence of ambiguity on word segmentation processing, and improving the accuracy rate of word segmentation.
申请公布号 WO2015196909(A1) 申请公布日期 2015.12.30
申请号 WO2015CN80675 申请日期 2015.06.03
申请人 BEIJING QIHOO TECHNOLOGY COMPANY LIMITED;QIZHI SOFTWARE (BEIJING) COMPANY LIMITED 发明人 XIANG, BIBO
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项
地址