发明名称 METHODS AND DEVICES FOR TRAINING A CLASSIFIER AND RECOGNIZING A TYPE OF INFORMATION
摘要 Methods and devices for training a classifier and for recognizing a type of information are provided. A method for training the classifier may include extracting, from sample information, a sample clause including a target keyword. A method may further include obtaining a sample training set by performing, on each of the sample clauses, binary labeling based on whether the respective sample clause belongs to a target class. A method may further include obtaining a plurality of words by performing word segmentation on each sample clause in the sample training set. A method may further include extracting a specified characteristic set from the plurality of words, the specified characteristic set including at least one characteristic word. A method may further include constructing a classifier based on the at least one characteristic word. A method may further include training the classifier based on results of the binary labeling of the sample clauses.
申请公布号 US2017052947(A1) 申请公布日期 2017.02.23
申请号 US201615221248 申请日期 2016.07.27
申请人 Xiaomi Inc. 发明人 WANG Pingze;LONG Fei;ZHANG Tao
分类号 G06F17/27;G10L15/26 主分类号 G06F17/27
代理机构 代理人
主权项 1. A method for training a classifier, comprising: extracting, from sample information, sample clauses containing a target keyword; obtaining a sample training set by performing, on each of the sample clauses, binary labeling based on whether the respective sample clause belongs to a target class; obtaining a plurality of words by performing word segmentation on each sample clause in the sample training set; extracting a specified characteristic set from the plurality of words, the specified characteristic set comprising at least one characteristic word; constructing a classifier based on the at least one characteristic word in the specified characteristic set; and training the classifier based on results of the binary labeling of the sample clauses in the sample training set.
地址 Beijing CN