发明名称 |
METHODS AND DEVICES FOR TRAINING A CLASSIFIER AND RECOGNIZING A TYPE OF INFORMATION |
摘要 |
Methods and devices for training a classifier and for recognizing a type of information are provided. A method for training the classifier may include extracting, from sample information, a sample clause including a target keyword. A method may further include obtaining a sample training set by performing, on each of the sample clauses, binary labeling based on whether the respective sample clause belongs to a target class. A method may further include obtaining a plurality of words by performing word segmentation on each sample clause in the sample training set. A method may further include extracting a specified characteristic set from the plurality of words, the specified characteristic set including at least one characteristic word. A method may further include constructing a classifier based on the at least one characteristic word. A method may further include training the classifier based on results of the binary labeling of the sample clauses. |
申请公布号 |
US2017052947(A1) |
申请公布日期 |
2017.02.23 |
申请号 |
US201615221248 |
申请日期 |
2016.07.27 |
申请人 |
Xiaomi Inc. |
发明人 |
WANG Pingze;LONG Fei;ZHANG Tao |
分类号 |
G06F17/27;G10L15/26 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for training a classifier, comprising:
extracting, from sample information, sample clauses containing a target keyword; obtaining a sample training set by performing, on each of the sample clauses, binary labeling based on whether the respective sample clause belongs to a target class; obtaining a plurality of words by performing word segmentation on each sample clause in the sample training set; extracting a specified characteristic set from the plurality of words, the specified characteristic set comprising at least one characteristic word; constructing a classifier based on the at least one characteristic word in the specified characteristic set; and training the classifier based on results of the binary labeling of the sample clauses in the sample training set. |
地址 |
Beijing CN |