发明名称 |
GENERATING GOLD QUESTIONS FOR CROWDSOURCING |
摘要 |
A system and method for generating gold questions for labeling tasks are disclosed. The method includes sampling a positive class from a predefined set of classes to be used in labeling documents, based on a computed measure of class popularity. A set of negative classes is identified from the set of classes based on a distance measure between the positive class and other classes in the set of classes. A gold question is generated which includes a document representative of the positive class and a set of candidate answers. The candidate answers include a label for the positive class and a label for each of the negative classes in the identified set of negative classes. A task may be generated which includes the gold question and a plurality of standard questions which each include a document to be labeled. A computer processor may implement all or part of the method. |
申请公布号 |
US2015235160(A1) |
申请公布日期 |
2015.08.20 |
申请号 |
US201414184936 |
申请日期 |
2014.02.20 |
申请人 |
Xerox Corporation |
发明人 |
Larlus-Larrondo Diane;Mishra Vivek Kumar;Kompalli Pramod Sankar;Perronnin Florent C. |
分类号 |
G06Q10/06;G06F21/30 |
主分类号 |
G06Q10/06 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for generating a gold question for a labeling task comprising:
sampling a positive class from a predefined set of classes to be used in labeling documents, based on a computed measure of class popularity; for the positive class, identifying a set of negative classes from the set of classes based on a distance measure between the positive class and other classes in the set of classes; generating a gold question which includes a document representative of the positive class and a set of candidate answers, the candidate answers including a label for the positive class and a label for each of the negative classes in the identified set of negative classes; and outputting the gold question, wherein at least one of the sampling, identifying, and generating is performed with a computer processor. |
地址 |
Norwalk CT US |