发明名称 GENERATING GOLD QUESTIONS FOR CROWDSOURCING
摘要 A system and method for generating gold questions for labeling tasks are disclosed. The method includes sampling a positive class from a predefined set of classes to be used in labeling documents, based on a computed measure of class popularity. A set of negative classes is identified from the set of classes based on a distance measure between the positive class and other classes in the set of classes. A gold question is generated which includes a document representative of the positive class and a set of candidate answers. The candidate answers include a label for the positive class and a label for each of the negative classes in the identified set of negative classes. A task may be generated which includes the gold question and a plurality of standard questions which each include a document to be labeled. A computer processor may implement all or part of the method.
申请公布号 US2015235160(A1) 申请公布日期 2015.08.20
申请号 US201414184936 申请日期 2014.02.20
申请人 Xerox Corporation 发明人 Larlus-Larrondo Diane;Mishra Vivek Kumar;Kompalli Pramod Sankar;Perronnin Florent C.
分类号 G06Q10/06;G06F21/30 主分类号 G06Q10/06
代理机构 代理人
主权项 1. A method for generating a gold question for a labeling task comprising: sampling a positive class from a predefined set of classes to be used in labeling documents, based on a computed measure of class popularity; for the positive class, identifying a set of negative classes from the set of classes based on a distance measure between the positive class and other classes in the set of classes; generating a gold question which includes a document representative of the positive class and a set of candidate answers, the candidate answers including a label for the positive class and a label for each of the negative classes in the identified set of negative classes; and outputting the gold question, wherein at least one of the sampling, identifying, and generating is performed with a computer processor.
地址 Norwalk CT US