发明名称 Systems and methods for generating automated evaluation models
摘要 Systems and methods are described for generating a scoring model for responses. A computer-implemented method of calibrating a scoring model using a processing system for scoring examinee responses includes accessing a plurality of training responses for training the scoring model. The plurality of training responses are analyzed to derive values of multiple features (variables) of the training responses. The scoring model is trained based on the values of the multiple features of the training responses and one or more external measures of proficiency for each individual associated with a training response utilized in the training. The one or more external measures are not derived from the training responses. Based on the training, a weight for each of the multiple features is determined. The scoring model is calibrated to include the weights for at least some of the features for scoring examinee responses.
申请公布号 US9443193(B2) 申请公布日期 2016.09.13
申请号 US201414257380 申请日期 2014.04.21
申请人 Educational Testing Service 发明人 Haberman Shelby J.;Zhang Mo;Bridgeman Brent
分类号 G06N5/02;G09B7/00;G06F17/27;G06N99/00 主分类号 G06N5/02
代理机构 Jones Day 代理人 Jones Day
主权项 1. A computer-implemented method of calibrating a scoring model for scoring examinee responses, comprising: accessing a plurality of training responses with a processing system for training a scoring model for scoring examinee responses, the training responses and examinee responses being constructed responses; analyzing the plurality of training responses with the processing system to derive values of multiple features of the training responses, the multiple features corresponding to variables of the scoring model; training the scoring model with the processing system based on the values of the multiple features of the training responses and one or more external measures of proficiency for each individual associated with a training response utilized in the training, the one or more external measures not being derived from the training responses; determining, based on said training, a weight for each of the multiple features; and calibrating the scoring model to include the weights for at least some of the features such that the scoring model is configured to generate scores for examinee responses.
地址 Princeton NJ US