Improving the Statistical Aspects of E-rater: Exploring Alternative Feature Reduction and Combination Rules

Author(s):: Feng, Xin; Dorans, Neil J.; Kaplan, Bruce A.
Publication Year:: 2003
Report Number:: RR-03-15
Source:: ETS Research Report
Document Type:: Report
Page Count:: 49
Subject/Key Words:: Classification, Prediction, Electronic Essay Rater (E-rater), Automated Essay Scoring (AES), Automated Scoring and Natural Language Processing

Abstract

quasi-uniform training sample and then validating these results in a target cross-validation sample. More research is needed in several areas. First, explicit modeling of the part of essay scores that is unrelated to word length is warranted. The POM (Proportional Odds Model) approach should be investigated in greater depth. Also needed is a statistical justification for using essay scores to score CVA variables. Algorithmic approaches to prediction/classification problem, such as boosting, may prove fruitful. Further investigation of quantile regression and ridge regression should be conducted.

Request Copy (specify title and report number, if any)
http://dx.doi.org/10.1002/j.2333-8504.2003.tb01907.x

Improving the Statistical Aspects of E-rater: Exploring Alternative Feature Reduction and Combination Rules

Abstract

Read More