Improving the Statistical Aspects of E-rater: Exploring Alternative Feature Reduction and Combination Rules AES
- Author(s):
- Feng, Xin; Dorans, Neil J.; Kaplan, Bruce A.
- Publication Year:
- 2003
- Report Number:
- RR-03-15
- Source:
- ETS Research Report
- Document Type:
- Report
- Page Count:
- 49
- Subject/Key Words:
- Classification, Prediction, Electronic Essay Rater (E-rater), Automated Essay Scoring (AES), Automated Scoring and Natural Language Processing
Abstract
quasi-uniform training sample and then validating these results in a target cross-validation sample. More research is needed in several areas. First, explicit modeling of the part of essay scores that is unrelated to word length is warranted. The POM (Proportional Odds Model) approach should be investigated in greater depth. Also needed is a statistical justification for using essay scores to score CVA variables. Algorithmic approaches to prediction/classification problem, such as boosting, may prove fruitful. Further investigation of quantile regression and ridge regression should be conducted.
Read More
- Request Copy (specify title and report number, if any)
- http://dx.doi.org/10.1002/j.2333-8504.2003.tb01907.x