Comparability of TOEFL® CBT Writing Prompts for Different Native Language Groups
- Lee, Yong-Won; Breland, Hunter; Muraki, Eiji
- Publication Year:
- Report Number:
- Document Type:
- Subject/Key Words:
- Computer-based writing assessment essay prompt comparability fairness polytomous DIF (differential item functioning) native languages logistic regression proportional odds-ratio model
This study has investigated the comparability of computer-based testing (CBT) writing prompts in the TOEFL® test (Test of English as a Foreign Language™) for examinees of different native language backgrounds. A total of 81 writing prompts introduced from July 1998 through August 2000 were examined using a three-step logistic regression procedure for ordinal items. An English language ability (ELA) variable was created by summing the standardized TOEFL Reading, Listening, and Structure scale scores. This ELA variable was used to match examinees of East Asian (Chinese, Japanese, and Korean) and European (German, French, and Spanish) language groups. Although about one third of the 81 prompts were initially flagged because of statistically significant group effects, the effect sizes were too small for any of those flagged prompts to be classified as having an important group effect.