An Analysis of TOEFL®-CBT Writing Prompt Difficulty and Comparability for Different Gender Groups

Author(s):
Breland, Hunter; Lee, Yong-Won; Najarian, Michelle; Muraki, Eiji
Publication Year:
2004
Report Number:
RR-04-05, TOEFL-RR-76
Source:
Document Type:
Subject/Key Words:
Computer-based writing assessment essay prompts comparability fairness polytomous DIF (differential item functioning) gender logistic regression proportional odds ratio models

Abstract

This investigation of the comparability of writing assessment prompts was conducted in two phases. In an exploratory Phase I, 47 writing prompts administered in the TOEFL® computer-based test (CBT) from July through December 1998 were examined. Logistic regression procedures were used to estimate prompt difficulty and gender effects. A panel of experts reviewed selected prompts, and a taxonomy of prompt characteristics was developed and related to prompt difficulty and gender differences. In Phase II, 87 prompts administered from July 1998 through March 2000 were analyzed. All of the prompts used in Phase I, together with 40 new prompts, were analyzed using the larger Phase II database. Recommendations are made for statistical quality control procedures to identify less comparable prompts.

Read More