Evaluating the Comparability of Paper-and-Pencil and Computerized Versions of a Large-Scale Certification Test PPT CBT DIF

Author(s):: Puhan, Gautam; Boughton, Keith; Kim, Sooyeon
Publication Year:: 2005
Report Number:: RR-05-21
Source:: ETS Research Report
Document Type:: Report
Page Count:: 15
Subject/Key Words:: Paper and Pencil Tests (PPT), Computer-Based Testing (CBT), Differential Item Functioning (DIF), Item Impact, Standardized Mean Difference

Abstract

The study evaluated the comparability of two versions of a teacher certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). Standardized mean difference (SMD) and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that effect sizes derived from the SMD were small (d < 0.20) and not statistically significant (p > 0.05), suggesting no substantial difference between the two test versions. Moreover, DIF analysis revealed that reading and mathematics items were comparable for both versions. However, five writing items were flagged for DIF. Substantive reviews failed to identify format differences that could explain the performance differences, so the causes of DIF could not be identified.

Request Copy (specify title and report number, if any)
http://dx.doi.org/10.1002/j.2333-8504.2005.tb01998.x

Evaluating the Comparability of Paper-and-Pencil and Computerized Versions of a Large-Scale Certification Test PPT CBT DIF

Abstract

Read More