Abstract
A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into account, thus estimating DIF magnitude appropriately when a test is composed of testlets. A fully Bayesian estimation method was adopted for parameter estimation. The recovery of parameters was evaluated for the proposed DIF model. Simulation results revealed that the proposed bifactor MIRT DIF model produced better estimates of DIF magnitude and higher DIF detection rates than the traditional IRT DIF model for all simulation conditions. A real data analysis was also conducted by applying the proposed DIF model to a statewide reading assessment data set.
Keywords
Affiliated Institutions
Related Publications
Modeling Differential Item Functioning Using a Generalization of the Multiple-Group Bifactor Model
The authors present a generalization of the multiple-group bifactor model that extends the classical bifactor model for categorical outcomes by relaxing the typical assumption o...
Exploring the Full-Information Bifactor Model in Vertical Scaling With Construct Shift
To address the lack of attention to construct shift in item response theory (IRT) vertical scaling, a multigroup, bifactor model was proposed to model the common dimension for a...
Bifactor Models and Rotations: Exploring the Extent to Which Multidimensional Data Yield Univocal Scale Scores
The application of psychological measures often results in item response data that arguably are consistent with both unidimensional (a single common factor) and multidimensional...
A GENERAL BAYESIAN MODEL FOR TESTLETS: THEORY AND APPLICATIONS
ABSTRACT This paper extends earlier work (Bradlow, Wainer, & Wang, 1999; Wainer, Bradlow, & Du, 2000) on the modeling of testlet‐based response data to include the situa...
Decisions that make a difference in detecting differential item functioning
There are numerous statistical procedures for detecting items that function differently across subgroups of examinees that take a test or survey. However, in endeavouring to det...
Publication Info
- Year
- 2011
- Type
- article
- Volume
- 35
- Issue
- 8
- Pages
- 604-622
- Citations
- 34
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1177/0146621611428447