An Empirical Investigation of Methods for Assessing Item Fit for Mixed Format Tests.

APPLIED MEASUREMENT IN EDUCATION(2013)

引用 5|浏览14
暂无评分
摘要
Empirical information regarding performance of model-fit procedures has been a persistent need in measurement practice. Statistical procedures for evaluating item fit were applied to real test examples that consist of both dichotomously and polytomously scored items. The item fit statistics used in this study included the PARSCALE's , Orlando and Thissen's (2000) and , and Stone's (2000) and . The results of this study indicated that the fit of an individual item was affected by the choice of model-fit analyses. The performance of fit indices appeared to vary depending on item response theory (IRT) model mixtures used for calibration, sample size, and test length. In terms of consistency among the fit indices, the statistics based on the same approach (e.g., and ) showed considerably higher agreement in detecting misfitting items than the statistics based on different approaches (e.g., and ). Consistent and inconsistent findings compared to previous research are discussed along with practical implications.
更多
查看译文
关键词
sample size,item response theory,statistics,goodness of fit
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要