A Note on the Type I Error Rate of the PARSCALE G² Statistic for Long Tests

Applied Psychological Measurement（2013）

引用 0|浏览0

暂无评分

摘要

The PARSCALE G2 statistic is arguably the most popular item fit statistic in operational testing. For long tests, the Type I error rates of the statistic have often been found to be satisfactory. However, the Type I error rates of the statistic have only been studied for sample sizes of up to several thousands. The authors examined the Type I error rates of the PARSCALE G2 statistic in a simulation study using sample sizes much larger than those considered in the literature. For any fixed test length, the Type I error rate of the PARSCALE G2 statistic is found to increase to 1 as the sample size increases. The findings contradict the claim in the PARSCALE software manual that the PARSCALE G2 statistic leads to a large-sample test and also contradict the common belief that the statistic has reasonable Type I error rates for long tests. Thus, this simulation study conveys the important practical message that the use of the PARSCALE G2 statistic cannot always be recommended even for long tests. The Type I error rates of the item fit statistics of Orlando and Thissen were found to be close to the nominal level for all simulation conditions considered here.

查看译文

关键词

parscale g2 statistic,long tests,error rate

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要

A Note on the Type I Error Rate of the PARSCALE G2 Statistic for Long Tests

A Note on the Type I Error Rate of the PARSCALE G² Statistic for Long Tests