
使用题组反应模型缓解局部题目依赖性对多阶段测验的危害 被引量:1

Using Rasch Testlet Model to Relieve the Perniciousness of Local Item Dependence in Multistage Testing
摘要 尽管多阶段测验(MST)在保持自适应测验优点的同时允许测验编制者按照一定的约束条件去建构每一个模块和题板,但建构测验时若因忽视某些潜在的因素而导致题目之间出现局部题目依赖性(LID)时,也会对MST测验结果带来一定的危害。为探究"LID对MST的危害"这一问题,本研究首先介绍了MST和LID等相关概念;然后通过模拟研究比较探讨该问题,结果表明LID的存在会影响被试能力估计的精度但仍为估计偏差较小,且该危害不限于某一特定的路由规则;之后为消除该危害,使用了题组反应模型作为MST施测过程中的分析模型,结果表明尽管该方法能够消除部分危害但效果有限。这一方面表明LID对MST中被试能力估计精度所带来的危害确实值得关注,另一方面也表明在今后关于如何消除MST中由LID造成危害的方法仍值得进一步探究的。 Multistage testing (MST) is a type of testing in which sets of items are administered adaptively and are scored as a unit. MST has most of the advantages of adaptive testing, with more efficient and precise measurement across the proficiency scale as well as time savings, without many of the disadvantages of an item-level adaptive testing (i.e., computerized adaptive testing). The local independence is a basic assumption of most psychometric models, e.g., item response theory (IRT) models. Unfortunately, such assumption can be easily violated in educational or psychological tests. Yen (1984, 1993) listed a number of factors leading to local item dependence (LID): test speed, fatigue or practice, item or response format, passage dependence, and scoring rubrics or raters. Because modules in MST can be treated as a series of mini tests, the assumption of local independence may also be violated by factors mentioned above or others. A lot of studies on traditional linear tests have shown that when standard IRT models (e.g., the Rasch model, the 2-prarameters Logistic model) are used to fit the data, LID results in the overestimation of the precision of the test as a whole, spuriously high reliability coefficients, and biased parameter estimates. Typically, MST uses the standard IRT models to estimate test- takers' abilities. Thus, we can deduce that the LID, if it exists, may affect the results of MST.
出处 《心理科学》 CSSCI CSCD 北大核心 2017年第1期216-223,共8页 Journal of Psychological Science
关键词 多阶段测验 局部依赖性 题组 项目反应理论 题组反应模型 计算机化自适应测验 multistage testing, local dependence, testlet, item response theory, computerized adaptive testing
  • 相关文献



  • 1刘庆思.英语等级考试题库介绍[J].中国考试,2006(12):21-24. 被引量:4
  • 2余嘉元,汪存友.项目反应理论参数估计研究中的蒙特卡罗方法[J].南京师大学报(社会科学版),2007(1):87-91. 被引量:11
  • 3Wainer, H. Introduction and history. In H.Wainer (ED.), Computer Adaptive Testing: A Primer. (pp.1-21). New Jersey: Lawrance Erlbaum. 1990. 被引量:1
  • 4Luechl, R. M. & Nungester, R.J. Some practical examples of computer-adaptive sequential testing. Journal of Educational Measurement, 1998,35, 229-249. 被引量:1
  • 5Luecht, R. M., & Nungester, R. J. Computer-adaptive Sequential Testing. In W. J. van der Linden and C. A. W. Glas (Ed.), Computerized Adaptive Testing: Theocy and Practices. (pp.117-128). Netherlands: Kluwer Academic Publishers. 2003. 被引量:1
  • 6Luecht, R. M. Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 1998, 22, 224-236. 被引量:1
  • 7Luecht, R. M., Brumfield, T. & Breithaupt, K. A Testlel Assembly Design for Adaptive Multislage Tests. Applied Measurement in Education, 2006,19(3), 189-202. 被引量:1
  • 8NBME. Author The 1996 Step 2 Field Test Study of a Computerized System for USMLE. The National Board Examiner, 43 (4). Philadelphia, PA: National Board of Medical Examiners. 1996. 被引量:1
  • 9NBME. Author. Summary of the 1997 USMLE Step 1 Computerized Fieht Test. The National Board Examiner, 44 (4). Philadelphia, PA: National Board of Medieal Examiners. 1997. 被引量:1
  • 10Bougbtxm, K. A. & Gierl, M. J. Automated Test Assembly Prcedures for Criterion-Referenced Testing Using Optimization Heuristies. Paper Presented at the Annual Meeting of the American Educational Research Assoeiation (AERA), New Orleans, LA. 2000, April. 被引量:1












使用帮助 返回顶部