期刊文献+

两种PETS计算机自适应序列测试框架比较研究 被引量:4

Comparison Simulation Study on Two PETS-CAST Configurations
下载PDF
导出
摘要 根据PETS考试的特点,结合计算机自适应序列测试(CAST)的优势,研究者提出并设计了1-3-5三阶段和1-2-5-5四阶段两种PETS-CAST测试框架。为了检验两个测试框架的性能,研究者模拟生成了样本量分别为500、1000、3000、5000四个考生群体的能力值,然后,利用蒙特卡罗模拟方法,在试题参数已知的CAST框架上模拟生成考生的作答反应。研究结果表明,随着阶段的增加,自适应序列测试提供了更多的测验信息,能力估计的标准误逐渐减小,模拟能力与估计能力呈现出高相关。1-2-5-5四阶段测试对考生能力估计及分类决策的准确性更高,结果更为可靠。该模拟研究为PETS-CAST的具体实施在理论层面做了一些有益的尝试。 Based on consideration of the PETS characteristics and its future development, the authors start with design of two PETS-CAST configurations. One is a 1-3-5 three-stage test configuration; the other is a 1-2-5-5 four-stage test configuration. The simulation study is conducted to validate which one performs better. Real difficulty parameters of PETS items and the simulated ability parameters of the candidates are used to generate the original score matrix and the item modules are delivered to the candidates following two adaptive procedures set according to the path rules. The author simulates responses of 500, 1000, 3000 and 5000 respondents on 1-2-5-5 four-stage design and 1-3-5 three-stage design, and then compares the psychometrics indexes of the two CAST designs. Results show that the 1-2-5-5 four-stage CAST design is a little better than 1-3-5 three-stage CAST design. This simulation study provides a sound basis for the implementation of PETS-CAST.
机构地区 教育部考试中心
出处 《中国考试》 2013年第1期16-22,共7页 journal of China Examinations
基金 全国教育科学"十一五"规划2009年度教育部重点课题"计算机辅助英语测试系统的研究与实践"(课题号:GFA097014)系列研究成果之一
关键词 计算机自适应序列测试 PETS 测试框架 模拟研究 CAST PETS Test Configuration Simulation Study
  • 相关文献

参考文献9

二级参考文献30

  • 1余嘉元.项目反应理论研究中的计算机模拟方法[J].心理科学,1991,14(2):47-49. 被引量:2
  • 2余嘉元,汪存友.项目反应理论参数估计研究中的蒙特卡罗方法[J].南京师大学报(社会科学版),2007(1):87-91. 被引量:11
  • 3Wainer, H. Introduction and history. In H.Wainer (ED.), Computer Adaptive Testing: A Primer. (pp.1-21). New Jersey: Lawrance Erlbaum. 1990. 被引量:1
  • 4Luechl, R. M. & Nungester, R.J. Some practical examples of computer-adaptive sequential testing. Journal of Educational Measurement, 1998,35, 229-249. 被引量:1
  • 5Luecht, R. M., & Nungester, R. J. Computer-adaptive Sequential Testing. In W. J. van der Linden and C. A. W. Glas (Ed.), Computerized Adaptive Testing: Theocy and Practices. (pp.117-128). Netherlands: Kluwer Academic Publishers. 2003. 被引量:1
  • 6Luecht, R. M. Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 1998, 22, 224-236. 被引量:1
  • 7Luecht, R. M., Brumfield, T. & Breithaupt, K. A Testlel Assembly Design for Adaptive Multislage Tests. Applied Measurement in Education, 2006,19(3), 189-202. 被引量:1
  • 8NBME. Author The 1996 Step 2 Field Test Study of a Computerized System for USMLE. The National Board Examiner, 43 (4). Philadelphia, PA: National Board of Medical Examiners. 1996. 被引量:1
  • 9NBME. Author. Summary of the 1997 USMLE Step 1 Computerized Fieht Test. The National Board Examiner, 44 (4). Philadelphia, PA: National Board of Medieal Examiners. 1997. 被引量:1
  • 10Bougbtxm, K. A. & Gierl, M. J. Automated Test Assembly Prcedures for Criterion-Referenced Testing Using Optimization Heuristies. Paper Presented at the Annual Meeting of the American Educational Research Assoeiation (AERA), New Orleans, LA. 2000, April. 被引量:1

共引文献30

同被引文献34

  • 1AERA, APA, & NCME. Standards for Educational and Psychologi- cal Testing. Washington, D.C. : AERA, 1999:1-174. 被引量:1
  • 2Xu, X., Sikali, E., Oranje, A., Kulick, E. Multi-stage testing in edu- cational survey assessments[C]. New Orleans: the National Council on Measurement in Education, 2011. 被引量:1
  • 3Bock, R. D., Zimowski, M. F. Feasibility studies of two-stage testing in large-scale educational assessment: Implications for NAEP[R]. Washington, DC: National Center for Education Statis- tics, 2003. 被引量:1
  • 4Drasgow, F., Luecht, R. M., Bennett, R. Technology and Testing [M]//Brennan, R. L. Educational measurement (4th ed.). Washing- ton, DC: American Council on Education/Praeger Publishers, 2006: 471-515. 被引量:1
  • 5Zenisky, A., Hambleton, R. K., Luecht, R. M. Multi-stage test- ing: Issues, designs, and researeh[M]//Van der Linden, W. J., Glas, C. A. W. Elements of Adaptive Testing. New York: Springer, 2010: 355-372. 被引量:1
  • 6王睿,罗照盛,王钰彤.计算机化多阶段自适应测验在限时瑞文推理测验中的应用[C]//第十七届全国心理学学术会议论文摘要集.北京:中国心理学会,2014. 被引量:1
  • 7Luecht, R. M., Nungester, R. J. Some practical examples of computer-adaptive sequential testing[J]. Journal of Educational Measurement, 1998 (35): 229-249. 被引量:1
  • 8Rosenbaum, P. R. Items bundles[J]. Psychometrika, 1988, 53 (3): 349-359. 被引量:1
  • 9Zenisky, A. L. Evaluating the effects of several multi-stage test- ing design variables on selected psychometric outcomes for certifica- tion and licensure assessment[D]. Amherst: University of Massachu- setts, 2004. 被引量:1
  • 10Wainer, H. Computerized Adaptive Testing: A primer[M]. Hills- dale: Lawrence Erlbaum Associates, 1990. 被引量:1

引证文献4

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部