基于IRT理論傳統(tǒng)紙筆測(cè)驗(yàn)與計(jì)算機(jī)自適應(yīng)測(cè)驗(yàn)結(jié)果對(duì)比分析
本文選題:CAT + IRT; 參考:《貴州師范大學(xué)》2014年碩士論文
【摘要】:基于項(xiàng)目反應(yīng)理論的計(jì)算機(jī)自適應(yīng)測(cè)驗(yàn)(Computerized AdaptiveTest,CAT)是一種比傳統(tǒng)測(cè)驗(yàn)方式更加快速有效的測(cè)驗(yàn)方式。目前,對(duì)于CAT的研究多集中在CAT系統(tǒng)的建立、CAT相關(guān)算法研究與優(yōu)化、CAT與其他測(cè)驗(yàn)理論的結(jié)合三個(gè)方面。罕有文章對(duì)CAT在測(cè)驗(yàn)中的準(zhǔn)確性及測(cè)驗(yàn)效率的進(jìn)行普適研究,尤其是研究在理想狀況下CAT與傳統(tǒng)紙筆測(cè)驗(yàn)方式的差異與特點(diǎn)。 本文以EPQ量表當(dāng)中N量表為基礎(chǔ),首先使用真實(shí)數(shù)據(jù)進(jìn)行模型擬合與參數(shù)估計(jì),研究在實(shí)際測(cè)驗(yàn)中,CAT的可行性以及其測(cè)驗(yàn)結(jié)果與傳統(tǒng)紙筆測(cè)驗(yàn)的相關(guān)與差異;而后利用Monte-Carlo生成模擬數(shù)據(jù),對(duì)基于IRT的傳統(tǒng)紙筆測(cè)驗(yàn)及CAT結(jié)果進(jìn)行分析,排除各種干擾因素之后,進(jìn)一步研究CAT與傳統(tǒng)紙筆測(cè)驗(yàn)在理想狀況下,其理論與模型的固有特點(diǎn),得到以下結(jié)果: (1)CAT效率的提升,是以測(cè)驗(yàn)準(zhǔn)確性的有控制的犧牲為條件的,其實(shí)質(zhì)是在測(cè)驗(yàn)效率與測(cè)驗(yàn)準(zhǔn)確率做平衡,每一個(gè)項(xiàng)目的減少,都會(huì)造成測(cè)驗(yàn)準(zhǔn)確性的犧牲,如何有計(jì)劃的篩選項(xiàng)目,是CAT首要的問題。 (2)題庫項(xiàng)目的增加,對(duì)于整個(gè)被試群體來說,會(huì)使傳統(tǒng)紙筆測(cè)驗(yàn)和CAT的測(cè)驗(yàn)準(zhǔn)確性增加,傳統(tǒng)紙筆測(cè)驗(yàn)準(zhǔn)確性的提升,同時(shí)表現(xiàn)在測(cè)驗(yàn)標(biāo)準(zhǔn)誤的降低上,而在CAT中,由于本文限定了終止條件的標(biāo)準(zhǔn)誤,最終被試能力估計(jì)值的標(biāo)準(zhǔn)誤并沒有因?yàn)闇y(cè)驗(yàn)準(zhǔn)確性的提升而改變,這說明傳統(tǒng)紙筆測(cè)驗(yàn)或者CAT所獲得的被試能力估計(jì)的標(biāo)準(zhǔn)誤,無法表明其測(cè)驗(yàn)的準(zhǔn)確性。 (3)對(duì)于不同能力水平的被試,題庫項(xiàng)目的增加,可能會(huì)提升其測(cè)驗(yàn)準(zhǔn)確性,也可能會(huì)提升其測(cè)驗(yàn)效率。
[Abstract]:Computerized Adaptive Test (CAT) based on item response theory is a more rapid and effective test method than traditional methods. At present, the research of CAT mainly focuses on the establishment of CAT system and the combination of cat and other test theories. Few articles have studied the accuracy and efficiency of CAT in testing, especially the differences and characteristics between CAT and traditional paper and pen test methods under ideal conditions. On the basis of N scale in EPQ scale, the model fitting and parameter estimation of real data are used to study the feasibility of cat in the actual test and the correlation and difference between the results of the test and the traditional paper and pen test. Then we use Monte-Carlo to generate simulation data, analyze the traditional paper pen test and CAT result based on IRT, remove all kinds of interference factors, and further study the inherent characteristics of the theory and model of CAT and traditional paper pen test under ideal condition. The following results were obtained: The improvement of cat efficiency is conditioned by the controlled sacrifice of test accuracy, which is essentially a balance between test efficiency and test accuracy, and the reduction of each item will result in the sacrifice of test accuracy. How to plan the selection of items, is the primary issue of CAT. 2) for the whole group, the increase of item bank will increase the accuracy of traditional paper and pen test and CAT test, improve the accuracy of traditional paper and pen test, and decrease the error of test standard, while in CAT, the accuracy of traditional paper and pen test will be improved, while in CAT, the accuracy of traditional paper and pen test will be improved. Because the standard error of the termination condition is limited in this paper, the standard error of the final test ability estimate is not changed by the improvement of the test accuracy, which indicates that the standard error of the traditional paper and pen test or the test ability estimation obtained by CAT. The accuracy of the test cannot be demonstrated. 3) for the subjects with different ability levels, the increase of test bank items may improve the accuracy and efficiency of the test.
【學(xué)位授予單位】:貴州師范大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:B842
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 熊建華,丁樹良,漆書青,戴海崎;用測(cè)驗(yàn)信息量分析試卷質(zhì)量[J];江西師范大學(xué)學(xué)報(bào)(自然科學(xué)版);2002年03期
2 鄧遠(yuǎn)平;蔡艷;羅照盛;;計(jì)算機(jī)自適應(yīng)測(cè)驗(yàn)中Rasch模型穩(wěn)健性的模擬研究[J];考試研究;2006年03期
3 辛濤;;項(xiàng)目反應(yīng)理論研究的新進(jìn)展[J];中國(guó)考試;2005年07期
4 楊建原;柏檜;趙守盈;;計(jì)算機(jī)自適應(yīng)測(cè)驗(yàn)開發(fā)的程序研究[J];中國(guó)考試;2012年03期
5 余嘉元;汪存友;;項(xiàng)目反應(yīng)理論參數(shù)估計(jì)研究中的蒙特卡羅方法[J];南京師大學(xué)報(bào)(社會(huì)科學(xué)版);2007年01期
6 郭慶科,房潔;經(jīng)典測(cè)驗(yàn)理論與項(xiàng)目反應(yīng)理論的對(duì)比研究[J];山東師大學(xué)報(bào)(自然科學(xué)版);2000年03期
7 崔洪弟;一種新型考試方式——基于計(jì)算機(jī)的自適應(yīng)考試[J];教育探索;2003年12期
8 李偉明,丁元,龐曉亮;項(xiàng)目反應(yīng)理論(IRT)模擬研究中的優(yōu)良設(shè)計(jì)和混合效應(yīng)模型[J];心理科學(xué);1998年04期
9 曹亦薇;項(xiàng)目反應(yīng)理論的分?jǐn)?shù)分布的預(yù)測(cè)作用[J];心理科學(xué);1998年04期
10 唐寧玉,,戴志恒;項(xiàng)目反應(yīng)理論在編制現(xiàn)代性量表中的應(yīng)用[J];心理科學(xué);1995年03期
本文編號(hào):1900843
本文鏈接:http://www.wukwdryxk.cn/shekelunwen/xinlixingwei/1900843.html