Agus Santoso


Universitas Terbuka (UT) applied online examination system (sistem ujian online SUO) for end of semester examination (ujian akhir semester-UAS), beside the paper and pencil test (P & P test). In order to improve efficiency, adaptive test application should be analyzed, as an alternative to present UAS system. The aim of the research was to compare the efficiency and accuracy level of the computerized adaptive testing (CAT) design and conventional test using both P & P test and SUO. The research was conducted by simulation procedure. The item bank for the simulation used calibrated 404 test items using item response theory model. In the research, CAT and P & P test algorithm was developed. To measure efficiency, the required number of the CAT design was analyzed, while to measure accuracy of the estimation, the bias and standard error of measurement of both design were compared. The simulation result showed that (1) CAT design was more efficient, since it required only half of the number of item which was used in P & P test, to estimate the ability of examinee, (2) CAT design was more accurate in estimating ability of examinee, compared to P & P test design, since it resulted lower bias and standard error of measurement compared to conventional test design. Therefore, CAT design could be applied in UTs UAS system, while considering the balance of content for each modules.


computerized adaptive testing, item response theory, paper and pencil test.

Full Text:



Baker, F.B. (1992). Item response theory: Parameter estimation techniques. New York: Marcel Dekker, Inc.

Bond, T.G. & Fox, C.M. (2007). Applying the rasch model: Fundamental measurement in the human sciences (2nd ed). Mahwah, NJ: Lawrence Erlbaum Associates, Publishers.

Dodd, B.G. (1990). The effect of item selection procedure and stepsize on computerized adaptive attitude measurement using the rating scale model. Applied psychological measurement, 4, 355 366.

Hambleton, R.K. Swaminathan, H. & Rogers, H.J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage Publications, Inc.

Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ : Lawrence Erlbaum Associates.

Thissen, D. (1990). Reliability and measurement precision. Dalam H. Wainer (Eds.), Computerized adaptive testing: A primer (2nd ed., pp. 161186). Hillsdale, NJ: Lawrence Erlbaum Associates.

Wainer, H., Dorans, N.J., Flaugher, R., Green, B.F., Mislery, R.J., Steinberg, L. et al. (1990). Computerized adaptive testing: A primer (2nd ed.). Hillsdale, NJ: Lawrence Erlbaum Associates.

Weiss, D.J. & Schleisman, J.L. (1999). Adaptive testing. Dalam G. N. Masters & J. P. Keeves (Eds.), Edvances in measurement in educational research and assessment (pp. 129137). Pergamon, NY: Elsevier Science Ltd.

Vispoel, W.P. (1999). Creating computerized adaptive test of music aptitude: Problem, solusions, and future directions. Dalam F. Drasgow, & J. B. Olson-Buchanan (Eds.), Innovations in computerized assessment (pp. 151 176). Mahwah, NJ: Lawrence Erlbaum Associates Publishers.


  • There are currently no refbacks.

Copyright (c) 2015 Jurnal Pendidikan

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.