How does Rasch modeling reveal difficulty and suitability level the fraction test question?


  • Karlimah Karlimah Elementary School Education Study Program, Universitas Pendidikan Indonesia, West Java



difficulty analysis, fractions, item fitness, Rasch modeling, test questions


This article explains how to analyze test items in arithmetic operation with fractions to obtain the items' level of difficulty and fitness. Data were collected by using multiple-choice questions given to 50 fourth-grade students of an elementary school in Tasikmalaya city. The answers were then analyzed using the Rasch model and Winsteps 3.75 application, a combination of standard deviation (SD) and logit mean values (Mean). The score data of each person and question were used to estimate the pure score in the logit scale, indicating the level of difficulty of the test items. The categories were difficult (logit value > +1 SD); very difficult (0.0 logit +1 SD); easy (0.0 logit -1 SD); very easy (logit value < –SD). Three criteria were used to determine the level of difficulty and fitness of the questions: the Outfit Z-Standard/ZSTD value; Outfit Mean Square/MNSQ; and Point Measure Correlation. It resulted in a collection of test items suitable for use with several levels of difficulties, namely, difficult, very difficult, easy, and very easy, from the previous items, which had difficult, medium, and easy categories. Rasch model can help categorize questions and students' ability levels.

Author Biography

Karlimah Karlimah, Elementary School Education Study Program, Universitas Pendidikan Indonesia, West Java

Pendidikan Guru Sekolah Dasar Terakreditasi A


Alwasilah, e. a. (1996). Glossary of educational assesment term. Ministry of Education and Culture.

Andrich, D. (1981). Book review: Probabilistic models for some intelligence and attainment test (expanded edition: Georg Rasch Chicago: The University of Chicago Press, 1980, 199 pp., 15 hardcover, 7 paperback). Applied Psychological Measurement, 5(4), 545-550.

Bond, T., & Fox, C. (2015). Applying the Rasch model: Fundamental measurement in the human sciences, third edition. Routledge.

Brodgen, H. E. (1977). The Rasch model, the law of comparative judgment and additive conjoint measurement. Psychometrika, 42(2), 631-634.

Doran, R. L. (1980). Basic measurement and evaluation of science instruction. National Science Teachers Association, 1742 Connecticut ave., NW, Washington, D.C 20009.

Fahmina, S., Masykuri, M., Ramadhani, D., & Yamtinah, S. (2019). Content validity uses Rasch model on computerized testlet instrument to measure chemical literacy capabilities. AIP Conference Proceeding, 2194(1), 1-8, Article 020023.

Goddard, R. D., Hoy, W. K., & Hoy, A. W. (2000). Collective teacher efficacy: its meaning, measure, and impact on student achievement. American education research journal, 37(2), 479-507.

Goodwin, L. D., & Leech, N. (2017). The meaning of validity in the new standards for educational and psychological testing. Measurement and Evaluation in Counseling and Development, 36(3), 181-191.

Hamdu, G., Fuadi, F., Yulianto, A., & Akhirani, Y. (2020). Items quality analysis using Rasch model to measure elementary school students’ critical thinking skill on STEM learning. JPI (Jurnal Pendidikan Indonesia), 9(1), 61-74.

Harris, L. R., & Brown, G. T. (2010). Mixing interview and questionnaire methods: practical problems in aligning data. Practical Assesment, Research, and Evaluation, 15(1), 1-15.

Jackson, T., Draugalis, J., Slack, M., Zachry, W., & D'Agostino, J. (2002). Validation of authentic performance assessment: A process suited for Rasch modeling. American Journal of Pharmaceutical Education, 66(3), 233-242.

Linacre, J. (1999). Investigating rating scale category utility. Journal of outcome measurement, 3(2), 103-122.

Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174.

Nur, L., Nurani, L. A., Suryana, D., & Ahmad, A. (2020). Rasch model application on character development instrument elementary school students. International Journal of Learning, Teaching and Education Research, 19(3), 437-459.

Parker, C. (2000). Performance measurement. Emerald Insight, 49(2), 63-66.

Ratna, I., Yamtinah, S., Ashadi, Masykuri, M., & Shidiq, A. (2017). The implementation of testlet assessment instrument in solubility and solubility product material for measuring students’ generic science skills. Advances in Social Science, Education and Humanities Research (ASSEHR), 158, 958-963.

Smith, R. M. (1986). Person fit in the Rasch model. Educational and Psychological Measurement, 46(2)359-372.

Sumintono, B. (2018). Rasch model measurements as tools in assesment for learning. Proceedings of the 1st International Conference on Education Innovation (ICEI 2017) Atlantis Press, 173, 38-42.

Sumintono, B., & Widhiarso, W. (2014). Aplikasi Rasch model untuk penelitian ilmu-ilmu sosial edisi revisi [Rasch model app for social science research revised edition]. Trim Komunikata Publishing House.

Sumintono, B., & Widhiarso, W. (2015). Aplikasi pemodelan Rasch pada asesmen pendidikan [Rasch modeling application in educational assessment]. Trim Komunikata Publishing House.

Zainul, A. (2001). Alternative assesment. Dirjen Dikti.




How to Cite

Karlimah, K. (2022). How does Rasch modeling reveal difficulty and suitability level the fraction test question?. Jurnal Elemen, 8(1), 66–76.




Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.