Evaluation of pharmacy mathematics assessment items using the Rasch model

Authors

  • Alifa Sabrina Sekolah Tinggi Ilmu Kesehatan IKIFA
  • Fitri Savitri Sekolah Tinggi Ilmu Kesehatan IKIFA
  • Farida Tuahuns Sekolah Tinggi Ilmu Kesehatan IKIFA

DOI:

https://doi.org/10.29408/jel.v12i2.33850

Keywords:

assessment instrument, mathematics learning outcomes, Rasch model analysis

Abstract

This study aims to assess the psychometric quality of mathematics evaluation tools using the Rasch Model. The analysis was conducted on 15 test items administered to university students, item difficulty, model fit, reliability, and test information function. The findings of the analysis show that the item difficulty level ranges from −1.50 to 1.75 logits, indicating sufficient variation in difficulty to measure abilities from low to high. Most items show good model fit with Mean Square INFIT and OUTFIT values within an acceptable range 0.5–1.5. Item reliability was very high (0.99), indicating stability in the item difficulty hierarchy, while individual reliability was in the moderate category (0.60), reflecting the homogeneity of the respondents' abilities. The test's information function peaks in the ability range of approximately θ = 0 to θ = +1, where the lowest measurement error occurs in that range, making this instrument most accurate in assessing abilities that are average to slightly average. These findings are consistent with Rasch theory and previous research. Overall, the results of this study reinforce that the Rasch Model is effective for assessing and improving mathematics evaluation tools in higher education and provide for the development of more accurate instruments in future research

Author Biographies

Alifa Sabrina, Sekolah Tinggi Ilmu Kesehatan IKIFA

This study aims to assess the psychometric quality of mathematics evaluation tools using the Rasch Model. The analysis was conducted on 15 test items administered to university students, item difficulty, model fit, reliability, and test information function. The findings of the analysis show that the item difficulty level ranges from −1.50 to 1.75 logits, indicating sufficient variation in difficulty to measure abilities from low to high. Most items show good model fit with Mean Square INFIT and OUTFIT values within an acceptable range 0.5–1.5. Item reliability was very high (0.99), indicating stability in the item difficulty hierarchy, while individual reliability was in the moderate category (0.60), reflecting the homogeneity of the respondents' abilities. The test's information function peaks in the ability range of approximately θ = 0 to θ = +1, where the lowest measurement error occurs in that range, making this instrument most accurate in assessing abilities that are average to slightly average. These findings are consistent with Rasch theory and previous research. Overall, the results of this study reinforce that the Rasch Model is effective for assessing and improving mathematics evaluation tools in higher education and provide for the development of more accurate instruments in future research

Fitri Savitri, Sekolah Tinggi Ilmu Kesehatan IKIFA

This study aims to assess the psychometric quality of mathematics evaluation tools using the Rasch Model. The analysis was conducted on 15 test items administered to university students, item difficulty, model fit, reliability, and test information function. The findings of the analysis show that the item difficulty level ranges from −1.50 to 1.75 logits, indicating sufficient variation in difficulty to measure abilities from low to high. Most items show good model fit with Mean Square INFIT and OUTFIT values within an acceptable range 0.5–1.5. Item reliability was very high (0.99), indicating stability in the item difficulty hierarchy, while individual reliability was in the moderate category (0.60), reflecting the homogeneity of the respondents' abilities. The test's information function peaks in the ability range of approximately θ = 0 to θ = +1, where the lowest measurement error occurs in that range, making this instrument most accurate in assessing abilities that are average to slightly average. These findings are consistent with Rasch theory and previous research. Overall, the results of this study reinforce that the Rasch Model is effective for assessing and improving mathematics evaluation tools in higher education and provide for the development of more accurate instruments in future research

Farida Tuahuns, Sekolah Tinggi Ilmu Kesehatan IKIFA

This study aims to assess the psychometric quality of mathematics evaluation tools using the Rasch Model. The analysis was conducted on 15 test items administered to university students, item difficulty, model fit, reliability, and test information function. The findings of the analysis show that the item difficulty level ranges from −1.50 to 1.75 logits, indicating sufficient variation in difficulty to measure abilities from low to high. Most items show good model fit with Mean Square INFIT and OUTFIT values within an acceptable range 0.5–1.5. Item reliability was very high (0.99), indicating stability in the item difficulty hierarchy, while individual reliability was in the moderate category (0.60), reflecting the homogeneity of the respondents' abilities. The test's information function peaks in the ability range of approximately θ = 0 to θ = +1, where the lowest measurement error occurs in that range, making this instrument most accurate in assessing abilities that are average to slightly average. These findings are consistent with Rasch theory and previous research. Overall, the results of this study reinforce that the Rasch Model is effective for assessing and improving mathematics evaluation tools in higher education and provide for the development of more accurate instruments in future research

References

Bond, T. G. (2015). This is the preface from the first edition of Bond & Fox (2001). It outlines the principles and practices which drove us to write the book. The third edition (2015). www.routledge.com/9780415833424.2001.

Bond, T. G., & Fox, C. M. (2007). Applying the rasch model : fundamental measurement in the human sciences second edition university of toledo.

Boone, W. J. (2016). Rasch analysis for instrument development : why , when , and how ? https://doi.org/10.1187/cbe.16-04-0148

Boone, W. J. (2021). Rasch measurement: applications in health professions education. Health Professions Education, 7(3), 275–284.

Cook, D. A., & Beckman, T. J. (2006). Current concepts in validity and reliability for psychometric instruments: theory and application. The American Journal Of Medicine, 119(2), 521–529. https://doi.org/10.1016/j.amjmed.2005.10.036

Crocker, & Algina. (2008). Introduction to classical and modern test theory. Cengage Learning. Cengage Learning.

De Champlain, A. F. (2010). A Primer on classical test theory and item response theory for assessments in medical education. Medical Education, 44(1), 109–117. https://doi.org/10.1111/j.1365-2923.2009.03425.x

Derendorf, H., & Schmidt, S. (2020). Clinical pharmacokinetics and pharmacodynamics rowland and tozer’s.

Dwijayanti, I., Utami, R. E., & Nugroho, A. A. (2021). Peningkatan kemampuan tpack calon guru: analisis kemungkinan kesalahan pemahaman konsep aljabar serta strategi pencegahannya [Improving prospective teachers' tpack competence: analysis of possible misunderstandings of algebraic concepts and prevention strategies]. Aksioma, 10(4), 2283-2290. https://doi.org/10.24127/ajpm.v10i4.4119

Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Lawrence Erlbaum Associates, Inc., Publishers.

Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: principles and applications. In Applied Psychological Measurement (Vol. 9, Issue 3). Springer Science. https://doi.org/10.1177/014662168500900315

Haute, A. de S. (2021). Medicinal product dose calculation. 1–3.

Hermansyah, E., Rizkyanti, C. A., & Hakim, L. N. (2024). Rasch calibration and differential item functioning (DIF) analysis of the school well-being scale for students Eka. Jurnal Kependidikan, 10(1), 1–11.

Linacre, M. (2012). A user’s guidelines winstep. https://www.winsteps.com/winman/index.html

Mcneish, D., & Wolf, M. G. (2020). Thinking twice about sum scores. 2020(April), 2287–2305.

Muhtarom, M. (2024). Developing an instruments to measure prospective teacher beliefs about mathematical problem-solving using the Rasch model. Jurnal Elemen, 10(2), 274–288. https://doi.org/10.29408/jel.v10i2.25040

Mulac, A., Hagesaether, E., & Granas, A. G. (2022). Medication dose calculation errors and other numeracy mishaps in hospitals: Analysis of the nature and enablers of incident reports. Journal of Advanced Nursing, 78(1), 224–238. https://doi.org/10.1111/jan.15072

Prasetya, W. A. (2023). Item quality analysis using the rasch model to measure …. JPEP. https://scholarhub.uny.ac.id/jpep/vol27/iss1/7/

Salafiya, T. T., Kamila, F. I., Kimnu, K. P., & Risqiyani, L. (2025). Kesulitan mahasiswa pendidikan matematika Universitas Negeri Semarang dalam memahami konsep kalkulus integral [Difficulties faced by mathematics education students at Semarang State University in understanding the concept of integral calculus]. Jurnal Angka, 2(1), 93–108.

Smith, S. K., Benbenek, M. M., Bakker, C. J., & Bockwoldt, D. (2022). Scoping review: Diagnostic reasoning as a component of clinical reasoning in the U . S . Primary Care Nurse Practitioner Education, June, 3869–3896. https://doi.org/10.1111/jan.15414

Sugiyono. (2014). Metode penelitian bisnis: Pendekatan kuantitatif, kualitatif, kombinasi, dan R&D [Business research methods: Quantitative, qualitative, mixed-methods, and R&D approaches]. Alfabeta.

Wagner, J. G. (2020). Pharmacokinetics for the pharmaceutical scientist. In pharmacokinetics for the pharmaceutical scientist. Springer. https://doi.org/10.1201/9780203743652

Wai, L., Yim, K., Lye, C. Y., & Koh, P. W. (2024). A psychometric evaluation of an item bank for an English reading comprehension tool using Rasch analysis. 10(1), 18–34.

Wright, B. D., & Masters, G. N. (1982). Rating scale analysis. MESA Press.

Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2003). Effects of local item dependence on the validity of irt item, test, and ability statistics. University of Massachusetts at Amherst. Medical College Admission Test.

Downloads

Published

30-04-2026

How to Cite

Sabrina, A., Savitri, F., & Tuahuns, F. (2026). Evaluation of pharmacy mathematics assessment items using the Rasch model. Jurnal Elemen, 12(2), 537–553. https://doi.org/10.29408/jel.v12i2.33850

Issue

Section

Articles

Similar Articles

<< < 30 31 32 33 34 35 

You may also start an advanced similarity search for this article.