Pengembangan Tes Matematika dengan Konteks COVID-19 untuk Siswa SMP/MTs Kelas VIII

Anggit Prabowo, Jarnawi Afgani Dahlan


This study aims to develop a mathematics test with the COVID-19 context to measure the student's skill competencies in grade VIII SMP/MTs. This study was a research development by following the test development procedures: compiling blueprints, writing items, reviewing questions, testing, analyzing the results of trials, and revising. The developed test consists of multiple-choice items with four options. These items were validated content by three experts (teacher, lecturer, and mathematics trainer) and were tested to 86 students. This study developed a mathematics test set with the COVID-19 context consisting of ten items that experts declared valid. Trial of ten items showed two items (numbers 7 and 6), which were not good. Item number 7 was too easy, while item number 6 was not good in the discrimination index. Besides, the two distractors in item number 7 did not work correctly. The estimated coefficient of reliability of the measurement results was quite high, indicated by a coefficient value of 0.658. The items that were not good have been revised by modifying the items so that the index of difficulty and discrimination look good. Useful and revised items were assembled into a mathematics test set for grade VIII students.


COVID-19; mathematics; test

Full Text:



Aiken, L. R. (1980). Content validity and reliability of single items or questionnaires. Educational and psychological measurement, 40(4), 955-959.

Allen, M. J., & Yen, W. M. (1979). Introduction to measurement theory. California: Grooks/Cole Publishing Company Monterey.

Althouse, L. A. (2001). Test development: ten steps to a valid and reliablecertification exam. Diakses dari

Brown, FG. (1983). Principles of educational and psychological testing. 3rd ed. New York: Holt, Rienhart and Winston.

Charmila, N., Zulkardi, Darmawijoyo. (2016). Pengembangan soal matematika model PISA menggunakan konteks Jambi. Jurnal Penelitian dan Evaluasi Pendidikan, 20(2), 198-207.

Clay, B. (2001) Is this a trick question? A short guide to writing effective test questions. Kansas: Kansas Curriculum Center.

Crocker L., & Algina J. (1986). Introduction to classical and modern test theory. New York Harcourt Brace Jovanovich.

Shete A, Kausar A, Lakhkar K, & Khan S. (2015). Item analysis: An evaluation of multiple-choice questions in physiology examination. Journal of Contemporary Medical Education, 3, 106-109.

DiBattista, D., & Kurzawa, L. (2011). Examination of the quality of multiple-choice items on classroom tests. Canadian Journal for the Scholarship of Teaching and Learning, 2(2), 1-23.

Dong, E., Du, H., & Gardner, L. (2020). An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Desease.

Dyah, F. W., & Putra, A. P. (2016). Pengembangan instrumen tes standar kognitif pada mata pelajaran IPA kelas 7 SMP di Kabupaten Banjar. Proceeding Biology Education Conference.

Ebel, R. L., & Frisbie, D. A. (1986). Esentials of educational measurement. New Jersey: Prentice Hall Inc.

Escudero, E. B., Reyna, N. L., & Morales, M. R. (2000). The level of difficulty and discrimination power of the basic knowledge and skills examination (EXHCOBA). Revista Electrónica de Investigación Educativa, 2, 1-16.

Gao, J., Tian, Z., & Yang, X. (2020). Breakthrough: Chloroquine phosphate has shown apparent efficacy in treatment of COVID-19 associated pneumonia in clinical studies. BioScience Trends Advance Publication, 4(1), 72-73.

George, D., & Mallery, P. (2003). SPSS for windows step by step: A simple guide and reference. 11.0 update (4th ed.). Boston: Allyn & Bacon.

Habib, M.A., Talukder, H.K., Rahman, M.M., & Ferdousi, S. (2017). Post-application quality analysis of MCQs of preclinical examination using item analysis. Bangladesh Journal of Medical Education, 7, 2-7.

Hajjar, S. T. E. L. (2018). Statistical analysis: Internal-consistency reliability and construct validity. International Journal of Quantitative and Qualitative Research Methods, 6(1), 27-38.

Haladyna, T. M., & Downing, S. M. (1993). How many options is enough for a multiple-choice item? Educational and Psychological Measurement, 53, 999-1010.

Haladyna, T. M. (1997). Writing test items to evaluate higher order thinking. MA: Allyn and Bacon.

Haladyna, T. M. & Downing, S. M. (1989). A taxonomy of multiple-choice item-writing rules, Applied Measurement in Education, 2(1), 37-50,

Haladyna, T. M., Rodriguez, M. C., & Stevens, C. (2019). Are multiple-choice items too fat ? Applied Measurement in Education, 32(4), 350–364.

Lipsitch, M., Swerdlow, D. L., & Finelli, L. (2020). Defining the epidemiology of Covid-19 — studies needed. The New England Journal of Medicine, 382(13), 1194-1196.

Matlock-Hetzel, S. (1997). Basic concepts in item and test analysis. EricDatabase.

McAlpine, M. (2002). A summary of methods of item analysis. CAA Centre: Luton.

McCowan, R.J., & McCowan, S.C. (1999). Item Analysis for Criterion-Referenced Tests. Online Submission.

Middleton, F. (2019). The four types of validity. Diakses dari pada tanggal 13 Januari 2020.

Musa, A., Shaheen, S., Elmardi, A., & Ahmed, A. (2018). Item difficulty & item discrimination as quality indicators of physiology MCQ examinations at the Faculty of Medicine Khartoum University. Khartoum Medical Journal, 11(2),1477 – 1486.

Osnal, Suhartono, & Wahyudi, I. (2016). Meningkatkan kemampuan guru dalam menyusun tes hasil belajar akhir semester melalui workshop di KKG Gugus 02 Kecamatan Sumbermalang Tahun 2014/2015. Pancaran, 5(1), 67-82.

Pande, S. S., Pande, S. R., Parate, V. R., Nikam, A. P., & Agrekar, S. H. (2013). Correlation between difficulty & discrimination indices of MCQs in formative exam in Physiology.South-East Asian Journal of Medical Education, 7, 45-50.

Prabowo, A., Anggoro, R. P., Astuti, D., & Fahmi, S. (2017). Interactive multimedia-based teaching material for 3-dimensional geometry. IOP Conf. Series: Journal of Physics: Conf. Series, 943(2017) 012047.

Prabowo, A., Kusdinar, U., & Rahmawati, U. (2018a). Pelatihan pengembangan instrumen tes mata pelajaran matematika SMP. International Journal of Community Service Learning, 2(3), 141-148.

Prabowo, A., Anggoro, R. P., & Rahmawati, U. (2018b). Profil hasil ujian nasional materi matematika SMP/MTs, Eduma, 7(2), 31-39.

Prabowo, A., Anggoro, R. P., Adiyanto, R, & Rahmawati, U. (2018). Interactive multimedia-based teaching material for trigonometry. IOP Conf. Series: Journal of Physics: Conference Series, 1097(2018), 012138.

Prabowo, A., Rahmawati, U., & Anggoro, R.P. (2019a). Android-based teaching material for statistics integrated with social media WhatsApp. International Journal on Emerging Mathematics Education, 3(1), 93-104.

Prabowo, A., Anggoro, R. P., Rahmawati, U., & Rokhima, N. (2019b). Android-based teaching material for straight-sides solid. Journal of Physics: Conference Series, 1321(2019), 032097.

Professional Testing Inc. (2006). What are the steps in the development of an exam program?

Putra, Y. Y. & Vebrian, R. (2019). Pengembangan soal matematika model PISA konteks Kain Cual Bangka Belitung. Journal Cendekia: Jurnal Pendidikan Matematika, 3(2), 333-340.

Rasiah, S-MS & Isaiah, R. (2006). Relationship between item difficulty and discrimination indices in true/false-type multiple choice questions of a para-clinical multidisciplinary paper. Annals Academy of Medicine Singapore, 35, 67- 71.

Rowe, S. E. (2001). Development of a test blueprint for the National Association of Industrial Technology certification exam. Retrospective Theses and Dissertations. 668. Iowa State University.

Spaan, M. (2006). Test and item specifications development. Language Assessment Quarterly, 3(1), 71–79.

Sunardi, Lestari, N. D. S., & Alam, A. F. S. (2016). Pengembangan soal literasi matematika konteks societal untuk siswa kelas VII SMP/MTs. Skripsi. Universitas Jember.

Sim, S. M., & Rasiah, R. I. (2006). Relationship between item difficulty and discrimination indices in true/false-type multiple choice questions of a para-clinical multidisciplinary paper. Annals of the Academy of Medicine. 35(2), 67-71.

Tarrant, M., Ware, J., & Mohammed, A. M. (2009). An assessment of functioning and non-functioning distractors in multiple-choice questions: A descriptive analysis. BMC British Medical Education, 9, 40.

Tavakol M. & Dennick, R. (2011). Post-examination analysis of objective tests. Med Teach, 33, 447-58.

Thomdike, R.L. (Ed.). (1971). Educational Measurement. Washington, D C. American Council on Education.

Ware, J., & Vik, T. (2009). Quality assurance of item writing: During the introduction of multiple choice questions in medicine for high stakes examinations. Medical Teacher, 31, 238-243.

Wim, J., Katrien, W., Patrick, D. P., & Patrick, V. K. (2008). Marketing Research with SPSS. Prentice Hall: Pearson Education.

WHO. (2020). Coronavirus disease (COVID-19) advice for the public. Diambil dari

Worldometers. (2020). COVID-19 Coronavirus Pandemic. Diakses pada 11 Mei 2020 dari

Article Metrics

Abstract view : 0 times
PDF - 0 times


  • There are currently no refbacks.

Copyright (c) 2020 Jurnal Elemen

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

 Creative Commons License
Jurnal Elemen is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View My Stats