Generative AI in mathematics education: Considerations for academic integrity and assessment strategies
DOI:
https://doi.org/10.29408/jel.v12i2.33851Keywords:
academic integrity, assessment redesign, ChatGPT, educational assessment, generative AIAbstract
The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity
References
Acopio, M. K. M. G. (2025). Technological proficiency and online resource utilization in mathematics education: A study of higher education instructors in the Philippines. Jurnal Elemen, 11(4), 845-859. https://doi.org/10.29408/jel.v11i4.32134
Ali, O., Murray, P. A., Momin, M., Dwivedi, Y. K., & Malik, T. (2024). The effects of artificial intelligence applications in educational settings: Challenges and strategies. Technological Forecasting and Social Change, 199, 123076. https://doi.org/10.1016/j.techfore.2023.123076
Almpanis, T., Conroy, D., & Joseph-Richard, P. (2025). Practical implications of generative AI on assessment: Snapshot of early reactions to assessment redesign in an HRM and a psychology course. Electronic Journal of E-Learning, 23(3), 19–29. https://doi.org/10.34190/ejel.23.3.3971
Ateeq, A., Alzoraiki, M., Milhem, M., & Ateeq, R. A. (2024). Artificial intelligence in education: implications for academic integrity and the shift toward holistic assessment. Frontiers in Education, 9. https://doi.org/10.3389/feduc.2024.1470979
Bellido-García, R. S., Venturo-Orbegoso, C. O., Cruzata-Martínez, A., Sarmiento-Villanueva, E. B., Corro-Quispe, J., & Rejas-Borjas, L. G. (2024). Involvement of the student in their learning: Effects of formative assessment on competency development. Eurasia Journal of Mathematics, Science and Technology Education, 20(5), em2440. https://doi.org/10.29333/ejmste/14453
Bernal, M. E. (2024). Revolutionizing elearning assessments: The role of GPT in crafting dynamic content and feedback. Journal of Artificial Intelligence and Technology, 4(3), 188–199. https://doi.org/10.37965/jait.2024.0513
Beynen, T. (2024). The role of students. In Assessment Literacies in Navigating University Assessment, GenAI, and Academic Integrity A journal of educational research and practice (Vol. 33, Issue 3). https://journals.library.brocku.ca/brocked
Carbonel, H., Belardi, A., Ross, J., & Jullien, J. M. (2025). Integrity and motivation in remote assessment. Online Learning Journal, 29(2), 25–46. https://doi.org/10.24059/olj.v29i2.4309
Chaudhry, I. S., Sarwary, S. A. M., El Refae, G. A., & Chabchoub, H. (2023). Time to revisit existing student’s performance evaluation approach in higher education sector in a new era of ChatGPT — A case study. Cogent Education, 10(1). https://doi.org/10.1080/2331186X.2023.2210461
Darling-Hammond, L., Flook, L., Cook-Harvey, C., Barron, B., & Osher, D. (2020). Implications for educational practice of the science of learning and development. Applied developmental science, 24(2), 97-140. https://doi.org/10.1080/10888691.2018.1537791
Farag, W. A., Nadeem, M., & Helal, M. (2024). Assessment transformation in the age of AI: Moving beyond the influence of generative tools. 2024 Mediterranean Smart Cities Conference (MSCC), 1–6. https://doi.org/10.1109/MSCC62288.2024.10697011
Findell, B., Swafford, J., & Kilpatrick, J. (Eds.). (2001). Adding it up: Helping children learn mathematics. National Academies Press. https://doi.org/10.17226/9822
Gander, T., & Harris, G. (2024). Understanding AI literacy for higher education students: Implications for assessment. He Rourou, 8. https://doi.org/10.54474/herourou.v1i1.10579
Gruenhagen, J. H., Sinclair, P. M., Carroll, J. A., Baker, P. R. A., Wilson, A., & Demant, D. (2024). The rapid rise of generative AI and its implications for academic integrity: Students’ perceptions and use of chatbots for assistance with assessments. Computers and Education: Artificial Intelligence, 7. https://doi.org/10.1016/j.caeai.2024.100273
Hsiao, Y. P., Klijn, N., & Chiu, M. S. (2023). Developing a framework to re-design writing assignment assessment for the era of Large Language Models. Learning: Research and Practice, 9(2), 148–158. https://doi.org/10.1080/23735082.2023.2257234
Ilieva, G., Yankova, T., Ruseva, M., & Kabaivanov, S. (2025). A framework for generative AI-driven assessment in higher education. Information, 16(6), 472. https://doi.org/10.3390/info16060472
Jiang, Y., Hao, J., Fauss, M., & Li, C. (2024). Detecting ChatGPT-generated essays in a large-scale writing assessment: Is there a bias against non-native English speakers? Computers & Education, 217, 105070. https://doi.org/10.1016/j.compedu.2024.105070
Johnson, S., Owens, E., Menendez, H., & Kim, D. (2024). Using ChatGPT-generated essays in library instruction. The Journal of Academic Librarianship, 50(2), 102863. https://doi.org/10.1016/j.acalib.2024.102863
Kasneci, E., Seßler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., ... & Kasneci, G. (2023). ChatGPT for good? On opportunities and challenges of large language models for education. Learning and individual differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274
Khan, M. M., Dong, Y., & Manesh, N. A. (2023). Authentic assessment design for meeting the challenges of generative artificial intelligence. Proceedings - Frontiers in Education Conference, FIE. https://doi.org/10.1109/FIE58773.2023.10343376
Khlaif, Z. N., Alkouk, W. A., Salama, N., & Abu Eideh, B. (2025). Redesigning assessments for AI-enhanced learning: A framework for educators in the generative AI era. Education Sciences, 15(2). https://doi.org/10.3390/educsci15020174
Kofinas, A. K., Tsay, C. H. H., & Pike, D. (2025). The impact of generative AI on academic integrity of authentic assessments within a higher education context. British Journal of Educational Technology. https://doi.org/10.1111/bjet.13585
Kouam, A. W. F., & Muchowe, R. M. (2024). Exploring graduate students’ perception and adoption of AI chatbots in Zimbabwe: Balancing pedagogical innovation and development of higher-order cognitive skills. Journal of Applied Learning & Teaching, 7(1). https://doi.org/10.37074/jalt.2024.7.1.12
Lehane, S., & Wright, A. (2024). Designing authentic assessment to improve academic integrity. International Conference on Higher Education Advances, 564–571. https://doi.org/10.4995/HEAd24.2024.17136
Lithner, J. (2008). A research framework for creative and imitative reasoning. Educational Studies in mathematics, 67(3), 255-276. https://doi.org/10.1007/s10649-007-9104-2
Liu, X. (2025). Navigating uncharted waters: Teachers’ perceptions of and reactions to AI-induced challenges to assessment. Asia-Pacific Education Researcher, 34(2), 711–722. https://doi.org/10.1007/s40299-024-00890-x
Lukianenko, V., & Kornieva, Z. (2024). Generative AI in student essays: English teachers’ perspectives on effective assessment methods. XLinguae, 17(4), 235–250. https://doi.org/10.18355/XL.2024.17.04.14
Makridakis, S., Petropoulos, F., & Kang, Y. (2023). Large language models: Their success and impact. Forecasting, 5(3), 536–549. https://doi.org/10.3390/forecast5030030
Mao, J., Chen, B., & Liu, J. C. (2024). Generative artificial intelligence in education and its implications for assessment. TechTrends, 68(1), 58–66. https://doi.org/10.1007/s11528-023-00911-4
Maulana, A., Murtafiah, W.,Handhika, J.,&, Alvares, J. I. (2025). Integrating augmented reality with the e-IM3 structured thinking model to enhance problem-solving skills and learning interest in elementary spatial geometry. Jurnal Elemen, 11(4), 1030-1049. https://doi.org/10.29408/jel.v11i4.32139
Nikolic, S., Sandison, C., Haque, R., Daniel, S., Grundy, S., Belkina, M., Lyden, S., Hassan, G. M., & Neal, P. (2024). ChatGPT, Copilot, Gemini, SciSpace and Wolfram versus higher education assessments: an updated multi-institutional study of the academic integrity impacts of Generative Artificial Intelligence (GenAI) on assessment, teaching and learning in engineering. Australasian Journal of Engineering Education, 29(2), 126–153. https://doi.org/10.1080/22054952.2024.2372154
Niss, M., & Højgaard, T. (2019). Mathematical competencies revisited. Educational studies in mathematics, 102(1), 9-28. https://doi.org/10.1007/s10649-019-09903-9
Page, M. J., Moher, D., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … McKenzie, J. E. (2021). PRISMA 2020 explanation and elaboration: Updated guidance and exemplars for reporting systematic reviews. BMJ, n160. https://doi.org/10.1136/bmj.n160
Perkins, C., Furze, M., Roe, L., & Macvaugh, J. (2024). The artificial intelligence assessment scale (AIAS): A framework for ethical integration of generative AI in educational assessment. In Journal of University Teaching and Learning Practice (Issue 6).
Pratiwi, H., Suherman, Hasruddin, & Ridha, M. (2025). Between shortcut and ethics: Navigating the use of artificial intelligence in academic writing among Indonesian doctoral students. European Journal of Education, 60(2). https://doi.org/10.1111/ejed.70083
Saher, A. S., Ali, A. M. J., Amani, D., & Najwan, F. (2022). Traditional versus authentic assessments in higher education. Pegem Egitim ve Ogretim Dergisi, 12(1), 283–291. https://doi.org/10.47750/pegegog.12.01.29
Schoenfeld, A. H. (2016). Learning to think mathematically: Problem solving, metacognition, and sense making in mathematics (Reprint). Journal of education, 196(2), 1-38. https://doi.org/10.1177/00220574161960
Schultz, M., Young, K., K. Gunning, T., & Harvey, M. L. (2022). Defining and measuring authentic assessment: a case study in the context of tertiary science. Assessment & Evaluation in Higher Education, 47(1), 77–94. https://doi.org/10.1080/02602938.2021.1887811
Stack, M. (2023). Investigating an assessment design that prevents students from using ChatGPT as the sole basis to pass assessment at the tertiary level. E-Journal of Humanities, Arts and Social Sciences, 64–77. https://doi.org/10.38159/ehass.20234127
Stylianides, G. J. (2009). Reasoning-and-proving in school mathematics textbooks. Mathematical thinking and learning, 11(4), 258-288. https://doi.org/10.1080/10986060903253954
Tenakwah, E. S., Boadu, G., Tenakwah, E. J., Parzakonis, M., Brady, M., Kansiime, P., Said, S., Ayilu, R., Radavoi, C., & Berman, A. (2023). Generative AI and higher education assessments: A competency-based analysis. https://doi.org/10.21203/rs.3.rs-2968456/v2
Teng, M. F., Mizumoto, A., & Takeuchi, O. (2024). Understanding growth mindset, self-regulated vocabulary learning, and vocabulary knowledge. System, 122, 103255. https://doi.org/10.1016/j.system.2024.103255
Usher, M. (2025). Generative AI vs. instructor vs. peer assessments: a comparison of grading and feedback in higher education. Assessment & Evaluation in Higher Education, 1–16. https://doi.org/10.1080/02602938.2025.2487495
Vlachopoulos, D., & Makri, A. (2024). A systematic literature review on authentic assessment in higher education: Best practices for the development of 21st century skills, and policy considerations. Studies in Educational Evaluation, 83, 101425. https://doi.org/10.1016/j.stueduc.2024.101425
Wang, T. (2023, August). Navigating generative AI (ChatGPT) in higher education: Opportunities and challenges. In International Conference on Smart Learning Environments (pp. 215-225). Singapore: Springer Nature Singapore. https://doi.org/10.1007/978-981-99-5961-7_28
Weinhandl, R., Baldinger, S., & Riegler, V. (2025). Design characteristics for discovery learning within digital mathematics learning environments from students’ perspectives. International Journal of Science and Mathematics Education, 1-29. https://doi.org/10.1007/s10763-025-10619-x
Zhai, X. (2023). ChatGPT for next generation science learning. XRDS: Crossroads, The ACM Magazine for Students, 29(3), 42-46. https://doi.org/10.1145/3589649
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Kunti Robiatul Mahmudah, Nur Robiah Nofikusumawati Peni, Faida Musa'ad, Soth Chea, Sommay Shingphachanh

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with the Jurnal Elemen agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).
- Authors are able to enter into separate, additional contractual arrangements for the distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
Jurnal Elemen is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License



