Generative AI in mathematics education: Considerations for academic integrity and assessment strategies

Authors

  • Kunti Robiatul Mahmudah Universitas Ahmad Dahlan
  • Nur Robiah Nofikusumawati Peni Universitas Ahmad Dahlan
  • Faida Musa'ad Universitas Muhammadiyah Sorong
  • Soth Chea Phnom Penh Teacher Education College
  • Sommay Shingphachanh Khangkhay Teacher Training College

DOI:

https://doi.org/10.29408/jel.v12i2.33851

Keywords:

academic integrity, assessment redesign, ChatGPT, educational assessment, generative AI

Abstract

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

Author Biographies

Kunti Robiatul Mahmudah, Universitas Ahmad Dahlan

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

Nur Robiah Nofikusumawati Peni, Universitas Ahmad Dahlan

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

Faida Musa'ad, Universitas Muhammadiyah Sorong

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

Soth Chea, Phnom Penh Teacher Education College

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

Sommay Shingphachanh, Khangkhay Teacher Training College

The rapid advancement of generative artificial intelligence (GenAI), particularly tools like ChatGPT, has introduced both opportunities and challenges for academic assessment in higher education. This systematic review explores how GenAI has influenced academic integrity concerns and highlights the assessment redesign strategies proposed or implemented in response. Drawing from 18 peer-reviewed articles published between 2022 and 2025, the review identifies seven key thematic areas: integration of GenAI in educational settings, pedagogical opportunities, integrity-related challenges, impacts on critical thinking and originality, educator and student perspectives, practical implementation outcomes, and strategic recommendations. While GenAI offers personalized feedback, improved access, and scaffolding for learning, it also raises critical issues including plagiarism, superficial engagement, and the erosion of authorship. The review further reveals a lack of institutional policy, inconsistent ethical guidelines, and disparities in GenAI access among students. In response, researchers advocate for AI-resilient assessment models, ethical literacy, and adaptive institutional frameworks. The findings underscore the need for a proactive, pedagogically informed approach to redesigning assessments that not only embrace the potential of GenAI but also safeguard academic standards and educational integrity

References

Acopio, M. K. M. G. (2025). Technological proficiency and online resource utilization in mathematics education: A study of higher education instructors in the Philippines. Jurnal Elemen, 11(4), 845-859. https://doi.org/10.29408/jel.v11i4.32134

Ali, O., Murray, P. A., Momin, M., Dwivedi, Y. K., & Malik, T. (2024). The effects of artificial intelligence applications in educational settings: Challenges and strategies. Technological Forecasting and Social Change, 199, 123076. https://doi.org/10.1016/j.techfore.2023.123076

Almpanis, T., Conroy, D., & Joseph-Richard, P. (2025). Practical implications of generative AI on assessment: Snapshot of early reactions to assessment redesign in an HRM and a psychology course. Electronic Journal of E-Learning, 23(3), 19–29. https://doi.org/10.34190/ejel.23.3.3971

Ateeq, A., Alzoraiki, M., Milhem, M., & Ateeq, R. A. (2024). Artificial intelligence in education: implications for academic integrity and the shift toward holistic assessment. Frontiers in Education, 9. https://doi.org/10.3389/feduc.2024.1470979

Bellido-García, R. S., Venturo-Orbegoso, C. O., Cruzata-Martínez, A., Sarmiento-Villanueva, E. B., Corro-Quispe, J., & Rejas-Borjas, L. G. (2024). Involvement of the student in their learning: Effects of formative assessment on competency development. Eurasia Journal of Mathematics, Science and Technology Education, 20(5), em2440. https://doi.org/10.29333/ejmste/14453

Bernal, M. E. (2024). Revolutionizing elearning assessments: The role of GPT in crafting dynamic content and feedback. Journal of Artificial Intelligence and Technology, 4(3), 188–199. https://doi.org/10.37965/jait.2024.0513

Beynen, T. (2024). The role of students. In Assessment Literacies in Navigating University Assessment, GenAI, and Academic Integrity A journal of educational research and practice (Vol. 33, Issue 3). https://journals.library.brocku.ca/brocked

Carbonel, H., Belardi, A., Ross, J., & Jullien, J. M. (2025). Integrity and motivation in remote assessment. Online Learning Journal, 29(2), 25–46. https://doi.org/10.24059/olj.v29i2.4309

Chaudhry, I. S., Sarwary, S. A. M., El Refae, G. A., & Chabchoub, H. (2023). Time to revisit existing student’s performance evaluation approach in higher education sector in a new era of ChatGPT — A case study. Cogent Education, 10(1). https://doi.org/10.1080/2331186X.2023.2210461

Darling-Hammond, L., Flook, L., Cook-Harvey, C., Barron, B., & Osher, D. (2020). Implications for educational practice of the science of learning and development. Applied developmental science, 24(2), 97-140. https://doi.org/10.1080/10888691.2018.1537791

Farag, W. A., Nadeem, M., & Helal, M. (2024). Assessment transformation in the age of AI: Moving beyond the influence of generative tools. 2024 Mediterranean Smart Cities Conference (MSCC), 1–6. https://doi.org/10.1109/MSCC62288.2024.10697011

Findell, B., Swafford, J., & Kilpatrick, J. (Eds.). (2001). Adding it up: Helping children learn mathematics. National Academies Press. https://doi.org/10.17226/9822

Gander, T., & Harris, G. (2024). Understanding AI literacy for higher education students: Implications for assessment. He Rourou, 8. https://doi.org/10.54474/herourou.v1i1.10579

Gruenhagen, J. H., Sinclair, P. M., Carroll, J. A., Baker, P. R. A., Wilson, A., & Demant, D. (2024). The rapid rise of generative AI and its implications for academic integrity: Students’ perceptions and use of chatbots for assistance with assessments. Computers and Education: Artificial Intelligence, 7. https://doi.org/10.1016/j.caeai.2024.100273

Hsiao, Y. P., Klijn, N., & Chiu, M. S. (2023). Developing a framework to re-design writing assignment assessment for the era of Large Language Models. Learning: Research and Practice, 9(2), 148–158. https://doi.org/10.1080/23735082.2023.2257234

Ilieva, G., Yankova, T., Ruseva, M., & Kabaivanov, S. (2025). A framework for generative AI-driven assessment in higher education. Information, 16(6), 472. https://doi.org/10.3390/info16060472

Jiang, Y., Hao, J., Fauss, M., & Li, C. (2024). Detecting ChatGPT-generated essays in a large-scale writing assessment: Is there a bias against non-native English speakers? Computers & Education, 217, 105070. https://doi.org/10.1016/j.compedu.2024.105070

Johnson, S., Owens, E., Menendez, H., & Kim, D. (2024). Using ChatGPT-generated essays in library instruction. The Journal of Academic Librarianship, 50(2), 102863. https://doi.org/10.1016/j.acalib.2024.102863

Kasneci, E., Seßler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., ... & Kasneci, G. (2023). ChatGPT for good? On opportunities and challenges of large language models for education. Learning and individual differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274

Khan, M. M., Dong, Y., & Manesh, N. A. (2023). Authentic assessment design for meeting the challenges of generative artificial intelligence. Proceedings - Frontiers in Education Conference, FIE. https://doi.org/10.1109/FIE58773.2023.10343376

Khlaif, Z. N., Alkouk, W. A., Salama, N., & Abu Eideh, B. (2025). Redesigning assessments for AI-enhanced learning: A framework for educators in the generative AI era. Education Sciences, 15(2). https://doi.org/10.3390/educsci15020174

Kofinas, A. K., Tsay, C. H. H., & Pike, D. (2025). The impact of generative AI on academic integrity of authentic assessments within a higher education context. British Journal of Educational Technology. https://doi.org/10.1111/bjet.13585

Kouam, A. W. F., & Muchowe, R. M. (2024). Exploring graduate students’ perception and adoption of AI chatbots in Zimbabwe: Balancing pedagogical innovation and development of higher-order cognitive skills. Journal of Applied Learning & Teaching, 7(1). https://doi.org/10.37074/jalt.2024.7.1.12

Lehane, S., & Wright, A. (2024). Designing authentic assessment to improve academic integrity. International Conference on Higher Education Advances, 564–571. https://doi.org/10.4995/HEAd24.2024.17136

Lithner, J. (2008). A research framework for creative and imitative reasoning. Educational Studies in mathematics, 67(3), 255-276. https://doi.org/10.1007/s10649-007-9104-2

Liu, X. (2025). Navigating uncharted waters: Teachers’ perceptions of and reactions to AI-induced challenges to assessment. Asia-Pacific Education Researcher, 34(2), 711–722. https://doi.org/10.1007/s40299-024-00890-x

Lukianenko, V., & Kornieva, Z. (2024). Generative AI in student essays: English teachers’ perspectives on effective assessment methods. XLinguae, 17(4), 235–250. https://doi.org/10.18355/XL.2024.17.04.14

Makridakis, S., Petropoulos, F., & Kang, Y. (2023). Large language models: Their success and impact. Forecasting, 5(3), 536–549. https://doi.org/10.3390/forecast5030030

Mao, J., Chen, B., & Liu, J. C. (2024). Generative artificial intelligence in education and its implications for assessment. TechTrends, 68(1), 58–66. https://doi.org/10.1007/s11528-023-00911-4

Maulana, A., Murtafiah, W.,Handhika, J.,&, Alvares, J. I. (2025). Integrating augmented reality with the e-IM3 structured thinking model to enhance problem-solving skills and learning interest in elementary spatial geometry. Jurnal Elemen, 11(4), 1030-1049. https://doi.org/10.29408/jel.v11i4.32139

Nikolic, S., Sandison, C., Haque, R., Daniel, S., Grundy, S., Belkina, M., Lyden, S., Hassan, G. M., & Neal, P. (2024). ChatGPT, Copilot, Gemini, SciSpace and Wolfram versus higher education assessments: an updated multi-institutional study of the academic integrity impacts of Generative Artificial Intelligence (GenAI) on assessment, teaching and learning in engineering. Australasian Journal of Engineering Education, 29(2), 126–153. https://doi.org/10.1080/22054952.2024.2372154

Niss, M., & Højgaard, T. (2019). Mathematical competencies revisited. Educational studies in mathematics, 102(1), 9-28. https://doi.org/10.1007/s10649-019-09903-9

Page, M. J., Moher, D., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … McKenzie, J. E. (2021). PRISMA 2020 explanation and elaboration: Updated guidance and exemplars for reporting systematic reviews. BMJ, n160. https://doi.org/10.1136/bmj.n160

Perkins, C., Furze, M., Roe, L., & Macvaugh, J. (2024). The artificial intelligence assessment scale (AIAS): A framework for ethical integration of generative AI in educational assessment. In Journal of University Teaching and Learning Practice (Issue 6).

Pratiwi, H., Suherman, Hasruddin, & Ridha, M. (2025). Between shortcut and ethics: Navigating the use of artificial intelligence in academic writing among Indonesian doctoral students. European Journal of Education, 60(2). https://doi.org/10.1111/ejed.70083

Saher, A. S., Ali, A. M. J., Amani, D., & Najwan, F. (2022). Traditional versus authentic assessments in higher education. Pegem Egitim ve Ogretim Dergisi, 12(1), 283–291. https://doi.org/10.47750/pegegog.12.01.29

Schoenfeld, A. H. (2016). Learning to think mathematically: Problem solving, metacognition, and sense making in mathematics (Reprint). Journal of education, 196(2), 1-38. https://doi.org/10.1177/00220574161960

Schultz, M., Young, K., K. Gunning, T., & Harvey, M. L. (2022). Defining and measuring authentic assessment: a case study in the context of tertiary science. Assessment & Evaluation in Higher Education, 47(1), 77–94. https://doi.org/10.1080/02602938.2021.1887811

Stack, M. (2023). Investigating an assessment design that prevents students from using ChatGPT as the sole basis to pass assessment at the tertiary level. E-Journal of Humanities, Arts and Social Sciences, 64–77. https://doi.org/10.38159/ehass.20234127

Stylianides, G. J. (2009). Reasoning-and-proving in school mathematics textbooks. Mathematical thinking and learning, 11(4), 258-288. https://doi.org/10.1080/10986060903253954

Tenakwah, E. S., Boadu, G., Tenakwah, E. J., Parzakonis, M., Brady, M., Kansiime, P., Said, S., Ayilu, R., Radavoi, C., & Berman, A. (2023). Generative AI and higher education assessments: A competency-based analysis. https://doi.org/10.21203/rs.3.rs-2968456/v2

Teng, M. F., Mizumoto, A., & Takeuchi, O. (2024). Understanding growth mindset, self-regulated vocabulary learning, and vocabulary knowledge. System, 122, 103255. https://doi.org/10.1016/j.system.2024.103255

Usher, M. (2025). Generative AI vs. instructor vs. peer assessments: a comparison of grading and feedback in higher education. Assessment & Evaluation in Higher Education, 1–16. https://doi.org/10.1080/02602938.2025.2487495

Vlachopoulos, D., & Makri, A. (2024). A systematic literature review on authentic assessment in higher education: Best practices for the development of 21st century skills, and policy considerations. Studies in Educational Evaluation, 83, 101425. https://doi.org/10.1016/j.stueduc.2024.101425

Wang, T. (2023, August). Navigating generative AI (ChatGPT) in higher education: Opportunities and challenges. In International Conference on Smart Learning Environments (pp. 215-225). Singapore: Springer Nature Singapore. https://doi.org/10.1007/978-981-99-5961-7_28

Weinhandl, R., Baldinger, S., & Riegler, V. (2025). Design characteristics for discovery learning within digital mathematics learning environments from students’ perspectives. International Journal of Science and Mathematics Education, 1-29. https://doi.org/10.1007/s10763-025-10619-x

Zhai, X. (2023). ChatGPT for next generation science learning. XRDS: Crossroads, The ACM Magazine for Students, 29(3), 42-46. https://doi.org/10.1145/3589649

Downloads

Published

07-05-2026

How to Cite

Mahmudah, K. R., Peni, N. R. N., Musa’ad, F., Chea, S., & Shingphachanh, S. (2026). Generative AI in mathematics education: Considerations for academic integrity and assessment strategies. Jurnal Elemen, 12(2), 554–576. https://doi.org/10.29408/jel.v12i2.33851

Issue

Section

Articles

Similar Articles

<< < 2 3 4 5 6 7 8 > >> 

You may also start an advanced similarity search for this article.