Comparing Accuracy, Language, and Pedagogy in AI-Generated Exponential Equation Solutions
DOI:
https://doi.org/10.17509/j-mer.v7i1.959Keywords:
ChatGPT, MathGPT, DeepSeek, Gemini, Generative AI, Exponential EquationAbstract
The growth of generative AI has transformed how mathematics is learned by providing instant solutions to a wide range of problems. This study evaluates four popular AI models, namely MathGPT, ChatGPT, DeepSeek, and Gemini AI, on three exponential equation problems. The assessment is based on three aspects: accuracy, language, and pedagogy. Accuracy ensures the answers are correct, consistent, and directly verified. Language assesses the narrative style in procedural steps, the use of appropriate notation, and the use of formal terminology. Pedagogy examines the scaffolding components, how they are communicated, and their suitability for different types of learners. The results show that all four AI models can provide correct answers, although their explanations vary. DeepSeek explains in detail. MathGPT excels at using formal notation and terminology; ChatGPT offers concise solutions with step-by-step guidance, while Gemini AI constructs intelligent narrative scaffolding. These findings indicate that AI answer quality depends not only on correctness but also on how the explanations are delivered and their suitability for learners' needs. Educators and developers can leverage each AI's strengths to create more adaptive and targeted learning tools.
Downloads
References
Anua, S. M., Chong, J., Nasir, J. N. M., Zamri, M. M. M., Lihat, N., & Gulpi, N. A. (2025). Isu Keselamatan dan Etika Penggunaan Kecerdasan Buatan. Journal of Energy and Safety Technology (JEST). https://doi.org/10.11113/jest.v8.195
Arifin, M., Zulkarnain, I., & Ansori, H. (2025). The influence of artificial intelligence on critical thinking ability in mathematics: A systematic literature review. Indonesian Journal of Science and Mathematics Education. https://doi.org/10.24042/ijsme.v8i1.24352
Benvenuti, M., Cangelosi, A., Weinberger, A., Mazzoni, E., Benassi, M., Barbaresi, M., & Orsoni, M. (2023). Artificial intelligence and human behavioral development: A perspective on new skills and competences acquisition for the educational context. Computers in Human Behavior, 148, 107903. https://doi.org/10.1016/j.chb.2023.107903
Canonigo, A. M. (2024). Levering AI to enhance students’ conceptual understanding and confidence in mathematics. Journal of Computer Assisted Learning, 40(6), 3215–3229. https://doi.org/10.1111/jcal.13065
Copur-Gencturk, Y., Li, J., & Atabas, S. (2024). Improving Teaching at Scale: Can AI Be Incorporated Into Professional Development to Create Interactive, Personalized Learning for Teachers? American Educational Research Journal, 61(4), 767–802. https://doi.org/10.3102/00028312241248514
Hubert, T., Mehta, R. S., Sartran, L., Horváth, M. Z., Žužić, G., Wieser, E., Huang, A., Schrittwieser, J., Schroecker, Y., Masoom, H., Bertolli, O., Zahavy, T., Mandhane, A., Yung, J., Beloshapka, I., Ibarz, B., Veeriah, V., Yu, L., Nash, O., . . . Silver, D. (2025). Olympiad-level formal mathematical reasoning with reinforcement learning. Nature, 651, 607 - 613. https://doi.org/10.1038/s41586-025-09833-y
Hwang, S. (2022). Examining the Effects of Artificial Intelligence on Elementary Students’ Mathematics Achievement: A Meta-Analysis. Sustainability, 14(20), 13185. https://doi.org/10.3390/su142013185
Islam, K., Saidah, N. L., Yahya, S., & Syaikhuddin, M. M. (2026). Evaluating AI Tutor Interaction with Ambiguous Junior High Mathematics Questions Using Black-Box. RIGGS: Journal of Artificial Intelligence and Digital Business. https://doi.org/10.31004/riggs.v5i1.6363
Macina, J., Daheim, N., Hakimi, I., Kapur, M., Gurevych, I., & Sachan, M. (2025). MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors. ArXiv, abs/2502.18940. https://doi.org/10.48550/arxiv.2502.18940
Maulida, L., Nurossobah, P., Aura, B. A., Nengsih, E. D., & Rasilah, R. (2024). Improving The Effectiveness of Mathematics Learning Through Artificial Intelligence: Literature Review. Journal of General Education and Humanities. https://doi.org/10.58421/gehu.v3i4.267
Moon, I. H. (2025). Benchmarking Large Language Models for Calculus Problem-Solving: A Comparative Analysis. ArXiv, abs/2504.13187. https://doi.org/10.48550/arxiv.2504.13187
Opesemowo, O. A. G., & Ndlovu, M. (2024). Artificial intelligence in mathematics education: The good, the bad, and the ugly. Journal of Pedagogical Research. https://doi.org/10.33902/JPR.202426428
Rahmatika, W., Mursalin, M., Saputra, E., & Fonna, M. (2025). The Influence of the Use of AI in Mathematics Learning at the High School Level: A Systematics Literature Review. Electronic Journal of Education, Social Economics and Technology. https://doi.org/10.33122/ejeset.v6i2.1068
Sa'adah, N., Faizah, S., Sa'dijah, C., Khabibah, S., & Kurniati, D. (2023). Students' Mathematical Thinking Process in Algebraic Verification Based on Crystalline Concept. Mathematics Teaching Research Journal, 15(1), 90-107.
Scarlatos, A., & Lan, A. S. (2023). Tree-Based Representation and Generation of Natural and Mathematical Language. 3714-3730. https://doi.org/10.48550/arxiv.2302.07974
Wardat, Y., Tashtoush, M. A., AlAli, R., & Jarrah, A. M. (2023). ChatGPT: A revolutionary tool for teaching and learning mathematics. Eurasia Journal of Mathematics, Science and Technology Education, 19(7). https://doi.org/10.29333/ejmste/13272
Xia, S., Li, X., Liu, Y., Wu, T., & Liu, P. (2024). Evaluating Mathematical Reasoning Beyond Accuracy. 27723-27730. https://doi.org/10.48550/arxiv.2404.05692
Zhang, J., & Zhang, Z. (2024). AI in teacher education: Unlocking new dimensions in teaching support, inclusive learning, and digital literacy. Journal of Computer Assisted Learning, 40(4), 1871–1885. https://doi.org/10.1111/jcal.12988

