Web-Based Automatic Code Evaluation System Using Claude AI for Programming Education

Authors

  • Paulus Lucky Tirma Irawan Universitas Ma Chung
  • Windra Swastika Universitas Ma Chung

DOI:

https://doi.org/10.37802/joti.v8i1.1305

Keywords:

Artificial Intelligence, Automatic Evaluation, Claude AI, Feedback System, Programming Learning

Abstract

Algorithm and Programming learning face challenges in providing fast and personalized feedback to students. The manual evaluation process conducted by lecturers requires considerable time, hindering students' iterative learning process. This study aims to develop a web submission platform prototype with automatic feedback based on Claude AI to support and enhance the programming learning process. The research method employs a Research and Development (R&D) approach with four stages: needs analysis, system design and planning, platform implementation, and testing and evaluation. The platform was developed using a PHP backend, MySQL database, and Claude AI integration through RESTful web services with a cascading AI evaluation strategy. Evaluation was conducted on 9 students with 39 submissions for three Java assignments with different difficulty levels. Results show the system successfully provides high-quality feedback with an average response time of 2.8 seconds and 100% evaluation success rate. Score distribution shows average improvement from the first assignment (82.3) to the third assignment (87.1), indicating a positive trend in iterative learning. A satisfaction survey of 8 respondents shows the system interface is user-friendly, and AI feedback helps identify syntax and program logic errors. Students made an average of 3.2 attempts per assignment, demonstrating high engagement in the learning process.

Downloads

Download data is not yet available.

References

Hao, Q., Smith IV, D.H., Ding, L., Ko, A., Ottaway, C., Wilson, J., Arakawa, K.H., Turcan, A., Poehlman, T. and Greer, T., "Towards understanding the effective design of automated formative feedback for programming assignments," Computer Science Education, vol. 32, no. 1, pp. 105-127, 2022.

M. M. Fakhri, et. al., "Barriers to Effective Learning: Examining the Influence of Delayed Feedback on Student Engagement and Problem Solving Skills in Ubiquitous Learning Programming," J. Appl. Sci. Eng. Tech. Educ., vol. 6, no. 1, pp. 69-79, Jul. 2024.

Z. Zhou, S. Wang, dan Y. Qian, "Learning From Errors: Exploring the Effectiveness of Enhanced Error Messages in Learning to Program," Front. Psychol., vol. 12, Nov. 2021

Barra, E., López-Pernas, S., Alonso, Á., Sánchez-Rada, J.F., Gordillo, A. and Quemada, J., "Automated assessment in programming courses: A case study during the COVID-19 era," Sustainability, vol. 12, no. 18, Art. no. 7451, 2020.

Azaiz, I., Kiesler, N. and Strickroth, S., "Feedback-generation for programming exercises with gpt-4," in Proc. 2024 Innovation and Technology in Computer Science Education V. 1, pp. 31-37, 2024.

H. Atun dan E. Usta, "The effects of programming education planned with TPACK framework on learning outcomes," Participatory Educational Research, vol. 10, no. 3, pp. 67-82, 2023.

M. Beege, et. al., "Learning programming from erroneous worked-examples. Which type of error is beneficial for learning?", Learning and Instruction, vol. 75, p. 101497, 2021.

H.-C. Ling dan H.-S. Chiang, "Learning Performance in Adaptive Learning Systems: A Case Study of Web Programming Learning Recommendations," Frontiers in Psychology, vol. 13, p. 770637, Jan. 2022.

M. Kwak, J. Jenkins, dan J. Kim, "Adaptive programming language learning system based on generative AI," International Journal of Intelligent Systems, vol. 24, no. 3, pp. 222-231, 2023.

Croft, D. and England, M., "Computing with CodeRunner at Coventry University: Automated summative assessment of Python and C++ code," in Proc. 4th Conference on Computing Education Practice, pp. 1-4, Jan. 2020.

Vetrivel, S.C., Arun, V.P., Ambikapathi, R. and Saravanan, T.P., "Automated Grading Systems: Enhancing Efficiency and Consistency in Student Assessments," in Adopting Artificial Intelligence Tools in Higher Education, CRC Press, pp. 41-61, 2025.

N. Scholz, et.al., "Partnering with AI: A Pedagogical Feedback System for LLM Integration into Programming Education," arXiv preprint arXiv:2507.00406, Jul. 2025.

Messer, M., Brown, N.C., Kölling, M. and Shi, M., 2024. Automated grading and feedback tools for programming education: A systematic review. ACM Transactions on Computing Education, 24(1), pp.1-43.

Souza, D.M., Felizardo, K.R. and Barbosa, E.F., "A systematic literature review of assessment tools for programming assignments," Computer Applications in Engineering Education, vol. 24, no. 6, pp. 752-774, 2016.

Paiva, J.C., Leal, J.P. and Figueira, Á., "Automated assessment in computer science education: A state-of-the-art review," ACM Transactions on Computing Education, vol. 22, no. 3, pp. 1-40, 2022.

Mertler, C.A., Introduction to educational research. Sage publications, 2024.

L. Afriani, "Understanding the design of research and development methods in the field of education," IJESS Int. J. Educ. Soc. Sci., vol. 6, no. 1, pp. 1-5, 2025.

Ahmad, S.I., Rana, T. and Maqbool, A., "A model-driven framework for the development of MVC-based (Web) application," Arabian Journal for Science and Engineering, vol. 47, no. 2, pp. 1733-1747, 2022.

S. Suarman, M. M. Afandi, dan M. K. Sari, "A Needs Analysis for Research-Based TPACK Models," Alishlah J. Stud. Islam., vol. 12, no. 1, pp. 45-60, Jan. 2025.

A. Ahmed et al., "Design and implementation of a responsive web-based system for controlling the financial budget of universities," Journal of Technology and Informatics (JoTI), vol. 5, no. 1, pp. 1-7, 2023.

A. Nugroho et al., "UI/UX design of a web-based student organizations system using the design thinking method approach," Journal of Technology and Informatics (JoTI), vol. 7, no. 1, pp. 24-38, 2025

Downloads