AI-Mediated Speaking Practice for Aptis ESOL Preparation: An Explanatory Sequential Mixed-Methods Case Study of Vietnamese EFL University Students
Abstract:
Artificial intelligence is increasingly used in English language learning, yet empirical evidence remains uneven regarding how AI-mediated speaking practice supports performance in standardised speaking-test preparation. This explanatory sequential mixed-methods case study examined how Vietnamese EFL university students preparing for Aptis ESOL Speaking engaged with an AI-supported speaking programme. Forty-eight students participated in an eight-week intervention integrating Aptis-style speaking tasks, ChatGPT voice interaction, automatic speech recognition (ASR) feedback, teacher scaffolding, and reflective journals. Quantitative data included pre/post Aptis-style speaking scores, speaking anxiety ratings, willingness-to-communicate scores, and AI-use logs. Qualitative data were collected from semi-structured interviews, learner journals, and teacher field notes. The reported findings indicated a statistically significant increase in speaking performance from pre-test to post-test, alongside reduced speaking anxiety and increased willingness to communicate. Qualitative findings suggested that AI created a low-stakes rehearsal space, increased opportunities for output, supported lexical and pronunciation noticing, and enhanced learner autonomy. However, students also reported over-reliance on AI-generated scripts, uneven feedback quality, and difficulties transferring rehearsed fluency to spontaneous test performance. The study argues that AI is most pedagogically valuable when integrated as a scaffolded rehearsal partner rather than as a substitute teacher, examiner, or answer generator.
KeyWords:
artificial intelligence, Aptis ESOL, EFL speaking, ChatGPT, automatic speech recognition, mixed methods, speaking anxiety, case study.
References:
- Ali, J. K. M., Shamsan, M. A., Hezam, T. A., & Mohammed, A. A. Q. (2023). An exploratory study of EFL learners’ use of ChatGPT for language learning tasks. Languages, 8(3), 212. https://doi.org/10.3390/languages8030212
- Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology, 3(2), 77–101. https://doi.org/10.1191/1478088706qp063oa
- British Council. (2024a). Aptis ESOL General.
- British Council. (2024b). Aptis Speaking test: Practice and video.
- British Council. (2024c). Aptis scoring system.
- Bygate, M. (2001). Speaking. In R. Carter & D. Nunan (Eds.), The Cambridge guide to teaching English to speakers of other languages (pp. 14–20). Cambridge University Press.
- Chen, K. T. C. (2022). Speech-to-text recognition in university EFL learning. Education and Information Technologies, 27, 9857–9875.
- Chen, X., Xie, H., Zou, D., & Hwang, G. J. (2020). Application and theory gaps during the rise of artificial intelligence in education. Computers and Education: Artificial Intelligence, 1, 100002. https://doi.org/10.1016/j.caeai.2020.100002
- Chen, Y., Wang, Y., & Liu, C. (2024). The effectiveness of artificial intelligence on English language learning achievement: A meta-analysis. System, 125, 103428. https://doi.org/10.1016/j.system.2024.103428
- Cislowska, K., & Pena-Acuna, B. (2024). Learning English as a second language with artificial intelligence. Frontiers in Education, 9, 1490067. https://doi.org/10.3389/feduc.2024.1490067
- Council of Europe. (2020). Common European Framework of Reference for Languages: Learning, teaching, assessment – Companion volume. Council of Europe Publishing.
- Creswell, J. W., & Plano Clark, V. L. (2018). Designing and conducting mixed methods research (3rd ed.). SAGE.
- Derwing, T. M., & Munro, M. J. (2015). Pronunciation fundamentals: Evidence-based perspectives for L2 teaching and research. John Benjamins.
- Evers, K., & Chen, S. (2022). Effects of automatic speech recognition software on pronunciation for adults with different learning styles. Journal of Educational Computing Research, 60(3), 669–685.
- Fryer, L. K., Nakao, K., & Thompson, A. (2023). Chatbot-assisted language learning: A meta-analysis. Education and Information Technologies, 28, 11251–11275.
- Goh, C. C. M., & Burns, A. (2012). Teaching speaking: A holistic approach. Cambridge University Press.
- Horwitz, E. K., Horwitz, M. B., & Cope, J. A. (1986). Foreign language classroom anxiety. The Modern Language Journal, 70(2), 125–132. https://doi.org/10.1111/j.1540-4781.1986.tb05256.x
- Kohnke, L., Moorhouse, B. L., & Zou, D. (2023). ChatGPT for language teaching and learning. RELC Journal, 54(2), 537–550.
- Liu, J., Liu, X., & Yang, C. (2022). A study of college students’ perceptions of utilizing automatic speech recognition technology to assist English oral proficiency. Frontiers in Psychology, 13, 1049139. https://doi.org/10.3389/fpsyg.2022.1049139
- Lo, C. K., Yu, P. L. H., Xu, S., Ng, D. T. K., & Jong, M. S. Y. (2024). Exploring the application of ChatGPT in ESL/EFL education and related research issues: A systematic review of empirical studies. Smart Learning Environments, 11, 50.
- Long, M. H. (1996). The role of the linguistic environment in second language acquisition. In W. Ritchie & T. Bhatia (Eds.), Handbook of second language acquisition (pp. 413–468). Academic Press.
- MacIntyre, P. D., Clement, R., Dornyei, Z., & Noels, K. A. (1998). Conceptualizing willingness to communicate in a L2. The Modern Language Journal, 82(4), 545–562. https://doi.org/10.1111/j.1540-4781.1998.tb01286.x
- Meniado, J. C. (2023). The impact of ChatGPT on English language teaching, learning, and assessment: A rapid review of literature. Arab World English Journal, 14(4), 3–18.
- Schmidt, R. (1990). The role of consciousness in second language learning. Applied Linguistics, 11(2), 129–158. https://doi.org/10.1093/applin/11.2.129
- Shadiev, R., & Liu, J. (2022). Review of research on applications of speech recognition technology to assist language learning. ReCALL, 34(1), 1–16.
- Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed-methods investigation. Frontiers in Psychology, 14, 1210187. https://doi.org/10.3389/fpsyg.2023.1210187
- Swain, M. (2005). The output hypothesis: Theory and research. In E. Hinkel (Ed.), Handbook of research in second language teaching and learning (pp. 471–483). Lawrence Erlbaum.
- Tai, T. Y. (2022). Effects of intelligent personal assistants on EFL learners’ oral proficiency outside the classroom. Computer Assisted Language Learning, 37(5), 1054–1077.
- Tai, T. Y., & Chen, H. H. J. (2023). Comparing the effects of intelligent personal assistant-human and human-human interactions on EFL learners’ willingness to communicate. Computers & Education, 201, 104845. https://doi.org/10.1016/j.compedu.2023.104845
- Thanh, V. T. (2026). The relationship between procrastination behavior and students’ English learning outcomes. Journal of Psychology and Education, 32(3), 295–299.
- Vo .T. T (2026). Theoretical foundations of factors influencing grammatical accuracy in university students’ English writing in Ho Chi Minh City. Journal of Education and Society, 32(2), 87–97.
- Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes. Harvard University Press.
- Woodrow, L. (2006). Anxiety and speaking English as a second language. RELC Journal, 37(3), 308–328. https://doi.org/10.1177/0033688206071315
- Wray, A. (2002). Formulaic language and the lexicon. Cambridge University Press.
- Yin, R. K. (2018). Case study research and applications: Design and methods (6th ed.). SAGE.
- Zhang, C., Meng, Y., & Ma, X. (2024). Artificial intelligence in EFL speaking: Impact on enjoyment, anxiety, and willingness to communicate. System, 121, 103259. https://doi.org/10.1016/j.system.2023.103259