Herramientas de conversión de voz a texto basadas en la web para mejorar la pronunciación del idioma inglés: un estudio descriptivo

  • Xavier Sulca Guale Technical University of Ambato
  • Adriana Nicole Lozano Celleri Technical University of Ambato
  • Marbella Cumanda Escalante Gamazo Technical University of Ambato
  • Rafael Ricardo Arias Huertas ISTE Higher University Technological Institute
Keywords: Web-based speech-to-text tools, Benefits, Pronunciation features, Pronunciation strategies.

Abstract

The current study determined the importance of pronunciation through the web-based speech-to-text tools to enhance the English language pronunciation. A total of 73 university students (33 males and 40 females) participated in this descriptive and non-experimental research. The data was gathered through a survey with 29 items on a Likert scale and, 3 open-ended questions. It was validated by experts and with the coefficient Cronbach’s Alpha (0,856). Moreover, it was based on three research questions. The results revealed that web-based speech-to-text tools are a good means of practicing pronunciation because it has free access, the speaker's voice adapts smoothly, and it converts the spoken phrases and words into text. Learners pointed out that web-based speech-to-text tools have several benefits such as encouraging them to improve their pronunciation and speaking and oral communication skills. They assist students to develop pronunciation features, and they promote autonomous work. Furthermore, there are many strategies to put into practice the improvement of pronunciation. Most of the students preferred the use of media by watching videos or listening to music or podcasts in English and they usually repeat the pronunciation of sounds and words. However, pronunciation rules are not considered as a principal strategy because learners do not have sufficient knowledge of them, and they are not implemented in the curriculum or lesson planning.

Downloads

Download data is not yet available.

References

Ahmadi, D., & Reza, M. (2018). The use of technology in English language learning: A literature review. International Journal of Research in English Education, 3(2), 115- 125. DOI 10.29252/ijree.3.2.115
Ahn, T. Y., & Lee, S. M. (2016). User experience of a mobile speaking application with automatic speech recognition for EFL learning. British Journal of Educational Technology, 47(4), 778-786. DOI:10.1111/bjet.12354
Denisov, P., & Vu, N. T. (2019). IMS-speech: A speech to text tool. arXiv preprint arXiv:1908.04743
Grand-Clement, S. (2017). Digital Learning: Education and Skills in the Digital Age. RAND Europe.
Gilakjani, A. P., & Ahmadi, M. R. (2011). Why is pronunciation so difficult to learn? English language teaching, 4(3), 74-83. DOI 10.5539/elt. v4n3p74
Gottardi, W., Almeida, J. F. D., & Tumolo, C. H. S. (2022). Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances. Texto livre, 15. https://doi.org/10.35699/19833652.2022.3676
Hosseini, S. B., & Pourmandnia, D. (2013). Language learners’ attitudes and beliefs: Brief review of the related literature and frameworks. International Journal on New Trends in Education and Their Implications, 4(4), 63-74.
Jensen, B. S. (2011). The Michigan guide to the TOEIC(R) speaking test (1 ed.). University of Michigan Press ELT. doi:10.5926/jjep.60.92
Jiang, M. Y. C., Jong, M. S. Y., Lau, W. W. F., Chai, C. S., & Wu, N. (2021). Using automatic speech recognition technology to enhance EFL learners’ oral language complexity in a flipped classroom. Australasian journal of educational technology, 37(2), 110-131. https://doi.org/10.14742/ajet.6798
Kothari, C. R. (2004). Research methodology: Methods and techniques. New Age International. 9788122424881
Levis, J., & Suvorov, R. (2012). Automatic speech recognition. The encyclopedia of applied linguistics. DOI 10.1007/978-981-15-0595-9_2
Liu, X., Xu, M., Li, M., Han, M., Chen, Z., Mo, Y., ... & Liu, M. (2019). Improving English pronunciation via automatic speech recognition technology. International Journal of Innovation and Learning, 25(2), 126-140. https://doi.org/10.1504/IJIL.2019.097674
Macnaughton, R. J. (1996). Numbers, scales, and qualitative research. The Lancet, 347(9008), 1099-1100.
Neri, A., Cucchiarini, C., & Strik, H. (2003, August). Automatic speech recognition for second language learning: How and why it actually works. In Proc. ICPhS (pp. 1157- 1160). https://www.researchgate.net/publication/228604457_Automatic_speech_recognition_for_second_language_learning_How_and_why_it_actually_works
Nurjanah, S. E. L., Ifadah, M., & Mulyadi, D. (2019a). Enhancing students’ pronunciation accuracy through Web-based speech-to-text tools application at MAN 1 Semarang. In Prosiding Seminar NasionalMahasiswa Unimus (Vol. 2). https://prosiding.unimus.ac.id/index.php/mahasiswa/article/view/490
Oxford, R. L., Lavine, R. Z., & Crookall, D. (1989). Language learning strategies, the communicative approach, and their classroom implications. Foreign Language Annals, 22(1), 29-39. https://doi.org/10.1111/j.19449720.1989.tb03139.x
Pawlak, M., & Szyszka, M. (2018). Researching pronunciation learning strategies: An overview and a critical look. Studies in Second Language Learning and Teaching, 8(2), 293-323. DOI 10.14746/ssllt.2018.8.2.6
Phung, K., Ramachandran, R., & Ogunshile, E. (2021). Exploring a Web-Based Application to Convert Tamil and Vietnamese Speech to Text without the Effect of Code-Switching and Code-Mixing. Programming and Computer Software, 47, 757-764.
Pourhosein Gilakjani, A., & Sabouri, N. B. (2017). Advantages of using computer in teaching English pronunciation. International Journal of Research in English Education, 2(3), 78-85. DOI 10.18869/acadpub.ijree.2.3.78
Prasad, V., Voice recognition system: speech-to-text, J.Appl. Fundam. Sci., 2015, vol. 1, no. 2, p. 191.
Tebelskis, J., Speech recognition using neural networks, PhD Dissertation, Carnegie Mellon Univ., 1995
Tergujeff, E. (2013). English pronunciation teaching in Finland. Jyväskylä studies in humanities, (207). https://jyx.jyu.fi/bitstream/handle/123456789/41900/1/978-951-3953225_vaitos03082013.pdf
Twining, P., Raffaghelli, J., Albion, P., & Knezek, D. (2013). Moving education into the digital age: the contribution of teachers' professional development. Journal of computer assisted learning, 29(5), 426-437.
Yaniafari, R. P., & Olivia, V. (2022). The Potential of ASR for Improving English Pronunciation: A Review. KnE Social Sciences,281-289. https://doi.org/10.18502/kss.v7i7.10670
Yoshida, M. T. (2016). Beyond Repeat after Me: Teaching Pronunciation to English Learners. TESOL Press. Available from: TESOL International Association. 1925 Ballenger Avenue Suite 550, Alexandria, VA 22314.http://www.tesol.org/docs/defaultsource/books/14038_sam.pdf?sfvrsn=2
Published
2023-10-31