Developing and implementing an English-Spanish literary parallel audio-textual corpus for data-driven ESL learning

Autores

Palavras-chave:

Data-Driven Learning (DDL), parallel corpus, audio-textual corpus, Second Language Acquisition (SLA)

Resumo

The purpose of this paper is to present the LITTERA corpus, an English-Spanish literary parallel speech corpus created for the purpose of language learning, and to sketch out a few pedagogical applications for the study of English phonology by Spanish-speaking language learners. It is composed of 25 literary texts that have been aligned with the Spanish translation and are accompanied by audio from the corresponding audiobooks. In this article, we will detail its conception, composition and features at length, as well as provide a few examples of how LITTERA can be applied in language learning, particularly within the realm of oral comprehension and speech production.

Biografia do Autor

Xavier Gómez Guinovart, SLI/TALG - Universidade de Vigo http://sli.uvigo.gal

Xavier Gómez Guinovart received his PhD degree in Computational Linguistics from the University of Santiago de Compostela in 1996. He is currently a professor at the University of Vigo. His research has been mainly in language technologies and resources, machine translation, corpus linguistics and computational lexicography.

Publicado

2022-02-25

Como Citar

Lang, M., & Gómez Guinovart, X. (2022). Developing and implementing an English-Spanish literary parallel audio-textual corpus for data-driven ESL learning. DELTA: Documentação E Estudos Em Linguística Teórica E Aplicada, 37(1). Recuperado de https://revistas.pucsp.br/index.php/delta/article/view/46421

Edição

Seção

Artigos