Back

Building the CELTEC Corpus: Assessing Lexical Complexity in Chilean Higher Education L2 English Learners.

  • Published In: Pakistan Journal of Life & Social Sciences, 2025, v. 23, n. 1. P. 2505 1 of 3

  • Database: Academic Search Ultimate 2 of 3

  • Authored By: Jeldres, Pamela Saavedra; Rosales, Lucía Ubilla; Muñoz, Belén Muñoz 3 of 3

Abstract

This article introduces the development, construction, and potential applications of a learner corpus--Chilean English Language Teacher Education Corpus (CELTEC)--comprising 404 texts written by English as a foreign language (EFL) pre-service teachers enrolled at nine universities in Chile. The study outlines the methodology for creating this pseudo-longitudinal corpus, facilitating replication. It includes three cohorts representing years 3, 4, and 5 of a five-year undergraduate programme. Additionally, the study examines the lexical complexity of the corpus texts, focusing on constructs such as lexical density, diversity, and sophistication. Data were collected using the corpus query language (CQL) in Sketch Engine and analysed with freely available tools, TAALES 2.2 and TAALED 1.4, to calculate indices of lexical complexity within a multidimensional framework. The results reveal a slight developmental trend in lexical complexity across the CELTEC cohorts. Lexical density is moderate, averaging between 40-49%, yet increases incrementally with each academic year. Lexical diversity also shows improvement across cohorts; however, this growth does not consistently correspond to higher lexical sophistication in the texts. These findings have implications for English for General Academic Purposes (EGAP) pedagogy, both within the study's context and in broader educational settings. Specifically, they underscore the critical need for more explicit instruction in advanced academic vocabulary to address learners' specific needs effectively. [ABSTRACT FROM AUTHOR]

Additional Information

  • Source:Pakistan Journal of Life & Social Sciences. 2025/01, Vol. 23, Issue 1, p2505
  • Document Type:Article
  • Subject Area:Language and Linguistics
  • Publication Date:2025
  • ISSN:1727-4915
  • DOI:10.57239/PJLSS-2025-23.1.00196
  • Accession Number:186946382
  • Copyright Statement:Copyright of Pakistan Journal of Life & Social Sciences is the property of Elite Scientific Forum and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.