Corpus Linguistics and Sketch Engine: the lexical selection of the São Francisco Settlement Project

Authors

DOI:

https://doi.org/10.5433/1519-5392.2024v24n1p106-130

Keywords:

Corpus Linguistics; Sketch Engine; Lexicon.

Abstract

This article aims to present a reflection on the importance of Corpus Linguistics (LC) as a methodological contribution for the lexical selection of the oral corpus of PA São Francisco, with the help of the computer program Sketch Engine. In this sense, LC is highlighted as a methodology that enables the analysis of language data in a probabilistic way and that allows analyzing patterns or trends in linguistic phenomen. Furthermore, it is linked to the creation of electronic corpora and the use of analysis software for reading both oral and written corpora. For example, Sketch Engine allows the comparison between a corpus of study and a reference corpus in order to highlight keywords through frequency analysis in different contexts. Thus, the methodological proposal of constructing an oral corpus and processing this data for future lexical analyses aimed at exploring linguistic data in a language, especially in specific linguistic research contexts.

Downloads

Download data is not yet available.

Author Biographies

Andreza Marcião dos Santos, Federal University of Minas Gerais

Doutora em Estudos Linguísticos pela Universidade Federal de Minas Gerais. Possui interesse em estudos que envolvem a Sociolinguística, a Lexicologia e o Ensino de Português como língua materna. 

Maria Cândida Trindade Costa de Seabra, Federal University of Minas Gerais

Doutora em Estudos Linguísticos pela Universidade Federal de Minas Gerais - UFMG. Professora Titular da Faculdade de Letras – FALE/UFMG.

References

BIBER, D. Variation across Speech and Writing. Cambridge: Cambridge University Press, 1998.

BIBER, D. Representativeness in corpus design. Literary and Linguistic Computing, v. 8, n, 4, p. 243-257, 1993. Disponível em: https://otipl.philol.msu.ru/media/biber930.pdf. Acesso em: 17 ago. 2023. DOI: https://doi.org/10.1093/llc/8.4.243

DAVIES, M. Corpus: an introduction. Biber, D.; Reppen, R. The Cambridge Handbook of English Corpus Linguistics. Cambridge: Cambridge 2015. p. 11-31. DOI: https://doi.org/10.1017/CBO9781139764377.002

LINDQUIST, H. Corpus Linguistics and Description of English. Edinburg University Press Ltd, 22 George Square, Edinburgh, 2009.

MCENERY, A.; HARDIE, A. Corpus Linguistics. Cambridge: Cambridge University Press, 2012. DOI: https://doi.org/10.1017/CBO9780511981395

FILLMORE, C. ‘Corpus linguistics’ or ‘computer corpus linguistics’. In: J. SVARTVIK (org.). Directions in Corpus Linguistics. Proceedings of Nobel Symposium 82 Stockholm. New York: De Gruyter Mouton, 1992.

RAYSON, P. Computational tools and methods for corpus compilation and analysis. Biber, D.; Reppen, R. The Cambridge Handbook of English Corpus Linguistics. Cambridge: Cambridge 2015. p. 32-49. DOI: https://doi.org/10.1017/CBO9781139764377.003

SARDINHA, T. B. Linguística de Corpus. Barueri: Monole. 2004

SARDINHA, T. B. Linguística de Corpus: Histórico e Problemática. D.E.L.T.A., v. 16, n. 2, p. 323-367, 2000. Disponível em: https://www.scielo.br/j/delta/a/vGknQkZQGsGYbrQfKmTZY4s/?lang=pt. Acesso em: 24 ago. 2023. DOI: https://doi.org/10.1590/S0102-44502000000200005

SARDINHA, T.B. Questões metodológicas de análise de metáfora na perspectiva da linguística de corpus. Gragoatá. Niterói, n. 26, 2009, p. 81-102. DOI: https://doi.org/10.1590/S0102-44502010000100007

Published

2024-06-30

How to Cite

SANTOS, Andreza Marcião dos; SEABRA, Maria Cândida Trindade Costa de. Corpus Linguistics and Sketch Engine: the lexical selection of the São Francisco Settlement Project. Entretextos, Londrina, v. 24, n. 1, p. 106–130, 2024. DOI: 10.5433/1519-5392.2024v24n1p106-130. Disponível em: https://ojs.uel.br/revistas/uel/index.php/entretextos/article/view/49565. Acesso em: 21 nov. 2024.

Similar Articles

You may also start an advanced similarity search for this article.