DO WE NEED STATISTICS WHEN WE HAVE LINGUISTICS?

Pacual Cantos Gómez

Autores

Pacual Cantos Gómez Universidad de Murcia

Palavras-chave:

Quantitative analysis, Statistics, Language modelling, Linguistic corpora

Resumo

Statistics is known to be a quantitative approach to research. However, most of the research done in the fields of language and linguistics is of a different kind, namely qualitative. Succinctly, qualitative analysis differs from quantitative analysis is that in the former no attempt is made to assign frequencies, percentages and the like, to the linguistic features found or identified in the data. In quantitative research, linguistic features are classified and counted, and even more complex statistical models are constructed in order to explain these observed facts. In qualitative research, however, we use the data only for identifying and describing features of language usage and for providing real occurrences/examples of particular phenomena. In this paper, we shall try to show how quantitative methods and statistical techniques can supplement qualitative analyses of language. We shall attempt to present some mathematical and statistical properties of natural languages, and introduce some of the quantitative methods which are of the most value in working empirically with texts and corpora, illustrating the various issues with numerous examples and moving from the most basic descriptive techniques (frequency counts and percentages) to decision-taking techniques (chi-square and z-score) and to more sophisticated statistical language models (Type-Token/LemmaToken/Lemma-Type formulae, cluster analysis and discriminant function analysis).

DO WE NEED STATISTICS WHEN WE HAVE LINGUISTICS?

Autores

Palavras-chave:

Resumo

Downloads

Publicado

Como Citar

Edição

Seção

Qualis

Enviar Submissão

Idioma

Palavras-chave