SciELO - Scientific Electronic Library Online

 
vol.40 número1El español hablado por los malecus: caracterización general y reconocimiento como variedad particular"El surrealismo en la obra poética manifiesto olvidado" índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay articulos similaresSimilares en SciELO

Compartir


Káñina

versión On-line ISSN 2215-2636versión impresa ISSN 0378-0473

Resumen

VINOGRADOV, Igor. Linguistic corpora of understudied languages: do they make sense?. Káñina [online]. 2016, vol.40, n.1, pp.116-130. ISSN 2215-2636.  http://dx.doi.org/10.15517/rk.v40i1.24143.

A corpus of an understudied language usually has documentary-linguistic nature and comprises all text material available in a particular language. However, without resorting to text selection, it is impossible to obtain a representative and balanced sample of language use. Lack of these two characteristics makes a corpus almost useless for any kind of quantitative research. Nevertheless, corpora of understudied languages comply with a wide range of language documentation objectives. Furthermore, they can serve as evidence of the existence of word forms or grammatical features in texts that meet specific search criteria. If such corpora have well-elaborated linguistic annotation, they can complement grammatical descriptions and dictionaries, standing out against common text collections due to their digital format. They are especially suitable for typological research, when one has to deal with a huge amount of data in different and unrelated languages.

Palabras clave : corpus linguistics; understudied languages, language documentation; quantitative methods.

        · resumen en Español     · texto en Inglés     · Inglés ( pdf )