Register variation remains stable across 60 languages

Haipeng Li, Jonathan Dunn, Andrea Nini

Research output: Contribution to journalArticlepeer-review

13 Downloads (Pure)


This paper measures the stability of cross-linguistic register variation. A register is a variety of a language that is associated with extra-linguistic context. The relationship between a register and its context is functional: the linguistic features that make up a register are motivated by the needs and constraints of the communicative situation. This view hypothesizes that register should be universal, so that we expect a stable relationship between the extra-linguistic context that defines a register and the sets of linguistic features which the register contains. In this paper, the universality and robustness of register variation is tested by comparing variation within versus between register-specific corpora in 60 languages using corpora produced in comparable communicative situations: tweets and Wikipedia articles. Our findings confirm the prediction that register variation is, in fact, universal.
Original languageEnglish
JournalCorpus Linguistics and Linguistic Theory
Early online date20 Sept 2022
Publication statusE-pub ahead of print - 20 Sept 2022


Dive into the research topics of 'Register variation remains stable across 60 languages'. Together they form a unique fingerprint.

Cite this