This project documents and explains a substantial number of grammatical universals by demonstrating a link between cross-linguistic patterns of language form and general trends of language use. The claim is that frequently expressed meanings tend to be expressed by short forms, not only at the level of words, but also throughout the grammars of languages around the world (form-frequency correspondences). A simple example is the asymmetry in the coding of present-tense forms and future-tense forms in the world’s languages, as one out of a multitude of analogous cases: Present-tense forms tend to be short or zero-coded, while future-tense forms tend to be longer or to have an overt marker. This corresponds to a usage asymmetry: Present-tense forms are generally more frequent than future-tense forms, in all languages. The proposed explanation is that higher-frequency items are more predictable than lower-frequency items, and predictable content need not be expressed overtly or can be expressed by shorter forms. Form-frequency correspondences thus make language structure more efficient, but it still needs to be shown that there exists a mechanism that creates and maintains these efficient structures: recurrent instances of language change driven by the speakers’ preference for user-friendly utterances.
Understanding linguistic diversity is important for society in a general way, especially at a time when Europe is become more linguistically diverse because of increasing migration.