CrossLingference | Cross-Linguistic statistical inference using hierarchical Bayesian models

Summary
Historical linguistics and linguistic typology share the objective of explaining cross-linguistic variation. Their traditional research agendas have been largely disjoint though since historical linguistics strives for depth and typology for breadth. This tension has been replicated in current statistical and computational renderings of two sub-disciplines. Computational models of language change generally focus on individual language families, while statistical typology pays little attention to diachronic processes. CrossLingference will bridge this gap. Using Bayesian hierarchical models, the reach of modern phylogenetic linguistics will be extended to cross-family models, where each lineage is assumed to follow its own dynamics, but cross-family variation is constrained and data from one family are used to make inference about the processes in other families. At the same time, state-of-the-art generalized linear mixed models will be extended to control both for genealogical history and language contact. These model-based approaches will be complemented by agent-based simulations.
CrossLingference will implement this general programme for the following domains of application, securing a lasting impact both on statistical typology and on computational historical linguistics:
- Sound laws in language change, enabling automatic reconstruction of proto-language vocabulary,
- Causal relationships between typological variables.
- Factoring of universal tendencies, historical contingencies and language contact in explaining variation in
word-order types and inflectional paradigms.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/834050
Start date: 01-10-2019
End date: 30-09-2025
Total budget - Public funding: 2 500 000,00 Euro - 2 500 000,00 Euro
Cordis data

Original description

Historical linguistics and linguistic typology share the objective of explaining cross-linguistic variation. Their traditional research agendas have been largely disjoint though since historical linguistics strives for depth and typology for breadth. This tension has been replicated in current statistical and computational renderings of two sub-disciplines. Computational models of language change generally focus on individual language families, while statistical typology pays little attention to diachronic processes. CrossLingference will bridge this gap. Using Bayesian hierarchical models, the reach of modern phylogenetic linguistics will be extended to cross-family models, where each lineage is assumed to follow its own dynamics, but cross-family variation is constrained and data from one family are used to make inference about the processes in other families. At the same time, state-of-the-art generalized linear mixed models will be extended to control both for genealogical history and language contact. These model-based approaches will be complemented by agent-based simulations.
CrossLingference will implement this general programme for the following domains of application, securing a lasting impact both on statistical typology and on computational historical linguistics:
- Sound laws in language change, enabling automatic reconstruction of proto-language vocabulary,
- Causal relationships between typological variables.
- Factoring of universal tendencies, historical contingencies and language contact in explaining variation in
word-order types and inflectional paradigms.

Status

SIGNED

Call topic

ERC-2018-ADG

Update Date

27-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2018
ERC-2018-ADG