GeLaTo was created by Chiara Barbieri, Damián Blasi and Robert Forkel.

Chiara Barbieri:
genetic data curation, population genetics analysis, database assembly.
Damián Blasi:
conceptualization, project name.
Robert Forkel:
database structure, data curation and web interface.

The concept of GeLaTo was developed from conversations at the Department of Linguistic and Cultural Evolution at the Max Planck Institute for the Science of Human History, Jena, Germany, and the dataset was assembled at the University of Zurich.

Concept supervision: Russell Gray, Balthasar Bickel, Kentaro Shimizu.

The curation of the dataset for glottocode assignation was performed by Barbieri, Blasi and Forkel with the expertise of Harald Hammarström, and the support of various contributors: Søren Wichmann, Simon J. Greenhill, Balthasar Bickel, Russell Gray, Natalia Chousou-Polydouri, Paul Heggarty, Tom Güldemann, Matthias Urban, Brigitte Pakendorf, Jessica Ivani, Kellen Parker van Dam, Anne-Maria Fehn, Hiba Babiker, Nora Muheim.

Details on the contribution for each genetic population is listed in the dataset curation notes.

Chiara Barbieri was supported by funds from the URPP Evolution in Action program of the University of Zurich and from the NCCR Evolving Language grant of the Swiss National Science Fundation.


Glottolog data is curated in a public repository at https://github.com/gelato-org/gelato-data. We welcome suggestions for:

  • improving the glottocode matches to the genetic populations
  • suggesting genetic publications to include, which satisfy the following criteria: data compatible to the genetic panels considered, and population samples with an anthropological/linguistic background description
  • suggesting new genetic analysis and variables to compute over the dataset


Academic publications which deal with GeLaTo include:

  • Barbieri et al. 2022. A global analysis of matches and mismatches between human genetic and linguistic histories. PNAS. DOI: 10.1073/pnas.2122084119