A Model-Based Approach for Analysis of Spatial Structure in Genetic Data

TitleA Model-Based Approach for Analysis of Spatial Structure in Genetic Data
Publication TypeJournal Article
Year of Publication2012
AuthorsYang, W-Y., Novembre J., Eskin E., & Halperin E.
Published inNature Genetics
Other Numbers3309

Characterizing genetic diversity within and between populations has broad applications in studies of human disease and evolution. We propose a new approach, spatial ancestry analysis, for the modeling of genotypes in two- or three-dimensional space. In spatial ancestry analysis (SPA), we explicitly model the spatial distribution of each SNP by assigning an allele frequency as a continuous function in geographic space. We show that the explicit modeling of the allele frequency allows individuals to be localized on the map on the basis of their genetic information alone. We apply our SPA method to a European and a worldwide population genetic variation data set and identify SNPs showing large gradients in allele frequency, and we suggest these as candidate regions under selection. These regions include SNPs in the well-characterized LCT region, as well as at loci including FOXP2, OCA2 and LRP1B.


W.-Y.Y. and E.E. are supported by grants from the US National Science Foundation (0513612, 0731455, 0729049, 0916676 and 1065276) and the US National Institutes of Health (K25 HL080079, U01 DA024417, P01 HL30568 and PO1 HL28481).J.N. is supported by National Science Foundation grant (0933731) and by the Searle Scholars Program. E.H. is a faculty fellow of the Edmond J. Safra Program at Tel Aviv University and was supported in part by the Israeli Science Foundation(grant 04514831) and by IBM open collaborative research award program.

Bibliographic Notes

Nature Genetics, Vol. 44, pp. 725-731

Abbreviated Authors

W.-Y. Yang, J. Novembre, E. Eskin, and E. Halperin

ICSI Research Group


ICSI Publication Type

Article in journal or magazine