Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods

Accurate inference of genetic ancestry is crucial for population-based association studies, accounting for population heterogeneity and structure. This study analyzes genome-wide SNP data from the Netherlands Twin Register to compare genetic ancestry estimates. The focus is on the comparison of ancestry estimates between family members and individuals genotyped on multiple arrays (Affymetrix 6.0, Affymetrix Axiom, and Illumina GSA). Two conventional methods, principal component analysis and ADMIXTURE, were implemented to estimate ancestry, each serving its specific purpose, rather than for dir... Mehr ...

Verfasser: Jeffrey J. Beck
Talitha Ahmed
Casey T. Finnicum
Koos Zwinderman
Erik A. Ehli
Dorret I. Boomsma
Jouke Jan Hottenga
Dokumenttyp: Artikel
Erscheinungsdatum: 2023
Reihe/Periodikum: Genes, Vol 14, Iss 7, p 1497 (2023)
Verlag/Hrsg.: MDPI AG
Schlagwörter: within-family analysis / genetic ancestry estimation / population structure / principal components analysis (PCA) / ADMIXTURE / Genetics / QH426-470
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-28988837
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://doi.org/10.3390/genes14071497

Accurate inference of genetic ancestry is crucial for population-based association studies, accounting for population heterogeneity and structure. This study analyzes genome-wide SNP data from the Netherlands Twin Register to compare genetic ancestry estimates. The focus is on the comparison of ancestry estimates between family members and individuals genotyped on multiple arrays (Affymetrix 6.0, Affymetrix Axiom, and Illumina GSA). Two conventional methods, principal component analysis and ADMIXTURE, were implemented to estimate ancestry, each serving its specific purpose, rather than for direct comparison. The results reveal that as the degree of genetic relatedness decreases, the Euclidean distances of genetic ancestry estimates between family members significantly increase (empirical p < 0.001), regardless of the estimation method and genotyping array. Ancestry estimates among individuals genotyped on multiple arrays also show statistically significant differences (empirical p < 0.001). Additionally, this study investigates the relationship between the ancestry estimates of non-identical twin offspring with ancestrally diverse parents and those with ancestrally similar parents. The results indicate a statistically significant weak correlation between the variation in ancestry estimates among offspring and differences in ancestry estimates among parents (Spearman’s rho: 0.07, p = 0.005). This study highlights the utility of current methods in inferring genetic ancestry, emphasizing the importance of reference population composition in determining ancestry estimates.