Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ochoa A, Storey JD. Estimating FST and kinship for arbitrary population structures. PLoS Genet 2021;17:e1009241. [PMID: 33465078 PMCID: PMC7846127 DOI: 10.1371/journal.pgen.1009241] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 01/29/2021] [Accepted: 11/02/2020] [Indexed: 12/20/2022] Open

For:	Ochoa A, Storey JD. Estimating FST and kinship for arbitrary population structures. PLoS Genet 2021;17:e1009241. [PMID: 33465078 PMCID: PMC7846127 DOI: 10.1371/journal.pgen.1009241] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 01/29/2021] [Accepted: 11/02/2020] [Indexed: 12/20/2022] Open

Number

Cited by Other Article(s)

Peláez P, Lorenzana GP, Baesen K, Montes JR, De La Torre AR. Spatially heterogeneous selection and inter-varietal differentiation maintain population structure and local adaptation in a widespread conifer. BMC Ecol Evol 2024;24:117. [PMID: 39227766 PMCID: PMC11373507 DOI: 10.1186/s12862-024-02304-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 08/28/2024] [Indexed: 09/05/2024] Open

Abstract

BACKGROUND

Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco) plays a critical role in the ecology and economy of Western North America. This conifer species comprises two distinct varieties: the coastal variety (var. menziesii) along the Pacific coast, and the interior variety (var. glauca) spanning the Rocky Mountains into Mexico, with instances of inter-varietal hybridization in Washington and British Columbia. Recent investigations have focused on assessing environmental pressures shaping Douglas-fir's genomic variation for a better understanding of its evolutionary and adaptive responses. Here, we characterize range-wide population structure, estimate inter-varietal hybridization levels, identify candidate loci for climate adaptation, and forecast shifts in species and variety distribution under future climates.

RESULTS

Using a custom SNP-array, we genotyped 540 trees revealing four distinct clusters with asymmetric admixture patterns in the hybridization zone. Higher genetic diversity observed in coastal and hybrid populations contrasts with lower diversity in inland populations of the southern Rockies and Mexico, exhibiting a significant isolation by distance pattern, with less marked but still significant isolation by environment. For both varieties, we identified candidate loci associated with local adaptation, with hundreds of genes linked to processes such as stimulus response, reactions to chemical compounds, and metabolic functions. Ecological niche modeling revealed contrasting potential distribution shifts among the varieties in the coming decades, with interior populations projected to lose habitat and become more vulnerable, while coastal populations are expected to gain suitable areas.

CONCLUSIONS

Overall, our findings provide crucial insights into the population structure and adaptive potential of Douglas-fir, with the coastal variety being the most likely to preserve its evolutionary path throughout the present century, which carry implications for the conservation and management of this species across their range.

Collapse

Mendoza-Maya E, Giles-Pérez GI, Vargas-Hernández JJ, Sáenz-Romero C, Martínez-Trujillo M, de Los Angeles Beltrán-Nambo M, Hernández-Díaz JC, Prieto-Ruíz JÁ, Jaramillo-Correa JP, Wehenkel C. Evolutionary drivers of reproductive fitness in two endangered forest trees. THE NEW PHYTOLOGIST 2024. [PMID: 39187985 DOI: 10.1111/nph.20073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 08/06/2024] [Indexed: 08/28/2024]

Tenhunen S, Thomasen JR, Sørensen LP, Berg P, Kargo M. Genomic analysis of inbreeding and coancestry in Nordic Jersey and Holstein dairy cattle populations. J Dairy Sci 2024;107:5897-5912. [PMID: 38608951 DOI: 10.3168/jds.2023-24553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 03/01/2024] [Indexed: 04/14/2024]

Abstract

In recent years, genomic selection (GS) has accelerated genetic gain in dairy cattle breeds worldwide. Despite the evident genetic progress, several dairy populations have also encountered challenges such as heightened inbreeding rates and reduced effective population sizes. The challenge has been to find a balance between achieving substantial genetic gain while managing genetic diversity within the population, thereby mitigating the negative effects of inbreeding depression. This study aims to elucidate the impact of GS on pedigree and genomic rates of inbreeding (ΔF) and coancestry (ΔC) in Nordic Jersey (NJ) and Holstein (NH) cattle populations. Furthermore, key genetic metrics, including the generation interval (L), effective population size (Ne), and future effective population size (FNe) were assessed between 2 time periods, before and after GS, and across distinct animal cohorts in both breeds: females, bulls, and approved semen-producing bulls (AI-sires). Analysis of ΔF and ΔC revealed distinct trends across the studied periods and animal groups. Notably, there was a consistent increase in yearly ΔF for most animal groups in both breeds. An exception was observed in NH AI-sires, which demonstrated a slight decrease in yearly ΔF. Moreover, NJ displayed minimal changes in yearly ΔC between the periods, whereas NH exhibited elevated ΔC values across all animal groups. Particularly striking was the substantial increase in yearly ΔC within the NH female population, surging from 0.02% to 0.39% between the periods. Implementation of GS resulted in a reduction of the generation interval across all animal cohorts in both NJ and NH breeds. However, the extent of reduction was more pronounced in males compared with females. This reduction in generation interval influenced generational changes in ΔF and ΔC. Bulls and AI-sires of both breeds exhibited reduced generational ΔF between periods, in contrast to females that demonstrated an opposing pattern. Between the periods, NJ maintained a relatively stable Ne (29.4 before and 30.3 after GS), whereas NH experienced a notable decline from 54.3 to 42.8. Female groups in both breeds displayed a negative Ne trend, whereas males demonstrated either neutral or positive Ne developments. Regarding FNe, NJ exhibited positive FNe development with an increase from 40.7 to 57.2. The opposite was observed in NH, where FNe decreased from 198.8 to 42.7. In summary, it was evident that the genomic methods could detect differences between the populations and changes in ΔF and ΔC more efficiently than pedigree methods. Implementation of GS yielded positive outcomes within the NJ population regarding the rate of coancestry but the opposite was observed with NH. Moreover, analysis of ΔC data hints at the potential to decrease future ΔF through informed mating strategies. Conversely, NH faces more pressing concerns, even though ΔF remains comparatively modest in contrast to what has been observed in other Holstein populations. These findings underscore the necessity of genomic control of inbreeding and coancestry with strategic changes in the Nordic breeding schemes for dairy to ensure long-term sustainability in the forthcoming years.

Collapse

Zhu Z, Lin R, Zhao B, Shi W, Cai Q, Zhang L, Xin Q, Li L, Miao Z, Zhou S, Huang Z, Huang Q, Zheng N. Whole-genome resequencing revealed the population structure and selection signal of 4 indigenous Chinese laying ducks. Poult Sci 2024;103:103832. [PMID: 38781766 PMCID: PMC11145554 DOI: 10.1016/j.psj.2024.103832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Revised: 04/20/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024] Open

Abstract

The assessment of animal genetic structure had significant importance for the preservation and breeding of animal germplasm resources. Selection signals are genotype markers generated during the process of biological evolution, and the detection of selection signals could reveal the direction of species evolution. The aim of this study was to generate a whole-genome resequencing data from Jinding duck, Shanma duck, Youxian Partridge duck, and Taiwan Brown tsaiya duck to reveal their population structure and selection signals. The population structure analysis revealed significant genetic differences among the 4 indigenous laying ducks, indicating their independent lineage. Specifically, Shanma duck and Youxian partridge duck were closely and likely originated from a common ancestor. In addition, selection sweep analysis was performed using the population genetic differentiation coefficient (Fst) and nucleotide diversity ratio (π ratio). The top 5% was used as the threshold for the Fst and π ratio, and the 2 thresholds were combined to identify selected genomic regions. In the selected regions of the 3 comparison groups, 136, 143, and 268 candidate genes were detected. Further screening of all candidate genes revealed that 35 candidate genes appeared simultaneously in 3 comparative groups, with 16 genes annotated. The 16 genes were analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. The results revealed 5 functional genes (AQP3, PIK3C3, NOL6, RPP25, and DCTN3) that may be related to important economic traits in laying ducks and involved mainly invasopressin-regulated water reabsorption, ribosome biogenesis, and the PI3K signaling pathway. The results provide insights into the protection and exploitation of genetic resources of Chinese indigenous laying ducks.

Collapse

Affiliation(s)

Zhiming Zhu Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Ruiyi Lin College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
Bangzhe Zhao Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China; College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
Wenli Shi Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China; College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
Qiannan Cai Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China; College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
Linli Zhang Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Qingwu Xin Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Li Li Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Zhongwei Miao Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Shiyi Zhou Seed Industry Development Center of Shishi, Shishi 362700, China
Zhongbin Huang Seed Industry Development Center of Shishi, Shishi 362700, China
Qinlou Huang Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China
Nenzhu Zheng Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences/ Fujian Key Laboratory of Animal Genetics and Breeding, Fuzhou 350013, China.

Collapse

Lawson DJ, Howard-McCombe J, Beaumont M, Senn H. How admixed captive breeding populations could be rescued using local ancestry information. Mol Ecol 2024:e17349. [PMID: 38634332 DOI: 10.1111/mec.17349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/21/2023] [Accepted: 02/26/2024] [Indexed: 04/19/2024]

Baguma JK, Mukasa SB, Nuwamanya E, Alicai T, Omongo CA, Ochwo-Ssemakula M, Ozimati A, Esuma W, Kanaabi M, Wembabazi E, Baguma Y, Kawuki RS. Identification of Genomic Regions for Traits Associated with Flowering in Cassava (Manihot esculenta Crantz). PLANTS (BASEL, SWITZERLAND) 2024;13:796. [PMID: 38592820 PMCID: PMC10974989 DOI: 10.3390/plants13060796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 01/25/2024] [Accepted: 01/26/2024] [Indexed: 04/11/2024]

Abstract

Flowering in cassava (Manihot esculenta Crantz) is crucial for the generation of botanical seed for breeding. However, genotypes preferred by most farmers are erect and poor at flowering or never flower. To elucidate the genetic basis of flowering, 293 diverse cassava accessions were evaluated for flowering-associated traits at two locations and seasons in Uganda. Genotyping using the Diversity Array Technology Pty Ltd. (DArTseq) platform identified 24,040 single-nucleotide polymorphisms (SNPs) distributed on the 18 cassava chromosomes. Population structure analysis using principal components (PCs) and kinships showed three clusters; the first five PCs accounted for 49.2% of the observed genetic variation. Linkage disequilibrium (LD) estimation averaged 0.32 at a distance of ~2850 kb (kilo base pairs). Polymorphism information content (PIC) and minor allele frequency (MAF) were 0.25 and 0.23, respectively. A genome-wide association study (GWAS) analysis uncovered 53 significant marker-trait associations (MTAs) with flowering-associated traits involving 27 loci. Two loci, SNPs S5_29309724 and S15_11747301, were associated with all the traits. Using five of the 27 SNPs with a Phenotype_Variance_Explained (PVE) ≥ 5%, 44 candidate genes were identified in the peak SNP sites located within 50 kb upstream or downstream, with most associated with branching traits. Eight of the genes, orthologous to Arabidopsis and other plant species, had known functional annotations related to flowering, e.g., eukaryotic translation initiation factor and myb family transcription factor. This study identified genomic regions associated with flowering-associated traits in cassava, and the identified SNPs can be useful in marker-assisted selection to overcome hybridization challenges, like unsynchronized flowering, and candidate gene validation.

Collapse

Affiliation(s)

Julius K. Baguma School of Agricultural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda; (S.B.M.); (E.N.); (M.O.-S.) National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.)
Settumba B. Mukasa School of Agricultural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda; (S.B.M.); (E.N.); (M.O.-S.)
Ephraim Nuwamanya School of Agricultural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda; (S.B.M.); (E.N.); (M.O.-S.) National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.)
Titus Alicai National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.)
Christopher Abu Omongo National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.) National Agricultural Research Organisation (NARO), Entebbe P.O. Box 295, Uganda;
Mildred Ochwo-Ssemakula School of Agricultural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda; (S.B.M.); (E.N.); (M.O.-S.)
Alfred Ozimati National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.) School of Biological Sciences, Makerere University, Kampala P.O. Box 7062, Uganda
Williams Esuma National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.) National Agricultural Research Organisation (NARO), Entebbe P.O. Box 295, Uganda;
Michael Kanaabi National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.)
Enoch Wembabazi National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.)
Yona Baguma National Agricultural Research Organisation (NARO), Entebbe P.O. Box 295, Uganda;
Robert S. Kawuki National Crops Resources Research Institute, Namulonge (NaCRRI), Kampala P.O. Box 7084, Uganda; (T.A.); (C.A.O.); (A.O.); (W.E.); (M.K.); (E.W.); (R.S.K.) National Agricultural Research Organisation (NARO), Entebbe P.O. Box 295, Uganda;

Collapse

Aalbers SE, Weir BS. Sequence-based population structure, relatedness, and inbreeding estimates for forensic autosomal STR markers. Forensic Sci Int Genet 2024;69:103009. [PMID: 38237274 DOI: 10.1016/j.fsigen.2024.103009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/11/2023] [Accepted: 01/11/2024] [Indexed: 01/29/2024]

Guan Y, Levy D. Estimation of inbreeding and kinship coefficients via latent identity-by-descent states. Bioinformatics 2024;40:btae082. [PMID: 38364309 PMCID: PMC10902678 DOI: 10.1093/bioinformatics/btae082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 01/15/2024] [Accepted: 02/12/2024] [Indexed: 02/18/2024] Open

Abstract

MOTIVATION

Estimating the individual inbreeding coefficient and pairwise kinship is an important problem in human genetics (e.g. in disease mapping) and in animal and plant genetics (e.g. inbreeding design). Existing methods, such as sample correlation-based genetic relationship matrix, KING, and UKin, are either biased, or not able to estimate inbreeding coefficients, or produce a large proportion of negative estimates that are difficult to interpret. This limitation of existing methods is partly due to failure to explicitly model inbreeding. Since all humans are inbred to various degrees by virtue of shared ancestries, it is prudent to account for inbreeding when inferring kinship between individuals.

RESULTS

We present "Kindred," an approach that estimates inbreeding and kinship by modeling latent identity-by-descent states that accounts for all possible allele sharing-including inbreeding-between two individuals. Kindred used non-negative least squares method to fit the model, which not only increases computation efficiency compared to the maximum likelihood method, but also guarantees non-negativity of the kinship estimates. Through simulation, we demonstrate the high accuracy and non-negativity of kinship estimates by Kindred. By selecting a subset of SNPs that are similar in allele frequencies across different continental populations, Kindred can accurately estimate kinship between admixed samples. In addition, we demonstrate that the realized kinship matrix estimated by Kindred is effective in reducing genomic control values via linear mixed model in genome-wide association studies. Finally, we demonstrate that Kindred produces sensible heritability estimates on an Australian height dataset.

AVAILABILITY AND IMPLEMENTATION

Kindred is implemented in C with multi-threading. It takes vcf file or stream as input and works seamlessly with bcftools. Kindred is freely available at https://github.com/haplotype/kindred.

Collapse

Cui R, Wu J, Yan K, Luo S, Hu Y, Feng W, Lu B, Wang J. Phased genome assemblies reveal haplotype-specific genetic load in the critically endangered Chinese Bahaba (Teleostei, Sciaenidae). Mol Ecol 2024;33:e17250. [PMID: 38179694 DOI: 10.1111/mec.17250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 12/06/2023] [Accepted: 12/11/2023] [Indexed: 01/06/2024]

Tsouris A, Brach G, Schacherer J, Hou J. Non-additive genetic components contribute significantly to population-wide gene expression variation. CELL GENOMICS 2024;4:100459. [PMID: 38190102 PMCID: PMC10794783 DOI: 10.1016/j.xgen.2023.100459] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/19/2023] [Accepted: 11/09/2023] [Indexed: 01/09/2024]

Goudet J, Weir BS. An allele-sharing, moment-based estimator of global, population-specific and population-pair FST under a general model of population structure. PLoS Genet 2023;19:e1010871. [PMID: 38011288 PMCID: PMC10703327 DOI: 10.1371/journal.pgen.1010871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 12/07/2023] [Accepted: 10/31/2023] [Indexed: 11/29/2023] Open

Garcia-Erill G, Hanghøj K, Heller R, Wiuf C, Albrechtsen A. Estimating admixture pedigrees of recent hybrids without a contiguous reference genome. Mol Ecol Resour 2023;23:1604-1619. [PMID: 37400991 DOI: 10.1111/1755-0998.13830] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 05/30/2023] [Accepted: 06/15/2023] [Indexed: 07/05/2023]

Cavedon M, Neufeld L, Finnegan L, Hervieux D, Michalak A, Pelletier A, Polfus J, Schwantje H, Skinner G, Steenweg R, Thacker C, Poissant J, Musiani M. Genomics of founders for conservation breeding: the Jasper caribou case. CONSERV GENET 2023;24:855-867. [PMID: 37969360 PMCID: PMC10638200 DOI: 10.1007/s10592-023-01540-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 06/07/2023] [Indexed: 11/17/2023]

Abstract

Conservation breeding programs are increasingly used as recovery actions for wild animals; bringing founders into captivity to rear captive populations for future reintroduction into the wild. The International Union for the Conservation of Nature recommends that founders should come from genetically close populations and should have sufficient genetic diversity to avoid mating among relatives. Genomic data are highly informative for evaluating founders due to their high resolution and ability to capture adaptive divergence, yet, their application in that context remains limited. Woodland caribou are federally listed as a Species at Risk in Canada, with several populations facing extirpation, such as those in the Rocky Mountains of Alberta and British Columbia (BC). To prevent local extirpation, Jasper National Park (JNP) is proposing a conservation breeding program. We examined single nucleotide polymorphisms for 144 caribou from 11 populations encompassing a 200,0002 km area surrounding JNP to provide information useful for identifying appropriate founders for this program. We found that this area likely hosts a caribou metapopulation historically characterized by high levels of gene flow, which indicates that multiple sources of founders would be appropriate for initiating a breeding program. However, population structure and adaptive divergence analyses indicate that JNP caribou are closest to populations in the BC Columbia range, which also have suitable genetic diversity for conservation breeding. We suggest that collaboration among jurisdictions would be beneficial to implement the program to promote recovery of JNP caribou and possibly other caribou populations in the surrounding area, which is strategically at the periphery of the distribution of this endangered species.

Supplementary Information

The online version contains supplementary material available at 10.1007/s10592-023-01540-3.

Collapse

LaPierre N, Fu B, Turnbull S, Eskin E, Sankararaman S. Leveraging family data to design Mendelian randomization that is provably robust to population stratification. Genome Res 2023;33:1032-1041. [PMID: 37197991 PMCID: PMC10538495 DOI: 10.1101/gr.277664.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/16/2023] [Indexed: 05/19/2023]

He L, Luo J, Niu S, Bai D, Chen Y. Population structure analysis to explore genetic diversity and geographical distribution characteristics of wild tea plant in Guizhou Plateau. BMC PLANT BIOLOGY 2023;23:255. [PMID: 37189087 DOI: 10.1186/s12870-023-04239-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 04/21/2023] [Indexed: 05/17/2023]

Abstract

BACKGROUND

Tea, the second largest consumer beverage in the world after water, is widely cultivated in tropical and subtropical areas. However, the effect of environmental factors on the distribution of wild tea plants is unclear.

RESULTS

A total of 159 wild tea plants were collected from different altitudes and geological types of the Guizhou Plateau. Using the genotyping-by-sequencing method, 98,241 high-quality single nucleotide polymorphisms were identified. Genetic diversity, population structure analysis, principal component analysis, phylogenetic analysis, and linkage disequilibrium were performed. The genetic diversity of the wild tea plant population from the Silicate Rock Classes of Camellia gymnogyna was higher than that from the Carbonate Rock Classes of Camellia tachangensis. In addition, the genetic diversity of wild tea plants from the second altitude gradient was significantly higher than that of wild tea plants from the third and first altitude gradients. Two inferred pure groups (GP01 and GP02) and one inferred admixture group (GP03) were identified by population structure analysis and were verified by principal component and phylogenetic analyses. The highest differentiation coefficients were determined for GP01 vs. GP02, while the lowest differentiation coefficients were determined for GP01 vs. GP03.

CONCLUSIONS

This study revealed the genetic diversity and geographical distribution characteristics of wild tea plants in the Guizhou Plateau. There are significant differences in genetic diversity and evolutionary direction between Camellia tachangensis with Carbonate Rock Classes at the first altitude gradient and Camellia gymnogyna with Silicate Rock Classes at the third altitude gradient. Geological environment, soil mineral element content, soil pH, and altitude markedly contributed to the genetic differentiation between Camellia tachangensis and Camellia gymnogyna.

Collapse

Hou Z, Ochoa A. Genetic association models are robust to common population kinship estimation biases. Genetics 2023;224:iyad030. [PMID: 36843304 PMCID: PMC10474929 DOI: 10.1093/genetics/iyad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 11/08/2022] [Accepted: 02/17/2023] [Indexed: 02/28/2023] Open

Solovieva E, Sakai H. PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data. BMC Bioinformatics 2023;24:135. [PMID: 37020193 PMCID: PMC10074814 DOI: 10.1186/s12859-023-05169-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 02/02/2023] [Indexed: 04/07/2023] Open

Abstract

BACKGROUND

Population structure and cryptic relatedness between individuals (samples) are two major factors affecting false positives in genome-wide association studies (GWAS). In addition, population stratification and genetic relatedness in genomic selection in animal and plant breeding can affect prediction accuracy. The methods commonly used for solving these problems are principal component analysis (to adjust for population stratification) and marker-based kinship estimates (to correct for the confounding effects of genetic relatedness). Currently, many tools and software are available that analyze genetic variation among individuals to determine population structure and genetic relationships. However, none of these tools or pipelines perform such analyses in a single workflow and visualize all the various results in a single interactive web application.

RESULTS

We developed PSReliP, a standalone, freely available pipeline for the analysis and visualization of population structure and relatedness between individuals in a user-specified genetic variant dataset. The analysis stage of PSReliP is responsible for executing all steps of data filtering and analysis and contains an ordered sequence of commands from PLINK, a whole-genome association analysis toolset, along with in-house shell scripts and Perl programs that support data pipelining. The visualization stage is provided by Shiny apps, an R-based interactive web application. In this study, we describe the characteristics and features of PSReliP and demonstrate how it can be applied to real genome-wide genetic variant data.

CONCLUSIONS

The PSReliP pipeline allows users to quickly analyze genetic variants such as single nucleotide polymorphisms and small insertions or deletions at the genome level to estimate population structure and cryptic relatedness using PLINK software and to visualize the analysis results in interactive tables, plots, and charts using Shiny technology. The analysis and assessment of population stratification and genetic relatedness can aid in choosing an appropriate approach for the statistical analysis of GWAS data and predictions in genomic selection. The various outputs from PLINK can be used for further downstream analysis. The code and manual for PSReliP are available at https://github.com/solelena/PSReliP .

Collapse

St-Pierre J, Oualkacha K, Bhatnagar SR. Efficient penalized generalized linear mixed models for variable selection and genetic risk prediction in high-dimensional data. Bioinformatics 2023;39:7008326. [PMID: 36708013 PMCID: PMC9907224 DOI: 10.1093/bioinformatics/btad063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 01/13/2023] [Accepted: 01/25/2023] [Indexed: 01/29/2023] Open

Abstract

MOTIVATION

Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an attractive alternative to principal components (PCs) adjustment to account for population structure and relatedness in high-dimensional penalized models. However, their use in binary trait GWAS rely on the invalid assumption that the residual variance does not depend on the estimated regression coefficients. Moreover, LMMs use a single spectral decomposition of the covariance matrix of the responses, which is no longer possible in generalized linear mixed models (GLMMs).

RESULTS

We introduce a new method called pglmm, a penalized GLMM that allows to simultaneously select genetic markers and estimate their effects, accounting for between-individual correlations and binary nature of the trait. We develop a computationally efficient algorithm based on penalized quasi-likelihood estimation that allows to scale regularized mixed models on high-dimensional binary trait GWAS. We show through simulations that when the dimensionality of the relatedness matrix is high, penalized LMM and logistic regression with PC adjustment fail to select important predictors, and have inferior prediction accuracy compared to pglmm. Further, we demonstrate through the analysis of two polygenic binary traits in a subset of 6731 related individuals from the UK Biobank data with 320K SNPs that our method can achieve higher predictive performance, while also selecting fewer predictors than a sparse regularized logistic lasso with PC adjustment.

AVAILABILITY AND IMPLEMENTATION

Our Julia package PenalizedGLMM.jl is publicly available on github: https://github.com/julstpierre/PenalizedGLMM.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Mary-Huard T, Balding D. Fast and accurate joint inference of coancestry parameters for populations and/or individuals. PLoS Genet 2023;19:e1010054. [PMID: 36656906 PMCID: PMC9888729 DOI: 10.1371/journal.pgen.1010054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 01/31/2023] [Accepted: 12/01/2022] [Indexed: 01/20/2023] Open

LaPierre N, Fu B, Turnbull S, Eskin E, Sankararaman S. Leveraging family data to design Mendelian Randomization that is provably robust to population stratification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.05.522936. [PMID: 36711635 PMCID: PMC9881984 DOI: 10.1101/2023.01.05.522936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Caliebe A, Tekola‐Ayele F, Darst BF, Wang X, Song YE, Gui J, Sebro RA, Balding DJ, Saad M, Dubé M. Including diverse and admixed populations in genetic epidemiology research. Genet Epidemiol 2022;46:347-371. [PMID: 35842778 PMCID: PMC9452464 DOI: 10.1002/gepi.22492] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 05/31/2022] [Accepted: 06/06/2022] [Indexed: 11/25/2022]

Giles‐Pérez GI, Aguirre‐Planter E, Eguiarte LE, Jaramillo‐Correa JP. Demographic modelling helps track the rapid and recent divergence of a conifer species pair from Central Mexico. Mol Ecol 2022;31:5074-5088. [PMID: 35951172 PMCID: PMC9804182 DOI: 10.1111/mec.16646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/26/2022] [Accepted: 07/28/2022] [Indexed: 01/05/2023]

Alptekin B, Erfatpour M, Mangel D, Pauli D, Blake T, Turner H, Lachowiec J, Sherman J, Fischer A. Selection of favorable alleles of genes controlling flowering and senescence improves malt barley quality. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2022;42:59. [PMID: 37313013 PMCID: PMC10248683 DOI: 10.1007/s11032-022-01331-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 09/14/2022] [Indexed: 06/15/2023]

Parodi L, Barbier M, Jacoupy M, Pujol C, Lejeune FX, Lallemant-Dudek P, Esteves T, Pennings M, Kamsteeg EJ, Guillaud-Bataille M, Banneau G, Coarelli G, Oumoussa BM, Fraidakis MJ, Stevanin G, Depienne C, van de Warrenburg B, Brice A, Durr A. The mitochondrial seryl-tRNA synthetase SARS2 modifies onset in spastic paraplegia type 4. Genet Med 2022;24:2308-2317. [PMID: 36056923 DOI: 10.1016/j.gim.2022.07.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 07/24/2022] [Accepted: 07/25/2022] [Indexed: 11/25/2022] Open

Affiliation(s)

Livia Parodi Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Mathieu Barbier Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Maxime Jacoupy Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Claire Pujol Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France; Pasteur Institute, Centre National de la Recherche Scientifique UMR 3691, Paris, France
François-Xavier Lejeune Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Pauline Lallemant-Dudek Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Typhaine Esteves Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France; Université de Bordeaux, CNRS, EPHE, INCIA, UMR 5287, Bordeaux, France
Maartje Pennings Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
Erik-Jan Kamsteeg Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
Marine Guillaud-Bataille Département de Génétique, AP-HP, GH Pitié-Salpêtrière, Sorbonne Université, Paris, France
Guillaume Banneau Département de Génétique, AP-HP, GH Pitié-Salpêtrière, Sorbonne Université, Paris, France
Giulia Coarelli Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Badreddine Mohand Oumoussa Sorbonne Université, Inserm, UMS Production et Analyse des données en Sciences de la vie et en Santé, PASS, Plateforme Post-génomique de la Pitié-Salpêtrière, P3S, Paris, France
Matthew J Fraidakis Rare Neurological Diseases Unit, Department of Neurology, Attikon University Hospital, Medical School of the University of Athens, Athens, Greece
Giovanni Stevanin Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France; Université de Bordeaux, CNRS, EPHE, INCIA, UMR 5287, Bordeaux, France
Christel Depienne Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France; Institut für Humangenetik, Universitätsklinikum Essen, Essen, Germany
Bart van de Warrenburg Department of Neurology, Donders Institute for Brain, Cognition and Behavior, Radboud University Medical Center, Nijmegen, the Netherlands
Alexis Brice Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France
Alexandra Durr Paris Brain Institute (Institut du Cerveau, ICM), INSERM, CNRS, Assistance Publique-Hôpitaux de Paris (AP-HP), Sorbonne Université, Paris, France.

Collapse

Sherwin WB. Bray-Curtis (AFD) differentiation in molecular ecology: Forecasting, an adjustment ( ^A A), and comparative performance in selection detection. Ecol Evol 2022;12:e9176. [PMID: 36110882 PMCID: PMC9465203 DOI: 10.1002/ece3.9176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 07/04/2022] [Accepted: 07/06/2022] [Indexed: 11/07/2022] Open

Abstract

Geographic genetic differentiation measures are used for purposes such as assessing genetic diversity and connectivity, and searching for signals of selection. Confirmation by unrelated measures can minimize false positives. A popular differentiation measure, Bray-Curtis, has been used increasingly in molecular ecology, renamed AFD (hereafter called BCAFD). Critically, BCAFD is expected to be partially independent of the commonly used Hill "Q-profile" measures. BCAFD needs scrutiny for potential biases, by examining limits on its value, and comparing simulations against expectations. BCAFD has two dependencies on within-population (alpha) variation, undesirable for a between-population (beta) measure. The first dependency is derived from similarity toG ST andF ST . The second dependency is that BCAFD cannot be larger than the highest allele proportion in either location (alpha variation), which can be overcome by data-filtering or by a modified statistic A A or "Adjusted AFD". The first dependency does not forestall applications such as assessing connectivity or selection, if we know the measure's null behavior under selective neutrality with specified conditions-which is shown in this article for A A, for equilibrium, and nonequilibrium, for the commonly used data type of single-nucleotide-polymorphisms (SNPs) in two locations. Thus, A A can be used in tandem with mathematically contrasting differentiation measures, with the aim of reducing false inferences. For detecting adaptive loci, the relative performance of A A and other measures was evaluated, showing that it is best to use two mathematically different measures simultaneously, and that A A is in one of the best such pairwise criteria. For any application, using A A, rather than BCAFD, avoids the counterintuitive limitation by maximum allele proportion within localities.

Collapse

Yang CJ, Ladejobi O, Mott R, Powell W, Mackay I. Analysis of historical selection in winter wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022;135:3005-3023. [PMID: 35864201 PMCID: PMC9482581 DOI: 10.1007/s00122-022-04163-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 06/22/2022] [Indexed: 06/15/2023]

Whole blood DNA methylation analysis reveals respiratory environmental traits involved in COVID-19 severity following SARS-CoV-2 infection. Nat Commun 2022;13:4597. [PMID: 35933486 PMCID: PMC9357033 DOI: 10.1038/s41467-022-32357-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 07/26/2022] [Indexed: 02/06/2023] Open

Long PN, Cook VJ, Majumder A, Barbour AG, Long AD. The utility of a closed breeding colony of Peromyscus leucopus for dissecting complex traits. Genetics 2022;221:iyac026. [PMID: 35143664 PMCID: PMC9071557 DOI: 10.1093/genetics/iyac026] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 02/01/2022] [Indexed: 11/13/2022] Open

Maróstica AS, Nunes K, Castelli EC, Silva NSB, Weir BS, Goudet J, Meyer D. How HLA diversity is apportioned: influence of selection and relevance to transplantation. Philos Trans R Soc Lond B Biol Sci 2022;377:20200420. [PMID: 35430892 PMCID: PMC9014195 DOI: 10.1098/rstb.2020.0420] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Chiu AM, Molloy EK, Tan Z, Talwalkar A, Sankararaman S. Inferring population structure in biobank-scale genomic data. Am J Hum Genet 2022;109:727-737. [PMID: 35298920 PMCID: PMC9069078 DOI: 10.1016/j.ajhg.2022.02.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 02/21/2022] [Indexed: 01/07/2023] Open

Lauer E, Holland J, Isik F. Prediction ability of genome-wide markers in Pinus taeda L. within and between population is affected by relatedness to the training population and trait genetic architecture. G3 (BETHESDA, MD.) 2022;12:6440053. [PMID: 34849838 PMCID: PMC9210318 DOI: 10.1093/g3journal/jkab405] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 11/08/2021] [Indexed: 11/26/2022]

Zhang QS, Goudet J, Weir BS. Rank-invariant estimation of inbreeding coefficients. Heredity (Edinb) 2022;128:1-10. [PMID: 34824382 PMCID: PMC8733021 DOI: 10.1038/s41437-021-00471-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 09/05/2021] [Accepted: 09/05/2021] [Indexed: 11/18/2022] Open

Duk M, Kanapin A, Rozhmina T, Bankin M, Surkova S, Samsonova A, Samsonova M. The Genetic Landscape of Fiber Flax. FRONTIERS IN PLANT SCIENCE 2021;12:764612. [PMID: 34950165 PMCID: PMC8691122 DOI: 10.3389/fpls.2021.764612] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 11/03/2021] [Indexed: 06/14/2023]

Laurent FX, Fischer A, Oldt RF, Kanthaswamy S, Buckleton JS, Hitchin S. Streamlining the decision-making process for international DNA kinship matching using Worldwide allele frequencies and tailored cutoff log₁₀LR thresholds. Forensic Sci Int Genet 2021;57:102634. [PMID: 34871915 DOI: 10.1016/j.fsigen.2021.102634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 10/13/2021] [Accepted: 11/15/2021] [Indexed: 11/30/2022]

Abstract

The identification of human remains belonging to missing persons is one of the main challenges for forensic genetics. Although other means of identification can be applied to missing person investigations, DNA is often extremely valuable to further support or refute potential associations. When reference DNA samples cannot be collected from personal items belonging to a missing person, a direct DNA identification cannot be carried out. However, identifications can be made indirectly using DNA from the missing person's relatives. The ranking of likelihood ratio (LR) values, which measure the fit of a missing person for any given pedigree, is often the first step in selecting candidates in a DNA database. Although implementing DNA kinship matching in a national environment is feasible, many challenges need to be resolved before applying this method to an international configuration. In this study, we present an innovative and intuitive method to perform international DNA kinship matching and facilitate the comparison of DNA profiles when the ancestry is unknown or unsure and/or when different marker sets are used. This straightforward method, which is based on calculations performed with the DNA matching software BONAPARTE, Worldwide allele frequencies and tailored cutoff log₁₀LR thresholds, allows for the classification of potential candidates according to the strength of the DNA evidence and the predicted proportion of adventitious matches. This is a powerful method for streamlining the decision-making process in missing person investigations and DVI processes, especially when there are low numbers of overlapping typed STRs. Intuitive interpretation tables and a decision tree will help strengthen international data comparison for the identification of reported missing individuals discovered outside their national borders.

Collapse

Calboli FCF, Delahaut V, Deflem I, Hablützel PI, Hellemans B, Kordas A, Raeymaekers JAM, Bervoets L, De Boeck G, Volckaert FAM. Association between Chromosome 4 and mercury accumulation in muscle of the three-spined stickleback (Gasterosteus aculeatus). Evol Appl 2021;14:2553-2567. [PMID: 34745343 PMCID: PMC8549617 DOI: 10.1111/eva.13298] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 08/18/2021] [Accepted: 08/29/2021] [Indexed: 11/29/2022] Open

Genome-Wide SNP Analysis Reveals Multiple Paternity in Burmese Pythons Invasive to the Greater Florida Everglades. J HERPETOL 2021. [DOI: 10.1670/20-104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

A spectral theory for Wright's inbreeding coefficients and related quantities. PLoS Genet 2021;17:e1009665. [PMID: 34280184 PMCID: PMC8320931 DOI: 10.1371/journal.pgen.1009665] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 07/29/2021] [Accepted: 06/13/2021] [Indexed: 12/20/2022] Open

Abstract

Wright’s inbreeding coefficient, F_ST, is a fundamental measure in population genetics. Assuming a predefined population subdivision, this statistic is classically used to evaluate population structure at a given genomic locus. With large numbers of loci, unsupervised approaches such as principal component analysis (PCA) have, however, become prominent in recent analyses of population structure. In this study, we describe the relationships between Wright’s inbreeding coefficients and PCA for a model of K discrete populations. Our theory provides an equivalent definition of F_ST based on the decomposition of the genotype matrix into between and within-population matrices. The average value of Wright’s F_ST over all loci included in the genotype matrix can be obtained from the PCA of the between-population matrix. Assuming that a separation condition is fulfilled and for reasonably large data sets, this value of F_ST approximates the proportion of genetic variation explained by the first (K − 1) principal components accurately. The new definition of F_ST is useful for computing inbreeding coefficients from surrogate genotypes, for example, obtained after correction of experimental artifacts or after removing adaptive genetic variation associated with environmental variables. The relationships between inbreeding coefficients and the spectrum of the genotype matrix not only allow interpretations of PCA results in terms of population genetic concepts but extend those concepts to population genetic analyses accounting for temporal, geographical and environmental contexts.

Principal component analysis (PCA) is the most-frequently used approach to describe population genetic structure from large population genomic data sets. In this study, we show that PCA not only estimates ancestries of sampled individuals, but also computes the average value of Wright’s inbreeding coefficient over the loci included in the genotype matrix. Our result shows that inbreeding coefficients and PCA eigenvalues provide equivalent descriptions of population structure. As a consequence, PCA extends the definition of those coefficients beyond the framework of allelic frequencies. We give examples on how F_ST can be computed from ancient DNA samples for which genotypes are corrected for coverage, and in an ecological genomic example where a proportion of genetic variation is explained by environmental variables.

Collapse

Ochoa A, Storey JD. Estimating FST and kinship for arbitrary population structures. PLoS Genet 2021;17:e1009241. [PMID: 33465078 PMCID: PMC7846127 DOI: 10.1371/journal.pgen.1009241] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 01/29/2021] [Accepted: 11/02/2020] [Indexed: 12/20/2022] Open

Abstract

F_ST and kinship are key parameters often estimated in modern population genetics studies in order to quantitatively characterize structure and relatedness. Kinship matrices have also become a fundamental quantity used in genome-wide association studies and heritability estimation. The most frequently-used estimators of F_ST and kinship are method-of-moments estimators whose accuracies depend strongly on the existence of simple underlying forms of structure, such as the independent subpopulations model of non-overlapping, independently evolving subpopulations. However, modern data sets have revealed that these simple models of structure likely do not hold in many populations, including humans. In this work, we analyze the behavior of these estimators in the presence of arbitrarily-complex population structures, which results in an improved estimation framework specifically designed for arbitrary population structures. After generalizing the definition of F_ST to arbitrary population structures and establishing a framework for assessing bias and consistency of genome-wide estimators, we calculate the accuracy of existing F_ST and kinship estimators under arbitrary population structures, characterizing biases and estimation challenges unobserved under their originally-assumed models of structure. We then present our new approach, which consistently estimates kinship and F_ST when the minimum kinship value in the dataset is estimated consistently. We illustrate our results using simulated genotypes from an admixture model, constructing a one-dimensional geographic scenario that departs nontrivially from the independent subpopulations model. Our simulations reveal the potential for severe biases in estimates of existing approaches that are overcome by our new framework. This work may significantly improve future analyses that rely on accurate kinship and F_ST estimates.

Kinship coefficients and F_ST, which measure relatedness and population structure, respectively, are important quantities needed to accurately perform various analyses on genetic data, including genome-wide association studies and heritability estimation. However, existing estimators require restrictive assumptions of independence that are not met by real human and other datasets. In this work we find that existing estimators can be severely biased under reasonable scenarios, first by theoretically determining their properties, and then using an admixture simulation to illustrate our findings. In particular, we find that existing F_ST estimators are downwardly biased, and that existing kinship matrix estimators have related biases that are on average downward and of similar magnitude but vary for every pair of individuals. These insights led us to a new estimation framework for kinship and F_ST that is practically unbiased for any population structure, as demonstrated by theory and simulations. Our new approaches—available as open-source R packages—are easy to use and are more widely applicable than existing approaches, and they are likely to improve downstream analyses that require accurate kinship and F_ST estimates.

Collapse