26
|
Shirasawa K, Kosugi S, Sasaki K, Ghelfi A, Okazaki K, Toyoda A, Hirakawa H, Isobe S. Genome features of common vetch ( Vicia sativa) in natural habitats. PLANT DIRECT 2021; 5:e352. [PMID: 34646975 PMCID: PMC8496506 DOI: 10.1002/pld3.352] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 09/09/2021] [Accepted: 09/10/2021] [Indexed: 05/30/2023]
Abstract
Wild plants are often tolerant to biotic and abiotic stresses in their natural environments, whereas domesticated plants such as crops frequently lack such resilience. This difference is thought to be due to the high levels of genome heterozygosity in wild plant populations and the low levels of heterozygosity in domesticated crop species. In this study, common vetch (Vicia sativa) was used as a model to examine this hypothesis. The common vetch genome (2n = 14) was estimated as 1.8 Gb in size. Genome sequencing produced a reference assembly that spanned 1.5 Gb, from which 31,146 genes were predicted. Using this sequence as a reference, 24,118 single nucleotide polymorphisms were discovered in 1243 plants from 12 natural common vetch populations in Japan. Common vetch genomes exhibited high heterozygosity at the population level, with lower levels of heterozygosity observed at specific genome regions. Such patterns of heterozygosity are thought to be essential for adaptation to different environments. The resources generated in this study will provide insights into de novo domestication of wild plants and agricultural enhancement.
Collapse
|
27
|
Shafat Z, Ahmed A, Parvez MK, Parveen S. Sequence to structure analysis of the ORF4 protein from Hepatitis E virus. Bioinformation 2021; 17:818-828. [PMID: 35539889 PMCID: PMC9049080 DOI: 10.6026/97320630017818] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 09/21/2021] [Accepted: 09/21/2021] [Indexed: 02/06/2023] Open
Abstract
Hepatitis E virus (HEV) is the main cause of acute hepatitis worldwide. HEV accounts for up to 30% mortality rate in pregnant women, with highest incidences reported for genotype 1 (G1) HEV. The contributing factors in adverse cases during pregnancy in women due to HEV infection is still debated. The mechanism underlying the pathogenesis of viral infection is attributed to different genomic component of HEV, i.e., open reading frames (ORFs): ORF1, ORF2, ORF3 and ORF4. Recently, ORF4 has been discovered in enhancing the replication of GI isolates of HEV through regulation of an IRES-like RNA element. However, its characterization through computational methodologies remains unexplored. In this novel study, we provide comprehensive overview of ORF4 protein's genetic and molecular characteristics through analyzing its sequence and different structural levels. A total of three different datasets (Human, Rat and Ferret) of ORF4 genomes were built and comparatively analyzed. Several non-synonymous mutations in conjunction with higher entropy values were observed in rat and ferret datasets, however, limited variation was observed in human ORF4 genomes. Higher transition to tranversion ratio was observed in the ORF4 genomes. Studies have reported the association of intrinsic disordered proteins (IDP) with drug discovery due to its role in several signaling and regulatory processes through protein-protein interactions (PPIs). As PPIs are potent drug target sources, thus the ORF4 protein was explored by analyzing its polypeptide structure in order to shed light on its intrinsic disorder. Pressures that lead towards preponderance of disordered-promoting amino acid residues shaped the evolution of ORF4. The intrinsic disorder propensity analysis revealed ORF4 protein (Human) as a highly disordered protein (IDP). Predominance of coils and lack of secondary structure further substantiated our findings suggesting its involvement in binding to ligand molecules. Thus, ORF4 contributes to cellular signaling processes through protein-protein interactions, as IDPs are targets for regulation to accelerate the process of drug designing strategies against HEV infections.
Collapse
|
28
|
Hardigan MA, Lorant A, Pincot DDA, Feldmann MJ, Famula RA, Acharya CB, Lee S, Verma S, Whitaker VM, Bassil N, Zurn J, Cole GS, Bird K, Edger PP, Knapp SJ. Unraveling the Complex Hybrid Ancestry and Domestication History of Cultivated Strawberry. Mol Biol Evol 2021; 38:2285-2305. [PMID: 33507311 PMCID: PMC8136507 DOI: 10.1093/molbev/msab024] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Cultivated strawberry (Fragaria × ananassa) is one of our youngest domesticates, originating in early eighteenth-century Europe from spontaneous hybrids between wild allo-octoploid species (Fragaria chiloensis and Fragaria virginiana). The improvement of horticultural traits by 300 years of breeding has enabled the global expansion of strawberry production. Here, we describe the genomic history of strawberry domestication from the earliest hybrids to modern cultivars. We observed a significant increase in heterozygosity among interspecific hybrids and a decrease in heterozygosity among domesticated descendants of those hybrids. Selective sweeps were found across the genome in early and modern phases of domestication—59–76% of the selectively swept genes originated in the three less dominant ancestral subgenomes. Contrary to the tenet that genetic diversity is limited in cultivated strawberry, we found that the octoploid species harbor massive allelic diversity and that F. × ananassa harbors as much allelic diversity as either wild founder. We identified 41.8 M subgenome-specific DNA variants among resequenced wild and domesticated individuals. Strikingly, 98% of common alleles and 73% of total alleles were shared between wild and domesticated populations. Moreover, genome-wide estimates of nucleotide diversity were virtually identical in F. chiloensis,F. virginiana, and F. × ananassa (π = 0.0059–0.0060). We found, however, that nucleotide diversity and heterozygosity were significantly lower in modern F. × ananassa populations that have experienced significant genetic gains and have produced numerous agriculturally important cultivars.
Collapse
|
29
|
Cunha DBD, Martins KCP, Júnior JNDS, Moreira ECDO, Sampaio I, Queiroz CDCS, Leite MA, Souza AAD, Santos CAD, Vallinoto M. High genetic diversity detected in the mitochondrial Control Region of the Serra Spanish Mackerel, Scomberomorus brasiliensis (Collette, Russo & Zavala, 1978) along the Brazilian coast. Mitochondrial DNA A DNA Mapp Seq Anal 2021; 32:178-185. [PMID: 37706236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2023]
Abstract
The Serra Spanish mackerel, Scomberomorus brasiliensis, is one of the most important fishery resources in the western tropical Atlantic, including northern and eastern Brazil. Despite its economic importance, few genetic markers have been sequenced in this species, and little is known of its population genetics. The present study evaluated the genetic variability of 110 individuals, representing three distinct Brazilian populations (Macapá, Fortaleza and Paranaguá), based on a segment of the mitochondrial Control Region. The sequences revealed high levels of genetic diversity, and suggested marked connectivity among the studied populations. A variable repeat was also found in the 3' portion of the studied Control Region fragment, which may prove useful as a marker in future genetic population studies of S. brasiliensis.
Collapse
|
30
|
Tigano A, Jacobs A, Wilder AP, Nand A, Zhan Y, Dekker J, Therkildsen NO. Chromosome-Level Assembly of the Atlantic Silverside Genome Reveals Extreme Levels of Sequence Diversity and Structural Genetic Variation. Genome Biol Evol 2021; 13:evab098. [PMID: 33964136 PMCID: PMC8214408 DOI: 10.1093/gbe/evab098] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 04/21/2021] [Accepted: 04/30/2021] [Indexed: 12/12/2022] Open
Abstract
The levels and distribution of standing genetic variation in a genome can provide a wealth of insights about the adaptive potential, demographic history, and genome structure of a population or species. As structural variants are increasingly associated with traits important for adaptation and speciation, investigating both sequence and structural variation is essential for wholly tapping this potential. Using a combination of shotgun sequencing, 10x Genomics linked reads and proximity-ligation data (Chicago and Hi-C), we produced and annotated a chromosome-level genome assembly for the Atlantic silverside (Menidia menidia)-an established ecological model for studying the phenotypic effects of natural and artificial selection-and examined patterns of genomic variation across two individuals sampled from different populations with divergent local adaptations. Levels of diversity varied substantially across each chromosome, consistently being highly elevated near the ends (presumably near telomeric regions) and dipping to near zero around putative centromeres. Overall, our estimate of the genome-wide average heterozygosity in the Atlantic silverside is among the highest reported for a fish, or any vertebrate (1.32-1.76% depending on inference method and sample). Furthermore, we also found extreme levels of structural variation, affecting ∼23% of the total genome sequence, including multiple large inversions (> 1 Mb and up to 12.6 Mb) associated with previously identified haploblocks showing strong differentiation between locally adapted populations. These extreme levels of standing genetic variation are likely associated with large effective population sizes and may help explain the remarkable adaptive divergence among populations of the Atlantic silverside.
Collapse
|
31
|
Tisthammer KH, Dong W, Joy JB, Pennings PS. Comparative Analysis of Within-Host Mutation Patterns and Diversity of Hepatitis C Virus Subtypes 1a, 1b, and 3a. Viruses 2021; 13:511. [PMID: 33808782 PMCID: PMC8003410 DOI: 10.3390/v13030511] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 03/17/2021] [Indexed: 12/13/2022] Open
Abstract
Understanding within-host evolution is critical for predicting viral evolutionary outcomes, yet such studies are currently lacking due to difficulty involving human subjects. Hepatitis C virus (HCV) is an RNA virus with high mutation rates. Its complex evolutionary dynamics and extensive genetic diversity are demonstrated in over 67 known subtypes. In this study, we analyzed within-host mutation frequency patterns of three HCV subtypes, using a large number of samples obtained from treatment-naïve participants by next-generation sequencing. We report that overall mutation frequency patterns are similar among subtypes, yet subtype 3a consistently had lower mutation frequencies and nucleotide diversity, while subtype 1a had the highest. We found that about 50% of genomic sites are highly conserved across subtypes, which are likely under strong purifying selection. We also compared within-host and between-host selective pressures, which revealed that Hyper Variable Region 1 within hosts was under positive selection, but was under slightly negative selection between hosts, which indicates that many mutations created within hosts are removed during the transmission bottleneck. Examining the natural prevalence of known resistance-associated variants showed their consistent existence in the treatment-naïve participants. These results provide insights into the differences and similarities among HCV subtypes that may be used to develop and improve HCV therapies.
Collapse
|
32
|
Myers BM, Burns KJ, Clark CJ, Brelsford A. The population genetics of nonmigratory Allen's Hummingbird ( Selasphorus sasin sedentarius) following a recent mainland colonization. Ecol Evol 2021; 11:1850-1865. [PMID: 33614008 PMCID: PMC7882939 DOI: 10.1002/ece3.7174] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 12/10/2020] [Accepted: 12/17/2020] [Indexed: 02/01/2023] Open
Abstract
Allen's Hummingbird comprises two subspecies, one migratory (Selasphorus sasin sasin) and one nonmigratory (S. s. sedentarius). The nonmigratory subspecies, previously endemic to the California Channel Islands, apparently colonized the California mainland on the Palos Verdes Peninsula some time before 1970 and now breeds throughout coastal southern California. We sequenced and compared populations of mainland nonmigratory Allen's Hummingbird to Channel Island populations from Santa Catalina, San Clemente, and Santa Cruz Island. We found no evidence of founder effects on the mainland population. Values of nucleotide diversity on the mainland were higher than on the Channel Islands. There were low levels of divergence between the Channel Islands and the mainland, and Santa Cruz Island was the most genetically distinct. Ecological niche models showed that rainfall and temperature variables on the Channel Islands are similar in the Los Angeles basin and predicted continued expansion of nonmigratory Allen's Hummingbird north along the coast and inland. We also reviewed previous genetic studies of vertebrate species found on the Channel Islands and mainland and showed that broad conclusions regarding island-mainland patterns remain elusive. Challenges include the idiosyncratic nature of colonization itself as well as the lack of a comprehensive approach that incorporates similar markers and sampling strategies across taxa, which, within the context of a comparative study of island-mainland relationships, may lead to inconsistent results.
Collapse
|
33
|
Burgarella C, Berger A, Glémin S, David J, Terrier N, Deu M, Pot D. The Road to Sorghum Domestication: Evidence From Nucleotide Diversity and Gene Expression Patterns. FRONTIERS IN PLANT SCIENCE 2021; 12:666075. [PMID: 34527004 PMCID: PMC8435843 DOI: 10.3389/fpls.2021.666075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 07/20/2021] [Indexed: 05/17/2023]
Abstract
Native African cereals (sorghum, millets) ensure food security to millions of low-income people from low fertility and drought-prone regions of Africa and Asia. In spite of their agronomic importance, the genetic bases of their phenotype and adaptations are still not well-understood. Here we focus on Sorghum bicolor, which is the fifth cereal worldwide for grain production and constitutes the staple food for around 500 million people. We leverage transcriptomic resources to address the adaptive consequences of the domestication process. Gene expression and nucleotide variability were analyzed in 11 domesticated and nine wild accessions. We documented a downregulation of expression and a reduction of diversity both in nucleotide polymorphism (30%) and gene expression levels (18%) in domesticated sorghum. These findings at the genome-wide level support the occurrence of a global reduction of diversity during the domestication process, although several genes also showed patterns consistent with the action of selection. Nine hundred and forty-nine genes were significantly differentially expressed between wild and domesticated gene pools. Their functional annotation points to metabolic pathways most likely contributing to the sorghum domestication syndrome, such as photosynthesis and auxin metabolism. Coexpression network analyzes revealed 21 clusters of genes sharing similar expression patterns. Four clusters (totaling 2,449 genes) were significantly enriched in differentially expressed genes between the wild and domesticated pools and two were also enriched in domestication and improvement genes previously identified in sorghum. These findings reinforce the evidence that the combined and intricated effects of the domestication and improvement processes do not only affect the behaviors of a few genes but led to a large rewiring of the transcriptome. Overall, these analyzes pave the way toward the identification of key domestication genes valuable for genetic resources characterization and breeding purposes.
Collapse
|
34
|
Florez-Rueda AM, Scharmann M, Roth M, Städler T. Population Genomics of the "Arcanum" Species Group in Wild Tomatoes: Evidence for Separate Origins of Two Self-Compatible Lineages. FRONTIERS IN PLANT SCIENCE 2021; 12:624442. [PMID: 33815438 PMCID: PMC8018279 DOI: 10.3389/fpls.2021.624442] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 02/24/2021] [Indexed: 05/07/2023]
Abstract
Given their diverse mating systems and recent divergence, wild tomatoes (Solanum section Lycopersicon) have become an attractive model system to study ecological divergence, the build-up of reproductive barriers, and the causes and consequences of the breakdown of self-incompatibility. Here we report on a lesser-studied group of species known as the "Arcanum" group, comprising the nominal species Solanum arcanum, Solanum chmielewskii, and Solanum neorickii. The latter two taxa are self-compatible but are thought to self-fertilize at different rates, given their distinct manifestations of the morphological "selfing syndrome." Based on experimental crossings and transcriptome sequencing of a total of 39 different genotypes from as many accessions representing each species' geographic range, we provide compelling evidence for deep genealogical divisions within S. arcanum; only the self-incompatible lineage known as "var. marañón" has close genealogical ties to the two self-compatible species. Moreover, there is evidence under multiple inference schemes for different geographic subsets of S. arcanum var. marañón being closest to S. chmielewskii and S. neorickii, respectively. To broadly characterize the population-genomic consequences of these recent mating-system transitions and their associated speciation events, we fit demographic models indicating strong reductions in effective population size, congruent with reduced nucleotide and S-locus diversity in the two independently derived self-compatible species.
Collapse
|
35
|
Barrow LN, Masiero da Fonseca E, Thompson CEP, Carstens BC. Predicting amphibian intraspecific diversity with machine learning: Challenges and prospects for integrating traits, geography, and genetic data. Mol Ecol Resour 2020; 21:2818-2831. [PMID: 33249725 DOI: 10.1111/1755-0998.13303] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 11/11/2020] [Accepted: 11/24/2020] [Indexed: 12/19/2022]
Abstract
The growing availability of genetic data sets, in combination with machine learning frameworks, offers great potential to answer long-standing questions in ecology and evolution. One such question has intrigued population geneticists, biogeographers, and conservation biologists: What factors determine intraspecific genetic diversity? This question is challenging to answer because many factors may influence genetic variation, including life history traits, historical influences, and geography, and the relative importance of these factors varies across taxonomic and geographic scales. Furthermore, interpreting the influence of numerous, potentially correlated variables is difficult with traditional statistical approaches. To address these challenges, we analysed repurposed data using machine learning and investigated predictors of genetic diversity, focusing on Nearctic amphibians as a case study. We aggregated species traits, range characteristics, and >42,000 genetic sequences for 299 species using open-access scripts and various databases. After identifying important predictors of nucleotide diversity with random forest regression, we conducted follow-up analyses to examine the roles of phylogenetic history, geography, and demographic processes on intraspecific diversity. Although life history traits were not important predictors for this data set, we found significant phylogenetic signal in genetic diversity within amphibians. We also found that salamander species at northern latitudes contained low genetic diversity. Data repurposing and machine learning provide valuable tools for detecting patterns with relevance for conservation, but concerted efforts are needed to compile meaningful data sets with greater utility for understanding global biodiversity.
Collapse
|
36
|
Rafiepour M, Ebrahimie E, Vahidi MF, Salekdeh GH, Niazi A, Dadpasand M, Liang D, Si J, Ding X, Han J, Zhang Y, Qanbari S. Whole-Genome Resequencing Reveals Adaptation Prior to the Divergence of Buffalo Subspecies. Genome Biol Evol 2020; 13:5976760. [PMID: 33179728 DOI: 10.1093/gbe/evaa231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/28/2020] [Indexed: 01/30/2023] Open
Abstract
The application of high-throughput genotyping or sequencing data helps us to understand the genomic response to natural and artificial selection. In this study, we scanned the genomes of five indigenous buffalo populations belong to three recognized breeds, adapted to different geographical and agro-ecological zones in Iran, to unravel the extent of genomic diversity and to localize genomic regions and genes underwent past selection. A total of 46 river buffalo whole genomes, from West and East Azerbaijan, Gilan, Mazandaran, and Khuzestan provinces, were resequenced. Our sequencing data reached to a coverage above 99% of the river buffalo reference genome and an average read depth around 9.2× per sample. We identified 20.55 million SNPs, including 63,097 missense, 707 stop-gain, and 159 stop-loss mutations that might have functional consequences. Genomic diversity analyses showed modest structuring among Iranian buffalo populations following frequent gene flow or admixture in the recent past. Evidence of positive selection was investigated using both differentiation (Fst) and fixation (Pi) metrics. Analysis of fixation revealed three genomic regions in all three breeds with aberrant polymorphism contents on BBU2, 20, and 21. Fixation signal on BBU2 overlapped with the OCA2-HERC2 genes, suggestive of adaptation to UV exposure through pigmentation mechanism. Further validation using resequencing data from other five bovine species as well as the Axiom Buffalo Genotyping Array 90K data of river and swamp buffaloes indicated that these fixation signals persisted across river and swamp buffaloes and extended to taurine cattle, implying an ancient evolutionary event occurred before the speciation of buffalo and taurine cattle. These results contributed to our understanding of major genetic switches that took place during the evolution of modern buffaloes.
Collapse
|
37
|
Jaramillo-Correa JP, Bagnoli F, Grivet D, Fady B, Aravanopoulos FA, Vendramin GG, González-Martínez SC. Evolutionary rate and genetic load in an emblematic Mediterranean tree following an ancient and prolonged population collapse. Mol Ecol 2020; 29:4797-4811. [PMID: 33063352 DOI: 10.1111/mec.15684] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 09/25/2020] [Accepted: 09/28/2020] [Indexed: 12/18/2022]
Abstract
Severe bottlenecks significantly diminish the amount of genetic diversity and the speed at which it accumulates (i.e., evolutionary rate). They further compromise the efficiency of natural selection to eliminate deleterious variants, which may reach fixation in the surviving populations. Consequently, expanding and adapting to new environments may pose a significant challenge when strong bottlenecks result in genetic pauperization. Herein, we surveyed the patterns of nucleotide diversity, molecular adaptation and genetic load across 177 gene-loci in a circum-Mediterranean conifer (Pinus pinea L.) that represents one of the most extreme cases of genetic pauperization in widespread outbreeding taxa. We found very little genetic variation in both hypervariable nuclear microsatellites (SSRs) and gene-loci, which translated into genetic diversity estimates one order of magnitude lower than those previously reported for pines. Such values were consistent with a strong population decline that began some ~1 Ma. Comparisons with the related and parapatric maritime pine (Pinus pinaster Ait.) revealed reduced rates of adaptive evolution (α and ωa ) and a significant accumulation of genetic load. It is unlikely that these are the result from differences in mutation rate or linkage disequilibrium between the two species; instead they are the presumable outcome of contrasting demographic histories affecting both the speed at which these taxa accumulate genetic diversity, and the global efficacy of selection. Future studies, and programs for conservation and management, should thus start testing for the effects of genetic load on fitness, and integrating such effects into predictive models.
Collapse
|
38
|
Zhang CY, Liu TJ, Mo XL, Huang HR, Yao G, Li JR, Ge XJ, Yan HF. Comparative Analyses of the Chloroplast Genomes of Patchouli Plants and Their Relatives in Pogostemon (Lamiaceae). PLANTS 2020; 9:plants9111497. [PMID: 33167549 PMCID: PMC7694494 DOI: 10.3390/plants9111497] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Revised: 10/30/2020] [Accepted: 11/02/2020] [Indexed: 01/06/2023]
Abstract
Pogostemon Desf., the largest genus of the tribe Pogostemoneae (Lamiaceae), consists of ca. 80 species distributed mainly from South and Southeast Asia to China. The genus contains many patchouli plants, which are of great economic importance but taxonomically difficult. Therefore, it is necessary to characterize more chloroplast (cp) genomes for infrageneric phylogeny analyses and species identification of Pogostemon, especially for patchouli plants. In this study, we newly generated four cp genomes for three patchouli plants (i.e., Pogostemon plectranthoides Desf., P. septentrionalis C. Y. Wu et Y. C. Huang, and two cultivars of P. cablin (Blanoco) Benth.). Comparison of all samples (including online available cp genomes of P. yatabeanus (Makino) Press and P. stellatus (Lour.) Kuntze) suggested that Pogostemon cp genomes are highly conserved in terms of genome size and gene content, with a typical quadripartite circle structure. Interspecific divergence of cp genomes has been maintained at a relatively low level, though seven divergence hotspot regions were identified by stepwise window analysis. The nucleotide diversity (Pi) value was correlated significantly with gap proportion (indels), but significantly negative with GC content. Our phylogenetic analyses based on 80 protein-coding genes yielded high-resolution backbone topologies for the Lamiaceae and Pogostemon. For the overall mean substitution rates, the synonymous (dS) and nonsynonymous (dN) substitution rate values of protein-coding genes varied approximately threefold, while the dN values among different functional gene groups showed a wider variation range. Overall, the cp genomes of Pogostemon will be useful for phylogenetic reconstruction, species delimitation and identification in the future.
Collapse
|
39
|
Calla B, Demkovich M, Siegel JP, Viana JPG, Walden KKO, Robertson HM, Berenbaum MR. Selective Sweeps in a Nutshell: The Genomic Footprint of Rapid Insecticide Resistance Evolution in the Almond Agroecosystem. Genome Biol Evol 2020; 13:5955766. [PMID: 33146372 PMCID: PMC7850051 DOI: 10.1093/gbe/evaa234] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2020] [Indexed: 11/14/2022] Open
Abstract
Among the most familiar forms of human-driven evolution on ecological time scales is the rapid acquisition of resistance to pesticides by insects. Since the widespread adoption of synthetic organic insecticides in the mid-twentieth century, over 500 arthropod species have evolved resistance to at least one insecticide. Efforts to determine the genetic bases of insecticide resistance have historically focused on individual loci, but the availability of genomic tools has facilitated the screening of genome-wide characteristics. We resequenced three contemporary populations of the navel orangeworm (Amyelois transitella), the principal pest of almond orchards in California, differing in bifenthrin resistance status to examine insecticide-induced changes in the population genomic landscape of this species. We detected an exceptionally large region with virtually no polymorphisms, extending to up to 1.3 Mb in the resistant population. This selective sweep includes genes associated with pyrethroid and DDT resistance, including a cytochrome P450 gene cluster and the gene encoding the voltage-gated sodium channel para. Moreover, the sequence along the sweep is nearly identical in the genome assembled from a population founded in 1966, suggesting that the foundation for insecticide resistance may date back a half-century, when California’s Central Valley experienced massive area-wide applications of DDT for pest control.
Collapse
|
40
|
Zhao W, Sun YQ, Pan J, Sullivan AR, Arnold ML, Mao JF, Wang XR. Effects of landscapes and range expansion on population structure and local adaptation. THE NEW PHYTOLOGIST 2020; 228:330-343. [PMID: 32323335 DOI: 10.1111/nph.16619] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Accepted: 04/15/2020] [Indexed: 05/25/2023]
Abstract
Understanding the origin and distribution of genetic diversity across landscapes is critical for predicting the future of organisms in changing climates. This study investigated how adaptive and demographic forces have shaped diversity and population structure in Pinus densata, a keystone species on Qinghai-Tibetan Plateau (QTP). We examined the distribution of genomic diversity across the range of P. densata using exome capture sequencing. We applied spatially explicit tests to dissect the impacts of allele surfing, geographic isolation and environmental gradients on population differentiation and forecasted how this genetic legacy may limit the persistence of P. densata in future climates. We found that allele surfing from range expansion could explain the distribution of 39% of the c. 48 000 genotyped single nucleotide polymorphisms (SNPs). Uncorrected, these allele frequency clines severely confounded inferences of selection. After controlling for demographic processes, isolation-by-environment explained 9.2-19.5% of the genetic structure, with c. 4.0% of loci being affected by selection. Allele surfing and genotype-environment associations resulted in genomic mismatch under projected climate scenarios. We illustrate that significant local adaptation, when coupled with reduced diversity as a result of demographic history, constrains potential evolutionary response to climate change. The strong signal of genomic vulnerability in P. densata may be representative for other QTP endemics.
Collapse
|
41
|
Lee J, Park J, Xi H, Park J. Comprehensive Analyses of the Complete Mitochondrial Genome of Figulus binodulus (Coleoptera: Lucanidae). JOURNAL OF INSECT SCIENCE (ONLINE) 2020; 20:10. [PMID: 32976575 PMCID: PMC7583265 DOI: 10.1093/jisesa/ieaa090] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Indexed: 05/06/2023]
Abstract
Figulus binodulus Waterhouse is a small stag beetle distributed in East Asia. We determined the first mitochondrial genome of F. binodulus of which is 16,261-bp long including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNAs, and a single large noncoding region of 1,717 bp. Gene order of F. binodulus is identical to the ancestral insect mitochondrial gene order as in most other stag beetle species. All of 22 tRNAs could be shaped into typical clover-leaf structure except trnSer1. Comparative analyses of 21 Lucanidae mitochondrial genomes was conducted in aspect of their length and AT-GC ratio. Nucleotide diversities analyses provide that cox1 and cox2 in Lucanidae are less diverse than those of Scarabaeoidea. Fifty simple sequence repeats (SSRs) were identified on F. binodulus mitochondrial genome. Comparative analysis of SSRs among five mitochondrial genomes displayed similar trend along with SSR types. Figulus binodulus was sister to all other available family Lucanidae species in the phylogenetic tree.
Collapse
|
42
|
Salek Ardestani S, Aminafshar M, Zandi Baghche Maryam MB, Banabazi MH, Sargolzaei M, Miar Y. Signatures of selection analysis using whole-genome sequence data reveals novel candidate genes for pony and light horse types. Genome 2020; 63:387-396. [PMID: 32407640 DOI: 10.1139/gen-2020-0001] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Natural selection and domestication have shaped modern horse populations, resulting in a vast range of phenotypically diverse breeds. Horse breeds are classified into three types (pony, light, and draft) generally based on their body type. Understanding the genetic basis of horse type variation and selective pressures related to the evolutionary trend can be particularly important for current selection strategies. Whole-genome sequences were generated for 14 pony and 32 light horses to investigate the genetic signatures of selection of the horse type in pony and light horses. In the overlapping extremes of the fixation index and nucleotide diversity results, we found novel genomic signatures of selective sweeps near key genes previously implicated in body measurements including C4ORF33, CRB1, CPN1, FAM13A, and FGF12 that may influence variation in pony and light horse types. This study contributes to a better understanding of the genetic background of differences between pony and light horse types.
Collapse
|
43
|
Distribution and Nucleotide Diversity of Yr15 in Wild Emmer Populations and Chinese Wheat Germplasm. Pathogens 2020; 9:pathogens9030212. [PMID: 32183066 PMCID: PMC7157507 DOI: 10.3390/pathogens9030212] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Revised: 02/27/2020] [Accepted: 03/09/2020] [Indexed: 12/17/2022] Open
Abstract
Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is a devastating fungal disease of wheat. The wild emmer gene, Yr15 (Wtk1), which confers a strong broad-spectrum resistance to Pst isolates, is composed of kinase and pseudokinase domains. The analysis of 361 wild emmer accessions from a wide range of natural habitats confirms that functional Wtk1 is distributed mainly along a narrow axis from Mt. Carmel to Mt. Hermon regions, in the northern part of Israel, where environmental conditions are favorable to the onset of stripe rust. An analysis of full-length Wtk1 DNA sequences from 49 wild emmer accessions identified three haplotypes and extremely low nucleotide diversity (π = 0.00002). The sequence diversity of Wtk1 is 9.5 times lower than that of broad-spectrum partial resistance gene Yr36 (π = 0.00019), and both are in sharp contrast to the high level of nucleotide diversity previously reported for race-specific resistance genes (e.g., Lr10 and Pm3). However, the nonfunctional wtk1 sequences possess high level of nucleotide diversity (π = 0.07). These results may reflect the different resistance mechanisms and the different evolutionary processes that shaped these resistance genes. Yr15 was absent in 189 Chinese wheat landraces and was present in only 1.02% of the 583 tested modern Chinese wheat cultivars. These results corroborate our previous results showing that Yr15 was absent in 94% of a worldwide collection of 513 wheat cultivars, therefore indicating the importance of Yr15 in wheat stripe rust resistance breeding programs in China and elsewhere around the globe.
Collapse
|
44
|
Ogutu C, Cherono S, Ntini C, Mollah MD, Zhao L, Belal MA, Han Y. Evolutionary rate variation among genes involved in galactomannan biosynthesis in Coffea canephora. Ecol Evol 2020; 10:2559-2569. [PMID: 32185001 PMCID: PMC7069334 DOI: 10.1002/ece3.6084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 12/08/2019] [Accepted: 12/09/2019] [Indexed: 12/03/2022] Open
Abstract
The endosperm cell walls of mature coffee seeds accumulate large amounts of mannan storage polysaccharides, which serve as nutrient reserve for embryo and contribute to beverage quality. Our study investigated the evolutionary patterns of key galactomannan (GM) biosynthesis genes using d N/d S ratio, synteny, and phylogenetic analysis and detected heterogeneity in rate of evolution among gene copies. Selection ratio index revealed evidence of positive selection in the branch editing gene Coffea canephora alpha (α) galactosidase (Cc-alpha Gal) at Cc11_g15950 copy (ω = 1.12), whereas strong purifying selection on deleterious mutations was observed in the Coffea canephora uridine diphosphate (UDP)-glucose 4'-epimerase (Cc-UG4E) and Coffea canephora mannose-1P guanylytransferase (Cc-MGT) genes controlling the crucial nucleotide carbon sugar building blocks flux in the pathway. Relatively low sequence diversity and strong syntenic linkages were detected in all GM pathway genes except in Cc-alpha Gal, which suggests a correlation between selection pressure and nucleotide diversity or synteny analysis. In addition, phylogenetic analysis revealed independent evolution or expansion of GM pathway genes in different plant species, with no obvious inferable clustering patterns according to either gene family or congruent with evolutionary plants lineages tested due to high dynamic nature and specific biochemical cell wall modification requirements. Altogether, our study shows a significant high rate of evolutionary variation among GM pathway genes in the diploid C. canephora and demonstrates the inherent variation in evolution of gene copies and their potential role in understanding selection rates in a homogenously connected metabolic pathway.
Collapse
|
45
|
Sun YQ, Zhao W, Xu CQ, Xu Y, El-Kassaby YA, De La Torre AR, Mao JF. Genetic Variation Related to High Elevation Adaptation Revealed by Common Garden Experiments in Pinus yunnanensis. Front Genet 2020; 10:1405. [PMID: 32117429 PMCID: PMC7027398 DOI: 10.3389/fgene.2019.01405] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 12/23/2019] [Indexed: 12/30/2022] Open
Abstract
Local adaptation, adaptation to specialized niches and environmental clines have been extensively reported for forest trees. Investigation of the adaptive genetic variation is crucial for forest resource management and breeding, especially in the context of global climate change. Here, we utilized a Pinus yunnanensis common garden experiments established at high and low elevation sites to assess the differences in growth and survival among populations and between the two common garden sites. The studied traits showed significant variation between the two test sites and among populations, suggesting adaptive divergence. To detect genetic variation related to environment, we captured 103,608 high quality SNPs based on RNA sequencing, and used them to assess the genetic diversity and population structure. We identified 321 outlier SNPs from 131 genes showing significant divergence in allelic frequency between survival populations of two sites. Functional categories associated with adaptation to high elevation were found to be related to flavonoid biosynthesis, response to UV, DNA repair, response to reactive oxygen species, and membrane lipid metabolic process. Further investigation of the outlier genes showed overrepresentation of the flavonoid biosynthesis pathway, suggesting that this pathway may play a key role in P. yunnanensis adaptation to high elevation environments. The outlier genes identified, and their variants, provide a basic reference for advanced investigations.
Collapse
|
46
|
Inferring the Genomic Landscape of Recombination Rate Variation in European Aspen ( Populus tremula). G3-GENES GENOMES GENETICS 2020; 10:299-309. [PMID: 31744900 PMCID: PMC6945010 DOI: 10.1534/g3.119.400504] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The rate of meiotic recombination is one of the central factors determining genome-wide levels of linkage disequilibrium which has important consequences for the efficiency of natural selection and for the dissection of quantitative traits. Here we present a new, high-resolution linkage map for Populus tremula that we use to anchor approximately two thirds of the P. tremula draft genome assembly on to the expected 19 chromosomes, providing us with the first chromosome-scale assembly for P. tremula (Table 2). We then use this resource to estimate variation in recombination rates across the P. tremula genome and compare these results to recombination rates based on linkage disequilibrium in a large number of unrelated individuals. We also assess how variation in recombination rates is associated with a number of genomic features, such as gene density, repeat density and methylation levels. We find that recombination rates obtained from the two methods largely agree, although the LD-based method identifies a number of genomic regions with very high recombination rates that the map-based method fails to detect. Linkage map and LD-based estimates of recombination rates are positively correlated and show similar correlations with other genomic features, showing that both methods can accurately infer recombination rate variation across the genome. Recombination rates are positively correlated with gene density and negatively correlated with repeat density and methylation levels, suggesting that recombination is largely directed toward gene regions in P. tremula.
Collapse
|
47
|
Development and Application of InDel Markers for Authentication of the Korean Herbs Zanthoxylum schinifolium and Zanthoxylum piperitum. Foods 2019; 8:foods8120658. [PMID: 31835363 PMCID: PMC6963880 DOI: 10.3390/foods8120658] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 11/29/2019] [Accepted: 12/04/2019] [Indexed: 12/01/2022] Open
Abstract
Zanthoxylum schinifolium and Zanthoxylum piperitum are the sources of the well-known traditional Korean herbal medicines “sancho” (prickly ash) and “chopi” (Korean pepper), respectively. Sancho and chopi are often indiscriminately mixed due to the similar appearance of the herbal materials when used as spices and herbal medicines. Moreover, commercial sancho and chopi products often contain adulterants, which is insufficient to ensure food efficacy and safety. In this study, we developed hypervariable insertion/deletion (InDel) markers to distinguish between sancho and chopi products by comparing the complete chloroplast genome sequences of four Zanthoxylum species deposited in the National Center for Biotechnology Information (NCBI) GenBank. Comparative analyses of the nucleotide diversity (Pi) of these Zanthoxylum genomes revealed four hypervariable divergent sites (trnH-psbA, psbZ-trnG, trnfM-rps14, and trnF-ndhK) with Pi > 0.025 among 520 windows. Of these four regions, including two genic and two intergenic regions, only psbZ-trnG yielded accurate PCR amplification results between commercial sancho and chopi products from the Korean herbal medicine market. We therefore selected psbZ-trnG, an InDel-variable locus with high discriminatory powers, as a candidate DNA barcode locus. This InDel marker could be used as a valuable, simple, and efficient tool for identifying these medicinal herbs, thereby increasing the safety of these spices and herbal materials in the food market.
Collapse
|
48
|
Carvalho SB, Torres J, Tarroso P, Velo-Antón G. Genes on the edge: A framework to detect genetic diversity imperiled by climate change. GLOBAL CHANGE BIOLOGY 2019; 25:4034-4047. [PMID: 31230387 DOI: 10.1111/gcb.14740] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 05/10/2019] [Accepted: 06/13/2019] [Indexed: 06/09/2023]
Abstract
Ongoing global warming is disrupting several ecological and evolutionary processes, spanning different levels of biological organization. Species are expected to shift their ranges as a response to climate change, with relevant implications to peripheral populations at the trailing and leading edges. Several studies have analyzed the exposure of species to climate change but few have explored exposure at the intraspecific level. We introduce a framework to forecast exposure to climate change at the intraspecific level. We build on existing methods by combining correlative species distribution models, a model of species range dynamics, and a model of phylogeographic interpolation. We demonstrate the framework by applying it to 20 Iberian amphibian and reptile species. Our aims were to: (a) identify which species and intraspecific lineages will be most exposed to future climate change; (b) test if nucleotide diversity at the edges of species ranges are significantly higher or lower than on the overall range; and (c) analyze if areas of higher species gain, loss, and turnover coincide with those predicted for lineages richness and nucleotide diversity. We found that about 80% of the studied species are predicted to contract their range. Within each species, some lineages were predicted to contract their range, while others were predicted to maintain or expand it. Therefore, estimating the impacts of climate change at the species level only can underestimate losses at the intraspecific level. Some species had significant high amount of nucleotide at the trailing or leading edge, or both, but we did not find a consistent pattern across species. Spatial patterns of species richness, gain, loss, and turnover were fairly concurrent with lineages richness and nucleotide diversity. Our results support the need for increased attention to intraspecific diversity regarding monitoring and conservation strategies under climate change.
Collapse
|
49
|
Natural Variation and Domestication Selection of ZmPGP1 Affects Plant Architecture and Yield-Related Traits in Maize. Genes (Basel) 2019; 10:genes10090664. [PMID: 31480272 PMCID: PMC6770335 DOI: 10.3390/genes10090664] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 08/28/2019] [Accepted: 08/28/2019] [Indexed: 12/24/2022] Open
Abstract
ZmPGP1, involved in the polar auxin transport, has been shown to be associated with plant height, leaf angle, yield traits, and root development in maize. To explore natural variation and domestication selection of ZmPGP1, we re-sequenced the ZmPGP1 gene in 349 inbred lines, 68 landraces, and 32 teosintes. Sequence polymorphisms, nucleotide diversity, and neutral tests revealed that ZmPGP1 might be selected during domestication and improvement processes. Marker–trait association analysis in inbred lines identified 11 variants significantly associated with 4 plant architecture and 5 ear traits. SNP1473 was the most significant variant for kernel length and ear grain weight. The frequency of an increased allele T was 40.6% in teosintes, and it was enriched to 60.3% and 89.1% during maize domestication and improvement. This result revealed that ZmPGP1 may be selected in the domestication and improvement process, and significant variants could be used to develop functional markers to improve plant architecture and ear traits in maize.
Collapse
|
50
|
Genetic Diversity and Phylogenetic Relationships of Annual and Perennial Glycine Species. G3-GENES GENOMES GENETICS 2019; 9:2325-2336. [PMID: 31097479 PMCID: PMC6643897 DOI: 10.1534/g3.119.400220] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We have estimated the average genetic diversity of two Glycine annual and six perennial species based upon 76 orthologous gene sets and performed phylogenetic analysis, divergence analysis and tests for departure from neutrality of the eight species using 52 orthologous gene sets. In addition, 367 orthologous gene sets were used to estimate the relationships of 11 G. canescens accessions. Among the perennials, G. canescens showed the highest nucleotide diversity. The other perennials, except for G. tomentella, had higher nucleotide diversity than the two annuals. Phylogenetic analysis of the Glycine showed a similar genome grouping with the previous report except for G. cyrtoloba and G. stenophita which formed a sister clade in the study. Divergence analysis supported the phylogenetic relationships that G. falcata was the most divergent from G. max, followed by G. cyrtoloba, G. syndetika, G. tomentella D3, G. stenophita and G. canescens. Most genic sequences were homogeneous in the levels of polymorphism and divergence between G. max and other Glycine species based on the HKA test, thus, Glycine perennials may have experienced a very similar evolution as inferred by trans-specific mutation analysis. The greater genetic diversity of most perennial Glycine species and their origins from the warmer and drier climates of Australia suggests the perennials maybe a potential source of heat and drought resistance that will be of value in the face of climate change.
Collapse
|