1
|
Liu Y, Ding K, Liang L, Zhang Z, Chen K, Li H. Comparative study on chloroplast genome of Tamarix species. Ecol Evol 2024; 14:e70353. [PMID: 39360124 PMCID: PMC11445282 DOI: 10.1002/ece3.70353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 09/05/2024] [Accepted: 09/10/2024] [Indexed: 10/04/2024] Open
Abstract
Tamaricaceae comprises about 120 species and has a long evolutionary history, Tamarix Linn accounts for approximately 75% of the total species in this family. It is the most widely distributed and diverse genus in the family. They have important ecological significance for transforming deserts and improving climate conditions. However, Tamarix is the most poorly classified genera among flowering plants owing to its large variability and high susceptibility to interspecific hybridization. In this study, the complete chloroplast genomes of three Tamarix species and one draft chloroplast genome were obtained in this study. Combined with eight chloroplast genomes deposited in GenBank, complete chloroplast sequences of 12 Tamarix species were used for further analysis. There are 176 non-SSR-related indels and 681 non-indel-related SSRs in the 12 Tamarix chloroplast genomes. The mononucleotide SSRs are the most prevalent among all types of SSRs. The mVISTA results indicate high sequence similarities across the chloroplast genome, suggesting that the chloroplast genomes are highly conserved, except for sample Tamarix androssowii (ENC850343). The IR regions and the coding regions are more conserved than the single-copy and noncoding regions. The trnF-ndhJ, ndhC-trnM-CAU, ycf1, and trnL-UAG-ndhF regions are the most variable and have higher variability than those of the universal DNA markers. Finally, the first phylogenetic tree of Tamaricaceae was constructed which confirmed the monophyly of Tamarix in Tamaricaceae. The first phylogenetic tree of Tamarix was based on the complete chloroplast genome to date, the changes in branch length and support rate can potentially help us clarify the phylogenetic relationships of Tamarix. All the obtained genetic resources will facilitate future studies in population genetics, species identification, and conservation biology of Tamarix.
Collapse
Affiliation(s)
- Yanlei Liu
- School of Landscape and Ecological Engineering Hebei University of Engineering Handan China
| | - Kuo Ding
- Bingtuan Xingxin Vocational and Technical College Tiemenguan China
| | - Lixiong Liang
- School of Landscape and Ecological Engineering Hebei University of Engineering Handan China
| | - Zhan Zhang
- Bingtuan Xingxin Vocational and Technical College Tiemenguan China
| | - Kai Chen
- Bingtuan Xingxin Vocational and Technical College Tiemenguan China
| | - Haiwen Li
- College of Life Sciences and Technology Tarim University Alar China
| |
Collapse
|
2
|
Shang C, Li E, Yu Z, Lian M, Chen Z, Liu K, Xu L, Tong Z, Wang M, Dong W. Chloroplast Genomic Resources and Genetic Divergence of Endangered Species Bretschneidera sinensis (Bretschneideraceae). Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.873100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Bretschneidera sinensis is an endangered woody species found in East and South China. Comprehensive intraspecies chloroplast genome studies have demonstrated novel genetic resources to assess the genetic variation and diversity of this species. Using genome skimming method, we assembled the whole chloroplast genome of 12 genotypes of B. sinensis from different geographical locations, covering most wild populations. The B. sinensis chloroplast genome size ranged from 158,959 to 159,045 base pairs (bp) and displayed a typical circular quadripartite structure. Comparative analyses of 12 B. sinensis chloroplast genome revealed 33 polymorphic simple sequence repeats (SSRs), 105 polymorphic single nucleotide polymorphisms (SNPs), and 55 indels. Phylogenetic analysis showed that the 12 genotypes were grouped into 2 branches, which is consistent with the geographical distribution (Eastern clade and Western clade). Divergence time estimates showed that the two clades were divergent from 0.6 Ma in the late Pleistocene. Ex situ conservation is essential for this species. In this study, we identified SNPs, indels, and microsatellites of B. sinensis by comparative analyses of chloroplast genomes and determined genetic variation between populations using these genomic markers. Chloroplast genomic resources are also important for further domestication, population genetic, and phylogenetic analysis, possibly in combination with molecular markers of mitochondrial and/or nuclear genomes.
Collapse
|
3
|
Xu W, Li Y, Li Y, Liu C, Wang Y, Xia G, Wang M. Asymmetric Somatic Hybridization Affects Synonymous Codon Usage Bias in Wheat. Front Genet 2021; 12:682324. [PMID: 34178040 PMCID: PMC8226224 DOI: 10.3389/fgene.2021.682324] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 05/07/2021] [Indexed: 11/24/2022] Open
Abstract
Asymmetric somatic hybridization is an efficient strategy for crop breeding by introducing exogenous chromatin fragments, which leads to whole genomic shock and local chromosomal shock that induces genome-wide genetic variation including indel (insertion and deletion) and nucleotide substitution. Nucleotide substitution causes synonymous codon usage bias (SCUB), an indicator of genomic mutation and natural selection. However, how asymmetric somatic hybridization affects SCUB has not been addressed. Here, we explored this issue by comparing expressed sequence tags of a common wheat cultivar and its asymmetric somatic hybrid line. Asymmetric somatic hybridization affected SCUB and promoted the bias to A- and T-ending synonymous codon (SCs). SCUB frequencies in chromosomes introgressed with exogenous fragments were comparable to those in chromosomes without exogenous fragments, showing that exogenous fragments had no local chromosomal effect. Asymmetric somatic hybridization affected SCUB frequencies in indel-flanking sequences more strongly than in non-flanking sequences, and this stronger effect was present in both chromosomes with and without exogenous fragments. DNA methylation-driven SCUB shift was more pronounced than other SC pairs. SCUB shift was similar among seven groups of allelic chromosomes as well as three sub-genomes. Our work demonstrates that the SCUB shift induced by asymmetric somatic hybridization is attributed to the whole genomic shock, and DNA methylation is a putative force of SCUB shift during asymmetric somatic hybridization. Asymmetric somatic hybridization provides an available method for deepening the nature of SCUB shift and genetic variation induced by genomic shock.
Collapse
Affiliation(s)
- Wenjing Xu
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yingchun Li
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yajing Li
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Chun Liu
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yanxia Wang
- Shijiazhuang Academy of Agriculture and Forestry Sciences, Shijiazhuang, China
| | - Guangmin Xia
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Mengcheng Wang
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| |
Collapse
|
4
|
Sohn HB, Kim SJ, Hong SY, Park SG, Oh DH, Lee S, Nam HY, Nam JH, Kim YH. Development of 50 InDel-based barcode system for genetic identification of tartary buckwheat resources. PLoS One 2021; 16:e0250786. [PMID: 34081692 PMCID: PMC8174720 DOI: 10.1371/journal.pone.0250786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 04/14/2021] [Indexed: 11/18/2022] Open
Abstract
Tartary buckwheat (Fagopyrum tataricum Gartn.) is a highly functional crop that is poised to be the target of many future breeding efforts. The reliable ex situ conservation of various genetic resources is essential for the modern breeding of tartary buckwheat varieties. We developed PCR-based co-dominant insertion/deletion (InDel) markers to discriminate tartary buckwheat genetic resources. First, we obtained the whole genome from 26 accessions across a superscaffold-scale reference genome of 569.37 Mb for tartary buckwheat cv. "Daegwan 3-7." Next, 171,926 homogeneous and 53,755 heterogeneous InDels were detected by comparing 26 accessions with the "Daegwan 3-7" reference sequence. Of these, 100 candidate InDels ranging from 5-20 bp in length were chosen for validation, and 50 of them revealed polymorphisms between the 26 accessions and "Daegwan 3-7." The validated InDels were further tested through the assessment of their likelihood to give rise to a single or a few PCR products in 50 other accessions, covering most tartary buckwheat genome types. The major allele frequencies ranged from 0.5616 at the TB42 locus to 0.9863 at the TB48 locus, with the average PIC value of 0.1532 with a range of 0.0267-0.3712. To create a user-friendly system, the homology of the genotypes between and among the accessions were visualized in both one- (1D) and two-dimensional (2D) barcode types by comparing amplicon polymorphisms with the reference variety, "Daegwan 3-7." A phylogenetic tree and population structure of the 76 accessions according to amplicon polymorphisms for the 50 InDel markers corresponded to those using non-synonymous single nucleotide polymorphism variants, indicating that the barcode system based on the 50 InDels was a useful tool to improve the reliability of identification of tartary buckwheat accessions in the germplasm stocks.
Collapse
Affiliation(s)
- Hwang-Bae Sohn
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| | - Su-Jeong Kim
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| | - Su-Young Hong
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| | - Sin-Gi Park
- TheragenEtex Bio Institute, TherageneEtex Inc., Suwon, Gyeonggi-do, Republic of Korea
| | - Dong-Ha Oh
- Department of Biological Science, Louisiana State University, Baton Rouge, LA, United States of America
| | - Sunghoon Lee
- EONE-DIAGNOMICS Genome Center Co. Ltd., Incheon, Republic of Korea
| | - Hwa Yeun Nam
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| | - Jung Hwan Nam
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| | - Yul-Ho Kim
- Highland Agriculture Research Institute, National Institute of Crop Science, Pyeongchang, Gangwon-do, Republic of Korea
| |
Collapse
|
5
|
Wang Y, Wang S, Liu Y, Yuan Q, Sun J, Guo L. Chloroplast genome variation and phylogenetic relationships of Atractylodes species. BMC Genomics 2021; 22:103. [PMID: 33541261 PMCID: PMC7863269 DOI: 10.1186/s12864-021-07394-8] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 01/19/2021] [Indexed: 12/21/2022] Open
Abstract
Background Atractylodes DC is the basic original plant of the widely used herbal medicines “Baizhu” and “Cangzhu” and an endemic genus in East Asia. Species within the genus have minor morphological differences, and the universal DNA barcodes cannot clearly distinguish the systemic relationship or identify the species of the genus. In order to solve these question, we sequenced the chloroplast genomes of all species of Atractylodes using high-throughput sequencing. Results The results indicate that the chloroplast genome of Atractylodes has a typical quadripartite structure and ranges from 152,294 bp (A. carlinoides) to 153,261 bp (A. macrocephala) in size. The genome of all species contains 113 genes, including 79 protein-coding genes, 30 transfer RNA genes and four ribosomal RNA genes. Four hotspots, rpl22-rps19-rpl2, psbM-trnD, trnR-trnT(GGU), and trnT(UGU)-trnL, and a total of 42–47 simple sequence repeats (SSR) were identified as the most promising potentially variable makers for species delimitation and population genetic studies. Phylogenetic analyses of the whole chloroplast genomes indicate that Atractylodes is a clade within the tribe Cynareae; Atractylodes species form a monophyly that clearly reflects the relationship within the genus. Conclusions Our study included investigations of the sequences and structural genomic variations, phylogenetics and mutation dynamics of Atractylodes chloroplast genomes and will facilitate future studies in population genetics, taxonomy and species identification. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07394-8.
Collapse
Affiliation(s)
- Yiheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Sheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Yanlei Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Qingjun Yuan
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Jiahui Sun
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| | - Lanping Guo
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| |
Collapse
|
6
|
Wang Y, Jia HM, Shen YT, Zhao HB, Yang QS, Zhu CQ, Sun DL, Wang GY, Zhou CC, Jiao Y, Chai CY, Yan LJ, Li XW, Jia HJ, Gao ZS. Construction of an anchoring SSR marker genetic linkage map and detection of a sex-linked region in two dioecious populations of red bayberry. HORTICULTURE RESEARCH 2020; 7:53. [PMID: 32257239 PMCID: PMC7109115 DOI: 10.1038/s41438-020-0276-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 02/02/2020] [Accepted: 02/12/2020] [Indexed: 05/21/2023]
Abstract
Red bayberry (Morella rubra) is an evergreen fruit tree found in southern China whose whole-genome sequence has recently been published. We updated the linkage map of the species by adding 118 SSR markers and the female-specific marker MrFT2_BD-SEX. The integrated map included eight linkage groups and spanned 491 cM. Eleven sex-associated markers were identified, six of which were located in linkage group 8, in agreement with the previously reported location of the sex-determining region. The MrFT2_BD-SEX marker was genotyped in 203 cultivated accessions. Among the females of the accessions, we found two female-specific alleles, designated W-b (151 bp) and W-d (129 bp). We previously found that 'Dongkui', a female cultivar, could produce viable pollen (we refer to such plants 'Dongkui-male') and serve as the paternal parent in crosses. The genotypes of the MrFT2_BD-SEX marker were W-b/Z in 'Biqi' and W-d/Z in 'Dongkui-male'. The progeny of a cross between these parents produced a 3:1 female (W-) to male (ZZ) ratio and the expected 1:1:1:1 ratio of W-b/W-d: W-b/Z: W-d/Z: Z/Z. In addition, the flowering and fruiting phenotypes of all the F1 progeny fit their genotypes. Our results confirm the existence of ZW sex determination and show that the female phenotype is controlled by a single dominant locus (W) in a small genomic region (59 kb and less than 3.3 cM). Furthermore, we have produced a homozygous "super female" (WW) that should produce all-female offspring in the F2 generation, providing a foundation for commercial use and presenting great potential for use in modern breeding programs.
Collapse
Affiliation(s)
- Yan Wang
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Hui-Min Jia
- Shanghai Center for Plant Stress Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, 201602 Shanghai, China
| | - Yu-Tong Shen
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Hai-Bo Zhao
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Qin-Song Yang
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Chang-Qing Zhu
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - De-Li Sun
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Guo-Yun Wang
- Yuyao Forestry Technology Extension Center, 315400 Ningbo, China
| | - Chao-Chao Zhou
- Yuyao Forestry Technology Extension Center, 315400 Ningbo, China
| | - Yun Jiao
- Institute of Forestry, Ningbo Academy of Agricultural Science, Ningbo, China
| | - Chun-Yan Chai
- Cixi Forestry Technology Extension Center, 315300 Cixi, China
| | - Li-Ju Yan
- Linhai Forestry Technology Extension Center, 317000 Taizhou, China
| | - Xiong-Wei Li
- Forest & Fruit Tree Institute, Shanghai Academy of Agricultural Sciences, 201403 Shanghai, China
| | - Hui-Juan Jia
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| | - Zhong-Shan Gao
- Fruit Science Institute, College of Agriculture and Biotechnology, Zhejiang University, 310058 Hangzhou, China
| |
Collapse
|
7
|
Zhang L, Hu J, Han X, Li J, Gao Y, Richards CM, Zhang C, Tian Y, Liu G, Gul H, Wang D, Tian Y, Yang C, Meng M, Yuan G, Kang G, Wu Y, Wang K, Zhang H, Wang D, Cong P. A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat Commun 2019; 10:1494. [PMID: 30940818 PMCID: PMC6445120 DOI: 10.1038/s41467-019-09518-x] [Citation(s) in RCA: 168] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 03/13/2019] [Indexed: 01/14/2023] Open
Abstract
A complete and accurate genome sequence provides a fundamental tool for functional genomics and DNA-informed breeding. Here, we assemble a high-quality genome (contig N50 of 6.99 Mb) of the apple anther-derived homozygous line HFTH1, including 22 telomere sequences, using a combination of PacBio single-molecule real-time (SMRT) sequencing, chromosome conformation capture (Hi-C) sequencing, and optical mapping. In comparison to the Golden Delicious reference genome, we identify 18,047 deletions, 12,101 insertions and 14 large inversions. We reveal that these extensive genomic variations are largely attributable to activity of transposable elements. Interestingly, we find that a long terminal repeat (LTR) retrotransposon insertion upstream of MdMYB1, a core transcriptional activator of anthocyanin biosynthesis, is associated with red-skinned phenotype. This finding provides insights into the molecular mechanisms underlying red fruit coloration, and highlights the utility of this high-quality genome assembly in deciphering agriculturally important trait in apple.
Collapse
Affiliation(s)
- Liyi Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Jiang Hu
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Xiaolei Han
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Jingjing Li
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Yuan Gao
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Christopher M Richards
- USDA-ARS National Center for Genetic Resources Preservation, Fort Collins, CO, 80521, USA
| | - Caixia Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yi Tian
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Guiming Liu
- Beijing Agro-Biotechnology Research Center, Beijing Academy of Agriculture and Forestry Sciences, 100097, Beijing, China
| | - Hera Gul
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Dajiang Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yu Tian
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Chuanxin Yang
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Minghui Meng
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Gaopeng Yuan
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Guodong Kang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yonglong Wu
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Kun Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Hengtao Zhang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Science, 450009, Zhengzhou, Henan, China
| | - Depeng Wang
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China.
| | - Peihua Cong
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China.
| |
Collapse
|
8
|
Cultrera NGM, Sarri V, Lucentini L, Ceccarelli M, Alagna F, Mariotti R, Mousavi S, Ruiz CG, Baldoni L. High Levels of Variation Within Gene Sequences of Olea europaea L. FRONTIERS IN PLANT SCIENCE 2019; 9:1932. [PMID: 30671076 PMCID: PMC6331486 DOI: 10.3389/fpls.2018.01932] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 12/12/2018] [Indexed: 05/08/2023]
Abstract
Gene sequence variation in cultivated olive (Olea europaea L. subsp. europaea var. europaea), the most important oil tree crop of the Mediterranean basin, has been poorly evaluated up to now. A deep sequence analysis of fragments of four genes, OeACP1, OeACP2, OeLUS and OeSUT1, in 90 cultivars, revealed a wide range of polymorphisms along all recognized allele forms and unexpected allele frequencies and genotype combinations. High linkage values among most polymorphisms were recorded within each gene fragment. The great sequence variability corresponded to a low number of alleles and, surprisingly, to a small fraction of genotype combinations. The distribution, frequency, and combination of the different alleles at each locus is possibly due to natural and human pressures, such as selection, ancestrality, or fitness. Phylogenetic analyses of allele sequences showed distant and complex patterns of relationships among cultivated olives, intermixed with other related forms, highlighting an evolutionary connection between olive cultivars and the O. europaea subspecies cuspidata and cerasiformis. This study demonstrates how a detailed and complete sequence analysis of a few gene portions and a thorough genotyping on a representative set of cultivars can clarify important issues related to sequence polymorphisms, reconstructing the phylogeny of alleles, as well as the genotype combinations. The identification of regions representing blocks of recombination could reveal polymorphisms that represent putatively functional markers. Indeed, specific mutations found on the analyzed OeACP1 and OeACP2 fragments seem to be correlated to the fruit weight.
Collapse
Affiliation(s)
- Nicolò G. M. Cultrera
- Institute of Biosciences and Bioresources, National Research Council, Perugia, Italy
| | - Vania Sarri
- Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy
| | - Livia Lucentini
- Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy
| | - Marilena Ceccarelli
- Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy
| | - Fiammetta Alagna
- ENEA Italian National Agency for New Technologies Energy and Sustainable Economic Development, Trisaia Research Center, Rotondella, Italy
| | - Roberto Mariotti
- Institute of Biosciences and Bioresources, National Research Council, Perugia, Italy
| | - Soraya Mousavi
- Institute of Biosciences and Bioresources, National Research Council, Perugia, Italy
| | | | - Luciana Baldoni
- Institute of Biosciences and Bioresources, National Research Council, Perugia, Italy
| |
Collapse
|
9
|
Abstract
A major current molecular evolution challenge is to link comparative genomic patterns to species' biology and ecology. Breeding systems are pivotal because they affect many population genetic processes and thus genome evolution. We review theoretical predictions and empirical evidence about molecular evolutionary processes under three distinct breeding systems-outcrossing, selfing, and asexuality. Breeding systems may have a profound impact on genome evolution, including molecular evolutionary rates, base composition, genomic conflict, and possibly genome size. We present and discuss the similarities and differences between the effects of selfing and clonality. In reverse, comparative and population genomic data and approaches help revisiting old questions on the long-term evolution of breeding systems.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France
| | - Clémentine M François
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution, UMR5554, Université Montpellier II, Montpellier, France.
| |
Collapse
|
10
|
Wang M, Ji Y, Feng S, Liu C, Xiao Z, Wang X, Wang Y, Xia G. The non-random patterns of genetic variation induced by asymmetric somatic hybridization in wheat. BMC PLANT BIOLOGY 2018; 18:244. [PMID: 30332989 PMCID: PMC6192298 DOI: 10.1186/s12870-018-1474-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 10/05/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND Asymmetric somatic hybridization is an efficient crop breeding approach by introducing several exogenous chromatin fragments, which leads to genomic shock and therefore induces genome-wide genetic variation. However, the fundamental question concerning the genetic variation such as whether it occurs randomly and suffers from selection pressure remains unknown. RESULTS Here, we explored this issue by comparing expressed sequence tags of a common wheat cultivar and its asymmetric somatic hybrid line. Both nucleotide substitutions and indels (insertions and deletions) had lower frequencies in coding sequences than in un-translated regions. The frequencies of nucleotide substitutions and indels were both comparable between chromosomes with and without introgressed fragments. Nucleotide substitutions distributed unevenly and were preferential to indel-flanking sequences, and the frequency of nucleotide substitutions at 5'-flanking sequences of indels was obviously higher in chromosomes with introgressed fragments than in those without exogenous fragment. Nucleotide substitutions and indels both had various frequencies among seven groups of allelic chromosomes, and the frequencies of nucleotide substitutions were strongly negatively correlative to those of indels. Among three sets of genomes, the frequencies of nucleotide substitutions and indels were both heterogeneous, and the frequencies of nucleotide substitutions exhibited drastically positive correlation to those of indels. CONCLUSIONS Our work demonstrates that the genetic variation induced by asymmetric somatic hybridization is attributed to both whole genomic shock and local chromosomal shock, which is a predetermined and non-random genetic event being closely associated with selection pressure. Asymmetric somatic hybrids provide a worthwhile model to further investigate the nature of genomic shock induced genetic variation.
Collapse
Affiliation(s)
- Mengcheng Wang
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| | - Yujie Ji
- College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, 210095 China
| | - Shiting Feng
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| | - Chun Liu
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| | - Zhen Xiao
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| | - Xiaoping Wang
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| | - Yanxia Wang
- Shijiazhuang Academy of Agriculture and Forestry Sciences, Shijiazhuang, 050041 China
| | - Guangmin Xia
- The Key Laboratory of Plant Cell Engineering and Germplasm Innovation, Ministry of Education, School of Life Science, Shandong University, 27 Shandanan Road, Jinan, Shandong 250100 People’s Republic of China
| |
Collapse
|
11
|
Global analysis of mutations driving microevolution of a heterozygous diploid fungal pathogen. Proc Natl Acad Sci U S A 2018; 115:E8688-E8697. [PMID: 30150418 PMCID: PMC6140516 DOI: 10.1073/pnas.1806002115] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Candida albicans is a heterozygous diploid yeast that is a commensal of the human gastrointestinal tract and a prevalent opportunistic pathogen. Here, whole-genome sequencing was performed on multiple C. albicans isolates passaged both in vitro and in vivo to characterize the complete spectrum of mutations arising in laboratory culture and in the mammalian host. We establish that, independent of culture niche, microevolution is primarily driven by de novo base substitutions and frequent short-tract loss-of-heterozygosity events. An average base-substitution rate of ∼1.2 × 10-10 per base pair per generation was observed in vitro, with higher rates inferred during host infection. Large-scale chromosomal changes were relatively rare, although chromosome 7 trisomies frequently emerged during passaging in a gastrointestinal model and was associated with increased fitness for this niche. Multiple chromosomal features impacted mutational patterns, with mutation rates elevated in repetitive regions, subtelomeric regions, and in gene families encoding cell surface proteins involved in host adhesion. Strikingly, de novo mutation rates were more than 800-fold higher in regions immediately adjacent to emergent loss-of-heterozygosity tracts, indicative of recombination-induced mutagenesis. Furthermore, genomes showed biased patterns of mutations suggestive of extensive purifying selection during passaging. These results reveal how both cell-intrinsic and cell-extrinsic factors influence C. albicans microevolution, and provide a quantitative picture of genome dynamics in this heterozygous diploid species.
Collapse
|
12
|
Guo C, Du J, Wang L, Yang S, Mauricio R, Tian D, Gu T. Insertions/Deletions-Associated Nucleotide Polymorphism in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2016; 7:1792. [PMID: 27965694 PMCID: PMC5127803 DOI: 10.3389/fpls.2016.01792] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Accepted: 11/15/2016] [Indexed: 06/06/2023]
Abstract
Although high levels of within-species variation are commonly observed, a general mechanism for the origin of such variation is still lacking. Insertions and deletions (indels) are a widespread feature of genomes and we hypothesize that there might be an association between indels and patterns of nucleotide polymorphism. Here, we investigate flanking sequences around 18 indels (>100 bp) among a large number of accessions of the plant, Arabidopsis thaliana. We found two distinct haplotypes, i.e., a nucleotide dimorphism, present around each of these indels and dimorphic haplotypes always corresponded to the indel-present/-absent patterns. In addition, the peaks of nucleotide diversity between the two divergent alleles were closely associated with these indels. Thus, there exists a close association between indels and dimorphisms. Further analysis suggests that indel-associated substitutions could be an important component of genetic variation shaping nucleotide polymorphism in Arabidopsis. Finally, we suggest a mechanism by which indels might generate these highly divergent haplotypes. This study provides evidence that nucleotide dimorphisms, which are frequently regarded as evidence of frequency-dependent selection, could be explained simply by structural variation in the genome.
Collapse
Affiliation(s)
- Changjiang Guo
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing UniversityNanjing, China
| | - Jianchang Du
- Provincial Key Laboratory of Agrobiology, Institute of Biotechnology, Jiangsu Academy of Agricultural SciencesNanjing, China
| | - Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing UniversityNanjing, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing UniversityNanjing, China
| | - Rodney Mauricio
- Department of Genetics, University of GeorgiaAthens, GA, USA
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing UniversityNanjing, China
| | - Tingting Gu
- State Key Laboratory of Plant Genetics and Germplasm Enhancement and College of Horticulture, Nanjing Agricultural UniversityNanjing, China
| |
Collapse
|
13
|
Lochovsky L, Zhang J, Fu Y, Khurana E, Gerstein M. LARVA: an integrative framework for large-scale analysis of recurrent variants in noncoding annotations. Nucleic Acids Res 2015; 43:8123-34. [PMID: 26304545 PMCID: PMC4787796 DOI: 10.1093/nar/gkv803] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2015] [Accepted: 07/28/2015] [Indexed: 01/22/2023] Open
Abstract
In cancer research, background models for mutation rates have been extensively calibrated in coding regions, leading to the identification of many driver genes, recurrently mutated more than expected. Noncoding regions are also associated with disease; however, background models for them have not been investigated in as much detail. This is partially due to limited noncoding functional annotation. Also, great mutation heterogeneity and potential correlations between neighboring sites give rise to substantial overdispersion in mutation count, resulting in problematic background rate estimation. Here, we address these issues with a new computational framework called LARVA. It integrates variants with a comprehensive set of noncoding functional elements, modeling the mutation counts of the elements with a β-binomial distribution to handle overdispersion. LARVA, moreover, uses regional genomic features such as replication timing to better estimate local mutation rates and mutational hotspots. We demonstrate LARVA's effectiveness on 760 whole-genome tumor sequences, showing that it identifies well-known noncoding drivers, such as mutations in the TERT promoter. Furthermore, LARVA highlights several novel highly mutated regulatory sites that could potentially be noncoding drivers. We make LARVA available as a software tool and release our highly mutated annotations as an online resource (larva.gersteinlab.org).
Collapse
Affiliation(s)
- Lucas Lochovsky
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Jing Zhang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Yao Fu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Ekta Khurana
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY 10065, USA Department of Physiology and Biophysics, Weill Cornell Medical College, New York, New York 10065
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA Department of Computer Science, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
14
|
Yang S, Wang L, Huang J, Zhang X, Yuan Y, Chen JQ, Hurst LD, Tian D. Parent–progeny sequencing indicates higher mutation rates in heterozygotes. Nature 2015; 523:463-7. [DOI: 10.1038/nature14649] [Citation(s) in RCA: 134] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Accepted: 06/11/2015] [Indexed: 12/15/2022]
|
15
|
Jovelin R, Cutter AD. Fine-scale signatures of molecular evolution reconcile models of indel-associated mutation. Genome Biol Evol 2013; 5:978-86. [PMID: 23558593 PMCID: PMC3673634 DOI: 10.1093/gbe/evt051] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Genomic structural alterations that vary within species, known as large copy number variants, represent an unanticipated and abundant source of genetic diversity that associates with variation in gene expression and susceptibility to disease. Even short insertions and deletions (indels) can exert important effects on genomes by locally increasing the mutation rate, with multiple mechanisms proposed to account for this pattern. To better understand how indels promote genome evolution, we demonstrate that the single nucleotide mutation rate is elevated in the vicinity of indels, with a resolution of tens of base pairs, for the two closely related nematode species Caenorhabditis remanei and C. sp. 23. In addition to indels being clustered with single nucleotide polymorphisms and fixed differences, we also show that transversion mutations are enriched in sequences that flank indels and that many indels associate with sequence repeats. These observations are compatible with a model that reconciles previously proposed mechanisms of indel-associated mutagenesis, implicating repeat sequences as a common driver of indel errors, which then recruit error-prone polymerases during DNA repair, resulting in a locally elevated single nucleotide mutation rate. The striking influence of indel variants on the molecular evolution of flanking sequences strengthens the emerging general view that mutations can induce further mutations.
Collapse
Affiliation(s)
- Richard Jovelin
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada.
| | | |
Collapse
|
16
|
The evolutionary genetics of the genes underlying phenotypic associations for loblolly pine (Pinus taeda, Pinaceae). Genetics 2013; 195:1353-72. [PMID: 24121773 DOI: 10.1534/genetics.113.157198] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
A primary goal of evolutionary genetics is to discover and explain the genetic basis of fitness-related traits and how this genetic basis evolves within natural populations. Unprecedented technological advances have fueled the discovery of genetic variants associated with ecologically relevant phenotypes in many different life forms, as well as the ability to scan genomes for deviations from selectively neutral models of evolution. Theoretically, the degree of overlap between lists of genomic regions identified using each approach is related to the genetic architecture of fitness-related traits and the strength and type of natural selection molding variation at these traits within natural populations. Here we address for the first time in a plant the degree of overlap between these lists, using patterns of nucleotide diversity and divergence for >7000 unique amplicons described from the extensive expressed sequence tag libraries generated for loblolly pine (Pinus taeda L.) in combination with the >1000 published genetic associations. We show that loci associated with phenotypic traits are distinct with regard to neutral expectations. Phenotypes measured at the whole plant level (e.g., disease resistance) exhibit an approximately twofold increase in the proportion of adaptive nonsynonymous substitutions over the genome-wide average. As expected for polygenic traits, these signals were apparent only when loci were considered at the level of functional sets. The ramifications of this result are discussed in light of the continued efforts to dissect the genetic basis of quantitative traits.
Collapse
|
17
|
Subramaniam S, Wang X, Freeling M, Pires JC. The fate of Arabidopsis thaliana homeologous CNSs and their motifs in the Paleohexaploid Brassica rapa. Genome Biol Evol 2013; 5:646-60. [PMID: 23493633 PMCID: PMC3641636 DOI: 10.1093/gbe/evt035] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Following polyploidy, duplicate genes are often deleted, and if they are not, then duplicate regulatory regions are sometimes lost. By what mechanism is this loss and what is the chance that such a loss removes function? To explore these questions, we followed individual Arabidopsis thaliana–A. thaliana conserved noncoding sequences (CNSs) into the Brassica ancestor, through a paleohexaploidy and into Brassica rapa. Thus, a single Brassicaceae CNS has six potential orthologous positions in B. rapa; a single Arabidopsis CNS has three potential homeologous positions. We reasoned that a CNS, if present on a singlet Brassica gene, would be unlikely to lose function compared with a more redundant CNS, and this is the case. Redundant CNSs go nondetectable often. Using this logic, each mechanism of CNS loss was assigned a metric of functionality. By definition, proved deletions do not function as sequence. Our results indicated that CNSs that go nondetectable by base substitution or large insertion are almost certainly still functional (redundancy does not matter much to their detectability frequency), whereas those lost by inferred deletion or indels are approximately 75% likely to be nonfunctional. Overall, an average nondetectable, once-redundant CNS more than 30 bp in length has a 72% chance of being nonfunctional, and that makes sense because 97% of them sort to a molecular mechanism with “deletion” in its description, but base substitutions do cause loss. Similarly, proved-functional G-boxes go undetectable by deletion 82% of the time. Fractionation mutagenesis is a procedure that uses polyploidy as a mutagenic agent to genetically alter RNA expression profiles, and then to construct testable hypotheses as to the function of the lost regulatory site. We show fractionation mutagenesis to be a “deletion machine” in the Brassica lineage.
Collapse
|
18
|
Cutter AD, Jovelin R, Dey A. Molecular hyperdiversity and evolution in very large populations. Mol Ecol 2013; 22:2074-95. [PMID: 23506466 PMCID: PMC4065115 DOI: 10.1111/mec.12281] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 02/06/2023]
Abstract
The genomic density of sequence polymorphisms critically affects the sensitivity of inferences about ongoing sequence evolution, function and demographic history. Most animal and plant genomes have relatively low densities of polymorphisms, but some species are hyperdiverse with neutral nucleotide heterozygosity exceeding 5%. Eukaryotes with extremely large populations, mimicking bacterial and viral populations, present novel opportunities for studying molecular evolution in sexually reproducing taxa with complex development. In particular, hyperdiverse species can help answer controversial questions about the evolution of genome complexity, the limits of natural selection, modes of adaptation and subtleties of the mutation process. However, such systems have some inherent complications and here we identify topics in need of theoretical developments. Close relatives of the model organisms Caenorhabditis elegans and Drosophila melanogaster provide known examples of hyperdiverse eukaryotes, encouraging functional dissection of resulting molecular evolutionary patterns. We recommend how best to exploit hyperdiverse populations for analysis, for example, in quantifying the impact of noncrossover recombination in genomes and for determining the identity and micro-evolutionary selective pressures on noncoding regulatory elements.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
19
|
Turco G, Schnable JC, Pedersen B, Freeling M. Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses. FRONTIERS IN PLANT SCIENCE 2013; 4:170. [PMID: 23874343 PMCID: PMC3708275 DOI: 10.3389/fpls.2013.00170] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Accepted: 05/13/2013] [Indexed: 05/07/2023]
Abstract
Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize.
Collapse
Affiliation(s)
| | - James C. Schnable
- *Correspondence: James C. Schnable and Michael Freeling, Department of Plant and Microbial Biology, University of California, 111 Koshland Hall, Berkeley, CA 94720, USA e-mail: ;
| | | | - Michael Freeling
- *Correspondence: James C. Schnable and Michael Freeling, Department of Plant and Microbial Biology, University of California, 111 Koshland Hall, Berkeley, CA 94720, USA e-mail: ;
| |
Collapse
|
20
|
Abstract
A major current molecular evolution challenge is to link comparative genomic patterns to species' biology and ecology. Breeding systems are pivotal because they affect many population genetic processes, and thus genome evolution. We review theoretical predictions and empirical evidence about molecular evolutionary processes under three distinct breeding systems-outcrossing, selfing, and asexuality. Breeding systems may have a profound impact on genome evolution, including molecular evolutionary rates, base composition, genomic conflict, and possibly genome size. However, while asexual species essentially conform to theoretical predictions, the situation is less simple in selfing species. We discuss the possible reasons to potentially explain this paradox. In reverse, comparative and population genomic data and approaches help revisiting old questions on the long-term evolution of breeding systems.
Collapse
|
21
|
Oliveira HR, Campana MG, Jones H, Hunt HV, Leigh F, Redhouse DI, Lister DL, Jones MK. Tetraploid wheat landraces in the Mediterranean basin: taxonomy, evolution and genetic diversity. PLoS One 2012; 7:e37063. [PMID: 22615891 PMCID: PMC3353906 DOI: 10.1371/journal.pone.0037063] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2012] [Accepted: 04/13/2012] [Indexed: 11/19/2022] Open
Abstract
The geographic distribution of genetic diversity and the population structure of tetraploid wheat landraces in the Mediterranean basin has received relatively little attention. This is complicated by the lack of consensus concerning the taxonomy of tetraploid wheats and by unresolved questions regarding the domestication and spread of naked wheats. These knowledge gaps hinder crop diversity conservation efforts and plant breeding programmes. We investigated genetic diversity and population structure in tetraploid wheats (wild emmer, emmer, rivet and durum) using nuclear and chloroplast simple sequence repeats, functional variations and insertion site-based polymorphisms. Emmer and wild emmer constitute a genetically distinct population from durum and rivet, the latter seeming to share a common gene pool. Our population structure and genetic diversity data suggest a dynamic history of introduction and extinction of genotypes in the Mediterranean fields.
Collapse
Affiliation(s)
- Hugo R Oliveira
- Department of Archaeology, University of Cambridge, Cambridge, United Kingdom.
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Guo B, Zou M, Wagner A. Pervasive indels and their evolutionary dynamics after the fish-specific genome duplication. Mol Biol Evol 2012; 29:3005-22. [PMID: 22490820 DOI: 10.1093/molbev/mss108] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Insertions and deletions (indels) in protein-coding genes are important sources of genetic variation. Their role in creating new proteins may be especially important after gene duplication. However, little is known about how indels affect the divergence of duplicate genes. We here study thousands of duplicate genes in five fish (teleost) species with completely sequenced genomes. The ancestor of these species has been subject to a fish-specific genome duplication (FSGD) event that occurred approximately 350 Ma. We find that duplicate genes contain at least 25% more indels than single-copy genes. These indels accumulated preferentially in the first 40 my after the FSGD. A lack of widespread asymmetric indel accumulation indicates that both members of a duplicate gene pair typically experience relaxed selection. Strikingly, we observe a 30-80% excess of deletions over insertions that is consistent for indels of various lengths and across the five genomes. We also find that indels preferentially accumulate inside loop regions of protein secondary structure and in regions where amino acids are exposed to solvent. We show that duplicate genes with high indel density also show high DNA sequence divergence. Indel density, but not amino acid divergence, can explain a large proportion of the tertiary structure divergence between proteins encoded by duplicate genes. Our observations are consistent across all five fish species. Taken together, they suggest a general pattern of duplicate gene evolution in which indels are important driving forces of evolutionary change.
Collapse
Affiliation(s)
- Baocheng Guo
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | | | | |
Collapse
|
23
|
Nakayama SI, Shi S, Tateno M, Shimada M, Takahasi KR. Mutation accumulation in a selfing population: consequences of different mutation rates between selfers and outcrossers. PLoS One 2012; 7:e33541. [PMID: 22448251 PMCID: PMC3308984 DOI: 10.1371/journal.pone.0033541] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2011] [Accepted: 02/10/2012] [Indexed: 11/18/2022] Open
Abstract
Currently existing theories predict that because deleterious mutations accumulate at a higher rate, selfing populations suffer from more intense genetic degradation relative to outcrossing populations. This prediction may not always be true when we consider a potential difference in deleterious mutation rate between selfers and outcrossers. By analyzing the evolutionary stability of selfing and outcrossing in an infinite population, we found that the genome-wide deleterious mutation rate would be lower in selfing than in outcrossing organisms. When this difference in mutation rate was included in simulations, we found that in a small population, mutations accumulated more slowly under selfing rather than outcrossing. This result suggests that under frequent and intense bottlenecks, a selfing population may have a lower risk of genetic extinction than an outcrossing population.
Collapse
Affiliation(s)
- Shin-ichiro Nakayama
- Nikko Botanical Gardens, Graduate School of Science, The University of Tokyo, Tokyo, Japan.
| | | | | | | | | |
Collapse
|
24
|
Gaut B, Yang L, Takuno S, Eguiarte LE. The Patterns and Causes of Variation in Plant Nucleotide Substitution Rates. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2011. [DOI: 10.1146/annurev-ecolsys-102710-145119] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Brandon Gaut
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697; , ,
| | - Liang Yang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697; , ,
| | - Shohei Takuno
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697; , ,
| | - Luis E. Eguiarte
- Instituto de Ecología, Universidad Nacional Autónoma de México, CP 04510 Mexico City, Mexico;
| |
Collapse
|
25
|
Kitamura K, Kawahara T. Estimation of outcrossing rates at small-scale flowering sites of the dwarf bamboo species, Sasa cernua. JOURNAL OF PLANT RESEARCH 2011; 124:683-8. [PMID: 21188457 DOI: 10.1007/s10265-010-0398-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 11/27/2010] [Indexed: 05/16/2023]
Abstract
We estimated the outcrossing rates at small-scale flowering sites of an endemic dwarf bamboo species, Sasa cernua. The multi-locus estimation of the outcrossing rate of the dwarf bamboo population was 0.148 (SD 0.118). Two culms with the highest outcrossing rates had heterozygous genotypes at one locus, but other culms in the locus were homozygotes. Five culms with high outcrossing rates bore 2-17% seeds with homozygous genotypes. Due to predominant selfing, the overall inbreeding coefficient of seeds was high, although it declined in seedlings. This suggests that selection against inbred progenies began early in the establishment process in the natural habitat.
Collapse
Affiliation(s)
- Keiko Kitamura
- Forest Dynamics and Diversity Group, Hokkaido Research Center, Forestry and Forest Products Research Institute, 7 Hitsujigaoka, Toyohira-ku, Sapporo 062-8516, Japan.
| | | |
Collapse
|
26
|
Abstract
It has been known for many years that the mutation rate varies across the genome. However, only with the advent of large genomic data sets is the full extent of this variation becoming apparent. The mutation rate varies over many different scales, from adjacent sites to whole chromosomes, with the strongest variation seen at the smallest scales. Some of these patterns have clear mechanistic bases, but much of the rate variation remains unexplained, and some of it is deeply perplexing. Variation in the mutation rate has important implications in evolutionary biology and underexplored implications for our understanding of hereditary disease and cancer.
Collapse
|
27
|
St Onge KR, Källman T, Slotte T, Lascoux M, Palmé AE. Contrasting demographic history and population structure in Capsella rubella and Capsella grandiflora, two closely related species with different mating systems. Mol Ecol 2011; 20:3306-20. [PMID: 21777317 DOI: 10.1111/j.1365-294x.2011.05189.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Both mating system and population history can have large impacts on genetic diversity and population structure. Here, we use multilocus sequence data to investigate how these factors impact two closely related Brassicaceae species: the selfing Capsella rubella and the outcrossing C. grandiflora. To do this, we have sequenced 16 loci in approximately 70 individuals from 7 populations of each species. Patterns of population structure differ strongly between the two species. In C. grandiflora, we observe an isolation-by-distance pattern and identify three clearly delineated genetic groups. In C. rubella, where we estimate the selfing rate to be 0.90-0.94, the pattern is less clear with some sampling populations forming separate genetic clusters while others are highly mixed. The two species also have divergent histories. Our analysis gives support for a bottleneck approximately 73 kya (20-139 kya) in C. rubella, which most likely represents speciation from C. grandiflora. In C. grandiflora, there is moderate support for the standard neutral model in 2 of 3 genetic clusters, while the third cluster and the total data set show evidence of expansion. It is clear that mating system has an impact on these two species, for example affecting the level of genetic variation and the genetic structure. However, our results also clearly show that a combination of past and present processes, some of which are not affected by mating system, is needed to explain the differences between C. rubella and C. grandiflora.
Collapse
Affiliation(s)
- Kate R St Onge
- Department of Plant Ecology and Evolution, Uppsala University, Norbyvägen 18D, 752 36 Uppsala, Sweden
| | | | | | | | | |
Collapse
|
28
|
Abstract
Sequence alignment (the grouping of homologous bases into one column) is fundamental to almost any task in comparative genomics. This translates to positing gaps in the genomic sequences to account for events of insertions and deletions (indels). The interrelationship between sequence alignment and phylogenetic reconstruction has drawn substantial attention recently with works showing the significance of differences in alignments. One of the plausible approaches in this direction is to grade the suitability of a tree to an associated alignment and vice verse. We here present a combinatorial (as opposed to statistical) approach based on the indel history. We show--both by simulations and by using real biological data from the Encyclopedia of DNA Elements (ENCODE)--that this criterion is sound. The novelty of our approach is the distinguishing between insertions and deletions, and augmenting the analysis with a dimension of "depth," extending it from the sequence space to the phylogenetic space. Using this approach, we perform a comprehensive study of indel characteristic behavior among mammals in both coding and non-coding regions. Our results show significant differences in indel patterns between coding and non-coding regions. We also show other characteristic patterns of indel evolution in the depth of the underlying phylogeny.
Collapse
Affiliation(s)
- Sagi Snir
- Department of Evolutionary Biology and the Institute of Evolution, Haifa University, Haifa, Israel.
| | | |
Collapse
|
29
|
McDonald MJ, Wang WC, Huang HD, Leu JY. Clusters of nucleotide substitutions and insertion/deletion mutations are associated with repeat sequences. PLoS Biol 2011; 9:e1000622. [PMID: 21697975 PMCID: PMC3114760 DOI: 10.1371/journal.pbio.1000622] [Citation(s) in RCA: 97] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2010] [Accepted: 04/22/2011] [Indexed: 12/24/2022] Open
Abstract
The genome-sequencing gold rush has facilitated the use of comparative genomics to uncover patterns of genome evolution, although their causal mechanisms remain elusive. One such trend, ubiquitous to prokarya and eukarya, is the association of insertion/deletion mutations (indels) with increases in the nucleotide substitution rate extending over hundreds of base pairs. The prevailing hypothesis is that indels are themselves mutagenic agents. Here, we employ population genomics data from Escherichia coli, Saccharomyces paradoxus, and Drosophila to provide evidence suggesting that it is not the indels per se but the sequence in which indels occur that causes the accumulation of nucleotide substitutions. We found that about two-thirds of indels are closely associated with repeat sequences and that repeat sequence abundance could be used to identify regions of elevated sequence diversity, independently of indels. Moreover, the mutational signature of indel-proximal nucleotide substitutions matches that of error-prone DNA polymerases. We propose that repeat sequences promote an increased probability of replication fork arrest, causing the persistent recruitment of error-prone DNA polymerases to specific sequence regions over evolutionary time scales. Experimental measures of the mutation rates of engineered DNA sequences and analyses of experimentally obtained collections of spontaneous mutations provide molecular evidence supporting our hypothesis. This study uncovers a new role for repeat sequences in genome evolution and provides an explanation of how fine-scale sequence contextual effects influence mutation rates and thereby evolution.
Collapse
|
30
|
Abstract
Uncovering general principles of genome evolution that are time-invariant and that operate in germ and somatic cells has implications for genome-wide association studies (GWAS), gene therapy, and disease genomics. Here we investigate the relationship between structural alterations (e.g., insertions and deletions) and single-nucleotide substitutions by comparing the following genomes that diverged at different times across germ- and somatic-cell lineages: (i) the reference human and chimpanzee genome (in million years), (ii) the reference human and personal genomes (in tens of thousands of years), and (iii) structurally altered regions in cancer and genetically engineered cells (in days). At the species level, genes with structural alteration in nearby regions show increased single-nucleotide changes and tend to evolve faster. In personal genomes, the single-nucleotide substitution rate is higher near sites of structural alteration and decreases with increasing distance. In human cancer cell populations and in cells genetically engineered using zinc-finger nucleases, single-nucleotide changes occur frequently near sites of structural alterations. We present evidence that structural alteration induces single-nucleotide changes in nearby regions and discuss possible molecular mechanisms that contribute to this phenomenon. We propose that the low fidelity of nonreplicative error-prone repair polymerases, which are used during insertion or deletion, result in break-repair-induced single-nucleotide mutations in the vicinity of structural alteration. Thus, in the mutational landscape, structural alterations are linked to single-nucleotide changes across different time scales in both somatic- and germ-cell lineages. We discuss implications for genome evolution, GWAS, disease genomics, and gene therapy and emphasize the need to investigate both types of mutations within a single framework.
Collapse
|