1
|
Chávez Montes RA, Ulloa M, Biniashvili T, Zackay A, Kfir N, Lopez-Arredondo D, Herrera-Estrella L. Assembly and annotation of the Gossypium barbadense L. 'Pima-S6' genome raise questions about the chromosome structure and gene content of Gossypium barbadense genomes. BMC Genomics 2023; 24:11. [PMID: 36627552 PMCID: PMC9830710 DOI: 10.1186/s12864-022-09102-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 12/28/2022] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Gossypium barbadense L. Pima cotton is known for its resistance to Fusarium wilt and for producing fibers of superior quality highly prized in the textile market. We report a high-quality genome assembly and annotation of Pima-S6 cotton and its comparison at the chromosome and protein level to other ten Gossypium published genome assemblies. RESULTS Synteny and orthogroup analyses revealed important differences on chromosome structure and annotated proteins content between our Pima-S6 and other publicly available G. barbadense assemblies, and across Gossypium assemblies in general. Detailed synteny analyses revealed chromosomal rearrangements between Pima-S6 and other Pima genomes on several chromosomes, with three major inversions in chromosomes A09, A13 and D05, raising questions about the true chromosome structure of Gossypium barbadense genomes. CONCLUSION Analyses of the re-assembled and re-annotated genome of the close relative G. barbadense Pima 3-79 using our Pima-S6 assembly suggest that contig placement of some recent G. barbadense assemblies might have been unduly influenced by the use of the G. hirsutum TM-1 genome as the anchoring reference. The Pima-S6 reference genome provides a valuable genomic resource and offers new insights on genomic structure, and can serve as G. barbadense genome reference for future assemblies and further support FOV4-related studies and breeding efforts.
Collapse
Affiliation(s)
- Ricardo A. Chávez Montes
- grid.264784.b0000 0001 2186 7496Institute of Genomics for Crop Abiotic Stress Tolerance, Plant and Soil Science Department, Texas Tech University, 79409 Lubbock, TX USA
| | - Mauricio Ulloa
- grid.508981.dPlant Stress and Germplasm Development Research, USDA-ARS, PA, CSRL, 79415 Lubbock, TX USA
| | | | | | | | - Damar Lopez-Arredondo
- grid.264784.b0000 0001 2186 7496Institute of Genomics for Crop Abiotic Stress Tolerance, Plant and Soil Science Department, Texas Tech University, 79409 Lubbock, TX USA
| | - Luis Herrera-Estrella
- grid.264784.b0000 0001 2186 7496Institute of Genomics for Crop Abiotic Stress Tolerance, Plant and Soil Science Department, Texas Tech University, 79409 Lubbock, TX USA ,grid.512574.0 Unidad de Genómica Avanzada/Langebio, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, 36821 Irapuato, Mexico
| |
Collapse
|
2
|
Zhou T, Wang N, Wang Y, Zhang XL, Li BG, Li W, Su JJ, Wang CX, Zhang A, Ma XF, Li ZH. Nucleotide Evolution, Domestication Selection, and Genetic Relationships of Chloroplast Genomes in the Economically Important Crop Genus Gossypium. FRONTIERS IN PLANT SCIENCE 2022; 13:873788. [PMID: 35498673 PMCID: PMC9051515 DOI: 10.3389/fpls.2022.873788] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 03/24/2022] [Indexed: 06/14/2023]
Abstract
Gossypium hirsutum (upland cotton) is one of the most economically important crops worldwide, which has experienced the long terms of evolution and domestication process from wild species to cultivated accessions. However, nucleotide evolution, domestication selection, and the genetic relationship of cotton species remain largely to be studied. In this study, we used chloroplast genome sequences to determine the evolutionary rate, domestication selection, and genetic relationships of 72 cotton genotypes (36 cultivated cotton accessions, seven semi-wild races of G. hirsutum, and 29 wild species). Evolutionary analysis showed that the cultivated tetraploid cotton genotypes clustered into a single clade, which also formed a larger lineage with the semi-wild races. Substitution rate analysis demonstrated that the rates of nucleotide substitution and indel variation were higher for the wild species than the semi-wild and cultivated tetraploid lineages. Selection pressure analysis showed that the wild species might have experienced greater selection pressure, whereas the cultivated cotton genotypes underwent artificial and domestication selection. Population clustering analysis indicated that the cultivated cotton accessions and semi-wild races have existed the obviously genetic differentiation. The nucleotide diversity was higher in the semi-wild races compared with the cultivated genotypes. In addition, genetic introgression and gene flow occurred between the cultivated tetraploid cotton and semi-wild genotypes, but mainly via historical rather than contemporary gene flow. These results provide novel molecular mechanisms insights into the evolution and domestication of economically important crop cotton species.
Collapse
Affiliation(s)
- Tong Zhou
- Shaanxi Key Laboratory for Animal Conservation, Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi’an, China
| | - Ning Wang
- Shaanxi Key Laboratory for Animal Conservation, Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi’an, China
| | - Yuan Wang
- Shaanxi Key Laboratory for Animal Conservation, Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi’an, China
| | - Xian-Liang Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Bao-Guo Li
- Shaanxi Key Laboratory for Animal Conservation, Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi’an, China
| | - Wei Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Jun-Ji Su
- Gansu Provincial Key Laboratory of Aridland Crop Science, College of Life Science and Technology, Gansu Agricultural University, Lanzhou, China
| | - Cai-Xiang Wang
- Gansu Provincial Key Laboratory of Aridland Crop Science, College of Life Science and Technology, Gansu Agricultural University, Lanzhou, China
| | - Ai Zhang
- Gansu Provincial Key Laboratory of Aridland Crop Science, College of Life Science and Technology, Gansu Agricultural University, Lanzhou, China
| | - Xiong-Feng Ma
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Zhong-Hu Li
- Shaanxi Key Laboratory for Animal Conservation, Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi’an, China
| |
Collapse
|
3
|
Enhancing Upland cotton for drought resilience, productivity, and fiber quality: comparative evaluation and genetic dissection. Mol Genet Genomics 2019; 295:155-176. [PMID: 31620883 DOI: 10.1007/s00438-019-01611-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 09/22/2019] [Indexed: 01/09/2023]
Abstract
To provision the world sustainably, modern society must increase overall crop production, while conserving and preserving natural resources. Producing more with diminishing water resources is an especially daunting endeavor. Toward the goal of genetically improving drought resilience of cultivated Upland cotton (Gossypium hirsutum L.), this study addresses the genetics of differential yield components referred to as productivity and fiber quality traits under regular-water versus low-water (LW) field conditions. We used ten traits to assess water stress deficit, which included six productivity and four fiber quality traits on two recombinant inbred line (RIL) populations from reciprocally crossed cultivars, Phytogen 72 and Stoneville 474. To facilitate genetic inferences, we genotyped RILs with the CottonSNP63K array, assembled high-density linkage maps of over 7000 SNPs and then analyzed quantitative trait variations. Analysis of variance revealed significant differences for all traits (p < 0.05) in these RIL populations. Although the LW irrigation regime significantly reduced all traits, except lint percent, the RILs exhibited a broad phenotypic spectrum of heritable differences across the water regimes. Transgressive segregation occurred among the RILs, suggesting the possibility of genetic gain through phenotypic selection for drought resilience and perhaps through marker-based selection. Analyses revealed more than 150 quantitative trait loci (QTLs) associated with productivity and fiber quality traits (p < 0.005) on different genomic regions of the cotton genome. The multiple-QTL models analysis with LOD > 3.0 detected 21 QTLs associated with productivity and 22 QTLs associated with fiber quality. For fiber traits, strong clustering and QTL associations occurred in c08 and its homolog c24 as well as c10, c14, and c21. Using contemporary genome sequence assemblies and bioinformatically related information, the identification of genomic regions associated with responses to plant stress/drought elevates the possibility of using marker-assisted and omics-based selection to enhance breeding for drought resilient cultivars and identifying candidate genes and networks. RILs with different responses to drought indicated that it is possible to maintain high fiber quality under LW conditions or reduce the of LW impact on quality. The heritable variation among elite bi-parental RILs for productivity and quality under field drought conditions, and their association of QTLs, and thus specific genomic regions, indicate opportunities for breeding-based gains in water resource conservation, i.e., enhancing cotton's agricultural sustainability.
Collapse
|
4
|
De Novo Genome Sequence Assemblies of Gossypium raimondii and Gossypium turneri. G3-GENES GENOMES GENETICS 2019; 9:3079-3085. [PMID: 31462444 PMCID: PMC6778788 DOI: 10.1534/g3.119.400392] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Cotton is an agriculturally important crop. Because of its importance, a genome sequence of a diploid cotton species (Gossypium raimondii, D-genome) was first assembled using Sanger sequencing data in 2012. Improvements to DNA sequencing technology have improved accuracy and correctness of assembled genome sequences. Here we report a new de novo genome assembly of G. raimondii and its close relative G. turneri. The two genomes were assembled to a chromosome level using PacBio long-read technology, HiC, and Bionano optical mapping. This report corrects some minor assembly errors found in the Sanger assembly of G. raimondii. We also compare the genome sequences of these two species for gene composition, repetitive element composition, and collinearity. Most of the identified structural rearrangements between these two species are due to intra-chromosomal inversions. More inversions were found in the G. turneri genome sequence than the G. raimondii genome sequence. These findings and updates to the D-genome sequence will improve accuracy and translation of genomics to cotton breeding and genetics.
Collapse
|
5
|
Grover CE, Arick MA, Thrash A, Conover JL, Sanders WS, Peterson DG, Frelichowski JE, Scheffler JA, Scheffler BE, Wendel JF. Insights into the Evolution of the New World Diploid Cottons (Gossypium, Subgenus Houzingenia) Based on Genome Sequencing. Genome Biol Evol 2019; 11:53-71. [PMID: 30476109 PMCID: PMC6320677 DOI: 10.1093/gbe/evy256] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2018] [Indexed: 12/24/2022] Open
Abstract
We employed phylogenomic methods to study molecular evolutionary processes and phylogeny in the geographically widely dispersed New World diploid cottons (Gossypium, subg. Houzingenia). Whole genome resequencing data (average of 33× genomic coverage) were generated to reassess the phylogenetic history of the subgenus and provide a temporal framework for its diversification. Phylogenetic analyses indicate that the subgenus likely originated following transoceanic dispersal from Africa about 6.6 Ma, but that nearly all of the biodiversity evolved following rapid diversification in the mid-Pleistocene (0.5-2.0 Ma), with multiple long-distance dispersals required to account for range expansion to Arizona, the Galapagos Islands, and Peru. Comparative analyses of cpDNAversus nuclear data indicate that this history was accompanied by several clear cases of interspecific introgression. Repetitive DNAs contribute roughly half of the total 880 Mb genome, but most transposable element families are relatively old and stable among species. In the genic fraction, pairwise synonymous mutation rates average 1% per Myr, with nonsynonymous changes being about seven times less frequent. Over 1.1 million indels were detected and phylogenetically polarized, revealing a 2-fold bias toward deletions over small insertions. We suggest that this genome down-sizing bias counteracts genome size growth by TE amplification and insertions, and helps explain the relatively small genomes that are restricted to this subgenus. Compared with the rate of nucleotide substitution, the rate of indel occurrence is much lower averaging about 17 nucleotide substitutions per indel event.
Collapse
Affiliation(s)
- Corrinne E Grover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - Mark A Arick
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Adam Thrash
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Justin L Conover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - William S Sanders
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
- Department of Computer Science & Engineering, Mississippi State University
- The Jackson Laboratory, Connecticut
| | - Daniel G Peterson
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | | | | | - Brian E Scheffler
- USDA, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| |
Collapse
|
6
|
Wu Y, Liu F, Yang DG, Li W, Zhou XJ, Pei XY, Liu YG, He KL, Zhang WS, Ren ZY, Zhou KH, Ma XF, Li ZH. Comparative Chloroplast Genomics of Gossypium Species: Insights Into Repeat Sequence Variations and Phylogeny. FRONTIERS IN PLANT SCIENCE 2018; 9:376. [PMID: 29619041 PMCID: PMC5871733 DOI: 10.3389/fpls.2018.00376] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Accepted: 03/06/2018] [Indexed: 05/10/2023]
Abstract
Cotton is one of the most economically important fiber crop plants worldwide. The genus Gossypium contains a single allotetraploid group (AD) and eight diploid genome groups (A-G and K). However, the evolution of repeat sequences in the chloroplast genomes and the phylogenetic relationships of Gossypium species are unclear. Thus, we determined the variations in the repeat sequences and the evolutionary relationships of 40 cotton chloroplast genomes, which represented the most diverse in the genus, including five newly sequenced diploid species, i.e., G. nandewarense (C1-n), G. armourianum (D2-1), G. lobatum (D7), G. trilobum (D8), and G. schwendimanii (D11), and an important semi-wild race of upland cotton, G. hirsutum race latifolium (AD1). The genome structure, gene order, and GC content of cotton species were similar to those of other higher plant plastid genomes. In total, 2860 long sequence repeats (>10 bp in length) were identified, where the F-genome species had the largest number of repeats (G. longicalyx F1: 108) and E-genome species had the lowest (G. stocksii E1: 53). Large-scale repeat sequences possibly enrich the genetic information and maintain genome stability in cotton species. We also identified 10 divergence hotspot regions, i.e., rpl33-rps18, psbZ-trnG (GCC), rps4-trnT (UGU), trnL (UAG)-rpl32, trnE (UUC)-trnT (GGU), atpE, ndhI, rps2, ycf1, and ndhF, which could be useful molecular genetic markers for future population genetics and phylogenetic studies. Site-specific selection analysis showed that some of the coding sites of 10 chloroplast genes (atpB, atpE, rps2, rps3, petB, petD, ccsA, cemA, ycf1, and rbcL) were under protein sequence evolution. Phylogenetic analysis based on the whole plastomes suggested that the Gossypium species grouped into six previously identified genetic clades. Interestingly, all 13 D-genome species clustered into a strong monophyletic clade. Unexpectedly, the cotton species with C, G, and K-genomes were admixed and nested in a large clade, which could have been due to their recent radiation, incomplete lineage sorting, and introgression hybridization among different cotton lineages. In conclusion, the results of this study provide new insights into the evolution of repeat sequences in chloroplast genomes and interspecific relationships in the genus Gossypium.
Collapse
Affiliation(s)
- Ying Wu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
- Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, College of Life Sciences, Northwest University, Xi’an, China
| | - Fang Liu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Dai-Gang Yang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Wei Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiao-Jian Zhou
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiao-Yu Pei
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Yan-Gai Liu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Kun-Lun He
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Wen-Sheng Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Zhong-Ying Ren
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Ke-Hai Zhou
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiong-Feng Ma
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
- *Correspondence: Zhong-Hu Li, Xiong-Feng Ma,
| | - Zhong-Hu Li
- Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, College of Life Sciences, Northwest University, Xi’an, China
- *Correspondence: Zhong-Hu Li, Xiong-Feng Ma,
| |
Collapse
|
7
|
Ulloa M, Hulse-Kemp AM, De Santiago LM, Stelly DM, Burke JJ. Insights Into Upland Cotton ( Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents. GENOMICS INSIGHTS 2017; 10:1178631017735104. [PMID: 29308012 PMCID: PMC5751910 DOI: 10.1177/1178631017735104] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Accepted: 09/10/2017] [Indexed: 11/17/2022]
Abstract
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton (Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F2, a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD)1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the At and Dt subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the Dt subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral At-subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum. However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid At and Dt subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Collapse
Affiliation(s)
- Mauricio Ulloa
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| | - Amanda M Hulse-Kemp
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - Luis M De Santiago
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - John J Burke
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| |
Collapse
|
8
|
Chen Z, Grover CE, Li P, Wang Y, Nie H, Zhao Y, Wang M, Liu F, Zhou Z, Wang X, Cai X, Wang K, Wendel JF, Hua J. Molecular evolution of the plastid genome during diversification of the cotton genus. Mol Phylogenet Evol 2017; 112:268-276. [PMID: 28414099 DOI: 10.1016/j.ympev.2017.04.014] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 04/12/2017] [Accepted: 04/12/2017] [Indexed: 01/08/2023]
Abstract
Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups, designated A-G and K, and one tetraploid genomic group, namely AD. To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome duringdiversification, chloroplast genomes (cpDNA) from 6 D-genome and 2 G-genome species of Gossypium (G. armourianum D2-1, G. harknessii D2-2, G. davidsonii D3-d, G. klotzschianum D3-k, G. aridum D4, G. trilobum D8, and G. australe G2, G. nelsonii G3) were newly reported here. In combination with the 26 previously released cpDNA sequences, we performed comparative phylogenetic analyses of 34 Gossypium chloroplast genomes that collectively represent most of the diversity in the genus. Gossypium chloroplasts span a small range in size that is mostly attributable to indels that occur in the large single copy (LSC) region of the genome. Phylogenetic analysis using a concatenation of all genes provides robust support for six major Gossypium clades, largely supporting earlier inferences but also revealing new information on intrageneric relationships. Using Theobroma cacao as an outgroup, diversification of the genus was dated, yielding results that are in accord with previous estimates of divergence times, but also offering new perspectives on the basal, early radiation of all major clades within the genus as well as gaps in the record indicative of extinctions. Like most higher-plant chloroplast genomes, all cotton species exhibit a conserved quadripartite structure, i.e., two large inverted repeats (IR) containing most of the ribosomal RNA genes, and two unique regions, LSC (large single sequence) and SSC (small single sequence). Within Gossypium, the IR-single copy region junctions are both variable and homoplasious among species. Two genes, accD and psaJ, exhibited greater rates of synonymous and non-synonymous substitutions than did other genes. Most genes exhibited Ka/Ks ratios suggestive of neutral evolution, with 8 exceptions distributed among one to several species. This research provides an overview of the molecular evolution of a single, large non-recombining molecular during the diversification of this important genus.
Collapse
Affiliation(s)
- Zhiwen Chen
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| | - Corrinne E Grover
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA.
| | - Pengbo Li
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| | - Yumei Wang
- Institute of Cash Crops, Hubei Academy of Agricultural Sciences, Wuhan 430064, Hubei, China.
| | - Hushuai Nie
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| | - Yanpeng Zhao
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| | - Meiyan Wang
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| | - Fang Liu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China.
| | - Zhongli Zhou
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China.
| | - Xingxing Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China.
| | - Xiaoyan Cai
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China.
| | - Kunbo Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China.
| | - Jonathan F Wendel
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA.
| | - Jinping Hua
- Laboratory of Cotton Genetics, Genomics and Breeding, College of Agronomy and Biotechnology/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
9
|
Kottapalli P, Ulloa M, Kottapalli KR, Payton P, Burke J. SNP Marker Discovery in Pima Cotton ( Gossypium barbadense L.) Leaf Transcriptomes. GENOMICS INSIGHTS 2016; 9:51-60. [PMID: 27721653 PMCID: PMC5049682 DOI: 10.4137/gei.s40377] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Revised: 08/22/2016] [Accepted: 08/24/2016] [Indexed: 11/17/2022]
Abstract
The objective of this study was to explore the known narrow genetic diversity and discover single-nucleotide polymorphic (SNP) markers for marker-assisted breeding within Pima cotton (Gossypium barbadense L.) leaf transcriptomes. cDNA from 25-day plants of three diverse cotton genotypes [Pima S6 (PS6), Pima S7 (PS7), and Pima 3-79 (P3-79)] was sequenced on Illumina sequencing platform. A total of 28.9 million reads (average read length of 138 bp) were generated by sequencing cDNA libraries of these three genotypes. The de novo assembly of reads generated transcriptome sets of 26,369 contigs for PS6, 25,870 contigs for PS7, and 24,796 contigs for P3-79. A Pima leaf reference transcriptome was generated consisting of 42,695 contigs. More than 10,000 single-nucleotide polymorphisms (SNPs) were identified between the genotypes, with 100% SNP frequency and a minimum of eight sequencing reads. The most prevalent SNP substitutions were C-T and A-G in these cotton genotypes. The putative SNPs identified can be utilized for characterizing genetic diversity, genotyping, and eventually in Pima cotton breeding through marker-assisted selection.
Collapse
Affiliation(s)
- Pratibha Kottapalli
- Center for Biotechnology and Genomics, Texas Tech University, Lubbock, TX, USA
| | - Mauricio Ulloa
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | | | - Paxton Payton
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | - John Burke
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| |
Collapse
|
10
|
Wendel JF, Grover CE. Taxonomy and Evolution of the Cotton Genus, Gossypium. AGRONOMY MONOGRAPHS 2015. [DOI: 10.2134/agronmonogr57.2013.0020] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Affiliation(s)
- Jonathan F. Wendel
- Dep. of Ecology, Evolution and Organismal Biology, Bessey Hall; Iowa State University; Ames IA 50011
| | - Corrinne E. Grover
- Dep. of Ecology, Evolution and Organismal Biology, Bessey Hall; Iowa State University; Ames IA 50011
| |
Collapse
|
11
|
Hulse-Kemp AM, Lemm J, Plieske J, Ashrafi H, Buyyarapu R, Fang DD, Frelichowski J, Giband M, Hague S, Hinze LL, Kochan KJ, Riggs PK, Scheffler JA, Udall JA, Ulloa M, Wang SS, Zhu QH, Bag SK, Bhardwaj A, Burke JJ, Byers RL, Claverie M, Gore MA, Harker DB, Islam MS, Jenkins JN, Jones DC, Lacape JM, Llewellyn DJ, Percy RG, Pepper AE, Poland JA, Mohan Rai K, Sawant SV, Singh SK, Spriggs A, Taylor JM, Wang F, Yourstone SM, Zheng X, Lawley CT, Ganal MW, Van Deynze A, Wilson IW, Stelly DM. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp. G3 (BETHESDA, MD.) 2015; 5:1187-209. [PMID: 25908569 PMCID: PMC4478548 DOI: 10.1534/g3.115.018416] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 04/11/2015] [Indexed: 11/18/2022]
Abstract
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community.
Collapse
Affiliation(s)
- Amanda M Hulse-Kemp
- Department of Soil & Crop Sciences, Texas A&M University, College Station, Texas 77843 Interdisciplinary Degree Program in Genetics, Texas A&M University, College Station, Texas 77843
| | - Jana Lemm
- TraitGenetics GmbH, 06466 Gatersleben, Germany
| | | | - Hamid Ashrafi
- Department of Plant Sciences and Seed Biotechnology Center, University of California-Davis, Davis, California 95616
| | - Ramesh Buyyarapu
- Dow AgroSciences, Trait Genetics and Technologies, Indianapolis, Indiana 46268
| | - David D Fang
- USDA-ARS-SRRC, Cotton Fiber Bioscience Research Unit, New Orleans, Louisiana 70124
| | - James Frelichowski
- USDA-ARS-SPARC, Crop Germplasm Research Unit, College Station, Texas 77845
| | - Marc Giband
- CIRAD, UMR AGAP, Montpellier, F34398, France EMBRAPA, Algodão, Nucleo Cerrado, 75.375-000 Santo Antônio de Goias, GO, Brazil
| | - Steve Hague
- Department of Soil & Crop Sciences, Texas A&M University, College Station, Texas 77843
| | - Lori L Hinze
- USDA-ARS-SPARC, Crop Germplasm Research Unit, College Station, Texas 77845
| | - Kelli J Kochan
- Department of Animal Science, Texas A&M University, College Station, Texas 77843
| | - Penny K Riggs
- Interdisciplinary Degree Program in Genetics, Texas A&M University, College Station, Texas 77843 Department of Animal Science, Texas A&M University, College Station, Texas 77843
| | - Jodi A Scheffler
- USDA-ARS, Jamie Whitten Delta States Research Center, Stoneville, Mississippi 38776
| | - Joshua A Udall
- Brigham Young University, Plant and Wildlife Science Department, Provo, Utah 84602
| | - Mauricio Ulloa
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, Texas 79415
| | - Shirley S Wang
- USDA-ARS-SPARC, Crop Germplasm Research Unit, College Station, Texas 77845
| | - Qian-Hao Zhu
- CSIRO Agriculture Flagship, Black Mountain Laboratories, ACT 2601, Australia
| | - Sumit K Bag
- CSIR-National Botanical Research Institute, Plant Molecular Biology Division, Lucknow-226001, UP, India
| | - Archana Bhardwaj
- CSIR-National Botanical Research Institute, Plant Molecular Biology Division, Lucknow-226001, UP, India
| | - John J Burke
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, Texas 79415
| | - Robert L Byers
- Brigham Young University, Plant and Wildlife Science Department, Provo, Utah 84602
| | | | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, New York 14853
| | - David B Harker
- Brigham Young University, Plant and Wildlife Science Department, Provo, Utah 84602
| | - Md S Islam
- USDA-ARS-SRRC, Cotton Fiber Bioscience Research Unit, New Orleans, Louisiana 70124
| | - Johnie N Jenkins
- USDA-ARS, Genetics and Precision Agriculture Research, Mississippi State, Mississippi 39762
| | - Don C Jones
- Cotton Incorporated, Agricultural Research, Cary, North Carolina 27513
| | | | - Danny J Llewellyn
- CSIRO Agriculture Flagship, Black Mountain Laboratories, ACT 2601, Australia
| | - Richard G Percy
- USDA-ARS-SPARC, Crop Germplasm Research Unit, College Station, Texas 77845
| | - Alan E Pepper
- Interdisciplinary Degree Program in Genetics, Texas A&M University, College Station, Texas 77843 Department of Biology, Texas A&M University, College Station, Texas 77843
| | - Jesse A Poland
- Wheat Genetics Resource Center, Department of Plant Pathology and Department of Agronomy, Kansas State University, Manhattan, Kansas 66506
| | - Krishan Mohan Rai
- CSIR-National Botanical Research Institute, Plant Molecular Biology Division, Lucknow-226001, UP, India
| | - Samir V Sawant
- CSIR-National Botanical Research Institute, Plant Molecular Biology Division, Lucknow-226001, UP, India
| | - Sunil Kumar Singh
- CSIR-National Botanical Research Institute, Plant Molecular Biology Division, Lucknow-226001, UP, India
| | - Andrew Spriggs
- CSIRO Agriculture Flagship, Black Mountain Laboratories, ACT 2601, Australia
| | - Jen M Taylor
- CSIRO Agriculture Flagship, Black Mountain Laboratories, ACT 2601, Australia
| | - Fei Wang
- Department of Soil & Crop Sciences, Texas A&M University, College Station, Texas 77843
| | - Scott M Yourstone
- Brigham Young University, Plant and Wildlife Science Department, Provo, Utah 84602
| | - Xiuting Zheng
- Department of Soil & Crop Sciences, Texas A&M University, College Station, Texas 77843
| | | | | | - Allen Van Deynze
- Department of Plant Sciences and Seed Biotechnology Center, University of California-Davis, Davis, California 95616
| | - Iain W Wilson
- CSIRO Agriculture Flagship, Black Mountain Laboratories, ACT 2601, Australia
| | - David M Stelly
- Department of Soil & Crop Sciences, Texas A&M University, College Station, Texas 77843 Interdisciplinary Degree Program in Genetics, Texas A&M University, College Station, Texas 77843
| |
Collapse
|