1
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
2
|
Characterization and Development of Genomic SSRs in Pecan (Carya illinoinensis). FORESTS 2020. [DOI: 10.3390/f11010061] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Research Highlights: The distribution of simple sequence repeat (SSR) motifs in two draft genomes of pecan was evaluated. Sixty-six SSR loci were validated by PCR amplification in pecan. Twenty-two new development markers can be used for genetic study in genus Carya. Background and Objectives: Pecan has good nutritional and health benefits and is an important crop worldwide. However, the genetic research in this species is insufficient. One of the main reasons for this is the lack of enough accurate, convenient, and economical molecular markers. Among different marker types, SSR loci are enormously useful in genetic studies. However, the number of SSRs in C. illinoinensis (Wangenh.) K. Koch is limited. Materials and Methods: The distribution of SSR motifs in the pecan genome was analyzed. Then, the primers for each SSR were designed. To evaluate their availability, 74 SSR loci were randomly selected and amplified in pecan. Finally, 22 new SSRs and eight former ones were picked to evaluate the genetic diversity in 60 pecan genotypes and to determine their transferability in other Carya species. Results: 145,714 and 143,041 SSR motifs were obtained from two draft genomes of ‘87MX3-2’ and ‘Pawnee’, respectively. In total, 9145 candidate primers were obtained. Sixty-six (89.19%) primers amplified the target products. Among the 30 SSRs, 29 loci showed polymorphism in 60 pecan genotypes. The polymorphic information content (PIC) values ranged from 0.012 to 0.906. In total, 26, 25, and 22 SSRs can be used in C. cathayensis Sarg., C. dabieshanensis W. C. Cheng & R. H. Chang, and C. hunanensis W.C. Liu, respectively. Finally, the dendrogram of all individuals was constructed. The results agree with the geographic origin of the four species and the pedigree relationships between different pecan cultivars. Conclusions: The characterization of SSRs in the pecan genome and the new SSRs will promote the progress of genetic study and breeding in pecan, as well as other species of genus Carya.
Collapse
|
3
|
Luo X, Liu J. Transcriptome Analysis of Acid-Responsive Genes and Pathways Involved in Polyamine Regulation in Iron Walnut. Genes (Basel) 2019; 10:E605. [PMID: 31405132 PMCID: PMC6723594 DOI: 10.3390/genes10080605] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 08/07/2019] [Accepted: 08/07/2019] [Indexed: 02/07/2023] Open
Abstract
We reported changes in the co-regulated mRNA expression in iron walnut (Juglans sigillata) in response to soil pH treatments and identified mRNAs specific to acidic soil conditions. Phenotypic and physiological analyses revealed that iron walnut growth was greater for the pH 4-5 and pH 5-6 treatments than for the pH 3-4 and pH 6-7 treatments. A total of 2768 differentially expressed genes were detected and categorized into 12 clusters by Short Time-series Expression Miner (STEM). The 994 low-expression genes in cluster III and 255 high-expression genes in cluster X were classified as acid-responsive genes on the basis of the relationships between phenotype, physiology, and STEM clustering, and the two gene clusters were analyzed by a maximum likelihood (ML) evolutionary tree with the greatest log likelihood values. No prominent sub-clusters occurred in cluster III, but three occurred in cluster X. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis indicated that acid-responsive genes were related primarily to arginine biosynthesis and the arginine/proline metabolism pathway, implying that polyamine accumulation may enhance iron walnut acid stress tolerance. Overall, our results revealed 1249 potentially acid-responsive genes in iron walnut, indicating that its response to acid stress involves different pathways and activated genes.
Collapse
Affiliation(s)
- Xiaomei Luo
- College of Forestry, Sichuan Agricultural University, Huimin Road 211 in Wenjiang District, Chengdu 611130, China.
| | - Juncheng Liu
- College of Forestry, Sichuan Agricultural University, Huimin Road 211 in Wenjiang District, Chengdu 611130, China
| |
Collapse
|
4
|
Si Z, Du B, Huo J, He S, Liu Q, Zhai H. A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition. BMC Genomics 2016; 17:945. [PMID: 27871234 PMCID: PMC5117676 DOI: 10.1186/s12864-016-3302-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Accepted: 11/15/2016] [Indexed: 11/14/2022] Open
Abstract
Background Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. Results The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93–10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. Conclusions The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3302-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zengzhi Si
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Bing Du
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Jinxi Huo
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Shaozhen He
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Qingchang Liu
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China.
| | - Hong Zhai
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
5
|
Martínez-García PJ, Crepeau MW, Puiu D, Gonzalez-Ibeas D, Whalen J, Stevens KA, Paul R, Butterfield TS, Britton MT, Reagan RL, Chakraborty S, Walawage SL, Vasquez-Gross HA, Cardeno C, Famula RA, Pratt K, Kuruganti S, Aradhya MK, Leslie CA, Dandekar AM, Salzberg SL, Wegrzyn JL, Langley CH, Neale DB. The walnut (Juglans regia) genome sequence reveals diversity in genes coding for the biosynthesis of non-structural polyphenols. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 87:507-32. [PMID: 27145194 DOI: 10.1111/tpj.13207] [Citation(s) in RCA: 118] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Revised: 04/22/2016] [Accepted: 04/27/2016] [Indexed: 05/18/2023]
Abstract
The Persian walnut (Juglans regia L.), a diploid species native to the mountainous regions of Central Asia, is the major walnut species cultivated for nut production and is one of the most widespread tree nut species in the world. The high nutritional value of J. regia nuts is associated with a rich array of polyphenolic compounds, whose complete biosynthetic pathways are still unknown. A J. regia genome sequence was obtained from the cultivar 'Chandler' to discover target genes and additional unknown genes. The 667-Mbp genome was assembled using two different methods (SOAPdenovo2 and MaSuRCA), with an N50 scaffold size of 464 955 bp (based on a genome size of 606 Mbp), 221 640 contigs and a GC content of 37%. Annotation with MAKER-P and other genomic resources yielded 32 498 gene models. Previous studies in walnut relying on tissue-specific methods have only identified a single polyphenol oxidase (PPO) gene (JrPPO1). Enabled by the J. regia genome sequence, a second homolog of PPO (JrPPO2) was discovered. In addition, about 130 genes in the large gallate 1-β-glucosyltransferase (GGT) superfamily were detected. Specifically, two genes, JrGGT1 and JrGGT2, were significantly homologous to the GGT from Quercus robur (QrGGT), which is involved in the synthesis of 1-O-galloyl-β-d-glucose, a precursor for the synthesis of hydrolysable tannins. The reference genome for J. regia provides meaningful insight into the complex pathways required for the synthesis of polyphenols. The walnut genome sequence provides important tools and methods to accelerate breeding and to facilitate the genetic dissection of complex traits.
Collapse
Affiliation(s)
| | - Marc W Crepeau
- Department of Evolution and Ecology, University of California, Davis, CA, 95616, USA
| | - Daniela Puiu
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, 21205, USA
| | - Daniel Gonzalez-Ibeas
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | - Jeanne Whalen
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | - Kristian A Stevens
- Department of Evolution and Ecology, University of California, Davis, CA, 95616, USA
| | - Robin Paul
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | | | | | - Russell L Reagan
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Sandeep Chakraborty
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Sriema L Walawage
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | | | - Charis Cardeno
- Department of Evolution and Ecology, University of California, Davis, CA, 95616, USA
| | - Randi A Famula
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Kevin Pratt
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | - Sowmya Kuruganti
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | | | - Charles A Leslie
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Abhaya M Dandekar
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA
| | - Steven L Salzberg
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, 21205, USA
- Departments of Biomedical Engineering, Computer Science, and Biostatistics, Johns Hopkins University, Baltimore, MD, 21205, USA
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269-3043, USA
| | - Charles H Langley
- Department of Evolution and Ecology, University of California, Davis, CA, 95616, USA
| | - David B Neale
- Department of Plant Sciences, University of California, Davis, CA, 95616, USA.
| |
Collapse
|
6
|
Seibt KM, Wenke T, Muders K, Truberg B, Schmidt T. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 86:268-285. [PMID: 26996788 DOI: 10.1111/tpj.13170] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 03/11/2016] [Accepted: 03/14/2016] [Indexed: 06/05/2023]
Abstract
Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | | | | | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
7
|
Luo MC, You FM, Li P, Wang JR, Zhu T, Dandekar AM, Leslie CA, Aradhya M, McGuire PE, Dvorak J. Synteny analysis in Rosids with a walnut physical map reveals slow genome evolution in long-lived woody perennials. BMC Genomics 2015; 16:707. [PMID: 26383694 PMCID: PMC4574618 DOI: 10.1186/s12864-015-1906-5] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 09/09/2015] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Mutations often accompany DNA replication. Since there may be fewer cell cycles per year in the germlines of long-lived than short-lived angiosperms, the genomes of long-lived angiosperms may be diverging more slowly than those of short-lived angiosperms. Here we test this hypothesis. RESULTS We first constructed a genetic map for walnut, a woody perennial. All linkage groups were short, and recombination rates were greatly reduced in the centromeric regions. We then used the genetic map to construct a walnut bacterial artificial chromosome (BAC) clone-based physical map, which contained 15,203 exonic BAC-end sequences, and quantified with it synteny between the walnut genome and genomes of three long-lived woody perennials, Vitis vinifera, Populus trichocarpa, and Malus domestica, and three short-lived herbs, Cucumis sativus, Medicago truncatula, and Fragaria vesca. Each measure of synteny we used showed that the genomes of woody perennials were less diverged from the walnut genome than those of herbs. We also estimated the nucleotide substitution rate at silent codon positions in the walnut lineage. It was one-fifth and one-sixth of published nucleotide substitution rates in the Medicago and Arabidopsis lineages, respectively. We uncovered a whole-genome duplication in the walnut lineage, dated it to the neighborhood of the Cretaceous-Tertiary boundary, and allocated the 16 walnut chromosomes into eight homoeologous pairs. We pointed out that during polyploidy-dysploidy cycles, the dominant tendency is to reduce the chromosome number. CONCLUSION Slow rates of nucleotide substitution are accompanied by slow rates of synteny erosion during genome divergence in woody perennials.
Collapse
Affiliation(s)
- Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA, USA.
| | - Frank M You
- Cereal Research Centre, Agriculture and Agri-Food Canada, Morden, Canada.
| | - Pingchuan Li
- Cereal Research Centre, Agriculture and Agri-Food Canada, Morden, Canada.
| | - Ji-Rui Wang
- Department of Plant Sciences, University of California, Davis, CA, USA. .,Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, China.
| | - Tingting Zhu
- Department of Plant Sciences, University of California, Davis, CA, USA.
| | - Abhaya M Dandekar
- Department of Plant Sciences, University of California, Davis, CA, USA.
| | - Charles A Leslie
- Department of Plant Sciences, University of California, Davis, CA, USA.
| | - Mallikarjuna Aradhya
- United States Department of Agriculture-Agricultural Research Service Clonal Repository, Davis, CA, USA.
| | - Patrick E McGuire
- Department of Plant Sciences, University of California, Davis, CA, USA.
| | - Jan Dvorak
- Department of Plant Sciences, University of California, Davis, CA, USA.
| |
Collapse
|
8
|
Zhu Y, Yin Y, Yang K, Li J, Sang Y, Huang L, Fan S. Construction of a high-density genetic map using specific length amplified fragment markers and identification of a quantitative trait locus for anthracnose resistance in walnut (Juglans regia L.). BMC Genomics 2015; 16:614. [PMID: 26283231 PMCID: PMC4539690 DOI: 10.1186/s12864-015-1822-8] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 08/07/2015] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Walnut (Juglans regia, 2n = 32, approximately 606 Mb per 1C genome) is an economically important tree crop. Resistance to anthracnose, caused by Colletotrichum gloeosporioides, is a major objective of walnut genetic improvement in China. The recently developed specific length amplified fragment sequencing (SLAF-seq) is an efficient strategy that can obtain large numbers of markers with sufficient sequence information to construct high-density genetic maps and permits detection of quantitative trait loci (QTLs) for molecular breeding. RESULTS SLAF-seq generated 161.64 M paired-end reads. 153,820 SLAF markers were obtained, of which 49,174 were polymorphic. 13,635 polymorphic markers were sorted into five segregation types and 2,577 markers of them were used to construct genetic linkage maps: 2,395 of these fell into 16 linkage groups (LGs) for the female map, 448 markers for the male map, and 2,577 markers for the integrated map. Taking into account the size of all LGs, the marker coverage was 2,664.36 cM for the female map, 1,305.58 cM for the male map, and 2,457.82 cM for the integrated map. The average intervals between two adjacent mapped markers were 1.11 cM, 2.91 cM and 0.95 cM for three maps, respectively. 'SNP_only' markers accounted for 89.25% of the markers on the integrated map. Mapping markers contained 5,043 single nucleotide polymorphisms (SNPs) loci, which corresponded to two SNP loci per SLAF marker. According to the integrated map, we used interval mapping (Logarithm of odds, LOD > 3.0) to detect our quantitative trait. One QTL was detected for anthracnose resistance. The interval of this QTL ranged from 165.51 cM to 176.33 cM on LG14, and ten markers in this interval that were above the threshold value were considered to be linked markers to the anthracnose resistance trait. The phenotypic variance explained by each marker ranged from 16.2 to 19.9%, and their LOD scores varied from 3.22 to 4.04. CONCLUSIONS High-density genetic maps for walnut containing 16 LGs were constructed using the SLAF-seq method with an F1 population. One QTL for walnut anthracnose resistance was identified based on the map. The results will aid molecular marker-assisted breeding and walnut resistance genes identification.
Collapse
Affiliation(s)
- Yufeng Zhu
- College of Forestry, Shandong Agricultural University, No.61 Daizong Load, Taian, Shandong Provence, 271018, P. R. China.
| | - Yanfei Yin
- College of Forestry, Shandong Agricultural University, No.61 Daizong Load, Taian, Shandong Provence, 271018, P. R. China.
| | - Keqiang Yang
- College of Forestry, Shandong Agricultural University, No.61 Daizong Load, Taian, Shandong Provence, 271018, P. R. China.
| | - Jihong Li
- College of Forestry, Shandong Agricultural University, No.61 Daizong Load, Taian, Shandong Provence, 271018, P. R. China.
| | - Yalin Sang
- College of Forestry, Shandong Agricultural University, No.61 Daizong Load, Taian, Shandong Provence, 271018, P. R. China.
| | - Long Huang
- Biomarker Technologies Corporation, Beijing, P. R. China.
| | - Shu Fan
- Biomarker Technologies Corporation, Beijing, P. R. China.
| |
Collapse
|
9
|
Müller BSDF, Sakamoto T, de Menezes IPP, Prado GS, Martins WS, Brondani C, de Barros EG, Vianello RP. Analysis of BAC-end sequences in common bean (Phaseolus vulgaris L.) towards the development and characterization of long motifs SSRs. PLANT MOLECULAR BIOLOGY 2014; 86:455-470. [PMID: 25164100 DOI: 10.1007/s11103-014-0240-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 08/14/2014] [Indexed: 06/03/2023]
Abstract
The increasing volume of genomic data on the Phaseolus vulgaris species have contributed to its importance as a model genetic species and positively affected the investigation of other legumes of scientific and economic value. To expand and gain a more in-depth knowledge of the common bean genome, the ends of a number of bacterial artificial chromosome (BAC) were sequenced, annotated and the presence of repetitive sequences was determined. In total, 52,270 BESs (BAC-end sequences), equivalent to 32 Mbp (~6 %) of the genome, were processed. In total, 3,789 BES-SSRs were identified, with a distribution of one SSR (simple sequence repeat) per 8.36 kbp and 2,000 were suitable for the development of SSRs, of which 194 were evaluated in low-resolution screening. From 40 BES-SSRs based on long motifs SSRs (≥ trinucleotides) analyzed in high-resolution genotyping, 34 showed an equally good amplification for the Andean and for the Mesoamerican genepools, exhibiting an average gene diversity (H E) of 0.490 and 5.59 alleles/locus, of which six classified as Class I showed a H E ≥ 0.7. The PCoA and structure analysis allowed to discriminate the gene pools (K = 2, FST = 0.733). From the 52,270 BESs, 2 % corresponded to transcription factors and 3 % to transposable elements. Putative functions for 24,321 BESs were identified and for 19,363 were assigned functional categories (gene ontology). This study identified highly polymorphic BES-SSRs containing tri- to hexanucleotides motifs and bringing together relevant genetic characteristics useful for breeding programs. Additionally, the BESs were incorporated into the international genome-sequencing project for the common bean.
Collapse
Affiliation(s)
- Bárbara Salomão de Faria Müller
- Laboratório de Genética Molecular de Plantas, Instituto de Biotecnologia Aplicada à Agropecuária (BIOAGRO), Universidade Federal de Viçosa (UFV), Viçosa, MG, Brazil
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Santos AA, Penha HA, Bellec A, Munhoz CDF, Pedrosa-Harand A, Bergès H, Vieira MLC. Begin at the beginning: A BAC-end view of the passion fruit (Passiflora) genome. BMC Genomics 2014; 15:816. [PMID: 25260959 PMCID: PMC4189760 DOI: 10.1186/1471-2164-15-816] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2014] [Accepted: 09/22/2014] [Indexed: 12/16/2022] Open
Abstract
Background The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. Results The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n = 9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). Conclusions We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome study. Remarkably, a number of BAC-end pair sequences could be mapped to intervals of the sequenced Arabidopsis thaliana, V. vinifera and P. trichocarpa chromosomes, and putative collinear microsyntenic regions were identified. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-816) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Maria Lucia Carneiro Vieira
- Departamento de Genética, Universidade de São Paulo, Escola Superior de Agricultura "Luiz de Queiroz", P,O, Box 83, 13400-970 Piracicaba, Brazil.
| |
Collapse
|
11
|
Ferreira de Carvalho J, Chelaifa H, Boutte J, Poulain J, Couloux A, Wincker P, Bellec A, Fourment J, Bergès H, Salmon A, Ainouche M. Exploring the genome of the salt-marsh Spartina maritima (Poaceae, Chloridoideae) through BAC end sequence analysis. PLANT MOLECULAR BIOLOGY 2013; 83:591-606. [PMID: 23877482 DOI: 10.1007/s11103-013-0111-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Accepted: 07/13/2013] [Indexed: 06/02/2023]
Abstract
Spartina species play an important ecological role on salt marshes. Spartina maritima is an Old-World species distributed along the European and North-African Atlantic coasts. This hexaploid species (2n = 6x = 60, 2C = 3,700 Mb) hybridized with different Spartina species introduced from the American coasts, which resulted in the formation of new invasive hybrids and allopolyploids. Thus, S. maritima raises evolutionary and ecological interests. However, genomic information is dramatically lacking in this genus. In an effort to develop genomic resources, we analysed 40,641 high-quality bacterial artificial chromosome-end sequences (BESs), representing 26.7 Mb of the S. maritima genome. BESs were searched for sequence homology against known databases. A fraction of 16.91% of the BESs represents known repeats including a majority of long terminal repeat (LTR) retrotransposons (13.67%). Non-LTR retrotransposons represent 0.75%, DNA transposons 0.99%, whereas small RNA, simple repeats and low-complexity sequences account for 1.38% of the analysed BESs. In addition, 4,285 simple sequence repeats were detected. Using the coding sequence database of Sorghum bicolor, 6,809 BESs found homology accounting for 17.1% of all BESs. Comparative genomics with related genera reveals that the microsynteny is better conserved with S. bicolor compared to other sequenced Poaceae, where 37.6% of the paired matching BESs are correctly orientated on the chromosomes. We did not observe large macrosyntenic rearrangements using the mapping strategy employed. However, some regions appeared to have experienced rearrangements when comparing Spartina to Sorghum and to Oryza. This work represents the first overview of S. maritima genome regarding the respective coding and repetitive components. The syntenic relationships with other grass genomes examined here help clarifying evolution in Poaceae, S. maritima being a part of the poorly-known Chloridoideae sub-family.
Collapse
Affiliation(s)
- J Ferreira de Carvalho
- UMR CNRS 6553 ECOBIO, OSUR, University of Rennes 1, Bât 14A Campus Scientifique de Beaulieu, 35042, Rennes Cedex, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Wegrzyn JL, Lin BY, Zieve JJ, Dougherty WM, Martínez-García PJ, Koriabine M, Holtz-Morris A, deJong P, Crepeau M, Langley CH, Puiu D, Salzberg SL, Neale DB, Stevens KA. Insights into the loblolly pine genome: characterization of BAC and fosmid sequences. PLoS One 2013; 8:e72439. [PMID: 24023741 PMCID: PMC3762812 DOI: 10.1371/journal.pone.0072439] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 07/10/2013] [Indexed: 12/22/2022] Open
Abstract
Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation efforts will be undertaken to characterize the reference sequences. Accurate annotation of these ancient genomes would be aided by a comprehensive repeat library; however, few studies have generated enough sequence to fully evaluate and catalog their non-genic content. In this paper, two sets of loblolly pine genomic sequence, 103 previously assembled BACs and 90,954 newly sequenced and assembled fosmid scaffolds, were analyzed. Together, this sequence represents 280 Mbp (roughly 1% of the loblolly pine genome) and one of the most comprehensive studies of repetitive elements and genes in a gymnosperm species. A combination of homology and de novo methodologies were applied to identify both conserved and novel repeats. Similarity analysis estimated a repetitive content of 27% that included both full and partial elements. When combined with the de novo investigation, the estimate increased to almost 86%. Over 60% of the repetitive sequence consists of full or partial LTR (long terminal repeat) retrotransposons. Through de novo approaches, 6,270 novel, full-length transposable element families and 9,415 sub-families were identified. Among those 6,270 families, 82% were annotated as single-copy. Several of the novel, high-copy families are described here, with the largest, PtPiedmont, comprising 133 full-length copies. In addition to repeats, analysis of the coding region reported 23 full-length eukaryotic orthologous proteins (KOGS) and another 29 novel or orthologous genes. These discoveries, along with other genomic resources, will be used to annotate conifer genomes and address long-standing questions about gymnosperm evolution.
Collapse
Affiliation(s)
- Jill L. Wegrzyn
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
- * E-mail: (JLW); (KAS)
| | - Brian Y. Lin
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Jacob J. Zieve
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - William M. Dougherty
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Pedro J. Martínez-García
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Maxim Koriabine
- Children's Hospital Oakland Research Institute, Oakland, California, United States of America
| | - Ann Holtz-Morris
- Children's Hospital Oakland Research Institute, Oakland, California, United States of America
| | - Pieter deJong
- Children's Hospital Oakland Research Institute, Oakland, California, United States of America
| | - Marc Crepeau
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Charles H. Langley
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Daniela Puiu
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Steven L. Salzberg
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - David B. Neale
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Kristian A. Stevens
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
- * E-mail: (JLW); (KAS)
| |
Collapse
|
13
|
You FM, Deal KR, Wang J, Britton MT, Fass JN, Lin D, Dandekar AM, Leslie CA, Aradhya M, Luo MC, Dvorak J. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms. BMC Genomics 2012; 13:354. [PMID: 22849334 PMCID: PMC3527177 DOI: 10.1186/1471-2164-13-354] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 07/05/2012] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). RESULTS The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar 'Chandler' were mapped to 48,661 'Chandler' bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium BeadChip, which was used to genotype a walnut mapping population having 'Chandler' as one of the parents. Genotyping results were used to adjust the filtering parameters of the updated AGSNP pipeline. With the adjusted filtering criteria, 69.6% of SNPs discovered with the updated pipeline were real and could be mapped on the walnut genetic map. A total of 13,439 SNPs were discovered by BES re-sequencing. BESs harboring SNPs were in 677 FPC contigs covering 98% of the physical map of the walnut genome. CONCLUSION The updated AGSNP pipeline is a versatile SNP discovery tool for a high-throughput, genome-wide SNP discovery in both autogamous and allogamous species. With this pipeline, a large set of SNPs were identified in a single walnut cultivar.
Collapse
Affiliation(s)
- Frank M You
- Department of Plant Sciences, University of California, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|