1
|
Repeat turnover meets stable chromosomes: repetitive DNA sequences mark speciation and gene pool boundaries in sugar beet and wild beets. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:171-190. [PMID: 38128038 DOI: 10.1111/tpj.16599] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/05/2023] [Accepted: 12/08/2023] [Indexed: 12/23/2023]
Abstract
Sugar beet and its wild relatives share a base chromosome number of nine and similar chromosome morphologies. Yet, interspecific breeding is impeded by chromosome and sequence divergence that is still not fully understood. Since repetitive DNAs are among the fastest evolving parts of the genome, we investigated, if repeatome innovations and losses are linked to chromosomal differentiation and speciation. We traced genome and chromosome-wide evolution across 13 beet species comprising all sections of the genera Beta and Patellifolia. For this, we combined short and long read sequencing, flow cytometry, and cytogenetics to build a comprehensive framework that spans the complete scale from DNA to chromosome to genome. Genome sizes and repeat profiles reflect the separation into three gene pools with contrasting evolutionary patterns. Among all repeats, satellite DNAs harbor most genomic variability, leading to fundamentally different centromere architectures, ranging from chromosomal uniformity in Beta and Patellifolia to the formation of patchwork chromosomes in Corollinae/Nanae. We show that repetitive DNAs are causal for the genome expansions and contractions across the beet genera, providing insights into the genomic underpinnings of beet speciation. Satellite DNAs in particular vary considerably between beet genomes, leading to the evolution of distinct chromosomal setups in the three gene pools, likely contributing to the barriers in beet breeding. Thus, with their isokaryotypic chromosome sets, beet genomes present an ideal system for studying the link between repeats, genomic variability, and chromosomal differentiation and provide a theoretical fundament for understanding barriers in any crop breeding effort.
Collapse
|
2
|
Generation and characterisation of an Arabidopsis thaliana f3h/fls1/ans triple mutant that accumulates eriodictyol derivatives. BMC PLANT BIOLOGY 2024; 24:99. [PMID: 38331743 PMCID: PMC10854054 DOI: 10.1186/s12870-024-04787-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 01/31/2024] [Indexed: 02/10/2024]
Abstract
BACKGROUND Flavonoids are plant specialised metabolites, which derive from phenylalanine and acetate metabolism. They possess a variety of beneficial characteristics for plants and humans. Several modification steps in the synthesis of tricyclic flavonoids cause for the amazing diversity of flavonoids in plants. The 2-oxoglutarate-dependent dioxygenases (2-ODDs) flavanone 3-hydroxylase (F3H, synonym FHT), flavonol synthase (FLS) and anthocyanidin synthase (ANS, synonym leucoanthocyanidin dioxygenase (LDOX)), catalyse oxidative modifications to the central C ring. They are highly similar and have been shown to catalyse, at least in part, each other's reactions. FLS and ANS have been identified as bifunctional enzymes in many species, including Arabidopsis thaliana, stressing the capability of plants to bypass missing or mutated reaction steps on the way to flavonoid production. However, little is known about such bypass reactions and the flavonoid composition of plants lacking all three central flavonoid 2-ODDs. RESULTS To address this issue, we generated a f3h/fls1/ans mutant, as well as the corresponding double mutants and investigated the flavonoid composition of this mutant collection. The f3h/fls1/ans mutant was further characterised at the genomic level by analysis of a nanopore DNA sequencing generated genome sequence assembly and at the transcriptomic level by RNA-Seq analysis. The mutant collection established, including the novel double mutants f3h/fls1 and f3h/ans, was used to validate and analyse the multifunctionalities of F3H, FLS1, and ANS in planta. Metabolite analyses revealed the accumulation of eriodictyol and additional glycosylated derivatives in mutants carrying the f3h mutant allele, resulting from the conversion of naringenin to eriodictyol by flavonoid 3'-hydroxylase (F3'H) activity. CONCLUSIONS We describe the in planta multifunctionality of the three central flavonoid 2-ODDs from A. thaliana and identify a bypass in the f3h/fls1/ans triple mutant that leads to the formation of eriodictyol derivatives. As (homo-)eriodictyols are known as bitter taste maskers, the annotated eriodictyol (derivatives) and in particular the observations made on their in planta production, could provide valuable insights for the creation of novel food supplements.
Collapse
|
3
|
Genomic characterization of a nematode tolerance locus in sugar beet. BMC Genomics 2023; 24:748. [PMID: 38057719 DOI: 10.1186/s12864-023-09823-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 11/21/2023] [Indexed: 12/08/2023] Open
Abstract
BACKGROUND Infection by beet cyst nematodes (BCN, Heterodera schachtii) causes a serious disease of sugar beet, and climatic change is expected to improve the conditions for BCN infection. Yield and yield stability under adverse conditions are among the main breeding objectives. Breeding of BCN tolerant sugar beet cultivars offering high yield in the presence of the pathogen is therefore of high relevance. RESULTS To identify causal genes providing tolerance against BCN infection, we combined several experimental and bioinformatic approaches. Relevant genomic regions were detected through mapping-by-sequencing using a segregating F2 population. DNA sequencing of contrasting F2 pools and analyses of allele frequencies for variant positions identified a single genomic region which confers nematode tolerance. The genomic interval was confirmed and narrowed down by genotyping with newly developed molecular markers. To pinpoint the causal genes within the potential nematode tolerance locus, we generated long read-based genome sequence assemblies of the tolerant parental breeding line Strube U2Bv and the susceptible reference line 2320Bv. We analyzed continuous sequences of the potential locus with regard to functional gene annotation and differential gene expression upon BCN infection. A cluster of genes with similarity to the Arabidopsis thaliana gene encoding nodule inception protein-like protein 7 (NLP7) was identified. Gene expression analyses confirmed transcriptional activity and revealed clear differences between susceptible and tolerant genotypes. CONCLUSIONS Our findings provide new insights into the genomic basis of plant-nematode interactions that can be used to design and accelerate novel management strategies against BCN.
Collapse
|
4
|
Three R2R3-MYB transcription factors from banana (Musa acuminata) activate structural anthocyanin biosynthesis genes as part of an MBW complex. BMC Res Notes 2023; 16:103. [PMID: 37312204 DOI: 10.1186/s13104-023-06375-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 06/02/2023] [Indexed: 06/15/2023] Open
Abstract
OBJECTIVE Bananas are one of the most popular fruits in the world, providing food security and employment opportunities in several developing countries. Increasing the anthocyanin content of banana fruit could improve the health-promoting properties. Anthocyanin biosynthesis is largely regulated at the transcriptional level. However, relatively little is known about the transcriptional activation of anthocyanin biosynthesis in banana. RESULTS We analysed the regulatory activity of three Musa acuminata MYBs that were predicted by bioinformatic analysis to transcriptionally regulate anthocyanin biosynthesis in banana. MaMYBA1, MaMYBA2 and MaMYBPA2 did not complement the anthocyanin-deficient phenotype of the Arabidopsis thaliana pap1/pap2 mutant. However, co-transfection experiments in A. thaliana protoplasts showed that MaMYBA1, MaMYBA2 and MaMYBPA2 function as components of a transcription factor complex with a bHLH and WD40 protein, the so called MBW complex, resulting in the activation of the A. thaliana ANTHOCYANIDIN SYNTHASE and DIHYDROFLAVONOL 4-REDUCTASE promoters. The activation potential of MaMYBA1, MaMYBA2 and MaMYBPA2 was increased when combined with the monocot Zea mays bHLH ZmR instead of the dicot AtEGL3. This work paves the path towards decoding the MBW complex-mediated transcriptional activation of anthocyanin biosynthesis in banana. It will also facilitate research towards increased anthocyanin content in banana and other monocot crops.
Collapse
|
5
|
Phased grapevine genome sequence of an Rpv12 carrier for biotechnological exploration of resistance to Plasmopara viticola. FRONTIERS IN PLANT SCIENCE 2023; 14:1180982. [PMID: 37223784 PMCID: PMC10200900 DOI: 10.3389/fpls.2023.1180982] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 04/11/2023] [Indexed: 05/25/2023]
Abstract
The downy mildew disease caused by the oomycete Plasmopara viticola is a serious threat for grapevine and can cause enormous yield losses in viticulture. The quantitative trait locus Rpv12, mediating resistance against P. viticola, was originally found in Asian Vitis amurensis. This locus and its genes were analyzed here in detail. A haplotype-separated genome sequence of the diploid Rpv12-carrier Gf.99-03 was created and annotated. The defense response against P. viticola was investigated in an infection time-course RNA-seq experiment, revealing approximately 600 upregulated Vitis genes during host-pathogen interaction. The Rpv12 regions of the resistance and the sensitivity encoding Gf.99-03 haplotype were structurally and functionally compared with each other. Two different clusters of resistance-related genes were identified within the Rpv12 locus. One cluster carries a set of four differentially expressed genes with three ACCELERATED CELL DEATH 6-like genes. The other cluster carries a set of six resistance gene analogs related to qualitative pathogen resistance. The Rpv12 locus and its candidate genes for P. viticola resistance provide a precious genetic resource for P. viticola resistance breeding. Newly developed co-segregating simple sequence repeat markers in close proximity to the R-genes enable its improved applicability in marker-assisted grapevine breeding.
Collapse
|
6
|
Mapping‑by‑Sequencing Reveals Genomic Regions Associated with Seed Quality Parameters in Brassica napus. Genes (Basel) 2022; 13:genes13071131. [PMID: 35885914 PMCID: PMC9317104 DOI: 10.3390/genes13071131] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/15/2022] [Accepted: 06/22/2022] [Indexed: 11/21/2022] Open
Abstract
Rapeseed (Brassica napus L.) is an important oil crop and has the potential to serve as a highly productive source of protein. This protein exhibits an excellent amino acid composition and has high nutritional value for humans. Seed protein content (SPC) and seed oil content (SOC) are two complex quantitative and polygenic traits which are negatively correlated and assumed to be controlled by additive and epistatic effects. A reduction in seed glucosinolate (GSL) content is desired as GSLs cause a stringent and bitter taste. The goal here was the identification of genomic intervals relevant for seed GSL content and SPC/SOC. Mapping by sequencing (MBS) revealed 30 and 15 new and known genomic intervals associated with seed GSL content and SPC/SOC, respectively. Within these intervals, we identified known but also so far unknown putatively causal genes and sequence variants. A 4 bp insertion in the MYB28 homolog on C09 shows a significant association with a reduction in seed GSL content. This study provides insights into the genetic architecture and potential mechanisms underlying seed quality traits, which will enhance future breeding approaches in B. napus.
Collapse
|
7
|
Complete pan-plastome sequences enable high resolution phylogenetic classification of sugar beet and closely related crop wild relatives. BMC Genomics 2022; 23:113. [PMID: 35139817 PMCID: PMC8830136 DOI: 10.1186/s12864-022-08336-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 01/20/2022] [Indexed: 12/24/2022] Open
Abstract
Background As the major source of sugar in moderate climates, sugar-producing beets (Beta vulgaris subsp. vulgaris) have a high economic value. However, the low genetic diversity within cultivated beets requires introduction of new traits, for example to increase their tolerance and resistance attributes – traits that often reside in the crop wild relatives. For this, genetic information of wild beet relatives and their phylogenetic placements to each other are crucial. To answer this need, we sequenced and assembled the complete plastome sequences from a broad species spectrum across the beet genera Beta and Patellifolia, both embedded in the Betoideae (order Caryophyllales). This pan-plastome dataset was then used to determine the wild beet phylogeny in high-resolution. Results We sequenced the plastomes of 18 closely related accessions representing 11 species of the Betoideae subfamily and provided high-quality plastome assemblies which represent an important resource for further studies of beet wild relatives and the diverse plant order Caryophyllales. Their assembly sizes range from 149,723 bp (Beta vulgaris subsp. vulgaris) to 152,816 bp (Beta nana), with most variability in the intergenic sequences. Combining plastome-derived phylogenies with read-based treatments based on mitochondrial information, we were able to suggest a unified and highly confident phylogenetic placement of the investigated Betoideae species. Our results show that the genus Beta can be divided into the two clearly separated sections Beta and Corollinae. Our analysis confirms the affiliation of B. nana with the other Corollinae species, and we argue against a separate placement in the Nanae section. Within the Patellifolia genus, the two diploid species Patellifolia procumbens and Patellifolia webbiana are, regarding the plastome sequences, genetically more similar to each other than to the tetraploid Patellifolia patellaris. Nevertheless, all three Patellifolia species are clearly separated. Conclusion In conclusion, our wild beet plastome assemblies represent a new resource to understand the molecular base of the beet germplasm. Despite large differences on the phenotypic level, our pan-plastome dataset is highly conserved. For the first time in beets, our whole plastome sequences overcome the low sequence variation in individual genes and provide the molecular backbone for highly resolved beet phylogenomics. Hence, our plastome sequencing strategy can also guide genomic approaches to unravel other closely related taxa. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08336-8.
Collapse
|
8
|
Characterization of the Brassica napus Flavonol Synthase Gene Family Reveals Bifunctional Flavonol Synthases. FRONTIERS IN PLANT SCIENCE 2021; 12:733762. [PMID: 34721462 PMCID: PMC8548573 DOI: 10.3389/fpls.2021.733762] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 09/21/2021] [Indexed: 06/13/2023]
Abstract
Flavonol synthase (FLS) is a key enzyme for the formation of flavonols, which are a subclass of the flavonoids. FLS catalyzes the conversion of dihydroflavonols to flavonols. The enzyme belongs to the 2-oxoglutarate-dependent dioxygenases (2-ODD) superfamily. We characterized the FLS gene family of Brassica napus that covers 13 genes, based on the genome sequence of the B. napus cultivar Express 617. The goal was to unravel which BnaFLS genes are relevant for seed flavonol accumulation in the amphidiploid species B. napus. Two BnaFLS1 homeologs were identified and shown to encode bifunctional enzymes. Both exhibit FLS activity as well as flavanone 3-hydroxylase (F3H) activity, which was demonstrated in vivo and in planta. BnaFLS1-1 and -2 are capable of converting flavanones into dihydroflavonols and further into flavonols. Analysis of spatio-temporal transcription patterns revealed similar expression profiles of BnaFLS1 genes. Both are mainly expressed in reproductive organs and co-expressed with the genes encoding early steps of flavonoid biosynthesis. Our results provide novel insights into flavonol biosynthesis in B. napus and contribute information for breeding targets with the aim to modify the flavonol content in rapeseed.
Collapse
|
9
|
Functional Characterisation of Banana ( Musa spp.) 2-Oxoglutarate-Dependent Dioxygenases Involved in Flavonoid Biosynthesis. FRONTIERS IN PLANT SCIENCE 2021; 12:701780. [PMID: 34484266 PMCID: PMC8415913 DOI: 10.3389/fpls.2021.701780] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 07/20/2021] [Indexed: 05/27/2023]
Abstract
Bananas (Musa) are non-grass, monocotyledonous, perennial plants that are well known for their edible fruits. Their cultivation provides food security and employment opportunities in many countries. Banana fruits contain high levels of minerals and phytochemicals, including flavonoids, which are beneficial for human nutrition. To broaden the knowledge on flavonoid biosynthesis in this major crop plant, we aimed to identify and functionally characterise selected structural genes encoding 2-oxoglutarate-dependent dioxygenases, involved in the formation of the flavonoid aglycon. Musa candidates genes predicted to encode flavanone 3-hydroxylase (F3H), flavonol synthase (FLS) and anthocyanidin synthase (ANS) were assayed. Enzymatic functionalities of the recombinant proteins were confirmed in vivo using bioconversion assays. Moreover, transgenic analyses in corresponding Arabidopsis thaliana mutants showed that MusaF3H, MusaFLS and MusaANS were able to complement the respective loss-of-function phenotypes, thus verifying functionality of the enzymes in planta. Knowledge gained from this work provides a new aspect for further research towards genetic engineering of flavonoid biosynthesis in banana fruits to increase their antioxidant activity and nutritional value.
Collapse
|
10
|
Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis. BMC Genomics 2021; 22:599. [PMID: 34362298 PMCID: PMC8348815 DOI: 10.1186/s12864-021-07877-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 07/06/2021] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades. RESULTS We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days. CONCLUSIONS Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations.
Collapse
|
11
|
Transcriptomic analysis of temporal shifts in berry development between two grapevine cultivars of the Pinot family reveals potential genes controlling ripening time. BMC PLANT BIOLOGY 2021; 21:327. [PMID: 34233614 PMCID: PMC8265085 DOI: 10.1186/s12870-021-03110-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/22/2021] [Indexed: 05/16/2023]
Abstract
BACKGROUND Grapevine cultivars of the Pinot family represent clonally propagated mutants with major phenotypic and physiological differences, such as different colour or shifted ripening time, as well as changes in important viticultural traits. Specifically, the cultivars 'Pinot Noir' (PN) and 'Pinot Noir Precoce' (PNP, early ripening) flower at the same time, but vary in the beginning of berry ripening (veraison) and, consequently, harvest time. In addition to genotype, seasonal climatic conditions (i.e. high temperatures) also affect ripening times. To reveal possible regulatory genes that affect the timing of veraison onset, we investigated differences in gene expression profiles between PN and PNP throughout berry development with a closely meshed time series and over two separate years. RESULTS The difference in the duration of berry formation between PN and PNP was quantified to be approximately two weeks under the growth conditions applied, using plant material with a proven PN and PNP clonal relationship. Clusters of co-expressed genes and differentially expressed genes (DEGs) were detected which reflect the shift in the timing of veraison onset. Functional annotation of these DEGs fit to observed phenotypic and physiological changes during berry development. In total, we observed 3,342 DEGs in 2014 and 2,745 DEGs in 2017 between PN and PNP, with 1,923 DEGs across both years. Among these, 388 DEGs were identified as veraison-specific and 12 were considered as berry ripening time regulatory candidates. The expression profiles revealed two candidate genes for ripening time control which we designated VviRTIC1 and VviRTIC2 (VIT_210s0071g01145 and VIT_200s0366g00020, respectively). These genes likely contribute the phenotypic differences observed between PN and PNP. CONCLUSIONS Many of the 1,923 DEGs show highly similar expression profiles in both cultivars if the patterns are aligned according to developmental stage. In our work, putative genes differentially expressed between PNP and PN which could control ripening time as well as veraison-specific genes were identified. We point out connections of these genes to molecular events during berry development and discuss potential candidate genes which may control ripening time. Two of these candidates were observed to be differentially expressed in the early berry development phase. Several down-regulated genes during berry ripening are annotated as auxin response factors / ARFs. Conceivably, general changes in auxin signaling may cause the earlier ripening phenotype of PNP.
Collapse
|
12
|
A novel, robust and mating-competent Chlamydomonas reinhardtii strain with an enhanced transgene expression capacity for algal biotechnology. ACTA ACUST UNITED AC 2021; 31:e00644. [PMID: 34168966 PMCID: PMC8209186 DOI: 10.1016/j.btre.2021.e00644] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 05/26/2021] [Accepted: 06/06/2021] [Indexed: 11/03/2022]
Abstract
In the future, algae biotechnology could play an important role in sustainable development, especially with regard to the production of valuable chemicals. Among the established laboratory strains with efficient transgene expression, there are none that have demonstrated the required robustness for industrial applications, which generally require growth at larger scale. Here, we created a robust and mating-competent cell line of the green microalga Chlamydomonas reinhardtii, which also possesses a high transgene expression capacity. This strain shows a comparably high resistance to shear stress by accumulating increased amounts of biomass under these conditions. As a proof-of-concept, a high phototrophic productivity of cadaverine from CO2 and nitrate was demonstrated by efficiently expressing a bacterial l-lysine decarboxylase. In contrast to other established strains, this novel chassis strain for phototrophic production schemes is equipped with the traits required for industrial applications, by combining mating-competence, cell wall-mediated robustness and high level transgene expression.
Collapse
|
13
|
Functional and evolutionary analysis of the Arabidopsis 4R-MYB protein SNAPc4 as part of the SNAP complex. PLANT PHYSIOLOGY 2021; 185:1002-1020. [PMID: 33693812 PMCID: PMC8133616 DOI: 10.1093/plphys/kiaa067] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/15/2020] [Indexed: 06/12/2023]
Abstract
Transcription initiation of the genes coding for small nuclear RNA (snRNA) has been extensively analyzed in humans and fruit fly, but only a single ortholog of a snRNA-activating protein complex (SNAPc) subunit has so far been characterized in plants. The genome of the model plant Arabidopsis thaliana encodes orthologs of all three core SNAPc subunits, including A. thaliana SNAP complex 4 (AtSNAPc4)-a 4R-MYB-type protein with four-and-a-half adjacent MYB repeat units. We report the conserved role of AtSNAPc4 as subunit of a protein complex involved in snRNA gene transcription and present genetic evidence that AtSNAPc4 is an essential gene in gametophyte and zygote development. We present experimental evidence that the three A. thaliana SNAPc subunits assemble into a SNAP complex and demonstrate the binding of AtSNAPc4 to snRNA promoters. In addition, co-localization studies show a link between AtSNAPc4 accumulation and Cajal bodies, known to aggregate at snRNA gene loci in humans. Moreover, we show the strong evolutionary conservation of single-copy 4R-MYB/SNAPc4 genes in a broad range of eukaryotes and present additional shared protein features besides the MYB domain, suggesting a conservation of the snRNA transcription initiation machinery along the course of the eukaryotic evolution.
Collapse
|
14
|
Reference-based QUantification Of gene Dispensability (QUOD). PLANT METHODS 2021; 17:18. [PMID: 33563309 PMCID: PMC7871624 DOI: 10.1186/s13007-021-00718-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 02/03/2021] [Indexed: 05/03/2023]
Abstract
BACKGROUND Dispensability of genes in a phylogenetic lineage, e.g. a species, genus, or higher-level clade, is gaining relevance as most genome sequencing projects move to a pangenome level. Most analyses classify genes as core genes, which are present in all investigated individual genomes, and dispensable genes, which only occur in a single or a few investigated genomes. The binary classification as 'core' or 'dispensable' is often based on arbitrary cutoffs of presence/absence in the analysed genomes. Even when extended to 'conditionally dispensable', this concept still requires the assignment of genes to distinct groups. RESULTS Here, we present a new method which overcomes this distinct classification by quantifying gene dispensability and present a dedicated tool for reference-based QUantification Of gene Dispensability (QUOD). As a proof of concept, sequence data of 966 Arabidopsis thaliana accessions (Ath-966) were processed to calculate a gene-specific dispensability score for each gene based on normalised coverage in read mappings. We validated this score by comparison of highly conserved Benchmarking Universal Single Copy Orthologs (BUSCOs) to all other genes. The average scores of BUSCOs were significantly lower than the scores of non-BUSCOs. Analysis of variation demonstrated lower variation values between replicates of a single accession than between iteratively, randomly selected accessions from the whole dataset Ath-966. Functional investigations revealed defense and antimicrobial response genes among the genes with high-dispensability scores. CONCLUSIONS Instead of classifying a gene as core or dispensable, QUOD assigns a dispensability score to each gene. Hence, QUOD facilitates the identification of candidate dispensable genes, associated with high dispensability scores, which often underlie lineage-specific adaptation to varying environmental conditions.
Collapse
|
15
|
The transition to flowering in winter rapeseed during vernalization. PLANT, CELL & ENVIRONMENT 2021; 44:506-518. [PMID: 33190312 DOI: 10.1111/pce.13946] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 11/03/2020] [Accepted: 11/04/2020] [Indexed: 06/11/2023]
Abstract
Flowering time is a major determinant of adaptation, fitness and yield in the allopolyploid species rapeseed (Brassica napus). Despite being a close relative to Arabidopsis thaliana, little is known about the timing of floral transition and the genes that govern this process. Winter, semi-winter and spring type plants have important life history characteristics that differ in vernalization requirements for flowering and are important for growing rapeseed in different regions of the world. In this study, we investigated the timing of vernalization-driven floral transition in winter rapeseed and the effect of photoperiod and developmental age on flowering time and vernalization responsiveness. Microscopy and whole transcriptome analyses at the shoot apical meristems of plants grown under controlled conditions showed that floral transition is initiated within few weeks of vernalization. Certain Bna.SOC1 and Bna.SPL5 homeologs were among the induced genes, suggesting that they are regulating the timing of cold-induced floral transition. Moreover, the flowering response of plants with shorter pre-vernalization period correlated with a delayed expression of Bna.SOC1 and Bna.SPL5 genes. In essence, this study presents a detailed analysis of vernalization-driven floral transition and the aspects of juvenility and dormancy and their effect on flowering time in rapeseed.
Collapse
|
16
|
The R2R3-MYB gene family in banana (Musa acuminata): Genome-wide identification, classification and expression patterns. PLoS One 2020; 15:e0239275. [PMID: 33021974 PMCID: PMC7537896 DOI: 10.1371/journal.pone.0239275] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 09/03/2020] [Indexed: 11/19/2022] Open
Abstract
The R2R3-MYB genes comprise one of the largest transcription factor gene families in plants, playing regulatory roles in plant-specific developmental processes, defense responses and metabolite accumulation. To date MYB family genes have not yet been comprehensively identified in the major staple fruit crop banana. In this study, we present a comprehensive, genome-wide analysis of the MYB genes from Musa acuminata DH-Pahang (A genome). A total of 285 R2R3-MYB genes as well as genes encoding three other classes of MYB proteins containing multiple MYB repeats were identified and characterised with respect to structure and chromosomal organisation. Organ- and development-specific expression patterns were determined from RNA-Seq data. For 280 M. acuminata MYB genes for which expression was found in at least one of the analysed samples, a variety of expression patterns were detected. The M. acuminata R2R3-MYB genes were functionally categorised, leading to the identification of seven clades containing only M. acuminata R2R3-MYBs. The encoded proteins may have specialised functions that were acquired or expanded in Musa during genome evolution. This functional classification and expression analysis of the MYB gene family in banana establishes a solid foundation for future comprehensive functional analysis of MaMYBs and can be utilized in banana improvement programmes.
Collapse
|
17
|
High Contiguity De Novo Genome Sequence Assembly of Trifoliate Yam ( Dioscorea dumetorum) Using Long Read Sequencing. Genes (Basel) 2020; 11:E274. [PMID: 32143301 PMCID: PMC7140821 DOI: 10.3390/genes11030274] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 02/25/2020] [Accepted: 02/29/2020] [Indexed: 12/17/2022] Open
Abstract
Trifoliate yam (Dioscorea dumetorum) is one example of an orphan crop, not traded internationally. Post-harvest hardening of the tubers of this species starts within 24 h after harvesting and renders the tubers inedible. Genomic resources are required for D. dumetorum to improve breeding for non-hardening varieties as well as for other traits. We sequenced the D. dumetorum genome and generated the corresponding annotation. The two haplophases of this highly heterozygous genome were separated to a large extent. The assembly represents 485 Mbp of the genome with an N50 of over 3.2 Mbp. A total of 35,269 protein-encoding gene models as well as 9941 non-coding RNA genes were predicted, and functional annotations were assigned.
Collapse
|
18
|
A Partially Phase-Separated Genome Sequence Assembly of the Vitis Rootstock 'Börner' ( Vitis riparia × Vitis cinerea) and Its Exploitation for Marker Development and Targeted Mapping. FRONTIERS IN PLANT SCIENCE 2020; 11:156. [PMID: 32194587 PMCID: PMC7064618 DOI: 10.3389/fpls.2020.00156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 01/31/2020] [Indexed: 06/10/2023]
Abstract
Grapevine breeding has become highly relevant due to upcoming challenges like climate change, a decrease in the number of available fungicides, increasing public concern about plant protection, and the demand for a sustainable production. Downy mildew caused by Plasmopara viticola is one of the most devastating diseases worldwide of cultivated Vitis vinifera. In modern breeding programs, therefore, genetic marker technologies and genomic data are used to develop new cultivars with defined and stacked resistance loci. Potential sources of resistance are wild species of American or Asian origin. The interspecific hybrid of Vitis riparia Gm 183 x Vitis cinerea Arnold, available as the rootstock cultivar 'Börner,' carries several relevant resistance loci. We applied next-generation sequencing to enable the reliable identification of simple sequence repeats (SSR), and we also generated a draft genome sequence assembly of 'Börner' to access genome-wide sequence variations in a comprehensive and highly reliable way. These data were used to cover the 'Börner' genome with genetic marker positions. A subset of these marker positions was used for targeted mapping of the P. viticola resistance locus, Rpv14, to validate the marker position list. Based on the reference genome sequence PN40024, the position of this resistance locus can be narrowed down to less than 0.5 Mbp on chromosome 5.
Collapse
|
19
|
Improved detection of familial defective apolipoprotein B-100 by restriction-site-introducing polymerase chain reaction. Clin Chem 2019. [DOI: 10.1093/clinchem/39.9.2026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
20
|
Twenty-Five Years of Propagation in Suspension Cell Culture Results in Substantial Alterations of the Arabidopsis Thaliana Genome. Genes (Basel) 2019; 10:E671. [PMID: 31480756 PMCID: PMC6770967 DOI: 10.3390/genes10090671] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 08/23/2019] [Accepted: 08/29/2019] [Indexed: 01/16/2023] Open
Abstract
Arabidopsis thaliana is one of the best studied plant model organisms. Besides cultivation in greenhouses, cells of this plant can also be propagated in suspension cell culture. At7 is one such cell line that was established about 25 years ago. Here, we report the sequencing and the analysis of the At7 genome. Large scale duplications and deletions compared to the Columbia-0 (Col-0) reference sequence were detected. The number of deletions exceeds the number of insertions, thus indicating that a haploid genome size reduction is ongoing. Patterns of small sequence variants differ from the ones observed between A. thaliana accessions, e.g., the number of single nucleotide variants matches the number of insertions/deletions. RNA-Seq analysis reveals that disrupted alleles are less frequent in the transcriptome than the native ones.
Collapse
|
21
|
Characterization of genes and alleles involved in the control of flowering time in grapevine. PLoS One 2019; 14:e0214703. [PMID: 31269026 PMCID: PMC6608932 DOI: 10.1371/journal.pone.0214703] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 06/18/2019] [Indexed: 12/30/2022] Open
Abstract
Grapevine (Vitis vinifera) is one of the most important perennial crop plants in worldwide. Understanding of developmental processes like flowering, which impact quality and quantity of yield in this species is therefore of high interest. This gets even more important when considering some of the expected consequences of climate change. Earlier bud burst and flowering, for example, may result in yield loss due to spring frost. Berry ripening under higher temperatures will impact wine quality. Knowledge of interactions between a genotype or allele combination and the environment can be used for the breeding of genotypes that are better adapted to new climatic conditions. To this end, we have generated a list of more than 500 candidate genes that may play a role in the timing of flowering. The grapevine genome was exploited for flowering time control gene homologs on the basis of functional data from model organisms like A. thaliana. In a previous study, a mapping population derived from early flowering GF.GA-47-42 and late flowering 'Villard Blanc' was analyzed for flowering time QTLs. In a second step we have now established a workflow combining amplicon sequencing and bioinformatics to follow alleles of selected candidate genes in the F1 individuals and the parental genotypes. Allele combinations of these genes in individuals of the mapping population were correlated with early or late flowering phenotypes. Specific allele combinations of flowering time candidate genes within and outside of the QTL regions for flowering time on chromosome 1, 4, 14, 17, and 18 were found to be associated with an early flowering phenotype. In addition, expression of many of the flowering candidate genes was analyzed over consecutive stages of bud and inflorescence development indicating functional roles of these genes in the flowering control network.
Collapse
|
22
|
A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set. PLoS One 2019; 14:e0216233. [PMID: 31112551 PMCID: PMC6529160 DOI: 10.1371/journal.pone.0216233] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 04/16/2019] [Indexed: 01/27/2023] Open
Abstract
In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.
Collapse
|
23
|
Abstract
This chapter describes a transient protoplast co-transfection method that can be used to quantitatively study in vivo the activity and function of promoters and promoter elements (reporters), and their induction or repression by transcription factors (effectors), stresses, hormones, or metabolites. A detailed protocol for carrying out transient co-transfection assays with Arabidopsis At7 protoplasts and calculating the promoter activity is provided.
Collapse
|
24
|
Consideration of non-canonical splice sites improves gene prediction on the Arabidopsis thaliana Niederzenz-1 genome sequence. BMC Res Notes 2017; 10:667. [PMID: 29202864 PMCID: PMC5716242 DOI: 10.1186/s13104-017-2985-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Accepted: 11/23/2017] [Indexed: 12/26/2022] Open
Abstract
Objective The Arabidopsis thaliana Niederzenz-1 genome sequence was recently published with an ab initio gene prediction. In depth analysis of the predicted gene set revealed some errors involving genes with non-canonical splice sites in their introns. Since non-canonical splice sites are difficult to predict ab initio, we checked for options to improve the annotation by transferring annotation information from the recently released Columbia-0 reference genome sequence annotation Araport11. Results Incorporation of hints generated from Araport11 enabled the precise prediction of non-canonical splice sites. Manual inspection of RNA-Seq read mapping and RT-PCR were applied to validate the structural annotations of non-canonical splice sites. Predictions of untranslated regions were also updated by harnessing the potential of Araport11’s information, which was generated by using high coverage RNA-Seq data. The improved gene set of the Nd-1 genome assembly (GeneSet_Nd-1_v1.1) was evaluated via comparison to the initial gene prediction (GeneSet_Nd-1_v1.0) as well as against Araport11 for the Col-0 reference genome sequence. GeneSet_Nd-1_v1.1 contains previously missed non-canonical splice sites in 1256 genes. Reciprocal best hits for 24,527 (89.4%) of all nuclear Col-0 genes against the GeneSet_Nd-1_v1.1 indicate a high gene prediction quality. Electronic supplementary material The online version of this article (10.1186/s13104-017-2985-y) contains supplementary material, which is available to authorized users.
Collapse
|
25
|
Crop wild relative populations of Beta vulgaris allow direct mapping of agronomically important genes. Nat Commun 2017; 8:15708. [PMID: 28585529 PMCID: PMC5467160 DOI: 10.1038/ncomms15708] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 04/21/2017] [Indexed: 01/13/2023] Open
Abstract
Rapid identification of agronomically important genes is of pivotal interest for crop breeding. One source of such genes are crop wild relative (CWR) populations. Here we used a CWR population of <200 wild beets (B. vulgaris ssp. maritima), sampled in their natural habitat, to identify the sugar beet (Beta vulgaris ssp. vulgaris) resistance gene Rz2 with a modified version of mapping-by-sequencing (MBS). For that, we generated a draft genome sequence of the wild beet. Our results show the importance of preserving CWR in situ and demonstrate the great potential of CWR for rapid discovery of causal genes relevant for crop improvement. The candidate gene for Rz2 was identified by MBS and subsequently corroborated via RNA interference (RNAi). Rz2 encodes a CC-NB-LRR protein. Access to the DNA sequence of Rz2 opens the path to improvement of resistance towards rhizomania not only by marker-assisted breeding but also by genome editing. Variation among wild relatives of crop plants can be used to identify genes underlying traits of agronomic importance. Here, the authors show that a modified mapping-by-sequencing approach can rapidly identify the genetic basis for viral resistance in sugar beet using wild beet populations in their natural habitat.
Collapse
|
26
|
The AtMYB12 activation domain maps to a short C-terminal region of the transcription factor. ACTA ACUST UNITED AC 2017; 72:251-257. [DOI: 10.1515/znc-2016-0221] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 02/05/2017] [Indexed: 01/14/2023]
Abstract
Abstract
The Arabidopsis thaliana R2R3-MYB transcription factor MYB12 is a light-inducible, flavonol-specific activator of flavonoid biosynthesis. The transactivation activity of the AtMYB12 protein was analyzed using a C-terminal deletion series in a transient A. thaliana protoplast assay with the goal of mapping the activation domain (AD). Although the deletion of the last 46 C-terminal amino acids did not affect the activation capacity, the deletion of the last 98 amino acids almost totally abolished transactivation of two different target promoters. A domain swap experiment using the yeast GAL4 DNA-binding domain revealed that the region from positions 282 to 328 of AtMYB12 was sufficient for transactivation. In contrast to the R2R3-MYB ADs known thus far, that of AtMYB12 is not located at the rearmost C-terminal end of the protein. The AtMYB12 AD is conserved in other experimentally proven R2R3-MYB flavonol regulators from different species.
Collapse
|
27
|
Enhancing the GABI-Kat Arabidopsis thaliana T-DNA Insertion Mutant Database by Incorporating Araport11 Annotation. PLANT & CELL PHYSIOLOGY 2017; 58:e7. [PMID: 28013277 PMCID: PMC5444572 DOI: 10.1093/pcp/pcw205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/11/2016] [Indexed: 05/29/2023]
Abstract
SimpleSearch provides access to a database containing information about T-DNA insertion lines of the GABI-Kat collection of Arabidopsis thaliana mutants. These mutants are an important tool for reverse genetics, and GABI-Kat is the second largest collection of such T-DNA insertion mutants. Insertion sites were deduced from flanking sequence tags (FSTs), and the database contains information about mutant plant lines as well as insertion alleles. Here, we describe improvements within the interface (available at http://www.gabi-kat.de/db/genehits.php) and with regard to the database content that have been realized in the last five years. These improvements include the integration of the Araport11 genome sequence annotation data containing the recently updated A. thaliana structural gene descriptions, an updated visualization component that displays groups of insertions with very similar insertion positions, mapped confirmation sequences, and primers. The visualization component provides a quick way to identify insertions of interest, and access to improved data about the exact structure of confirmed insertion alleles. In addition, the database content has been extended by incorporating additional insertion alleles that were detected during the confirmation process, as well as by adding new FSTs that have been produced during continued efforts to complement gaps in FST availability. Finally, the current database content regarding predicted and confirmed insertion alleles as well as primer sequences has been made available as downloadable flat files.
Collapse
|
28
|
ReadXplorer 2-detailed read mapping analysis and visualization from one single source. Bioinformatics 2016; 32:3702-3708. [PMID: 27540267 PMCID: PMC5167064 DOI: 10.1093/bioinformatics/btw541] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2016] [Revised: 08/02/2016] [Accepted: 08/15/2016] [Indexed: 01/29/2023] Open
Abstract
MOTIVATION The vast amount of already available and currently generated read mapping data requires comprehensive visualization, and should benefit from bioinformatics tools offering a wide spectrum of analysis functionality from just one source. Appropriate handling of multiple mapped reads during mapping analyses remains an issue that demands improvement. RESULTS The capabilities of the read mapping analysis and visualization tool ReadXplorer were vastly enhanced. Here, we present an even finer granulated read mapping classification, improving the level of detail for analyses and visualizations. The spectrum of automatic analysis functions has been broadened to include genome rearrangement detection as well as correlation analysis between two mapping data sets. Existing functions were refined and enhanced, namely the computation of differentially expressed genes, the read count and normalization analysis and the transcription start site detection. Additionally, ReadXplorer 2 features a highly improved support for large eukaryotic data sets and a command line version, enabling its integration into workflows. Finally, the new version is now able to display any kind of tabular results from other bioinformatics tools. AVAILABILITY AND IMPLEMENTATION http://www.readxplorer.org CONTACT: readxplorer@computational.bio.uni-giessen.deSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
|
29
|
A De Novo Genome Sequence Assembly of the Arabidopsis thaliana Accession Niederzenz-1 Displays Presence/Absence Variation and Strong Synteny. PLoS One 2016; 11:e0164321. [PMID: 27711162 PMCID: PMC5053417 DOI: 10.1371/journal.pone.0164321] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2016] [Accepted: 09/22/2016] [Indexed: 11/23/2022] Open
Abstract
Arabidopsis thaliana is the most important model organism for fundamental plant biology. The genome diversity of different accessions of this species has been intensively studied, for example in the 1001 genome project which led to the identification of many small nucleotide polymorphisms (SNPs) and small insertions and deletions (InDels). In addition, presence/absence variation (PAV), copy number variation (CNV) and mobile genetic elements contribute to genomic differences between A. thaliana accessions. To address larger genome rearrangements between the A. thaliana reference accession Columbia-0 (Col-0) and another accession of about average distance to Col-0, we created a de novo next generation sequencing (NGS)-based assembly from the accession Niederzenz-1 (Nd-1). The result was evaluated with respect to assembly strategy and synteny to Col-0. We provide a high quality genome sequence of the A. thaliana accession (Nd-1, LXSY01000000). The assembly displays an N50 of 0.590 Mbp and covers 99% of the Col-0 reference sequence. Scaffolds from the de novo assembly were positioned on the basis of sequence similarity to the reference. Errors in this automatic scaffold anchoring were manually corrected based on analyzing reciprocal best BLAST hits (RBHs) of genes. Comparison of the final Nd-1 assembly to the reference revealed duplications and deletions (PAV). We identified 826 insertions and 746 deletions in Nd-1. Randomly selected candidates of PAV were experimentally validated. Our Nd-1 de novo assembly allowed reliable identification of larger genic and intergenic variants, which was difficult or error-prone by short read mapping approaches alone. While overall sequence similarity as well as synteny is very high, we detected short and larger (affecting more than 100 bp) differences between Col-0 and Nd-1 based on bi-directional comparisons. The de novo assembly provided here and additional assemblies that will certainly be published in the future will allow to describe the pan-genome of A. thaliana.
Collapse
|
30
|
Chloroplast Genome Sequence of Arabidopsis thaliana Accession Landsberg erecta, Assembled from Single-Molecule, Real-Time Sequencing Data. GENOME ANNOUNCEMENTS 2016; 4:e00975-16. [PMID: 27660776 PMCID: PMC5034127 DOI: 10.1128/genomea.00975-16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 08/08/2016] [Indexed: 11/20/2022]
Abstract
A publicly available data set from Pacific Biosciences was used to create an assembly of the chloroplast genome sequence of the Arabidopsis thaliana genotype Landsberg erecta The assembly is solely based on single-molecule, real-time sequencing data and hence provides high resolution of the two inverted repeat regions typically contained in chloroplast genomes.
Collapse
|
31
|
Evolution of tonoplast P-ATPase transporters involved in vacuolar acidification. THE NEW PHYTOLOGIST 2016; 211:1092-107. [PMID: 27214749 DOI: 10.1111/nph.14008] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 03/07/2016] [Indexed: 05/21/2023]
Abstract
Petunia mutants (Petunia hybrida) with blue flowers defined a novel vacuolar proton pump consisting of two interacting P-ATPases, PH1 and PH5, that hyper-acidify the vacuoles of petal cells. PH5 is similar to plasma membrane H(+) P3A -ATPase, whereas PH1 is the only known eukaryoticP3B -ATPase. As there were no indications that this tonoplast pump is widespread in plants, we investigated the distribution and evolution of PH1 and PH5. We combined database mining and phylogenetic and synteny analyses of PH1- and PH5-like proteins from all kingdoms with functional analyses (mutant complementation and intracellular localization) of homologs from diverse angiosperms. We identified functional PH1 and PH5 homologs in divergent angiosperms. PH5 homologs evolved from plasma membrane P3A -ATPases, acquiring an N-terminal tonoplast-sorting sequence and new cellular function before angiosperms appeared. PH1 is widespread among seed plants and related proteins are found in some groups of bacteria and fungi and in one moss, but is absent in most algae, suggesting that its evolution involved several cases of gene loss and possibly horizontal transfer events. The distribution of PH1 and PH5 in the plant kingdom suggests that vacuolar acidification by P-ATPases appeared in gymnosperms before flowers. This implies that, next to flower color determination, vacuolar hyper-acidification is required for yet unknown processes.
Collapse
|
32
|
Repeat Composition of CenH3-chromatin and H3K9me2-marked heterochromatin in Sugar Beet (Beta vulgaris). BMC PLANT BIOLOGY 2016; 16:120. [PMID: 27230558 PMCID: PMC4881148 DOI: 10.1186/s12870-016-0805-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 05/17/2016] [Indexed: 05/18/2023]
Abstract
BACKGROUND Sugar beet (Beta vulgaris) is an important crop of temperate climate zones, which provides nearly 30 % of the world's annual sugar needs. From the total genome size of 758 Mb, only 567 Mb were incorporated in the recently published genome sequence, due to the fact that regions with high repetitive DNA contents (e.g. satellite DNAs) are only partially included. Therefore, to fill these gaps and to gain information about the repeat composition of centromeres and heterochromatic regions, we performed chromatin immunoprecipitation followed by sequencing (ChIP-Seq) using antibodies against the centromere-specific histone H3 variant of sugar beet (CenH3) and the heterochromatic mark of dimethylated lysine 9 of histone H3 (H3K9me2). RESULTS ChIP-Seq analysis revealed that active centromeres containing CenH3 consist of the satellite pBV and the Ty3-gypsy retrotransposon Beetle7, while heterochromatin marked by H3K9me2 exhibits heterogeneity in repeat composition. H3K9me2 was mainly associated with the satellite family pEV, the Ty1-copia retrotransposon family Cotzilla and the DNA transposon superfamily of the En/Spm type. In members of the section Beta within the genus Beta, immunostaining using the CenH3 antibody was successful, indicating that orthologous CenH3 proteins are present in closely related species within this section. CONCLUSIONS The identification of repetitive genome portions by ChIP-Seq experiments complemented the sugar beet reference sequence by providing insights into the repeat composition of poorly characterized CenH3-chromatin and H3K9me2-heterochromatin. Therefore, our work provides the basis for future research and application concerning the sugar beet centromere and repeat-rich heterochromatic regions characterized by the presence of H3K9me2.
Collapse
|
33
|
Rapid gene identification in sugar beet using deep sequencing of DNA from phenotypic pools selected from breeding panels. BMC Genomics 2016; 17:236. [PMID: 26980001 PMCID: PMC4791833 DOI: 10.1186/s12864-016-2566-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 03/03/2016] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND The combination of bulk segregant analysis (BSA) and next generation sequencing (NGS), also known as mapping by sequencing (MBS), has been shown to significantly accelerate the identification of causal mutations for species with a reference genome sequence. The usual approach is to cross homozygous parents that differ for the monogenic trait to address, to perform deep sequencing of DNA from F2 plants pooled according to their phenotype, and subsequently to analyze the allele frequency distribution based on a marker table for the parents studied. The method has been successfully applied for EMS induced mutations as well as natural variation. Here, we show that pooling genetically diverse breeding lines according to a contrasting phenotype also allows high resolution mapping of the causal gene in a crop species. The test case was the monogenic locus causing red vs. green hypocotyl color in Beta vulgaris (R locus). RESULTS We determined the allele frequencies of polymorphic sequences using sequence data from two diverging phenotypic pools of 180 B. vulgaris accessions each. A single interval of about 31 kbp among the nine chromosomes was identified which indeed contained the causative mutation. CONCLUSIONS By applying a variation of the mapping by sequencing approach, we demonstrated that phenotype-based pooling of diverse accessions from breeding panels and subsequent direct determination of the allele frequency distribution can be successfully applied for gene identification in a crop species. Our approach made it possible to identify a small interval around the causative gene. Sequencing of parents or individual lines was not necessary. Whenever the appropriate plant material is available, the approach described saves time compared to the generation of an F2 population. In addition, we provide clues for planning similar experiments with regard to pool size and the sequencing depth required.
Collapse
|
34
|
Natural variation in flavonol accumulation in Arabidopsis is determined by the flavonol glucosyltransferase BGLU6. JOURNAL OF EXPERIMENTAL BOTANY 2016; 67:1505-17. [PMID: 26717955 PMCID: PMC4762388 DOI: 10.1093/jxb/erv546] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Flavonols are colourless secondary metabolites, primarily regarded as UV-protection pigments that are deposited in plants in their glycosylated forms. The glycosylation of flavonols is mainly catalysed by UDP-sugar-dependent glycosyltransferases (UGTs). Although the structures of flavonol glycosides accumulating in Arabidopsis thaliana are known, many genes involved in the flavonol glycosylation pathway are yet to be discovered. The flavonol glycoside profiles of seedlings from 81 naturally occurring A. thaliana accessions were screened using high performance thin layer chromatography. A qualitative variation in flavonol 3-O-gentiobioside 7-O-rhamnoside (F3GG7R) content was identified. Ler × Col-0 recombinant inbred line mapping and whole genome association mapping led to the identification of a glycoside hydrolase family 1-type gene, At1g60270/BGLU6, that encodes a homolog of acyl-glucose-dependent glucosyltransferases involved in the glycosylation of anthocyanins, possibly localized in the cytoplasm, and that is co-expressed with genes linked to phenylpropanoid biosynthesis. A causal single nucleotide polymorphism introducing a premature stop codon in non-producer accessions was found to be absent in the producers. Several other naturally occurring loss-of-function alleles were also identified. Two independent bglu6 T-DNA insertion mutants from the producer accessions showed loss of F3GG7R. Furthermore, bglu6 mutant lines complemented with the genomic Ler BGLU6 gene confirmed that BGLU6 is essential for production of F3GGR7. We have thus identified an accession-specific gene that causes a qualitative difference in flavonol glycoside accumulation in A. thaliana strains. This gene encodes a flavonol 3-O-glucoside: 6″-O-glucosyltransferase that does not belong to the large canonical family of flavonol glycosyltransferases that use UDP-conjugates as the activated sugar donor substrate.
Collapse
|
35
|
Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 85:229-44. [PMID: 26676716 DOI: 10.1111/tpj.13103] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Revised: 11/23/2015] [Accepted: 11/26/2015] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs.
Collapse
|
36
|
The Structural Features of Thousands of T-DNA Insertion Sites Are Consistent with a Double-Strand Break Repair-Based Insertion Mechanism. MOLECULAR PLANT 2015; 8:1651-64. [PMID: 26343971 DOI: 10.1016/j.molp.2015.08.011] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Revised: 07/28/2015] [Accepted: 08/13/2015] [Indexed: 05/06/2023]
Abstract
Transformation by Agrobacterium tumefaciens, an important tool in modern plant research, involves the integration of T-DNA initially present on a plasmid in agrobacteria into the genome of plant cells. The process of attachment of the agrobacteria to plant cells and the transport of T-DNA into the cell and further to the nucleus has been well described. However, the exact mechanism of integration into the host's DNA is still unclear, although several models have been proposed. During confirmation of T-DNA insertion alleles from the GABI-Kat collection of Arabidopsis thaliana mutants, we have generated about 34,000 sequences from the junctions between inserted T-DNA and adjacent genome regions. Here, we describe the evaluation of this dataset with regard to existing models for T-DNA integration. The results suggest that integration into the plant genome is mainly mediated by the endogenous plant DNA repair machinery. The observed integration events showed characteristics highly similar to those of repair sites of double-strand breaks with respect to microhomology and deletion sizes. In addition, we describe unexpected integration events, such as large deletions and inversions at the integration site that are relevant for correct interpretation of results from T-DNA insertion mutants in reverse genetics experiments.
Collapse
|
37
|
SMRT sequencing only de novo assembly of the sugar beet (Beta vulgaris) chloroplast genome. BMC Bioinformatics 2015; 16:295. [PMID: 26377912 PMCID: PMC4573686 DOI: 10.1186/s12859-015-0726-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2015] [Accepted: 09/06/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Third generation sequencing methods, like SMRT (Single Molecule, Real-Time) sequencing developed by Pacific Biosciences, offer much longer read length in comparison to Next Generation Sequencing (NGS) methods. Hence, they are well suited for de novo- or re-sequencing projects. Sequences generated for these purposes will not only contain reads originating from the nuclear genome, but also a significant amount of reads originating from the organelles of the target organism. These reads are usually discarded but they can also be used for an assembly of organellar replicons. The long read length supports resolution of repetitive regions and repeats within the organelles genome which might be problematic when just using short read data. Additionally, SMRT sequencing is less influenced by GC rich areas and by long stretches of the same base. RESULTS We describe a workflow for a de novo assembly of the sugar beet (Beta vulgaris ssp. vulgaris) chloroplast genome sequence only based on data originating from a SMRT sequencing dataset targeted on its nuclear genome. We show that the data obtained from such an experiment are sufficient to create a high quality assembly with a higher reliability than assemblies derived from e.g. Illumina reads only. The chloroplast genome is especially challenging for de novo assembling as it contains two large inverted repeat (IR) regions. We also describe some limitations that still apply even though long reads are used for the assembly. CONCLUSIONS SMRT sequencing reads extracted from a dataset created for nuclear genome (re)sequencing can be used to obtain a high quality de novo assembly of the chloroplast of the sequenced organism. Even with a relatively small overall coverage for the nuclear genome it is possible to collect more than enough reads to generate a high quality assembly that outperforms short read based assemblies. However, even with long reads it is not always possible to clarify the order of elements of a chloroplast genome sequence reliantly which we could demonstrate with Fosmid End Sequences (FES) generated with Sanger technology. Nevertheless, this limitation also applies to short read sequencing data but is reached in this case at a much earlier stage during finishing.
Collapse
|
38
|
Exploiting single-molecule transcript sequencing for eukaryotic gene prediction. Genome Biol 2015; 16:184. [PMID: 26328666 PMCID: PMC4556409 DOI: 10.1186/s13059-015-0729-7] [Citation(s) in RCA: 94] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Accepted: 07/22/2015] [Indexed: 12/20/2022] Open
Abstract
We develop a method to predict and validate gene models using PacBio single-molecule, real-time (SMRT) cDNA reads. Ninety-eight percent of full-insert SMRT reads span complete open reading frames. Gene model validation using SMRT reads is developed as automated process. Optimized training and prediction settings and mRNA-seq noise reduction of assisting Illumina reads results in increased gene prediction sensitivity and precision. Additionally, we present an improved gene set for sugar beet (Beta vulgaris) and the first genome-wide gene set for spinach (Spinacia oleracea). The workflow and guidelines are a valuable resource to obtain comprehensive gene sets for newly sequenced genomes of non-model eukaryotes.
Collapse
|
39
|
TRANSPARENT TESTA 13 is a tonoplast P3A -ATPase required for vacuolar deposition of proanthocyanidins in Arabidopsis thaliana seeds. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 82:840-9. [PMID: 25891958 DOI: 10.1111/tpj.12854] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Revised: 04/05/2015] [Accepted: 04/09/2015] [Indexed: 05/23/2023]
Abstract
Intracellular pH homeostasis is essential for all living cells. In plants, pH is usually maintained by three structurally distinct and differentially localized types of proton pump: P-type H(+) -ATPases in the plasma membrane, and multimeric vacuolar-type H(+) -ATPases (V-ATPases) and vacuolar H(+) -pyrophosphatases (H(+) -PPases) in endomembranes. Here, we show that reduced accumulation of proanthocyanidins (PAs) and hence the diminished brown seed coloration found in the Arabidopsis thaliana mutant transparent testa 13 (tt13) is caused by disruption of the gene encoding the P3A -ATPase AHA10. Identification of the gene encoded by the tt13 locus completes the molecular characterization of the classical set of transparent testa mutants. Cells of the tt13 seed coat endothelium do not contain PA-filled central vacuoles as observed in the wild-type. tt13 phenocopies tt12, a mutant that is defective in vacuolar import of the PA precursor epicatechin. Our data show that vacuolar loading with PA precursors depends on TT13. Consistent with the tt13 phenotype, but in contrast to other isoforms of P-type H(+) -ATPases, TT13 localizes to the tonoplast. PA accumulation in tt13 is partially restored by expression of the tonoplast localized H(+) -PPase VHP1. Our findings indicate that the P3A -ATPase TT13 functions as a proton pump in the tonoplast of seed coat endothelium cells, and generates the driving force for TT12-mediated transport of PA precursors to the vacuole.
Collapse
|
40
|
Fast forward genetics to identify mutations causing a high light tolerant phenotype in Chlamydomonas reinhardtii by whole-genome-sequencing. BMC Genomics 2015; 16:57. [PMID: 25730202 PMCID: PMC4336690 DOI: 10.1186/s12864-015-1232-y] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Accepted: 01/12/2015] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND High light tolerance of microalgae is a desired phenotype for efficient cultivation in large scale production systems under fluctuating outdoor conditions. Outdoor cultivation requires the use of either wild-type or non-GMO derived mutant strains due to safety concerns. The identification and molecular characterization of such mutants derived from untagged forward genetics approaches was limited previously by the tedious and time-consuming methods involving techniques such as classical meiotic mapping. The combination of mapping with next generation sequencing technologies offers alternative strategies to identify genes involved in high light adaptation in untagged mutants. RESULTS We used the model alga Chlamydomonas reinhardtii in a non-GMO mutation strategy without any preceding crossing step or pooled progeny to identify genes involved in the regulatory processes of high light adaptation. To generate high light tolerant mutants, wildtype cells were mutagenized only to a low extent, followed by a stringent selection. We performed whole-genome sequencing of two independent mutants hit1 and hit2 and the parental wildtype. The availability of a reference genome sequence and the removal of shared bakground variants between the wildtype strain and each mutant, enabled us to identify two single nucleotide polymorphisms within the same gene Cre02.g085050, hereafter called LRS1 (putative Light Response Signaling protein 1). These two independent single amino acid exchanges are both located in the putative WD40 propeller domain of the corresponding protein LRS1. Both mutants exhibited an increased rate of non-photochemical-quenching (NPQ) and an improved resistance against chemically induced reactive oxygen species. In silico analyses revealed homology of LRS1 to the photoregulatory protein COP1 in plants. CONCLUSIONS In this work we identified the nuclear encoded gene LRS1 as an essential factor for high light adaptation in C. reinhardtii. The causative random mutation within this gene was identified by a rapid and efficient method, avoiding any preceding crossing step, meiotic mapping, or pooled progeny. Our results open up new insights into mechanisms of high light adaptation in microalgae and at the same time provide a simplified strategy for non-GMO forward genetics, a crucial precondition that could result in the identification of key factors for economically relevant biological processes within algae.
Collapse
|
41
|
Update on transparent testa mutants from Arabidopsis thaliana: characterisation of new alleles from an isogenic collection. PLANTA 2014; 240:955-70. [PMID: 24903359 DOI: 10.1007/s00425-014-2088-0] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 04/23/2014] [Indexed: 05/23/2023]
Abstract
We present a comprehensive overview on flavonoid-related phenotypes of A. thaliana tt and tds mutants, provide tools for their characterisation, increase the number of available alleles and demonstrate that tds3 is allelic to tt12 and tds5 to aha10. Flavonoid biosynthesis is one of the best-studied secondary metabolite pathways in plants. In the model system Arabidopsis thaliana it leads to the synthesis of three phenolic compound classes: flavonol glycosides, anthocyanins and proanthocyanidins (PAs). PAs appear brown in their oxidised polymeric forms, and most A. thaliana mutants impaired in flavonoid accumulation were identified through screens for lack of this seed coat pigmentation. These mutants are referred to as transparent testa (tt) or tannin-deficient seed (tds). More than 20 mutants of these types have been published, probably representing most of the genes relevant for PA accumulation in A. thaliana. However, data about the genes involved in PA deposition or oxidation are still rather scarce. Also, for some of the known mutants it is unclear if they represent additional loci or if they are allelic to known genes. For the present study, we have performed a systematic phenotypic characterisation of almost all available tt and tds mutants and built a collection of mutants in the genetic background of the accession Columbia to minimise effects arising from ecotype variation. We have identified a novel tt6 allele from a forward genetic screen and demonstrated that tds3 is allelic to tt12 and tds5 to aha10.
Collapse
|
42
|
The Genomics Revolution and its Impact on Future Biotechnology. J Biotechnol 2014; 190:1. [DOI: 10.1016/j.jbiotec.2014.10.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
43
|
Reliable in silico identification of sequence polymorphisms and their application for extending the genetic map of sugar beet (Beta vulgaris). PLoS One 2014; 9:e110113. [PMID: 25302600 PMCID: PMC4193868 DOI: 10.1371/journal.pone.0110113] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 09/07/2014] [Indexed: 02/03/2023] Open
Abstract
Molecular markers are a highly valuable tool for creating genetic maps. Like in many other crops, sugar beet (Beta vulgaris L.) breeding is increasingly supported by the application of such genetic markers. Single nucleotide polymorphism (SNP) based markers have a high potential for automated analysis and high-throughput genotyping. We developed a bioinformatics workflow that uses Sanger and 2nd-generation sequence data for detection, evaluation and verification of new transcript-associated SNPs from sugar beet. RNAseq data from one parent of an established mapping population were produced by 454-FLX sequencing and compared to Sanger ESTs derived from the other parent. The workflow established for SNP detection considers the quality values of both types of reads, provides polymorphic alignments as well as selection criteria for reliable SNP detection and allows painless generation of new genetic markers within genes. We obtained a total of 14,323 genic SNPs and InDels. According to empirically optimised settings for the quality parameters, we classified these SNPs into four usability categories. Validation of a subset of the in silico detected SNPs by genotyping the mapping population indicated a high success rate of the SNP detection. Finally, a total of 307 new markers were integrated with existing data into a new genetic map of sugar beet which offers improved resolution and the integration of terminal markers.
Collapse
|
44
|
Genome-wide identification and characterisation of R2R3-MYB genes in sugar beet (Beta vulgaris). BMC PLANT BIOLOGY 2014; 14:249. [PMID: 25249410 PMCID: PMC4180131 DOI: 10.1186/s12870-014-0249-8] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2014] [Accepted: 09/17/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND The R2R3-MYB genes comprise one of the largest transcription factor gene families in plants, playing regulatory roles in plant-specific developmental processes, metabolite accumulation and defense responses. Although genome-wide analysis of this gene family has been carried out in some species, the R2R3-MYB genes in Beta vulgaris ssp. vulgaris (sugar beet) as the first sequenced member of the order Caryophyllales, have not been analysed heretofore. RESULTS We present a comprehensive, genome-wide analysis of the MYB genes from Beta vulgaris ssp. vulgaris (sugar beet) which is the first species of the order Caryophyllales with a sequenced genome. A total of 70 R2R3-MYB genes as well as genes encoding three other classes of MYB proteins containing multiple MYB repeats were identified and characterised with respect to structure and chromosomal organisation. Also, organ specific expression patterns were determined from RNA-seq data. The R2R3-MYB genes were functionally categorised which led to the identification of a sugar beet-specific clade with an atypical amino acid composition in the R3 domain, putatively encoding betalain regulators. The functional classification was verified by experimental confirmation of the prediction that the R2R3-MYB gene Bv_iogq encodes a flavonol regulator. CONCLUSIONS This study provides the first step towards cloning and functional dissection of the role of MYB transcription factor genes in the nutritionally and evolutionarily interesting species B. vulgaris. In addition, it describes the flavonol regulator BvMYB12, being the first sugar beet R2R3-MYB with an experimentally proven function.
Collapse
|
45
|
QTL analysis of flowering time and ripening traits suggests an impact of a genomic region on linkage group 1 in Vitis. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2014; 127:1857-72. [PMID: 25112201 PMCID: PMC4145202 DOI: 10.1007/s00122-014-2310-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Accepted: 04/05/2014] [Indexed: 05/21/2023]
Abstract
In the recent past, genetic analyses of grapevine focused mainly on the identification of resistance loci for major diseases such as powdery and downy mildew. Currently, breeding programs make intensive use of these results by applying molecular markers linked to the resistance traits. However, modern genetics also allows to address additional agronomic traits that have considerable impact on the selection of grapevine cultivars. In this study, we have used linkage mapping for the identification and characterization of flowering time and ripening traits in a mapping population from a cross of V3125 ('Schiava Grossa' × 'Riesling') and the interspecific rootstock cultivar 'Börner' (Vitis riparia × Vitis cinerea). Comparison of the flowering time QTL mapping with data derived from a second independent segregating population identified several common QTLs. Especially a large region on linkage group 1 proved to be of special interest given the genetic divergence of the parents of the two populations. The proximity of the QTL region contains two CONSTANS-like genes. In accordance with data from other plants such as Arabidopsis thaliana and Oryza sativa, we hypothesize that these genes are major contributors to control the time of flowering in Vitis.
Collapse
|
46
|
Profiling of extensively diversified plant LINEs reveals distinct plant-specific subclades. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2014; 79:385-97. [PMID: 24862340 DOI: 10.1111/tpj.12565] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Revised: 05/12/2014] [Accepted: 05/15/2014] [Indexed: 05/03/2023]
Abstract
A large fraction of eukaryotic genomes is made up of long interspersed nuclear elements (LINEs). Due to their capability to create novel copies via error-prone reverse transcription, they generate multiple families and reach high copy numbers. Although mammalian LINEs have been well described, plant LINEs have been only poorly investigated. Here, we present a systematic cross-species survey of LINEs in higher plant genomes shedding light on plant LINE evolution as well as diversity, and facilitating their annotation in genome projects. Applying a Hidden Markov Model (HMM)-based analysis, 59 390 intact LINE reverse transcriptases (RTs) were extracted from 23 plant genomes. These fall in only two out of 28 LINE clades (L1 and RTE) known in eukaryotes. While plant RTE LINEs are highly homogenous and mostly constitute only a single family per genome, plant L1 LINEs are extremely diverse and form numerous families. Despite their heterogeneity, all members across the 23 species fall into only seven L1 subclades, some of them defined here. Exemplarily focusing on the L1 LINEs of a basal reference plant genome (Beta vulgaris), we show that the subclade classification level does not only reflect RT sequence similarity, but also mirrors structural aspects of complete LINE retrotransposons, like element size, position and type of encoded enzymatic domains. Our comprehensive catalogue of plant LINE RTs serves the classification of highly diverse plant LINEs, while the provided subclade-specific HMMs facilitate their annotation.
Collapse
|
47
|
Abstract
Abstract
Pheromone-initiated sexual reproduction of Volvox carteri f. nagariensis is started by a “spontaneous” generation of sexual male spheroids at a rate of 10-4. As previously suggested, these males are mutants (sexc) which become independent of pheromone-controlled sex-induction. Unequivocal evidence for a mutative event now comes from the observation that the sexc trait is transmitted through cell lineages during embryogenesis, thus confirming earlier morphological analysis, Mutation semms to be restricted to cell division, The high frequency of mutation points to a special enchacing mechanism for whick the possibility of a genome rearrangement is discussed.
Collapse
|
48
|
The CHH motif in sugar beet satellite DNA: a modulator for cytosine methylation. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2014; 78:937-50. [PMID: 24661787 DOI: 10.1111/tpj.12519] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 05/03/2023]
Abstract
Methylation of DNA is important for the epigenetic silencing of repetitive DNA in plant genomes. Knowledge about the cytosine methylation status of satellite DNAs, a major class of repetitive DNA, is scarce. One reason for this is that arrays of tandemly arranged sequences are usually collapsed in next-generation sequencing assemblies. We applied strategies to overcome this limitation and quantified the level of cytosine methylation and its pattern in three satellite families of sugar beet (Beta vulgaris) which differ in their abundance, chromosomal localization and monomer size. We visualized methylation levels along pachytene chromosomes with respect to small satellite loci at maximum resolution using chromosome-wide fluorescent in situ hybridization complemented with immunostaining and super-resolution microscopy. Only reduced methylation of many satellite arrays was obtained. To investigate methylation at the nucleotide level we performed bisulfite sequencing of 1569 satellite sequences. We found that the level of methylation of cytosine strongly depends on the sequence context: cytosines in the CHH motif show lower methylation (44-52%), while CG and CHG motifs are more strongly methylated. This affects the overall methylation of satellite sequences because CHH occurs frequently while CG and CHG are rare or even absent in the satellite arrays investigated. Evidently, CHH is the major target for modulation of the cytosine methylation level of adjacent monomers within individual arrays and contributes to their epigenetic function. This strongly indicates that asymmetric cytosine methylation plays a role in the epigenetic modification of satellite repeats in plant genomes.
Collapse
|
49
|
Integrating bioinformatic resources to predict transcription factors interacting with cis-sequences conserved in co-regulated genes. BMC Genomics 2014; 15:317. [PMID: 24773781 PMCID: PMC4234446 DOI: 10.1186/1471-2164-15-317] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 04/16/2014] [Indexed: 11/22/2022] Open
Abstract
Background Using motif detection programs it is fairly straightforward to identify conserved cis-sequences in promoters of co-regulated genes. In contrast, the identification of the transcription factors (TFs) interacting with these cis-sequences is much more elaborate. To facilitate this, we explore the possibility of using several bioinformatic and experimental approaches for TF identification. This starts with the selection of co-regulated gene sets and leads first to the prediction and then to the experimental validation of TFs interacting with cis-sequences conserved in the promoters of these co-regulated genes. Results Using the PathoPlant database, 32 up-regulated gene groups were identified with microarray data for drought-responsive gene expression from Arabidopsis thaliana. Application of the binding site estimation suite of tools (BEST) discovered 179 conserved sequence motifs within the corresponding promoters. Using the STAMP web-server, 49 sequence motifs were classified into 7 motif families for which similarities with known cis-regulatory sequences were identified. All motifs were subjected to a footprintDB analysis to predict interacting DNA binding domains from plant TF families. Predictions were confirmed by using a yeast-one-hybrid approach to select interacting TFs belonging to the predicted TF families. TF-DNA interactions were further experimentally validated in yeast and with a Physcomitrella patens transient expression system, leading to the discovery of several novel TF-DNA interactions. Conclusions The present work demonstrates the successful integration of several bioinformatic resources with experimental approaches to predict and validate TFs interacting with conserved sequence motifs in co-regulated genes.
Collapse
|
50
|
An easy-to-use primer design tool to address paralogous loci and T-DNA insertion sites in the genome of Arabidopsis thaliana. PLANT METHODS 2014; 10:28. [PMID: 25324895 PMCID: PMC4169229 DOI: 10.1186/1746-4811-10-28] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Accepted: 09/09/2014] [Indexed: 05/22/2023]
Abstract
BACKGROUND More than 90% of the Arabidopsis thaliana genes are members of multigene families. DNA sequence similarities present in such related genes can cause trouble, e.g. when molecularly analysing mutant alleles of these genes. Also, flanking-sequence-tag (FST) based predictions of T-DNA insertion positions are often located within paralogous regions of the genome. In such cases, the prediction of the correct insertion site must include careful sequence analyses on the one hand and a paralog specific primer design for experimental confirmation of the prediction on the other hand. RESULTS GABI-Kat is a large A. thaliana insertion line resource, which uses in-house confirmation to provide highly reliable access to T-DNA insertion alleles. To offer trustworthy mutant alleles of paralogous loci, we considered multiple insertion site predictions for single FSTs and implemented this 1-to-N relation in our database. The resulting paralogous predictions were addressed experimentally and the correct insertion locus was identified in most cases, including cases in which there were multiple predictions with identical prediction scores. A newly developed primer design tool that takes paralogous regions into account was developed to streamline the confirmation process for paralogs. The tool is suitable for all parts of the genome and is freely available at the GABI-Kat website. Although the tool was initially designed for the analysis of T-DNA insertion mutants, it can be used for any experiment that requires locus-specific primers for the A. thaliana genome. It is easy to use and also able to design amplimers with two genome-specific primers as required for genotyping segregating families of insertion mutants when looking for homozygous offspring. CONCLUSIONS The paralog-aware confirmation process significantly improved the reliability of the insertion site assignment when paralogous regions of the genome were affected. An automatic online primer design tool that incorporates experience from the in-house confirmation of T-DNA insertion lines has been made available. It provides easy access to primers for the analysis of T-DNA insertion alleles, but it is also beneficial for other applications as well.
Collapse
|