101
|
De Novo Transcriptome Sequencing of Rough Lemon Leaves ( Citrus jambhiri Lush.) in Response to Plenodomus tracheiphilus Infection. Int J Mol Sci 2021; 22:ijms22020882. [PMID: 33477297 PMCID: PMC7830309 DOI: 10.3390/ijms22020882] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 01/14/2021] [Accepted: 01/15/2021] [Indexed: 12/17/2022] Open
Abstract
Mal secco is one of the most severe diseases of citrus, caused by the necrotrophic fungus Plenodomus tracheiphilus. With the main aim of identifying candidate genes involved in the response of citrus plants to "Mal secco", we performed a de novo transcriptome analysis of rough lemon seedlings subjected to inoculation of P. tracheiphilus. The analysis of differential expressed genes (DEGs) highlighted a sharp response triggered by the pathogen as a total of 4986 significant DEGs (2865 genes up-regulated and 2121 down-regulated) have been revealed. The analysis of the most significantly enriched KEGG pathways indicated that a crucial role is played by genes involved in "Plant hormone signal transduction", "Phenylpropanoid biosynthesis", and "Carbon metabolism". The main findings of this work are that under fungus challenge, the rough lemon genes involved both in the light harvesting and the photosynthetic electron flow were significantly down-regulated, thus probably inducing a shortage of energy for cellular functions. Moreover, the systemic acquired resistance (SAR) was activated through the induced salicylic acid cascade. Interestingly, RPM1 interacting protein 4, an essential positive regulator of plant defense, and BIR2, which is a negative regulator of basal level of immunity, have been identified thus representing useful targets for molecular breeding.
Collapse
|
102
|
An 8.22 Mb Assembly and Annotation of the Alpaca ( Vicugna pacos) Y Chromosome. Genes (Basel) 2021; 12:genes12010105. [PMID: 33467186 PMCID: PMC7830431 DOI: 10.3390/genes12010105] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 01/07/2021] [Accepted: 01/14/2021] [Indexed: 12/26/2022] Open
Abstract
The unique evolutionary dynamics and complex structure make the Y chromosome the most diverse and least understood region in the mammalian genome, despite its undisputable role in sex determination, development, and male fertility. Here we present the first contig-level annotated draft assembly for the alpaca (Vicugna pacos) Y chromosome based on hybrid assembly of short- and long-read sequence data of flow-sorted Y. The latter was also used for cDNA selection providing Y-enriched testis transcriptome for annotation. The final assembly of 8.22 Mb comprised 4.5 Mb of male specific Y (MSY) and 3.7 Mb of the pseudoautosomal region. In MSY, we annotated 15 X-degenerate genes and two novel transcripts, but no transposed sequences. Two MSY genes, HSFY and RBMY, are multicopy. The pseudoautosomal boundary is located between SHROOM2 and HSFY. Comparative analysis shows that the small and cytogenetically distinct alpaca Y shares most of MSY sequences with the larger dromedary and Bactrian camel Y chromosomes. Most of alpaca X-degenerate genes are also shared with other mammalian MSYs, though WWC3Y is Y-specific only in alpaca/camels and the horse. The partial alpaca Y assembly is a starting point for further expansion and will have applications in the study of camelid populations and male biology.
Collapse
|
103
|
Du H, Diao C, Zhao P, Zhou L, Liu JF. Integrated hybrid de novo assembly technologies to obtain high-quality pig genome using short and long reads. Brief Bioinform 2021; 22:6082823. [PMID: 33429431 DOI: 10.1093/bib/bbaa399] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 11/20/2020] [Accepted: 12/08/2020] [Indexed: 11/12/2022] Open
Abstract
With the rapid progress of sequencing technologies, various types of sequencing reads and assembly algorithms have been designed to construct genome assemblies. Although recent studies have attempted to evaluate the appropriate type of sequencing reads and algorithms for assembling high-quality genomes, it is still a challenge to set the correct combination for constructing animal genomes. Here, we present a comparative performance assessment of 14 assembly combinations-9 software programs with different short and long reads of Duroc pig. Based on the results of the optimization process for genome construction, we designed an integrated hybrid de novo assembly pipeline, HSCG, and constructed a draft genome for Duroc pig. Comparison between the new genome and Sus scrofa 11.1 revealed important breakpoints in two S. scrofa 11.1 genes. Our findings may provide new insights into the pan-genome analysis studies of agricultural animals, and the integrated assembly pipeline may serve as a guide for the assembly of other animal genomes.
Collapse
|
104
|
Musilova J, Kourilova X, Bezdicek M, Lengerova M, Obruca S, Skutkova H, Sedlar K. First Complete Genome of the Thermophilic Polyhydroxyalkanoates-Producing Bacterium Schlegelella thermodepolymerans DSM 15344. Genome Biol Evol 2021; 13:6081016. [PMID: 33432323 PMCID: PMC8023429 DOI: 10.1093/gbe/evab007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/06/2021] [Indexed: 11/13/2022] Open
Abstract
Schlegelella thermodepolymerans is a moderately thermophilic bacterium capable of producing polyhydroxyalkanoates-biodegradable polymers representing an alternative to conventional plastics. Here, we present the first complete genome of the type strain S. thermodepolymerans DSM 15344 that was assembled by hybrid approach using both long (Oxford Nanopore) and short (Illumina) reads. The genome consists of a single 3,858,501-bp-long circular chromosome with GC content of 70.3%. Genome annotation identified 3,650 genes in total, whereas 3,598 open reading frames belonged to protein-coding genes. Functional annotation of the genome and division of genes into clusters of orthologous groups revealed a relatively high number of 1,013 genes with unknown function or unknown clusters of orthologous groups, which reflects the fact that only a little is known about thermophilic polyhydroxyalkanoates-producing bacteria on a genome level. On the other hand, 270 genes involved in energy conversion and production were detected. This group covers genes involved in catabolic processes, which suggests capability of S. thermodepolymerans DSM 15344 to utilize and biotechnologically convert various substrates such as lignocellulose-based saccharides, glycerol, or lipids. Based on the knowledge of its genome, it can be stated that S. thermodepolymerans DSM 15344 is a very interesting, metabolically versatile bacterium with great biotechnological potential.
Collapse
|
105
|
Das D, Baruah IK, Panda D, Paswan RR, Acharjee S, Sarmah BK. Bruchid beetle ovipositioning mediated defense responses in black gram pods. BMC PLANT BIOLOGY 2021; 21:38. [PMID: 33430784 PMCID: PMC7802178 DOI: 10.1186/s12870-020-02796-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 12/14/2020] [Indexed: 05/20/2023]
Abstract
BACKGROUND Black gram [Vigna mungo (L)] seeds are a rich source of digestible protein and dietary fibre, both for human and animal consumption. However, the quality and quantity of the Vigna seeds are severely affected by bruchid beetles during storage. Therefore, analyses of the expression of the bruchid induced transcript dynamics in black gram pods would be helpful to understand the underlying defense mechanism against bruchid oviposition. RESULTS We used the RNAseq approach to survey the changes in transcript profile in the developing seeds of a moderately resistant cultivar IC-8219 against bruchid oviposition using a susceptible cultivar T-9 as a control. A total of 96,084,600 and 99,532,488 clean reads were generated from eight (4 each) samples of IC-8219 and T-9 cultivar, respectively. Based on the BLASTX search against the NR database, 32,584 CDSs were generated of which 31,817 CDSs were significantly similar to Vigna radiata, a close relative of Vigna mungo. The IC-8219 cultivar had 630 significantly differentially expressed genes (DEGs) of which 304 and 326 genes up and down-regulated, respectively. However, in the T-9 cultivar, only 168 DEGs were identified of which 142 and 26 genes up and down-regulated, respectively. The expression analyses of 10 DEGs by qPCR confirmed the accuracy of the RNA-Seq data. Gene Ontology and KEGG pathway analyses helped us to better understand the role of these DEGs in oviposition mediated defense response of black gram. In both the cultivars, the most significant transcriptomic changes in response to the oviposition were related to the induction of defense response genes, transcription factors, secondary metabolites, enzyme inhibitors, and signal transduction pathways. It appears that the bruchid ovipositioning mediated defense response in black gram is induced by SA signaling pathways and defense genes such as defensin, genes for secondary metabolites, and enzyme inhibitors could be potential candidates for resistance to bruchids. CONCLUSION We generated a transcript profile of immature black gram pods upon bruchid ovipositioning by de novo assembly and studied the underlying defense mechanism of a moderately resistant cultivar.
Collapse
|
106
|
Dong N, Bandura J, Zhang Z, Wang Y, Labadie K, Noel B, Davison A, Koene JM, Sun HS, Coutellec MA, Feng ZP. Ion channel profiling of the Lymnaea stagnalis ganglia via transcriptome analysis. BMC Genomics 2021; 22:18. [PMID: 33407100 PMCID: PMC7789530 DOI: 10.1186/s12864-020-07287-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 11/28/2020] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND The pond snail Lymnaea stagnalis (L. stagnalis) has been widely used as a model organism in neurobiology, ecotoxicology, and parasitology due to the relative simplicity of its central nervous system (CNS). However, its usefulness is restricted by a limited availability of transcriptome data. While sequence information for the L. stagnalis CNS transcripts has been obtained from EST libraries and a de novo RNA-seq assembly, the quality of these assemblies is limited by a combination of low coverage of EST libraries, the fragmented nature of de novo assemblies, and lack of reference genome. RESULTS In this study, taking advantage of the recent availability of a preliminary L. stagnalis genome, we generated an RNA-seq library from the adult L. stagnalis CNS, using a combination of genome-guided and de novo assembly programs to identify 17,832 protein-coding L. stagnalis transcripts. We combined our library with existing resources to produce a transcript set with greater sequence length, completeness, and diversity than previously available ones. Using our assembly and functional domain analysis, we profiled L. stagnalis CNS transcripts encoding ion channels and ionotropic receptors, which are key proteins for CNS function, and compared their sequences to other vertebrate and invertebrate model organisms. Interestingly, L. stagnalis transcripts encoding numerous putative Ca2+ channels showed the most sequence similarity to those of Mus musculus, Danio rerio, Xenopus tropicalis, Drosophila melanogaster, and Caenorhabditis elegans, suggesting that many calcium channel-related signaling pathways may be evolutionarily conserved. CONCLUSIONS Our study provides the most thorough characterization to date of the L. stagnalis transcriptome and provides insights into differences between vertebrates and invertebrates in CNS transcript diversity, according to function and protein class. Furthermore, this study provides a complete characterization of the ion channels of Lymnaea stagnalis, opening new avenues for future research on fundamental neurobiological processes in this model system.
Collapse
|
107
|
Villanueva-Corrales S, García-Botero C, Garcés-Cardona F, Ramírez-Ríos V, Villanueva-Mejía DF, Álvarez JC. The Complete Chloroplast Genome of Plukenetia volubilis Provides Insights Into the Organelle Inheritance. FRONTIERS IN PLANT SCIENCE 2021; 12:667060. [PMID: 33968119 PMCID: PMC8103035 DOI: 10.3389/fpls.2021.667060] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 03/24/2021] [Indexed: 05/04/2023]
Abstract
Plukenetia volubilis L. (Malpighiales: Euphorbiaceae), also known as Sacha inchi, is considered a promising crop due to its high seed content of unsaturated fatty acids (UFAs), all of them highly valuable for food and cosmetic industries, but the genetic basis of oil biosynthesis of this non-model plant is still insufficient. Here, we sequenced the total DNA of Sacha inchi by using Illumina and Nanopore technologies and approached a de novo reconstruction of the whole nucleotide sequence and the organization of its 164,111 bp length of the chloroplast genome, displaying two copies of an inverted repeat sequence [inverted repeat A (IRA) and inverted repeat B (IRB)] of 28,209 bp, each one separating a small single copy (SSC) region of 17,860 bp and a large single copy (LSC) region of 89,833 bp. We detected two large inversions on the chloroplast genome that were not presented in the previously reported sequence and studied a promising cpDNA marker, useful in phylogenetic approaches. This chloroplast DNA (cpDNA) marker was used on a set of five distinct Colombian cultivars of P. volubilis from different geographical locations to reveal their phylogenetic relationships. Thus, we evaluated if it has enough resolution to genotype cultivars, intending to crossbreed parents and following marker's trace down to the F1 generation. We finally elucidated, by using molecular and cytological methods on cut flower buds, that the inheritance mode of P. volubilis cpDNA is maternally transmitted and proposed that it occurs as long as it is physically excluded during pollen development. This de novo chloroplast genome will provide a valuable resource for studying this promising crop, allowing the determination of the organellar inheritance mechanism of some critical phenotypic traits and enabling the use of genetic engineering in breeding programs to develop new varieties.
Collapse
|
108
|
Torres-Silva G, Correia LNF, Batista DS, Koehler AD, Resende SV, Romanel E, Cassol D, Almeida AMR, Strickler SR, Specht CD, Otoni WC. Transcriptome Analysis of Melocactus glaucescens (Cactaceae) Reveals Metabolic Changes During in vitro Shoot Organogenesis Induction. FRONTIERS IN PLANT SCIENCE 2021; 12:697556. [PMID: 34490003 PMCID: PMC8417902 DOI: 10.3389/fpls.2021.697556] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 07/16/2021] [Indexed: 05/16/2023]
Abstract
Melocactus glaucescens is an endangered cactus highly valued for its ornamental properties. In vitro shoot production of this species provides a sustainable alternative to overharvesting from the wild; however, its propagation could be improved if the genetic regulation underlying its developmental processes were known. The present study generated de novo transcriptome data, describing in vitro shoot organogenesis induction in M. glaucescens. Total RNA was extracted from explants before (control) and after shoot organogenesis induction (treated). A total of 14,478 unigenes (average length, 520 bases) were obtained using Illumina HiSeq 3000 (Illumina Inc., San Diego, CA, USA) sequencing and transcriptome assembly. Filtering for differential expression yielded 2,058 unigenes. Pairwise comparison of treated vs. control genes revealed that 1,241 (60.3%) unigenes exhibited no significant change, 226 (11%) were downregulated, and 591 (28.7%) were upregulated. Based on database analysis, more transcription factor families and unigenes appeared to be upregulated in the treated samples than in controls. Expression of WOUND INDUCED DEDIFFERENTIATION 1 (WIND1) and CALMODULIN (CaM) genes, both of which were upregulated in treated samples, was further validated by real-time quantitative PCR (RT-qPCR). Differences in gene expression patterns between control and treated samples indicate substantial changes in the primary and secondary metabolism of M. glaucescens after the induction of shoot organogenesis. These results help to clarify the molecular genetics and functional genomic aspects underlying propagation in the Cactaceae family.
Collapse
|
109
|
Lee HE, Manivannan A, Lee SY, Han K, Yeum JG, Jo J, Kim J, Rho IR, Lee YR, Lee ES, Kang BC, Kim DS. Chromosome Level Assembly of Homozygous Inbred Line 'Wongyo 3115' Facilitates the Construction of a High-Density Linkage Map and Identification of QTLs Associated With Fruit Firmness in Octoploid Strawberry ( Fragaria × ananassa). FRONTIERS IN PLANT SCIENCE 2021; 12:696229. [PMID: 34335662 PMCID: PMC8317996 DOI: 10.3389/fpls.2021.696229] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 06/08/2021] [Indexed: 05/02/2023]
Abstract
Strawberry is an allo-octoploid crop with high genome heterozygosity and complexity, which hinders the sequencing and the assembly of the genome. However, in the present study, we have generated a chromosome level assembly of octoploid strawberry sourced from a highly homozygous inbred line 'Wongyo 3115', using long- and short-read sequencing technologies. The assembly of 'Wongyo 3115' produced 805.6 Mb of the genome with 323 contigs scaffolded into 208 scaffolds with an N50 of 27.3 Mb after further gap filling. The whole genome annotation resulted in 151,892 genes with a gene density of 188.52 (genes/Mb) and validation of a genome, using BUSCO analysis resulted in 94.10% complete BUSCOs. Firmness is one of the vital traits in strawberry, which facilitate the postharvest shelf-life qualities. The molecular and genetic mechanisms that contribute the firmness in strawberry remain unclear. We have constructed a high-density genetic map based on the 'Wongyo 3115' reference genome to identify loci associated with firmness in the present study. For the quantitative trait locus (QTL) identification, the 'BS F2' populations developed from two inbred lines were genotyped, using an Axiom 35K strawberry chip, and marker positions were analyzed based on the 'Wongyo 3115' genome. Genetic maps were constructed with 1,049 bin markers, spanning the 3,861 cM. Using firmness data of 'BS F2' obtained from 2 consecutive years, five QTLs were identified on chromosomes 3-3, 5-1, 6-1, and 6-4. Furthermore, we predicted the candidate genes associated with firmness in strawberries by utilizing transcriptome data and QTL information. Overall, we present the chromosome-level assembly and annotation of a homozygous octoploid strawberry inbred line and a linkage map constructed to identify QTLs associated with fruit firmness.
Collapse
|
110
|
Liu Y, Helmann TC, Stodghill P, Filiatrault MJ. Complete Genome Sequence Resource for the Necrotrophic Plant-Pathogenic Bacterium Pectobacterium carotovorum WPP14. PLANT DISEASE 2021; 105:196-198. [PMID: 32720879 DOI: 10.1094/pdis-05-20-1059-a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Pectobacterium spp. are a major cause of loss in vegetable and ornamental plant production. One of these species, Pectobacterium carotovorum, can cause soft rot disease on many plants, particularly potato. These diseases lead to significant economic loss and pose food security threats by reducing crop yields in the field, in transit, and during storage. The Gram-negative enterobacterium P. carotovorum WPP14 is a particularly virulent strain for which there is no available closed genome, limiting the molecular research for this important pathogen. Here, we report a high-quality complete and annotated genome sequence of P. carotovorum WPP14. The 4,892,225-bp genome was assembled with Nanopore reads and polished with Illumina reads, yielding 394× and 164× coverage, respectively. This closed genome provides a resource for research on improved detection and biology of P. carotovorum, which could translate into improved disease management.
Collapse
|
111
|
Camiolo S, Suárez NM, Chalka A, Venturini C, Breuer J, Davison AJ. GRACy: A tool for analysing human cytomegalovirus sequence data. Virus Evol 2020; 7:veaa099. [PMID: 33505707 PMCID: PMC7816668 DOI: 10.1093/ve/veaa099] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Modern DNA sequencing has instituted a new era in human cytomegalovirus (HCMV) genomics. A key development has been the ability to determine the genome sequences of HCMV strains directly from clinical material. This involves the application of complex and often non-standardized bioinformatics approaches to analysing data of variable quality in a process that requires substantial manual intervention. To relieve this bottleneck, we have developed GRACy (Genome Reconstruction and Annotation of Cytomegalovirus), an easy-to-use toolkit for analysing HCMV sequence data. GRACy automates and integrates modules for read filtering, genotyping, genome assembly, genome annotation, variant analysis, and data submission. These modules were tested extensively on simulated and experimental data and outperformed generic approaches. GRACy is written in Python and is embedded in a graphical user interface with all required dependencies installed by a single command. It runs on the Linux operating system and is designed to allow the future implementation of a cross-platform version. GRACy is distributed under a GPL 3.0 license and is freely available at https://bioinformatics.cvr.ac.uk/software/ with the manual and a test dataset.
Collapse
|
112
|
Wibberg D, Price-Carter M, Rückert C, Blom J, Möbius P. Complete Genome Sequence of Ovine Mycobacterium avium subsp. paratuberculosis Strain JIII-386 (MAP-S/type III) and Its Comparison to MAP-S/type I, MAP-C, and M. avium Complex Genomes. Microorganisms 2020; 9:microorganisms9010070. [PMID: 33383865 PMCID: PMC7823733 DOI: 10.3390/microorganisms9010070] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 12/21/2020] [Accepted: 12/24/2020] [Indexed: 02/07/2023] Open
Abstract
Mycobacterium avium (M. a.) subsp. paratuberculosis (MAP) is a worldwide-distributed obligate pathogen in ruminants causing Johne’s disease. Due to a lack of complete subtype III genome sequences, there is not yet conclusive information about genetic differences between strains of cattle (MAP-C, type II) and sheep (MAP-S) type, and especially between MAP-S subtypes I, and III. Here we present the complete, circular genome of MAP-S/type III strain JIII-386 (DE) closed by Nanopore-technology and its comparison with MAP-S/type I closed genome of strain Telford (AUS), MAP-S/type III draft genome of strain S397 (U.S.), twelve closed MAP-C strains, and eight closed M.-a.-complex-strains. Structural comparative alignments revealed clearly the mosaic nature of MAP, emphasized differences between the subtypes and the higher diversity of MAP-S genomes. The comparison of various genomic elements including transposases and genomic islands provide new insights in MAP genomics. MAP type specific phenotypic features may be attributed to genes of known large sequence polymorphisms (LSPSs) regions I–IV and deletions #1 and #2, confirmed here, but could also result from identified frameshifts or interruptions of various virulence-associated genes (e.g., mbtC in MAP-S). Comprehensive core and pan genome analysis uncovered unique genes (e.g., cytochromes) and genes probably acquired by horizontal gene transfer in different MAP-types and subtypes, but also emphasized the highly conserved and close relationship, and the complex evolution of M.-a.-strains.
Collapse
|
113
|
Lee GLY, Zakaria NN, Convey P, Futamata H, Zulkharnain A, Suzuki K, Abdul Khalil K, Shaharuddin NA, Alias SA, González-Rocha G, Ahmad SA. Statistical Optimisation of Phenol Degradation and Pathway Identification through Whole Genome Sequencing of the Cold-Adapted Antarctic Bacterium, Rhodococcus sp. Strain AQ5-07. Int J Mol Sci 2020; 21:ijms21249363. [PMID: 33316871 PMCID: PMC7764105 DOI: 10.3390/ijms21249363] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 11/18/2020] [Accepted: 11/19/2020] [Indexed: 11/16/2022] Open
Abstract
Study of the potential of Antarctic microorganisms for use in bioremediation is of increasing interest due to their adaptations to harsh environmental conditions and their metabolic potential in removing a wide variety of organic pollutants at low temperature. In this study, the psychrotolerant bacterium Rhodococcus sp. strain AQ5-07, originally isolated from soil from King George Island (South Shetland Islands, maritime Antarctic), was found to be capable of utilizing phenol as sole carbon and energy source. The bacterium achieved 92.91% degradation of 0.5 g/L phenol under conditions predicted by response surface methodology (RSM) within 84 h at 14.8 °C, pH 7.05, and 0.41 g/L ammonium sulphate. The assembled draft genome sequence (6.75 Mbp) of strain AQ5-07 was obtained through whole genome sequencing (WGS) using the Illumina Hiseq platform. The genome analysis identified a complete gene cluster containing catA, catB, catC, catR, pheR, pheA2, and pheA1. The genome harbours the complete enzyme systems required for phenol and catechol degradation while suggesting phenol degradation occurs via the β-ketoadipate pathway. Enzymatic assay using cell-free crude extract revealed catechol 1,2-dioxygenase activity while no catechol 2,3-dioxygenase activity was detected, supporting this suggestion. The genomic sequence data provide information on gene candidates responsible for phenol and catechol degradation by indigenous Antarctic bacteria and contribute to knowledge of microbial aromatic metabolism and genetic biodiversity in Antarctica.
Collapse
|
114
|
Banerjee A, Singh A, Roychoudhury A. De novo RNA-Seq analysis in sensitive rice cultivar and comparative transcript profiling in contrasting genotypes reveal genetic biomarkers for fluoride-stress response. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2020; 267:115378. [PMID: 33254681 DOI: 10.1016/j.envpol.2020.115378] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 07/13/2020] [Accepted: 08/04/2020] [Indexed: 06/12/2023]
Abstract
The fluoride-sensitive indica rice cultivar, IR-64 was subjected to NaF-treatment for 25 days, following which RNA-Seq analysis identified significant up and down regulation of 1,303 and 93 transcripts respectively. Gene ontology (GO) enrichment analysis classified transcripts into groups related to 'cellular part', 'membrane', 'catalytic activity', 'transporter activity', 'binding', 'metabolic processes' and 'cellular processes'. Analysis of differentially expressed genes (DEGs) revealed fluoride-mediated suppression of abscisic acid (ABA) biosynthesis and signaling. Instead, the gibberellin-dependent pathway and signaling via ABA-independent transcription factors (TFs) was activated. Comparative profiling of selected DEGs in IR-64 and fluoride-tolerant variety, Khitish revealed significant cytoskeletal and nucleosomal remodelling, accompanied with escalated levels of autophagy in stressed IR-64 (unlike that in stressed Khitish). Genes associated with ion, solute and xenobiotic transport were strongly up regulated in stressed IR-64, indicating potential fluoride entry through these channels. On the contrary, genes associated with xenobiotic mobility were suppressed in the tolerant cultivar, which restricted bioaccumulation and translocation of fluoride. Pairwise expression profile analysis between stressed IR-64 and Khitish, supported by extensive statistical modelling predicted that fluoride susceptibility was associated with high expression of genes like amino acid transporter, ABC transporter2, CLCd, MFS monosaccharide transporter, SulfT2.1 and PotT2 while fluoride tolerance with high expression of Sweet11.
Collapse
|
115
|
Cao LJ, Song W, Yue L, Guo SK, Chen JC, Gong YJ, Hoffmann AA, Wei SJ. Chromosome-level genome of the peach fruit moth Carposina sasakii (Lepidoptera: Carposinidae) provides a resource for evolutionary studies on moths. Mol Ecol Resour 2020; 21:834-848. [PMID: 33098233 DOI: 10.1111/1755-0998.13288] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 10/05/2020] [Accepted: 10/16/2020] [Indexed: 01/22/2023]
Abstract
The peach fruit moth (PFM), Carposina sasakii Matsumura, is a major phytophagous orchard pest widely distributed across Northeast Asia. Here, we report the chromosome-level genome for the PFM, representing the first genome for the family Carposinidae, from the lepidopteran superfamily Copromorphoidea. The genome was assembled into 404.83 Mb sequences using PacBio long-read and Illumina short-read sequences, including 275 contigs, with a contig N50 length of 2.62 Mb. All contigs were assembled into 31 linkage groups assisted by the Hi-C technique, including 30 autosomes and a Z chromosome. BUSCO analysis showed that 98.3% of genes were complete and 0.4% of genes were fragmented, while 1.3% of genes were missing in the assembled genome. In total, 21,697 protein-coding genes were predicted, of which 84.80% were functionally annotated. Because of the importance of diapause triggered by photoperiod in PFM, five circadian genes in the PFM as well as in the other related species were annotated, and potential genes related to diapause and photoperiodic reaction were also identified from transcriptome sequencing. In addition, manual annotation of detoxification gene families was undertaken and showed a higher number of glutathione S-transferase (GST) gene in PFM than in most other lepidopterans, in contrast to a lower number of uridine diphosphate (UDP)-glycosyltransferase (UGT) gene, carboxyl/cholinesterases (CCE) gene and cytochrome P450 monooxygenase (P450) gene, suggesting different detoxication pathways in this moth. The high-quality genome provides a resource for comparative evolutionary studies of this moth and its relatives within the context of radiations across Lepidoptera.
Collapse
|
116
|
Gong Z, Zhu JW, Li CP, Jiang S, Ma LN, Tang BX, Zou D, Chen ML, Sun YB, Song SH, Zhang Z, Xiao JF, Xue YB, Bao YM, Du ZL, Zhao WM. An online coronavirus analysis platform from the National Genomics Data Center. Zool Res 2020; 41:705-708. [PMID: 33045776 PMCID: PMC7671910 DOI: 10.24272/j.issn.2095-8137.2020.065] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 10/12/2020] [Indexed: 11/24/2022] Open
Abstract
Since the first reported severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in December 2019, coronavirus disease 2019 (COVID-19) has become a global pandemic, spreading to more than 200 countries and regions worldwide. With continued research progress and virus detection, SARS-CoV-2 genomes and sequencing data have been reported and accumulated at an unprecedented rate. To meet the need for fast analysis of these genome sequences, the National Genomics Data Center (NGDC) of the China National Center for Bioinformation (CNCB) has established an online coronavirus analysis platform, which includes de novoassembly, BLAST alignment, genome annotation, variant identification, and variant annotation modules. The online analysis platform can be freely accessed at the 2019 Novel Coronavirus Resource (2019nCoVR) (https://bigd.big.ac.cn/ncov/online/tools).
Collapse
|
117
|
Sheng M, She J, Xu W, Hong Y, Su Z, Zhang X. HpeNet: Co-expression Network Database for de novo Transcriptome Assembly of Paeonia lactiflora Pall. Front Genet 2020; 11:570138. [PMID: 33193666 PMCID: PMC7641121 DOI: 10.3389/fgene.2020.570138] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 08/18/2020] [Indexed: 01/23/2023] Open
Abstract
The herbaceous peony (Paeonia lactiflora Pall.) is a well-known ornamental flowering and pharmaceutical plant found in China. Its high medicinal value has long been recognized by traditional Chinese medicine (as Radix paeoniae Alba and Radix paeoniae Rubra), and it has become economically valued for its oilseed in recent years; like other Paeonia species, it has been identified as a novel resource for the α-linolenic acid used in seed oil production. However, its genome has not yet been sequenced, and little transcriptome data on Paeonia lactiflora are available. To obtain a comprehensive transcriptome for Paeonia lactiflora, RNAs from 10 tissues of the Paeonia lactiflora Pall. cv Shaoyou17C were used for de novo assembly, and 416,062 unigenes were obtained. Using a homology search, it was found that 236,222 (approximately 57%) unigenes had at least one BLAST hit in one or more public data resources. The construction of co-expression networks is a feasible means for improving unigene annotation. Using in-house transcriptome data, we obtained a co-expression network covering 95.13% of the unigenes. Then we integrated co-expression network analyses and lipid-related pathway genes to study lipid metabolism in Paeonia lactiflora cultivars. Finally, we constructed the online database HpeNet (http://bioinformatics.cau.edu.cn/HpeNet) to integrate transcriptome data, gene information, the co-expression network, and so forth. The database can also be searched for gene details, gene functions, orthologous matches, and other data. Our online database may help the research community identify functional genes and perform research on Paeonia lactiflora more conveniently. We hope that de novo transcriptome assembly, combined with co-expression networks, can provide a feasible means to predict the gene function of species that do not have a reference genome.
Collapse
|
118
|
Muliyar RK, Chowdappa P, Behera SK, Kasaragod S, Gangaraj KP, Kotimoole CN, Nekrakalaya B, Mohanty V, Sampgod RB, Banerjee G, Das AJ, Niral V, Karun A, Mahato AK, Gaikwad K, Singh NK, Prasad TSK. Assembly and Annotation of the Nuclear and Organellar Genomes of a Dwarf Coconut (Chowghat Green Dwarf) Possessing Enhanced Disease Resistance. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 24:726-742. [PMID: 33170083 DOI: 10.1089/omi.2020.0147] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Coconut (Cocos nucifera L.), an important source of vegetable oil, nutraceuticals, functional foods, and housing materials, provides raw materials for a repertoire of industries engaged in the manufacture of cosmetics, soaps, detergents, paints, varnishes, and emulsifiers, among other products. The palm plays a vital role in maintaining and promoting the sustainability of farming systems of the fragile ecosystems of islands and coastal regions of the tropics. In this study, we present the genome of a dwarf coconut variety "Chowghat Green Dwarf" (CGD) from India, possessing enhanced resistance to root (wilt) disease. Utilizing short reads from the Illumina HiSeq 4000 platform and long reads from the Pacific Biosciences RSII platform, we have assembled the draft genome assembly of 1.93 Gb. The genome is distributed over 26,855 scaffolds, with ∼81.56% of the assembled genome present in scaffolds of lengths longer than 50 kb. About 77.29% of the genome was composed of transposable elements and repeats. Gene prediction yielded 51,953 genes, which upon stringent filtering, based on Annotation Edit Distance, resulted in 13,707 genes, which coded for 11,181 proteins. Among these, we gathered transcript level evidence for a total of 6828 predicted genes based on the RNA-Seq data from different coconut tissues, since they presented assembled transcripts within the genome annotation coordinates. A total of 112 nucleotide-binding and leucine-rich repeat loci, belonging to six classes, were detected. We have also undertaken the assembly and annotation of the CGD chloroplast and mitochondrial genomes. The availability of the dwarf coconut genome shall prove invaluable for deducing the origin of dwarf coconut cultivars, dissection of genes controlling plant habit and fruit color, and accelerated breeding for improved agronomic traits.
Collapse
|
119
|
Draft Genome Assembly of the Freshwater Apex Predator Wels Catfish ( Silurus glanis) Using Linked-Read Sequencing. G3-GENES GENOMES GENETICS 2020; 10:3897-3906. [PMID: 32917720 PMCID: PMC7642921 DOI: 10.1534/g3.120.401711] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The wels catfish (Silurus glanis) is one of the largest freshwater fish species in the world. This top predator plays a key role in ecosystem stability, and represents an iconic trophy-fish for recreational fishermen. S. glanis is also a highly valued species for its high-quality boneless flesh, and has been cultivated for over 100 years in Eastern and Central Europe. The interest in rearing S. glanis continues to grow; the aquaculture production of this species has almost doubled during the last decade. However, despite its high ecological, cultural and economic importance, the available genomic resources for S. glanis are very limited. To fulfill this gap we report a de novo assembly and annotation of the whole genome sequence of a female S. glanis. The linked-read based technology with 10X Genomics Chromium chemistry and Supernova assembler produced a highly continuous draft genome of S. glanis: ∼0.8Gb assembly (scaffold N50 = 3.2 Mb; longest individual scaffold = 13.9 Mb; BUSCO completeness = 84.2%), which included 313.3 Mb of putative repeated sequences. In total, 21,316 protein-coding genes were predicted, of which 96% were annotated functionally from either sequence homology or protein signature searches. The highly continuous genome assembly will be an invaluable resource for aquaculture genomics, genetics, conservation, and breeding research of S. glanis.
Collapse
|
120
|
Hamada M, Satoh N, Khalturin K. A Reference Genome from the Symbiotic Hydrozoan, Hydra viridissima. G3 (BETHESDA, MD.) 2020; 10:3883-3895. [PMID: 32900905 PMCID: PMC7642931 DOI: 10.1534/g3.120.401411] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 09/03/2020] [Indexed: 11/18/2022]
Abstract
Various Hydra species have been employed as model organisms since the 18th century. Introduction of transgenic and knock-down technologies made them ideal experimental systems for studying cellular and molecular mechanisms involved in regeneration, body-axis formation, senescence, symbiosis, and holobiosis. In order to provide an important reference for genetic studies, the Hydra magnipapillata genome (species name has been changed to H. vulgaris) was sequenced a decade ago (Chapman et al., 2010) and the updated genome assembly, Hydra 2.0, was made available by the National Human Genome Research Institute in 2017. While H. vulgaris belongs to the non-symbiotic brown hydra lineage, the green hydra, Hydra viridissima, harbors algal symbionts and belongs to an early diverging clade that separated from the common ancestor of brown and green hydra lineages at least 100 million years ago (Schwentner and Bosch 2015; Khalturin et al., 2019). While interspecific interactions between H. viridissima and endosymbiotic unicellular green algae of the genus Chlorella have been a subject of interest for decades, genomic information about green hydras was nonexistent. Here we report a draft 280-Mbp genome assembly for Hydra viridissima strain A99, with a scaffold N50 of 1.1 Mbp. The H. viridissima genome contains an estimated 21,476 protein-coding genes. Comparative analysis of Pfam domains and orthologous proteins highlights characteristic features of H. viridissima, such as diversification of innate immunity genes that are important for host-symbiont interactions. Thus, the H. viridissima assembly provides an important hydrozoan genome reference that will facilitate symbiosis research and better comparisons of metazoan genome architectures.
Collapse
|
121
|
Ma Q, Sun T, Li S, Wen J, Zhu L, Yin T, Yan K, Xu X, Li S, Mao J, Wang Y, Jin S, Zhao X, Li Q. The Acer truncatum genome provides insights into nervonic acid biosynthesis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 104:662-678. [PMID: 32772482 PMCID: PMC7702125 DOI: 10.1111/tpj.14954] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 07/08/2020] [Accepted: 07/21/2020] [Indexed: 05/10/2023]
Abstract
Acer truncatum (purpleblow maple) is a woody tree species that produces seeds with high levels of valuable fatty acids (especially nervonic acid). However, the lack of a complete genome sequence has limited both basic and applied research on A. truncatum. We describe a high-quality draft genome assembly comprising 633.28 Mb (contig N50 = 773.17 kb; scaffold N50 = 46.36 Mb) with at least 28 438 predicted genes. The genome underwent an ancient triplication, similar to the core eudicots, but there have been no recent whole-genome duplication events. Acer yangbiense and A. truncatum are estimated to have diverged about 9.4 million years ago. A combined genomic, transcriptomic, metabonomic, and cell ultrastructural analysis provided new insights into the biosynthesis of very long-chain monounsaturated fatty acids. In addition, three KCS genes were found that may contribute to regulating nervonic acid biosynthesis. The KCS paralogous gene family expanded to 28 members, with 10 genes clustered together and distributed in the 0.27-Mb region of pseudochromosome 4. Our chromosome-scale genomic characterization may facilitate the discovery of agronomically important genes and stimulate functional genetic research on A. truncatum. Furthermore, the data presented also offer important foundations from which to study the molecular mechanisms influencing the production of nervonic acids.
Collapse
|
122
|
Ma X, Fan J, Wu Y, Zhao S, Zheng X, Sun C, Tan L. Whole-genome de novo assemblies reveal extensive structural variations and dynamic organelle-to-nucleus DNA transfers in African and Asian rice. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 104:596-612. [PMID: 32748498 PMCID: PMC7693357 DOI: 10.1111/tpj.14946] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 07/17/2020] [Accepted: 07/22/2020] [Indexed: 05/05/2023]
Abstract
Asian cultivated rice (Oryza sativa) and African cultivated rice (Oryza glaberrima) originated from the wild rice species Oryza rufipogon and Oryza barthii, respectively. The genomes of both cultivated species have undergone profound changes during domestication. Whole-genome de novo assemblies of O. barthii, O. glaberrima, O. rufipogon and Oryza nivara, produced using PacBio single-molecule real-time (SMRT) and next-generation sequencing (NGS) technologies, showed that Gypsy-like retrotransposons are the major contributors to genome size variation in African and Asian rice. Through the detection of genome-wide structural variations (SVs), we observed that besides 28 shared SV hot spots, another 67 hot spots existed in either the Asian or African rice genomes. Based on gene annotation information of the SVs, we established that organelle-to-nucleus DNA transfers resulted in numerous SVs that participated in the nuclear genome divergence of rice species and subspecies. We detected 52 giant nuclear integrants of organelle DNA (NORGs, defined as >10 kb) in six Oryza AA genomes. In addition, we developed an effective method to genotype giant NORGs, based on genome assembly, and first showed the dynamic change in the distribution of giant NORGs in rice natural population. Interestingly, 16 highly differentiated giant NORGs tended to accumulate in natural populations of Asian rice from higher latitude regions, grown at lower temperatures and light intensities. Our study provides new insight into the genome divergence of African and Asian rice, and establishes that organelle-to-nucleus DNA transfers, as potentially powerful contributors to environmental adaptation during rice evolution, play a major role in producing SVs in rice genomes.
Collapse
|
123
|
Multifactorial Causes of Chronic Mortality in Juvenile Sturgeon ( Huso huso). Animals (Basel) 2020; 10:ani10101866. [PMID: 33066257 PMCID: PMC7602020 DOI: 10.3390/ani10101866] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 10/08/2020] [Accepted: 10/09/2020] [Indexed: 01/05/2023] Open
Abstract
This investigation focused on an episode of chronic mortality observed in juvenile Huso huso sturgeons. The examined subjects underwent pathological, microbiological, molecular, and chemical investigations. Grossly severe body shape deformities, epaxial muscle softening, and multifocal ulcerative dermatitis were the main observed findings. The more constant histopathologic findings were moderate to severe rarefaction and disorganization of the lymphohematopoietic lymphoid tissues, myofiber degeneration, atrophy and interstitial edema of skeletal epaxial muscles, and degeneration and atrophy of the gangliar neurons close to the myofibers. Chemical investigations showed a lower selenium concentration in affected animals, suggesting nutritional myopathy. Other manifestations were nephrocalcinosis and splenic vessel wall hyalinosis. Septicemia due to bacteria such as Aeromonas veronii, Shewanella putrefaciens, Citrobacter freundii, Chryseobacterium sp., and pigmented hyphae were found. No major sturgeon viral pathogens were detected by classical methods. Next-generation sequencing (NGS) analysis confirmed the absence of viral pathogens, with the exception of herpesvirus, at the order level; also, the presence of Aeromonas veronii and Shewanella putrefaciens was confirmed at the family level by the metagenomic classification of NGS data. In the absence of a primary yet undetected biological cause, it is supposed that environmental stressors, including nutritional imbalances, may have led to immune system impairment, facilitating the entry of opportunistic bacteria and mycotic hyphae.
Collapse
|
124
|
Han B, Jing Y, Dai J, Zheng T, Gu F, Zhao Q, Zhu F, Song X, Deng H, Wei P, Song C, Liu D, Jiang X, Wang F, Chen Y, Sun C, Yao H, Zhang L, Chen N, Chen S, Li X, Wei Y, Ouyang Z, Yan H, Lu J, Wang H, Guo L, Kong L, Zhao J, Li S, Luo L, Kristiansen K, Feng Z, Sun S, Chen C, Yue Z, Chen N. A Chromosome-Level Genome Assembly of Dendrobium Huoshanense Using Long Reads and Hi-C Data. Genome Biol Evol 2020; 12:2486-2490. [PMID: 33045048 PMCID: PMC7846097 DOI: 10.1093/gbe/evaa215] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/07/2020] [Indexed: 01/03/2023] Open
Abstract
Dendrobium huoshanense is used to treat various diseases in traditional Chinese medicine. Recent studies have identified active components. However, the lack of genomic data limits research on the biosynthesis and application of these therapeutic ingredients. To address this issue, we generated the first chromosome-level genome assembly and annotation of D. huoshanense. We integrated PacBio sequencing data, Illumina paired-end sequencing data, and Hi-C sequencing data to assemble a 1.285 Gb genome, with contig and scaffold N50 lengths of 598 kb and 71.79 Mb, respectively. We annotated 21,070 protein-coding genes and 0.96 Gb transposable elements, constituting 74.92% of the whole assembly. In addition, we identified 252 genes responsible for polysaccharide biosynthesis by Kyoto Encyclopedia of Genes and Genomes functional annotation. Our data provide a basis for further functional studies, particularly those focused on genes related to glycan biosynthesis and metabolism, and have implications for both conservation and medicine.
Collapse
|
125
|
Yin J, Jiang L, Wang L, Han X, Guo W, Li C, Zhou Y, Denton M, Zhang P. A high-quality genome of taro (Colocasia esculenta (L.) Schott), one of the world's oldest crops. Mol Ecol Resour 2020; 21:68-77. [PMID: 32790213 DOI: 10.1111/1755-0998.13239] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 07/30/2020] [Accepted: 07/31/2020] [Indexed: 01/04/2023]
Abstract
Taro (Colocasia esculenta (L.), Schott), from the Araceae family, is one of the oldest crops with important edible, medicinal, nutritional and economic value. Taro is a highly polymorphic species including diverse genotypes adapted to a broad range of environments, but the taro genome has rarely been investigated. Here, a high-quality chromosome-level genome of C. esculenta was assembled using data sequenced by Illumina, PacBio and Nanopore platforms. The assembled genome size was 2,405 Mb with a contig N50 of 400.0 kb and a scaffold N50 of 159.4 Mb. In total, 2,311 Mb (96.09%) of the contig sequences was anchored onto 14 chromosomes to form pseudomolecules, and 2,126 Mb (88.43%) was annotated as repetitive sequences. Of the 28,695 predicted protein-coding genes, 26,215 genes (91.4%) could be functionally annotated. On the basis of phylogenetic analysis using 769 genes, C. esculenta and Spirodela polyrhiza were placed on one branch of the tree that diverged approximately 73.23 million years ago. The synteny analyses showed that there have been two whole-genome duplication events in C. esculenta separated by a relatively short gap. According to comparative genome analysis, a larger number (1,189) of distinct gene families and long terminal repeats were enriched in C. esculenta. Our high-quality taro genome will provide valuable resources for further genetic, ecological and evolutionary analyses of taro or other species in the Araceae.
Collapse
|