1
|
Low ETL, Chan KL, Zaki NM, Taranenko E, Ordway JM, Wischmeyer C, Buntjer J, Halim MAA, Sanusi NSNM, Nagappan J, Rosli R, Bondar E, Amiruddin N, Sarpan N, Ting NC, Chan PL, Ong-Abdullah M, Marjuni M, Mustaffa S, Abdullah N, Azizi N, Bacher B, Lakey N, Tatarinova TV, Manaf MAA, Sambanthamurti R, Singh R. Chromosome-scale Elaeis guineensis and E. oleifera assemblies: comparative genomics of oil palm and other Arecaceae. G3 (BETHESDA, MD.) 2024; 14:jkae135. [PMID: 38918881 PMCID: PMC11373658 DOI: 10.1093/g3journal/jkae135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 04/18/2023] [Accepted: 05/16/2024] [Indexed: 06/27/2024]
Abstract
Elaeis guineensis and E. oleifera are the two species of oil palm. E. guineensis is the most widely cultivated commercial species, and introgression of desirable traits from E. oleifera is ongoing. We report an improved E. guineensis genome assembly with substantially increased continuity and completeness, as well as the first chromosome-scale E. oleifera genome assembly. Each assembly was obtained by integration of long-read sequencing, proximity ligation sequencing, optical mapping, and genetic mapping. High interspecific genome conservation is observed between the two species. The study provides the most extensive gene annotation to date, including 46,697 E. guineensis and 38,658 E. oleifera gene predictions. Analyses of repetitive element families further resolve the DNA repeat architecture of both genomes. Comparative genomic analyses identified experimentally validated small structural variants between the oil palm species and resolved the mechanism of chromosomal fusions responsible for the evolutionary descending dysploidy from 18 to 16 chromosomes.
Collapse
Affiliation(s)
- Eng-Ti Leslie Low
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Kuang-Lim Chan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Noorhariza Mohd Zaki
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | | | - Jared M Ordway
- Orion Genomics, 3730 Foundry Way, St. Louis, MO 63110, USA
| | | | - Jaap Buntjer
- Orion Genomics, 3730 Foundry Way, St. Louis, MO 63110, USA
| | - Mohd Amin Ab Halim
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Nik Shazana Nik Mohd Sanusi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Jayanthi Nagappan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Rozana Rosli
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Eugeniya Bondar
- Biology Department, University of La Verne, La Verne, CA 91750, USA
| | - Nadzirah Amiruddin
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Norashikin Sarpan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Ngoot-Chin Ting
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Pek-Lan Chan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Meilina Ong-Abdullah
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Marhalil Marjuni
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Suzana Mustaffa
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Norziha Abdullah
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Norazah Azizi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Blaire Bacher
- Orion Genomics, 3730 Foundry Way, St. Louis, MO 63110, USA
| | - Nathan Lakey
- Orion Genomics, 3730 Foundry Way, St. Louis, MO 63110, USA
| | | | - Mohamad Arif Abd Manaf
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Ravigadevi Sambanthamurti
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Rajinder Singh
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| |
Collapse
|
2
|
Lee J, Kang YJ, Park H, Shim S, Ha J, Lee T, Kim MY, Lee SH. Unraveling the maternal and paternal origins of allotetraploid Vigna reflexo-pilosa. Sci Rep 2023; 13:22951. [PMID: 38135720 PMCID: PMC10746702 DOI: 10.1038/s41598-023-49908-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/13/2023] [Indexed: 12/24/2023] Open
Abstract
The genomic structures of Vigna hirtella Ridl. and Vigna trinervia (B.Heyne ex Wight & Arn.) Tateishi & Maxted, key ancestral species of the allotetraploid Vigna reflexo-pilosa var. glabra (Roxb.) N.Tomooka & Maxted, remain poorly understood. This study presents a comprehensive genomic comparison of these species to deepen our knowledge of their evolutionary trajectories. By comparing the genomic profiles of V. hirtella and V. trinervia with those of V. reflexo-pilosa, we investigate the complex genomic mechanisms underlying allopolyploid evolution within the genus Vigna. Comparison of the chloroplast genome revealed that V. trinervia is closely related to V. reflexo-pilosa. De novo assembly of the whole genome, followed by synteny analysis and Ks value calculations, confirms that V. trinervia is closely related to the A genome of V. reflexo-pilosa, and V. hirtella to its B genome. Furthermore, the comparative analyses reveal that V. reflexo-pilosa retains residual signatures of a previous polyploidization event, particularly evident in higher gene family copy numbers. Our research provides genomic evidence for polyploidization within the genus Vigna and identifies potential donor species of allotetraploid species using de novo assembly techniques. Given the Southeast Asian distribution of both V. hirtella and V. trinervia, natural hybridization between these species, with V. trinervia as the maternal ancestor and V. hirtella as the paternal donor, seems plausible.
Collapse
Affiliation(s)
- Jayern Lee
- Department of Agriculture, Forestry and Bioresources and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- , Macrogen, Seoul, Republic of Korea
| | - Yang Jae Kang
- Division of Bio & Medical Bigdata Department (BK4 Program), Gyeongsang National University, Jinju, Republic of Korea
- Division of Life Science Department at, Gyeongsang National University, Jinju, Republic of Korea
| | - Halim Park
- Division of Bio & Medical Bigdata Department (BK4 Program), Gyeongsang National University, Jinju, Republic of Korea
| | - Sangrea Shim
- Department of Forest Resources, College of Forest and Environmental Sciences, Kangwon National University, Chuncheon, Republic of Korea
| | - Jungmin Ha
- Department of Plant Science, Gangneung-Wonju National University, Gangneung, Republic of Korea
| | | | - Moon Young Kim
- Department of Agriculture, Forestry and Bioresources and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- Plant Genomics and Breeding Institute, Seoul National University, Seoul, Republic of Korea
| | - Suk-Ha Lee
- Department of Agriculture, Forestry and Bioresources and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea.
- Plant Genomics and Breeding Institute, Seoul National University, Seoul, Republic of Korea.
| |
Collapse
|
3
|
Gaikwad K, Ramakrishna G, Srivastava H, Saxena S, Kaila T, Tyagi A, Sharma P, Sharma S, Sharma R, Mahla HR, Kumar K, Sv AM, Solanke AU, Kalia P, Rao AR, Rai A, Sharma TR, Singh NK. The chromosome-scale genome assembly of cluster bean provides molecular insight into edible gum (galactomannan) biosynthesis family genes. Sci Rep 2023; 13:9941. [PMID: 37336893 DOI: 10.1038/s41598-023-33762-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 04/18/2023] [Indexed: 06/21/2023] Open
Abstract
Cluster bean (Cyamopsis tetragonoloba (L.) Taub 2n = 14, is commonly known as Guar. Apart from being a vegetable crop, it is an abundant source of a natural hetero-polysaccharide called guar gum or galactomannan. Here, we are reporting a chromosome-scale reference genome assembly of a popular cluster bean cultivar RGC-936, by combining sequencing data from Illumina, 10X Genomics, Oxford Nanopore technologies. An initial assembly of 1580 scaffolds with an N50 value of 7.12 Mb was generated and these scaffolds were anchored to a high density SNP linkage map. Finally, a genome assembly of 550.31 Mb (94% of the estimated genome size of ~ 580 Mb (through flow cytometry) with 58 scaffolds was obtained, including 7 super scaffolds with a very high N50 value of 78.27 Mb. Phylogenetic analysis using single copy orthologs among 12 angiosperms showed that cluster bean shared a common ancestor with other legumes 80.6 MYA. No evidence of recent whole genome duplication event in cluster bean was found in our analysis. Further comparative transcriptomics analyses revealed pod-specific up-regulation of genes encoding enzymes involved in galactomannan biosynthesis. The high-quality chromosome-scale cluster bean genome assembly will facilitate understanding of the molecular basis of galactomannan biosynthesis and aid in genomics-assisted improvement of cluster bean.
Collapse
Affiliation(s)
- Kishor Gaikwad
- ICAR-National Institute for Plant Biotechnology, New Delhi, India.
| | | | | | - Swati Saxena
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - Tanvi Kaila
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - Anshika Tyagi
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - Priya Sharma
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - Sandhya Sharma
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - R Sharma
- ICAR-Central Arid Zone Research Institute, Jodhpur, India
| | - H R Mahla
- ICAR-Central Arid Zone Research Institute, Jodhpur, India
| | - Kuldeep Kumar
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | - Amitha Mithra Sv
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| | | | - Pritam Kalia
- Division of Vegetable Sciences, ICAR-Indian Agricultural Research Institute, New Delhi, India
| | - A R Rao
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Anil Rai
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - T R Sharma
- DDG (CS), Indian Council of Agricultural Research, New Delhi, India
| | - N K Singh
- ICAR-National Institute for Plant Biotechnology, New Delhi, India
| |
Collapse
|
4
|
Kim J, Lee EJ, Lee KE, Nho YH, Ryu J, Kim SY, Yoo JK, Kang S, Seo SW. Docsubty: FLALipid extract derived from newly isolated Rhodotorula toruloides LAB-07 for cosmetic applications. Comput Struct Biotechnol J 2023; 21:2009-2017. [PMID: 36968014 PMCID: PMC10036517 DOI: 10.1016/j.csbj.2023.03.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 03/10/2023] [Accepted: 03/11/2023] [Indexed: 03/15/2023] Open
Abstract
Rhodotorula toruloides is a non-conventional yeast with a natural carotenoid pathway. In particular, R. toruloides is an oleaginous yeast that can accumulate lipids in high content, thereby gaining interest as a promising industrial host. In this study, we isolated and taxonomically identified a new R. toruloides LAB-07 strain. De novo genome assembly using PacBio and Illumina hybrid platforms yielded 27 contigs with a 20.78 Mb genome size. Subsequent genome annotation analysis based on RNA-seq predicted 5296 protein-coding genes, including the fatty acid production pathway. We compared lipid production under different media; it was highest in the yeast extract salt medium with glycerol as a carbon source. Polyunsaturated α-linolenic acid was detected among the fatty acids, and docking phosphatidylcholine as a substrate to modeled Fad2, which annotated as Δ12-fatty acid desaturase showed bifunctional Δ12, 15-desaturation is structurally possible in that the distances between the diiron center and the carbon-carbon bond in which desaturation occurs were similar to those of structurally identified mouse stearoyl-CoA desaturase. Finally, the applicability of the extracted total lipid fraction of R. toruloides was investigated, demonstrating an increase in filaggrin expression and suppression of heat-induced MMP-1 expression when applied to keratinocytes, along with the additional antioxidant activity. This work presents a new R. toruloides LAB-07 strain with genomic and lipidomic data, which would help understand the physiology of R. toruloides. Also, the various skin-related effect of R. toruloides lipid extract indicates its potential usage as a promising cosmetic ingredient.
Collapse
|
5
|
Abdullah-Zawawi MR, Govender N, Harun S, Muhammad NAN, Zainal Z, Mohamed-Hussein ZA. Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom. PLANTS (BASEL, SWITZERLAND) 2022; 11:2614. [PMID: 36235479 PMCID: PMC9573505 DOI: 10.3390/plants11192614] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/05/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
In higher plants, the complexity of a system and the components within and among species are rapidly dissected by omics technologies. Multi-omics datasets are integrated to infer and enable a comprehensive understanding of the life processes of organisms of interest. Further, growing open-source datasets coupled with the emergence of high-performance computing and development of computational tools for biological sciences have assisted in silico functional prediction of unknown genes, proteins and metabolites, otherwise known as uncharacterized. The systems biology approach includes data collection and filtration, system modelling, experimentation and the establishment of new hypotheses for experimental validation. Informatics technologies add meaningful sense to the output generated by complex bioinformatics algorithms, which are now freely available in a user-friendly graphical user interface. These resources accentuate gene function prediction at a relatively minimal cost and effort. Herein, we present a comprehensive view of relevant approaches available for system-level gene function prediction in the plant kingdom. Together, the most recent applications and sought-after principles for gene mining are discussed to benefit the plant research community. A realistic tabulation of plant genomic resources is included for a less laborious and accurate candidate gene discovery in basic plant research and improvement strategies.
Collapse
Affiliation(s)
- Muhammad-Redha Abdullah-Zawawi
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur 56000, Malaysia
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nisha Govender
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Sarahani Harun
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zamri Zainal
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| |
Collapse
|
6
|
Kim MS, Lee T, Baek J, Kim JH, Kim C, Jeong SC. Genome assembly of the popular Korean soybean cultivar Hwangkeum. G3 (BETHESDA, MD.) 2021; 11:jkab272. [PMID: 34568925 PMCID: PMC8496230 DOI: 10.1093/g3journal/jkab272] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Accepted: 07/27/2021] [Indexed: 01/01/2023]
Abstract
Massive resequencing efforts have been undertaken to catalog allelic variants in major crop species including soybean, but the scope of the information for genetic variation often depends on short sequence reads mapped to the extant reference genome. Additional de novo assembled genome sequences provide a unique opportunity to explore a dispensable genome fraction in the pan-genome of a species. Here, we report the de novo assembly and annotation of Hwangkeum, a popular soybean cultivar in Korea. The assembly was constructed using PromethION nanopore sequencing data and two genetic maps and was then error-corrected using Illumina short-reads and PacBio SMRT reads. The 933.12 Mb assembly was annotated as containing 79,870 transcripts for 58,550 genes using RNA-Seq data and the public soybean annotation set. Comparison of the Hwangkeum assembly with the Williams 82 soybean reference genome sequence (Wm82.a2.v1) revealed 1.8 million single-nucleotide polymorphisms, 0.5 million indels, and 25 thousand putative structural variants. However, there was no natural megabase-scale chromosomal rearrangement. Incidentally, by adding two novel subfamilies, we found that soybean contains four clearly separated subfamilies of centromeric satellite repeats. Analyses of satellite repeats and gene content suggested that the Hwangkeum assembly is a high-quality assembly. This was further supported by comparison of the marker arrangement of anthocyanin biosynthesis genes and of gene arrangement at the Rsv3 locus. Therefore, the results indicate that the de novo assembly of Hwangkeum is a valuable additional reference genome resource for characterizing traits for the improvement of this important crop species.
Collapse
Affiliation(s)
- Myung-Shin Kim
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju, Chungbuk 28116, Republic of Korea
- Plant Immunity Research Center, Interdisciplinary Program in Agricultural Genomics, College of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea
| | - Taeyoung Lee
- Bioinformatics Institute, Macrogen Inc., Seoul 08511, Republic of Korea
| | - Jeonghun Baek
- Bioinformatics Institute, Macrogen Inc., Seoul 08511, Republic of Korea
| | - Ji Hong Kim
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju, Chungbuk 28116, Republic of Korea
| | - Changhoon Kim
- Bioinformatics Institute, Macrogen Inc., Seoul 08511, Republic of Korea
| | - Soon-Chun Jeong
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju, Chungbuk 28116, Republic of Korea
| |
Collapse
|
7
|
Hu XL, Lu H, Hassan MM, Zhang J, Yuan G, Abraham PE, Shrestha HK, Villalobos Solis MI, Chen JG, Tschaplinski TJ, Doktycz MJ, Tuskan GA, Cheng ZMM, Yang X. Advances and perspectives in discovery and functional analysis of small secreted proteins in plants. HORTICULTURE RESEARCH 2021; 8:130. [PMID: 34059650 PMCID: PMC8167165 DOI: 10.1038/s41438-021-00570-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 04/26/2021] [Indexed: 05/02/2023]
Abstract
Small secreted proteins (SSPs) are less than 250 amino acids in length and are actively transported out of cells through conventional protein secretion pathways or unconventional protein secretion pathways. In plants, SSPs have been found to play important roles in various processes, including plant growth and development, plant response to abiotic and biotic stresses, and beneficial plant-microbe interactions. Over the past 10 years, substantial progress has been made in the identification and functional characterization of SSPs in several plant species relevant to agriculture, bioenergy, and horticulture. Yet, there are potentially a lot of SSPs that have not been discovered in plant genomes, which is largely due to limitations of existing computational algorithms. Recent advances in genomics, transcriptomics, and proteomics research, as well as the development of new computational algorithms based on machine learning, provide unprecedented capabilities for genome-wide discovery of novel SSPs in plants. In this review, we summarize known SSPs and their functions in various plant species. Then we provide an update on the computational and experimental approaches that can be used to discover new SSPs. Finally, we discuss strategies for elucidating the biological functions of SSPs in plants.
Collapse
Affiliation(s)
- Xiao-Li Hu
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, USA
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Haiwei Lu
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | | | - Jin Zhang
- State Key Laboratory of Subtropical Silviculture, School of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang, China
| | - Guoliang Yuan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Paul E Abraham
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Him K Shrestha
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
- Department of Genome Science and Technology, University of Tennessee, Knoxville, TN, USA
| | | | - Jin-Gui Chen
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Timothy J Tschaplinski
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Mitchel J Doktycz
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Gerald A Tuskan
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Zong-Ming Max Cheng
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, USA.
- College of Horticulture, Nanjing Agricultural University, Nanjing, Jiangsu, China.
| | - Xiaohan Yang
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, USA.
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
| |
Collapse
|
8
|
Muliyar RK, Chowdappa P, Behera SK, Kasaragod S, Gangaraj KP, Kotimoole CN, Nekrakalaya B, Mohanty V, Sampgod RB, Banerjee G, Das AJ, Niral V, Karun A, Mahato AK, Gaikwad K, Singh NK, Prasad TSK. Assembly and Annotation of the Nuclear and Organellar Genomes of a Dwarf Coconut (Chowghat Green Dwarf) Possessing Enhanced Disease Resistance. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 24:726-742. [PMID: 33170083 DOI: 10.1089/omi.2020.0147] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Coconut (Cocos nucifera L.), an important source of vegetable oil, nutraceuticals, functional foods, and housing materials, provides raw materials for a repertoire of industries engaged in the manufacture of cosmetics, soaps, detergents, paints, varnishes, and emulsifiers, among other products. The palm plays a vital role in maintaining and promoting the sustainability of farming systems of the fragile ecosystems of islands and coastal regions of the tropics. In this study, we present the genome of a dwarf coconut variety "Chowghat Green Dwarf" (CGD) from India, possessing enhanced resistance to root (wilt) disease. Utilizing short reads from the Illumina HiSeq 4000 platform and long reads from the Pacific Biosciences RSII platform, we have assembled the draft genome assembly of 1.93 Gb. The genome is distributed over 26,855 scaffolds, with ∼81.56% of the assembled genome present in scaffolds of lengths longer than 50 kb. About 77.29% of the genome was composed of transposable elements and repeats. Gene prediction yielded 51,953 genes, which upon stringent filtering, based on Annotation Edit Distance, resulted in 13,707 genes, which coded for 11,181 proteins. Among these, we gathered transcript level evidence for a total of 6828 predicted genes based on the RNA-Seq data from different coconut tissues, since they presented assembled transcripts within the genome annotation coordinates. A total of 112 nucleotide-binding and leucine-rich repeat loci, belonging to six classes, were detected. We have also undertaken the assembly and annotation of the CGD chloroplast and mitochondrial genomes. The availability of the dwarf coconut genome shall prove invaluable for deducing the origin of dwarf coconut cultivars, dissection of genes controlling plant habit and fruit color, and accelerated breeding for improved agronomic traits.
Collapse
Affiliation(s)
| | - Pallem Chowdappa
- ICAR-Central Plantation Crops Research Institute (CPCRI), Kasaragod, India
| | - Santosh Kumar Behera
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Sandeep Kasaragod
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | | | - Chinmaya Narayana Kotimoole
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Bhagya Nekrakalaya
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Varshasnata Mohanty
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | | | | | | | - Vittal Niral
- ICAR-Central Plantation Crops Research Institute (CPCRI), Kasaragod, India
| | - Anitha Karun
- ICAR-Central Plantation Crops Research Institute (CPCRI), Kasaragod, India
| | - Ajay Kumar Mahato
- ICAR-National Research Center on Plant Biotechnology, Pusa Campus, New Delhi, India
| | - Kishor Gaikwad
- ICAR-National Research Center on Plant Biotechnology, Pusa Campus, New Delhi, India
| | - Nagendra Kumar Singh
- ICAR-National Research Center on Plant Biotechnology, Pusa Campus, New Delhi, India
| | | |
Collapse
|
9
|
Sarpan N, Taranenko E, Ooi SE, Low ETL, Espinoza A, Tatarinova TV, Ong-Abdullah M. DNA methylation changes in clonally propagated oil palm. PLANT CELL REPORTS 2020; 39:1219-1233. [PMID: 32591850 DOI: 10.1007/s00299-020-02561-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 06/17/2020] [Indexed: 06/11/2023]
Abstract
Several hypomethylated sites within the Karma region of EgDEF1 and hotspot regions in chromosomes 1, 2, 3, and 5 may be associated with mantling. One of the main challenges faced by the oil palm industry is fruit abnormalities, such as the "mantled" phenotype that can lead to reduced yields. This clonal abnormality is an epigenetic phenomenon and has been linked to the hypomethylation of a transposable element within the EgDEF1 gene. To understand the epigenome changes in clones, methylomes of clonal oil palms were compared to methylomes of seedling-derived oil palms. Whole-genome bisulfite sequencing data from seedlings, normal, and mantled clones were analyzed to determine and compare the context-specific DNA methylomes. In seedlings, coding and regulatory regions are generally hypomethylated while introns and repeats are extensively methylated. Genes with a low number of guanines and cytosines in the third position of codons (GC3-poor genes) were increasingly methylated towards their 3' region, while GC3-rich genes remain demethylated, similar to patterns in other eukaryotic species. Predicted promoter regions were generally hypomethylated in seedlings. In clones, CG, CHG, and CHH methylation levels generally decreased in functionally important regions, such as promoters, 5' UTRs, and coding regions. Although random regions were found to be hypomethylated in clonal genomes, hypomethylation of certain hotspot regions may be associated with the clonal mantling phenotype. Our findings, therefore, suggest other hypomethylated CHG sites within the Karma of EgDEF1 and hypomethylated hotspot regions in chromosomes 1, 2, 3 and 5, are associated with mantling.
Collapse
Affiliation(s)
- Norashikin Sarpan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000, Kajang, Selangor, Malaysia
| | - Elizaveta Taranenko
- Department of Biology, University of La Verne, La Verne, CA, USA
- Department of Fundamental Biology and Biotechnology, Siberian Federal University, 660074, Krasnoyarsk, Russia
| | - Siew-Eng Ooi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000, Kajang, Selangor, Malaysia
| | - Eng-Ti Leslie Low
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000, Kajang, Selangor, Malaysia
| | | | - Tatiana V Tatarinova
- Department of Biology, University of La Verne, La Verne, CA, USA.
- Department of Fundamental Biology and Biotechnology, Siberian Federal University, 660074, Krasnoyarsk, Russia.
- Vavilov Institute for General Genetics, Moscow, Russia.
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia.
| | - Meilina Ong-Abdullah
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000, Kajang, Selangor, Malaysia.
| |
Collapse
|
10
|
Kim JH, Kim HK, Kim H, Chan BKK, Kang S, Kim W. Draft Genome Assembly of a Fouling Barnacle, Amphibalanus amphitrite (Darwin, 1854): The First Reference Genome for Thecostraca. Front Ecol Evol 2019. [DOI: 10.3389/fevo.2019.00465] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
|
11
|
Pachganov S, Murtazalieva K, Zarubin A, Sokolov D, Chartier DR, Tatarinova TV. TransPrise: a novel machine learning approach for eukaryotic promoter prediction. PeerJ 2019; 7:e7990. [PMID: 31695967 PMCID: PMC6827441 DOI: 10.7717/peerj.7990] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Accepted: 10/04/2019] [Indexed: 02/01/2023] Open
Abstract
As interest in genetic resequencing increases, so does the need for effective mathematical, computational, and statistical approaches. One of the difficult problems in genome annotation is determination of precise positions of transcription start sites. In this paper we present TransPrise-an efficient deep learning tool for prediction of positions of eukaryotic transcription start sites. Our pipeline consists of two parts: the binary classifier operates the first, and if a sequence is classified as TSS-containing the regression step follows, where the precise location of TSS is being identified. TransPrise offers significant improvement over existing promoter-prediction methods. To illustrate this, we compared predictions of TransPrise classification and regression models with the TSSPlant approach for the well annotated genome of Oryza sativa. Using a computer equipped with a graphics processing unit, the run time of TransPrise is 250 minutes on a genome of 374 Mb long. The Matthews correlation coefficient value for TransPrise is 0.79, more than two times larger than the 0.31 for TSSPlant classification models. This represents a high level of prediction accuracy. Additionally, the mean absolute error for the regression model is 29.19 nt, allowing for accurate prediction of TSS location. TransPrise was also tested in Homo sapiens, where mean absolute error of the regression model was 47.986 nt. We provide the full basis for the comparison and encourage users to freely access a set of our computational tools to facilitate and streamline their own analyses. The ready-to-use Docker image with all necessary packages, models, code as well as the source code of the TransPrise algorithm are available at (http://compubioverne.group/). The source code is ready to use and customizable to predict TSS in any eukaryotic organism.
Collapse
Affiliation(s)
- Stepan Pachganov
- Ugra Research Institute of Information Technologies, Khanty-Mansiysk, Russia
| | - Khalimat Murtazalieva
- Vavilov Institute for General Genetics, Moscow, Russia.,Institute of Bioinformatics, Moscow, Russia
| | - Aleksei Zarubin
- Tomsk National Research Medical Center of the Russian Academy of Sciences, Research Institute of Medical Genetics, Tomsk, Russia
| | | | - Duane R Chartier
- International Center for Art Intelligence, Inc., Los Angeles, CA, United States of America
| | - Tatiana V Tatarinova
- Vavilov Institute for General Genetics, Moscow, Russia.,Department of Biology, University of La Verne, La Verne, CA, United States of America.,A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia.,Siberian Federal University, Krasnoyarsk, Russia
| |
Collapse
|
12
|
Kim KU, Kim KM, Choi YH, Hurh BS, Lee I. Whole genome analysis of Aspergillus sojae SMF 134 supports its merits as a starter for soybean fermentation. J Microbiol 2019; 57:874-883. [PMID: 31250400 DOI: 10.1007/s12275-019-9152-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 05/13/2019] [Accepted: 05/16/2019] [Indexed: 10/26/2022]
Abstract
Aspergillus sojae is a koji (starter) mold that has been applied for food fermentation in Asia. The whole genome of A. sojae SMF 134, which was isolated from meju (Korean soybean fermented brick), was analyzed at the genomic level to evaluate its potential as a starter for soybean fermentation. The genome size was 40.1 Mbp, which was expected to be composed of eight chromosomes with 13,748 ORFs. Strain SMF 134 had a total of 151 protease genes, among which two more leucine aminopeptidase (lap) genes were found in addition to the previously known lap 1, and three γ-glutamyltranspeptidase (ggt) genes were newly identified. Such genomic characteristics of SMF 134 with many protease and flavor-related (lap and ggt) genes support its merits as a starter for soybean fermentation. In addition, this first complete genome of A. sojae will allow for further genetic studies to better understand the production of various enzymes, including proteases, LAPs, and GGTs, as well as other characteristics as a starter mold for soybean fermentation.
Collapse
Affiliation(s)
- Kang Uk Kim
- Department of Bio and Fermentation Convergence Technology, BK21 PLUS Project, Kookmin University, Seoul, 02707, Republic of Korea
| | - Kyung Min Kim
- Department of Bio and Fermentation Convergence Technology, BK21 PLUS Project, Kookmin University, Seoul, 02707, Republic of Korea
| | - Yong-Ho Choi
- Sempio Fermentation Research Center, Sempio Foods Company, Cheongju, 28156, Republic of Korea
| | - Byung-Serk Hurh
- Sempio Fermentation Research Center, Sempio Foods Company, Cheongju, 28156, Republic of Korea
| | - Inhyung Lee
- Department of Bio and Fermentation Convergence Technology, BK21 PLUS Project, Kookmin University, Seoul, 02707, Republic of Korea.
| |
Collapse
|
13
|
Ooi SE, Sarpan N, Abdul Aziz N, Nuraziyan A, Ong-Abdullah M. Differential expression of heat shock and floral regulatory genes in pseudocarpel initials of mantled female inflorescences from Elaeis guineensis Jacq. PLANT REPRODUCTION 2019; 32:167-179. [PMID: 30467592 DOI: 10.1007/s00497-018-0350-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 11/19/2018] [Indexed: 06/09/2023]
Abstract
Transcriptomes generated by laser capture microdissected abnormal staminodes revealed adoption of carpel programming during organ initiation with decreased expression of numerousHSPs,EgDEF1, EgGLO1but increasedLEAFYexpression. The abnormal mantled phenotype in oil palm involves a feminization of the male staminodes into pseudocarpels in pistillate inflorescences. Previous studies on oil palm flowering utilized entire inflorescences or spikelets, which comprised not only the male and female floral organs, but the surrounding tissues as well. Laser capture microdissection coupled with RNA sequencing was conducted to investigate the specific transcriptomes of male and female floral organs from normal and mantled female inflorescences. A higher number of differentially expressed genes (DEGs) were identified in abnormal versus normal male organs compared with abnormal versus normal female organs. In addition, the abnormal male organ transcriptome closely mimics the transcriptome of abnormal female organ. While the transcriptome of abnormal female organ was relatively similar to the normal female organ, a substantial amount of female DEGs encode HEAT SHOCK PROTEIN genes (HSPs). A similar high amount (20%) of male DEGs encode HSPs as well. As these genes exhibited decreased expression in abnormal floral organs, mantled floral organ development may be associated with lower stress indicators. Stamen identity genes EgDEF1 and EgGLO1 were the main floral regulatory genes with decreased expression in abnormal male organs or pseudocarpel initials. Expression of several floral transcription factors was elevated in pseudocarpel initials, notably LEAFY, FIL and DL orthologs, substantiating the carpel specification programming of abnormal staminodes. Specific transcriptomes thus obtained through this approach revealed a host of differentially regulated genes in pseudocarpel initials compared to normal male staminodes.
Collapse
Affiliation(s)
- Siew-Eng Ooi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, 43000, Kajang, Selangor, Malaysia.
| | - Norashikin Sarpan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, 43000, Kajang, Selangor, Malaysia
| | - Norazlin Abdul Aziz
- Molecular Pathology Unit, Cancer Research Centre (CaRC), Institute for Medical Research, Jalan Pahang, 50588, Kuala Lumpur, Malaysia
| | - Azimi Nuraziyan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, 43000, Kajang, Selangor, Malaysia
| | - Meilina Ong-Abdullah
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, 6 Persiaran Institusi, 43000, Kajang, Selangor, Malaysia.
| |
Collapse
|
14
|
Cook DE, Valle-Inclan JE, Pajoro A, Rovenich H, Thomma BP, Faino L. Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing. PLANT PHYSIOLOGY 2019; 179:38-54. [PMID: 30401722 PMCID: PMC6324239 DOI: 10.1104/pp.18.00848] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 10/19/2018] [Indexed: 05/16/2023]
Abstract
Single-molecule full-length complementary DNA (cDNA) sequencing can aid genome annotation by revealing transcript structure and alternative splice forms, yet current annotation pipelines do not incorporate such information. Here we present long-read annotation (LoReAn) software, an automated annotation pipeline utilizing short- and long-read cDNA sequencing, protein evidence, and ab initio prediction to generate accurate genome annotations. Based on annotations of two fungal genomes (Verticillium dahliae and Plicaturopsis crispa) and two plant genomes (Arabidopsis [Arabidopsis thaliana] and Oryza sativa), we show that LoReAn outperforms popular annotation pipelines by integrating single-molecule cDNA-sequencing data generated from either the Pacific Biosciences or MinION sequencing platforms, correctly predicting gene structure, and capturing genes missed by other annotation pipelines.
Collapse
Affiliation(s)
- David E. Cook
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
| | - Jose Espejo Valle-Inclan
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
| | - Alice Pajoro
- Laboratory of Molecular Biology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
| | - Hanna Rovenich
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
| | - Bart P.H.J. Thomma
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
- Author for contact:
| | - Luigi Faino
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands
| |
Collapse
|
15
|
Genomic Analysis of the Insect-Killing Fungus Beauveria bassiana JEF-007 as a Biopesticide. Sci Rep 2018; 8:12388. [PMID: 30120392 PMCID: PMC6098154 DOI: 10.1038/s41598-018-30856-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 08/07/2018] [Indexed: 02/07/2023] Open
Abstract
Insect-killing fungi have high potential in pest management. A deeper insight into the fungal genes at the whole genome level is necessary to understand the inter-species or intra-species genetic diversity of fungal genes, and to select excellent isolates. In this work, we conducted a whole genome sequencing of Beauveria bassiana (Bb) JEF-007 and characterized pathogenesis-related features and compared with other isolates including Bb ARSEF2860. A large number of Bb JEF-007 genes showed high identity with Bb ARSEF2860, but some genes showed moderate or low identity. The two Bb isolates showed a significant difference in vegetative growth, antibiotic-susceptibility, and virulence against Tenebrio molitor larvae. When highly identical genes between the two Bb isolates were subjected to real-time PCR, their transcription levels were different, particularly in heat shock protein 30 (hsp30) gene which is related to conidial thermotolerance. In several B. bassiana isolates, chitinases and trypsin-like protease genes involved in pathogenesis were highly conserved, but other genes showed noticeable sequence variation within the same species. Given the transcriptional and genetic diversity in B. bassiana, a selection of virulent isolates with industrial advantages is a pre-requisite, and this genetic approach could support the development of excellent biopesticides with intellectual property protection.
Collapse
|
16
|
Sanusi NSNM, Rosli R, Halim MAA, Chan KL, Nagappan J, Azizi N, Amiruddin N, Tatarinova TV, Low ETL. PalmXplore: oil palm gene database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:5098614. [PMID: 30239681 PMCID: PMC6146135 DOI: 10.1093/database/bay095] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2018] [Accepted: 08/16/2018] [Indexed: 12/20/2022]
Abstract
A set of Elaeis guineensis genes had been generated by combining two gene prediction pipelines: Fgenesh++ developed by Softberry and Seqping by the Malaysian Palm Oil Board. PalmXplore was developed to provide a scalable data repository and a user-friendly search engine system to efficiently store, manage and retrieve the oil palm gene sequences and annotations. Information deposited in PalmXplore includes predicted genes, their genomic coordinates, as well as the annotations derived from external databases, such as Pfam, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes. Information about genes related to important traits, such as those involved in fatty acid biosynthesis (FAB) and disease resistance, is also provided. The system offers Basic Local Alignment Search Tool homology search, where the results can be downloaded or visualized in the oil palm genome browser (MYPalmViewer). PalmXplore is regularly updated offering new features, improvements to genome annotation and new genomic sequences. The system is freely accessible at http://palmxplore.mpob.gov.my.
Collapse
Affiliation(s)
- Nik Shazana Nik Mohd Sanusi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Rozana Rosli
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia.,Genomics and Computational Biology Research Group, University of South Wales,Pontypridd, Wales, UK
| | - Mohd Amin Ab Halim
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Kuang-Lim Chan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Jayanthi Nagappan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Norazah Azizi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Nadzirah Amiruddin
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| | - Tatiana V Tatarinova
- Department of Biology, University of La Verne,1950 Third Street La Verne, CA, USA
| | - Eng-Ti Leslie Low
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, Kajang, Selangor, Malaysia
| |
Collapse
|
17
|
Chan KL, Tatarinova TV, Rosli R, Amiruddin N, Azizi N, Halim MAA, Sanusi NSNM, Jayanthi N, Ponomarenko P, Triska M, Solovyev V, Firdaus-Raih M, Sambanthamurthi R, Murphy D, Low ETL. Evidence-based gene models for structural and functional annotations of the oil palm genome. Biol Direct 2017; 12:21. [PMID: 28886750 PMCID: PMC5591544 DOI: 10.1186/s13062-017-0191-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 08/07/2017] [Indexed: 11/13/2022] Open
Abstract
Background Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools. Results Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC3-rich genes (GC3 ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures. Conclusions We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC3-rich and intronless), as well as those associated with important functions, such as FA biosynthesis and disease resistance. The study demonstrated the advantages of having an integrated approach to gene prediction and developed a computational framework for combining multiple genome annotations. These results, available in the oil palm annotation database (http://palmxplore.mpob.gov.my), will provide important resources for studies on the genomes of oil palm and related crops. Reviewers This article was reviewed by Alexander Kel, Igor Rogozin, and Vladimir A. Kuznetsov. Electronic supplementary material The online version of this article (doi:10.1186/s13062-017-0191-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kuang-Lim Chan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia.,Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
| | - Tatiana V Tatarinova
- Department of Biology, University of La Verne, La Verne, California, 91750, USA.,Spatial Sciences Institute, University of Southern California, Los Angeles, CA, 90089, USA
| | - Rozana Rosli
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia.,Genomics and Computational Biology Research Group, University of South Wales, Pontypridd, CF371DL, UK
| | - Nadzirah Amiruddin
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Norazah Azizi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Mohd Amin Ab Halim
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Nik Shazana Nik Mohd Sanusi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Nagappan Jayanthi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Petr Ponomarenko
- Spatial Sciences Institute, University of Southern California, Los Angeles, CA, 90089, USA
| | - Martin Triska
- Children's Hospital Los Angeles, University of Southern California, Los Angeles, CA, 90089, USA
| | - Victor Solovyev
- Softberry Inc., 116 Radio Circle, Suite 400, Mount Kisco, NY, 10549, USA
| | - Mohd Firdaus-Raih
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
| | - Ravigadevi Sambanthamurthi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Denis Murphy
- Genomics and Computational Biology Research Group, University of South Wales, Pontypridd, CF371DL, UK
| | - Eng-Ti Leslie Low
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, No. 6, Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia.
| |
Collapse
|
18
|
Hong CY, Lee SY, Ryu SH, Kim M. Whole-genome de novo sequencing of wood rot fungus Fomitopsis palustris (ATCC62978) with both a cellulolytic and ligninolytic enzyme system. J Biotechnol 2017; 251:156-159. [DOI: 10.1016/j.jbiotec.2017.04.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2016] [Revised: 04/06/2017] [Accepted: 04/11/2017] [Indexed: 10/19/2022]
|