1
|
Dai H, Ai H, Wang Y, Shi J, Ren L, Li J, Tao Y, Xu Z, Zheng J. Molecular Characteristics and Expression Patterns of Carotenoid Cleavage Oxygenase Family Genes in Rice ( Oryza sativa L.). Int J Mol Sci 2024; 25:10264. [PMID: 39408594 PMCID: PMC11477027 DOI: 10.3390/ijms251910264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/14/2024] [Accepted: 09/16/2024] [Indexed: 10/20/2024] Open
Abstract
Carotenoid cleavage oxygenases (CCOs) cleave carotenoid molecules to produce bioactive products that influence the synthesis of hormones such as abscisic acid (ABA) and strigolactones (SL), which regulate plant growth, development, and stress adaptation. Here, to explore the molecular characteristics of all members of the OsCCO family in rice, fourteen OsCCO family genes were identified in the genome-wide study. The results revealed that the OsCCO family included one OsNCED and four OsCCD subfamilies. The OsCCO family was phylogenetically close to members of the maize ZmCCO family and the Sorghum SbCCO family. A collinearity relationship was observed between OsNCED3 and OsNCED5 in rice, as well as OsCCD7 and OsNCED5 between rice and Arabidopsis, Sorghum, and maize. OsCCD4a and OsCCD7 were the key members in the protein interaction network of the OsCCO family, which was involved in the catabolic processes of carotenoids and terpenoid compounds. miRNAs targeting OsCCO family members were mostly involved in the abiotic stress response, and RNA-seq data further confirmed the molecular properties of OsCCO family genes in response to abiotic stress and hormone induction. qRT-PCR analysis showed the differential expression patterns of OsCCO members across various rice organs. Notably, OsCCD1 showed relatively high expression levels in all organs except for ripening seeds and endosperm. OsNCED2a, OsNCED3, OsCCD1, OsCCD4a, OsCCD7, OsCCD8a, and OsCCD8e were potentially involved in plant growth and differentiation. Meanwhile, OsNCED2a, OsNCED2b, OsNCED5, OsCCD8b, and OsCCD8d were associated with reproductive organ development, flowering, and seed formation. OsNCED3, OsCCD4b, OsCCD4c, OsCCD8b, and OsCCD8c were related to assimilate transport and seed maturation. These findings provide a theoretical basis for further functional analysis of the OsCCO family.
Collapse
Affiliation(s)
- Hanjing Dai
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Hao Ai
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Yingrun Wang
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Jia Shi
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Lantian Ren
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Jieqin Li
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Yulu Tao
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| | - Zhaoshi Xu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China
| | - Jiacheng Zheng
- College of Agronomy, Anhui Science and Technology University, Chuzhou 233100, China; (H.D.)
| |
Collapse
|
2
|
Sierro N, Auberson M, Dulize R, Ivanov NV. Chromosome-level genome assemblies of Nicotiana tabacum, Nicotiana sylvestris, and Nicotiana tomentosiformis. Sci Data 2024; 11:135. [PMID: 38278835 PMCID: PMC10817978 DOI: 10.1038/s41597-024-02965-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 01/12/2024] [Indexed: 01/28/2024] Open
Abstract
The Solanaceae species Nicotiana tabacum, an economically important crop plant cultivated worldwide, is an allotetraploid species that appeared about 200,000 years ago as the result of the hybridization of diploid ancestors of Nicotiana sylvestris and Nicotiana tomentosiformis. The previously published genome assemblies for these three species relied primarily on short-reads, and the obtained pseudochromosomes only partially covered the genomes. In this study, we generated annotated de novo chromosome-level genomes of N. tabacum, N. sylvestris, and N. tomentosiformis, which contain 3.99 Gb, 2.32 Gb, and 1.74 Gb, respectively of sequence data, with 97.6%, 99.5%, and 95.9% aligned in chromosomes, and represent 99.2%, 98.3%, and 98.5% of the near-universal single-copy orthologs Solanaceae genes. The completion levels of these chromosome-level genomes for N. tabacum, N. sylvestris, and N. tomentosiformis are comparable to other reference Solanaceae genomes, enabling more efficient synteny-based cross-species research.
Collapse
Affiliation(s)
- Nicolas Sierro
- PMI R&D, Philip Morris Products S.A., Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland.
| | - Mehdi Auberson
- PMI R&D, Philip Morris Products S.A., Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Rémi Dulize
- PMI R&D, Philip Morris Products S.A., Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Nikolai V Ivanov
- PMI R&D, Philip Morris Products S.A., Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| |
Collapse
|
3
|
Hassan AH, Mokhtar MM, El Allali A. Transposable elements: multifunctional players in the plant genome. FRONTIERS IN PLANT SCIENCE 2024; 14:1330127. [PMID: 38239225 PMCID: PMC10794571 DOI: 10.3389/fpls.2023.1330127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 12/06/2023] [Indexed: 01/22/2024]
Abstract
Transposable elements (TEs) are indispensable components of eukaryotic genomes that play diverse roles in gene regulation, recombination, and environmental adaptation. Their ability to mobilize within the genome leads to gene expression and DNA structure changes. TEs serve as valuable markers for genetic and evolutionary studies and facilitate genetic mapping and phylogenetic analysis. They also provide insight into how organisms adapt to a changing environment by promoting gene rearrangements that lead to new gene combinations. These repetitive sequences significantly impact genome structure, function and evolution. This review takes a comprehensive look at TEs and their applications in biotechnology, particularly in the context of plant biology, where they are now considered "genomic gold" due to their extensive functionalities. The article addresses various aspects of TEs in plant development, including their structure, epigenetic regulation, evolutionary patterns, and their use in gene editing and plant molecular markers. The goal is to systematically understand TEs and shed light on their diverse roles in plant biology.
Collapse
Affiliation(s)
- Asmaa H. Hassan
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Ben Guerir, Morocco
- Agricultural Genetic Engineering Research Institute, Agriculture Research Center, Giza, Egypt
| | - Morad M. Mokhtar
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Ben Guerir, Morocco
- Agricultural Genetic Engineering Research Institute, Agriculture Research Center, Giza, Egypt
| | - Achraf El Allali
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Ben Guerir, Morocco
| |
Collapse
|
4
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
5
|
Mokhtar MM, El Allali A. MegaLTR: a web server and standalone pipeline for detecting and annotating LTR-retrotransposons in plant genomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1237426. [PMID: 37810401 PMCID: PMC10552921 DOI: 10.3389/fpls.2023.1237426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/21/2023] [Indexed: 10/10/2023]
Abstract
LTR-retrotransposons (LTR-RTs) are a class of RNA-replicating transposon elements (TEs) that can alter genome structure and function by moving positions, repositioning genes, shifting exons, and causing chromosomal rearrangements. LTR-RTs are widespread in many plant genomes and constitute a significant portion of the genome. Their movement and activity in eukaryotic genomes can provide insight into genome evolution and gene function, especially when LTR-RTs are located near or within genes. Building the redundant and non-redundant LTR-RTs libraries and their annotations for species lacking this resource requires extensive bioinformatics pipelines and expensive computing power to analyze large amounts of genomic data. This increases the need for online services that provide computational resources with minimal overhead and maximum efficiency. Here, we present MegaLTR as a web server and standalone pipeline that detects intact LTR-RTs at the whole-genome level and integrates multiple tools for structure-based, homologybased, and de novo identification, classification, annotation, insertion time determination, and LTR-RT gene chimera analysis. MegaLTR also provides statistical analysis and visualization with multiple tools and can be used to accelerate plant species discovery and assist breeding programs in their efforts to improve genomic resources. We hope that the development of online services such as MegaLTR, which can analyze large amounts of genomic data, will become increasingly important for the automated detection and annotation of LTR-RT elements.
Collapse
Affiliation(s)
- Morad M. Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| |
Collapse
|
6
|
Alsamman AM, Mousa KH, Nassar AE, Faheem MM, Radwan KH, Adly MH, Hussein A, Istanbuli T, Mokhtar MM, Elakkad TA, Kehel Z, Hamwieh A, Abdelsattar M, El Allali A. Identification, characterization, and validation of NBS-encoding genes in grass pea. Front Genet 2023; 14:1187597. [PMID: 37408775 PMCID: PMC10318170 DOI: 10.3389/fgene.2023.1187597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/01/2023] [Indexed: 07/07/2023] Open
Abstract
Grass pea is a promising crop with the potential to provide food and fodder, but its genomics has not been adequately explored. Identifying genes for desirable traits, such as drought tolerance and disease resistance, is critical for improving the plant. Grass pea currently lacks known R-genes, including the nucleotide-binding site-leucine-rich repeat (NBS-LRR) gene family, which plays a key role in protecting the plant from biotic and abiotic stresses. In our study, we used the recently published grass pea genome and available transcriptomic data to identify 274 NBS-LRR genes. The evolutionary relationships between the classified genes on the reported plants and LsNBS revealed that 124 genes have TNL domains, while 150 genes have CNL domains. All genes contained exons, ranging from 1 to 7. Ten conserved motifs with lengths ranging from 16 to 30 amino acids were identified. We found TIR-domain-containing genes in 132 LsNBSs, with 63 TIR-1 and 69 TIR-2, and RX-CCLike in 84 LsNBSs. We also identified several popular motifs, including P-loop, Uup, kinase-GTPase, ABC, ChvD, CDC6, Rnase_H, Smc, CDC48, and SpoVK. According to the gene enrichment analysis, the identified genes undergo several biological processes such as plant defense, innate immunity, hydrolase activity, and DNA binding. In the upstream regions, 103 transcription factors were identified that govern the transcription of nearby genes affecting the plant excretion of salicylic acid, methyl jasmonate, ethylene, and abscisic acid. According to RNA-Seq expression analysis, 85% of the encoded genes have high expression levels. Nine LsNBS genes were selected for qPCR under salt stress conditions. The majority of the genes showed upregulation at 50 and 200 μM NaCl. However, LsNBS-D18, LsNBS-D204, and LsNBS-D180 showed reduced or drastic downregulation compared to their respective expression levels, providing further insights into the potential functions of LsNBSs under salt stress conditions. They provide valuable insights into the potential functions of LsNBSs under salt stress conditions. Our findings also shed light on the evolution and classification of NBS-LRR genes in legumes, highlighting the potential of grass pea. Further research could focus on the functional analysis of these genes, and their potential use in breeding programs to improve the salinity, drought, and disease resistance of this important crop.
Collapse
Affiliation(s)
- Alsamman M. Alsamman
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
- International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Khaled H. Mousa
- International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Ahmed E. Nassar
- International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Mostafa M. Faheem
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
| | - Khaled H. Radwan
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
| | - Monica H. Adly
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
- International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Ahmed Hussein
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
| | - Tawffiq Istanbuli
- International Center for Agricultural Research in the Dry Areas (ICARDA), Terbol, Lebanon
| | - Morad M. Mokhtar
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
| | - Tamer Ahmed Elakkad
- Department of Genetics and Genetic Engineering, Faculty of Agriculture at Moshtohor, Benha University, Benha, Egypt
- Moshtohor Research Park, Molecular Biology Lab, Benha University, Benha, Egypt
| | - Zakaria Kehel
- Biodiversity and Crop Improvement Program, International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Aladdin Hamwieh
- International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Mohamed Abdelsattar
- Agricultural Genetic Engineering Research Institute (AGERI), Agricultural Research Center (ARC), Giza, Egypt
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
| |
Collapse
|
7
|
Mokhtar MM, Abd-Elhalim HM, El Allali A. A large-scale assessment of the quality of plant genome assemblies using the LTR assembly index. AOB PLANTS 2023; 15:plad015. [PMID: 37197714 PMCID: PMC10184434 DOI: 10.1093/aobpla/plad015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 04/01/2023] [Indexed: 05/19/2023]
Abstract
Recent advances in genome sequencing have led to an increase in the number of sequenced genomes. However, the presence of repetitive sequences complicates the assembly of plant genomes. The LTR assembly index (LAI) has recently been widely used to assess the quality of genome assembly, as a higher LAI is associated with a higher quality of assembly. Here, we assessed the quality of assembled genomes of 1664 plant and algal genomes using LAI and reported the results as data repository called PlantLAI (https://bioinformatics.um6p.ma/PlantLAI). A number of 55 117 586 pseudomolecules/scaffolds with a total length of 988.11 gigabase-pairs were examined using the LAI workflow. A total of 46 583 551 accurate LTR-RTs were discovered, including 2 263 188 Copia, 2 933 052 Gypsy, and 1 387 311 unknown superfamilies. Consequently, only 1136 plant genomes are suitable for LAI calculation, with values ranging from 0 to 31.59. Based on the quality classification system, 476 diploid genomes were classified as draft, 472 as reference, and 135 as gold genomes. We also provide a free webtool to calculate the LAI of newly assembled genomes and the ability to save the result in the repository. The data repository is designed to fill in the gaps in the reported LAI of existing genomes, while the webtool is designed to help researchers calculate the LAI of their newly sequenced genomes.
Collapse
Affiliation(s)
- Morad M Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Lot 660 Hay Moulay Rachid, Benguerir 43150, Morocco
| | - Haytham M Abd-Elhalim
- Agricultural Genetic Engineering Research Institute, Agricultural Research Center, Giza 12619, Egypt
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Lot 660 Hay Moulay Rachid, Benguerir 43150, Morocco
| |
Collapse
|
8
|
Hassan AH, Mokhtar MM, El Allali A. TEMM: A Curated Data Resource for Transposon Element-Based Molecular Markers in Plants. Methods Mol Biol 2023; 2703:45-57. [PMID: 37646936 DOI: 10.1007/978-1-0716-3389-2_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
Transposon elements (TEs) are mobile genetic elements that can insert themselves into new locations and modify the plant genome. In recent years, they have been used as molecular markers in plant breeding programs. TE-based molecular markers (TE-markers) are divided into two categories depending on the transcription mechanism of the TEs. The first category is retrotransposon-based molecular markers, which include RBIP, IRAP, REMAP, and iPBS. The second group is DNA-based-TE-markers, which include MITE, TE-junction, and CACTA TE-markers. These markers are a good tool for studying genetic diversity and can provide information on plants' phylogenetic and evolutionary history. They can help improve breeding programs to increase agronomic traits and develop new varieties. Overall, TE-markers play an important role in plant genetics and plant breeding and contribute to a better understanding of plant biology. Here, we present TEMM, a curated data resource for TE-markers in plants. Relevant research articles were screened to collect primer sequences and related information. Only articles containing primer sequences are added to the present data resource. TEMM contains 784 primers with their associated PCR reaction programs and their applications in various crops. These include 203 IPBS, 191 RBIP, 140 IRAP, 78 TE-junction, 76 IRAPS, 47 RBIP-IRAP, 16 IRAP-REMAP, 12 REMAP, 12 REMA-IRAP, 6 REMA, and 3 ISBP primers. The data resource is freely available at https://bioinformatics.um6p.ma/TEMM .
Collapse
Affiliation(s)
- Asmaa H Hassan
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
| | - Morad M Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco.
| |
Collapse
|