1
|
Yadav DK, Srivastava GP, Singh A, Singh M, Yadav N, Tuteja N. Proteome-wide analysis reveals G protein-coupled receptor-like proteins in rice ( Oryza sativa). PLANT SIGNALING & BEHAVIOR 2024; 19:2365572. [PMID: 38904257 PMCID: PMC11195488 DOI: 10.1080/15592324.2024.2365572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 06/04/2024] [Indexed: 06/22/2024]
Abstract
G protein-coupled receptors (GPCRs) constitute the largest family of transmembrane proteins in metazoans that mediate the regulation of various physiological responses to discrete ligands through heterotrimeric G protein subunits. The existence of GPCRs in plant is contentious, but their comparable crucial role in various signaling pathways necessitates the identification of novel remote GPCR-like proteins that essentially interact with the plant G protein α subunit and facilitate the transduction of various stimuli. In this study, we identified three putative GPCR-like proteins (OsGPCRLPs) (LOC_Os06g09930.1, LOC_Os04g36630.1, and LOC_Os01g54784.1) in the rice proteome using a stringent bioinformatics workflow. The identified OsGPCRLPs exhibited a canonical GPCR 'type I' 7TM topology, patterns, and biologically significant sites for membrane anchorage and desensitization. Cluster-based interactome mapping revealed that the identified proteins interact with the G protein α subunit which is a characteristic feature of GPCRs. Computational results showing the interaction of identified GPCR-like proteins with G protein α subunit and its further validation by the membrane yeast-two-hybrid assay strongly suggest the presence of GPCR-like 7TM proteins in the rice proteome. The absence of a regulator of G protein signaling (RGS) box in the C- terminal domain, and the presence of signature motifs of canonical GPCR in the identified OsGPCRLPs strongly suggest that the rice proteome contains GPCR-like proteins that might be involved in signal transduction.
Collapse
Affiliation(s)
- Dinesh K. Yadav
- Plant Molecular Biology and Genetic Engineering Laboratory, Department of Botany, University of Allahabad, Prayagraj, India
| | - Gyan Prakash Srivastava
- Plant Molecular Biology and Genetic Engineering Laboratory, Department of Botany, University of Allahabad, Prayagraj, India
| | - Ananya Singh
- Plant Molecular Biology and Genetic Engineering Laboratory, Department of Botany, University of Allahabad, Prayagraj, India
| | - Madhavi Singh
- Plant Molecular Biology and Genetic Engineering Laboratory, Department of Botany, University of Allahabad, Prayagraj, India
| | - Neelam Yadav
- Plant Molecular Biology and Genetic Engineering Laboratory, Department of Botany, University of Allahabad, Prayagraj, India
| | - Narendra Tuteja
- Plant Molecular Biology, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| |
Collapse
|
2
|
Tian R, Nájera-González HR, Nigam D, Khan A, Chen J, Xin Z, Herrera-Estrella L, Jiao Y. Leucine-rich repeat receptor kinase BM41 regulates cuticular wax deposition in sorghum. JOURNAL OF EXPERIMENTAL BOTANY 2024; 75:6331-6345. [PMID: 39041593 DOI: 10.1093/jxb/erae319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 07/22/2024] [Indexed: 07/24/2024]
Abstract
Cuticular wax (CW) is the first defensive barrier of plants that forms a waterproof barrier, protects the plant from desiccation, and defends against insects, pathogens, and UV radiation. Sorghum, an important grass crop with high heat and drought tolerance, exhibits a much higher wax load than other grasses and the model plant Arabidopsis. In this study, we explored the regulation of sorghum CW biosynthesis using a bloomless mutant. The CW on leaf sheaths of the bloomless 41 (bm41) mutant showed significantly reduced very long-chain fatty acids (VLCFAs), triterpenoids, alcohols, and other wax components, with an overall 86% decrease in total wax content compared with the wild type. Notably, the 28-carbon and 30-carbon VLCFAs were decreased in the mutants. Using bulk segregant analysis, we identified the causal gene of the bloomless phenotype as a leucine-rich repeat transmembrane protein kinase. Transcriptome analysis of the wild-type and bm41 mutant leaf sheaths revealed BM41 as a positive regulator of lipid biosynthesis and steroid metabolism. BM41 may regulate CW biosynthesis by regulating the expression of the gene encoding 3-ketoacyl-CoA synthase 6. Identification of BM41 as a new regulator of CW biosynthesis provides fundamental knowledge for improving grass crops' heat and drought tolerance by increasing CW.
Collapse
Affiliation(s)
- Ran Tian
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| | - Héctor-Rogelio Nájera-González
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| | - Deepti Nigam
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| | - Adil Khan
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| | - Junping Chen
- Plant Stress and Germplasm Development Unit, Crop Systems Research Laboratory, USDA-ARS, 3810, 4th Street, Lubbock, TX 79415, USA
| | - Zhanguo Xin
- Plant Stress and Germplasm Development Unit, Crop Systems Research Laboratory, USDA-ARS, 3810, 4th Street, Lubbock, TX 79415, USA
| | - Luis Herrera-Estrella
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| | - Yinping Jiao
- Department of Plant and Soil Science, Institute of Genomics for Crop Abiotic Stress Tolerance (IGCAST), Texas Tech University, Lubbock, TX 79409, USA
| |
Collapse
|
3
|
Senkevich K, Parlar SC, Chantereault C, Yu E, Ahmad J, Ruskey JA, Asayesh F, Spiegelman D, Waters C, Monchi O, Dauvilliers Y, Dupré N, Miliukhina I, Timofeeva A, Emelyanov A, Pchelina S, Greenbaum L, Hassin-Baer S, Alcalay RN, Gan-Or Z. Are rare heterozygous SYNJ1 variants associated with Parkinson's disease? NPJ Parkinsons Dis 2024; 10:201. [PMID: 39455605 PMCID: PMC11512049 DOI: 10.1038/s41531-024-00809-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 10/02/2024] [Indexed: 10/28/2024] Open
Abstract
Previous studies have established that rare biallelic SYNJ1 mutations cause autosomal recessive parkinsonism and Parkinson's disease (PD). We analyzed 8165 PD cases, 818 early-onset-PD (EOPD, < 50 years) and 70,363 controls. Burden meta-analysis revealed an association between rare nonsynonymous variants and variants with high Combined Annotation-Dependent Depletion score (> 20) in the Sac1 SYNJ1 domain and PD (Pfdr = 0.040). A meta-analysis of EOPD patients demonstrated an association between all rare heterozygous SYNJ1 variants and PD (Pfdr = 0.029).
Collapse
Affiliation(s)
- Konstantin Senkevich
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada.
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada.
- Department of Human Genetics, McGill University, Montréal, QC, Canada.
| | - Sitki Cem Parlar
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Cloe Chantereault
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Eric Yu
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Jamil Ahmad
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada
| | - Jennifer A Ruskey
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada
| | - Farnaz Asayesh
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Dan Spiegelman
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
| | - Cheryl Waters
- Department of Neurology, College of Physicians and Surgeons, New York, Columbia City, NY, USA
| | - Oury Monchi
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada
- Département de radiologie, radio-oncologie et médecine nucléaire, Université de Montréal, Montréal, QC, Canada
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, Montréal, QC, Canada
| | - Yves Dauvilliers
- National Reference Center for Narcolepsy, Sleep Unit, Department of Neurology, Gui-de-Chauliac Hospital, CHU Montpellier, University of Montpellier, Montpellier, France
| | - Nicolas Dupré
- Neuroscience axis, CHU de Québec-Université Laval, Québec, QC, Canada
- Department of Medicine, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | | | - Alla Timofeeva
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Anton Emelyanov
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Sofya Pchelina
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Lior Greenbaum
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Danek Gertner Institute of Human Genetics, Sheba Medical Center, Tel Hashomer, Israel
- The Joseph Sagol Neuroscience Center, Sheba Medical Center, Tel Hashomer, Israel
| | - Sharon Hassin-Baer
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Joseph Sagol Neuroscience Center, Sheba Medical Center, Tel Hashomer, Israel
- The Movement Disorders Institute, Department of Neurology, Sheba Medical Center, Tel Hashomer, Israel
| | - Roy N Alcalay
- Department of Neurology, College of Physicians and Surgeons, New York, Columbia City, NY, USA
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Division of Movement Disorders, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Ziv Gan-Or
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada.
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada.
- Department of Human Genetics, McGill University, Montréal, QC, Canada.
| |
Collapse
|
4
|
Segura-Vega J, González-Herrera A, Molina-Bravo R, Solano-González S. Computational identification and characterization of chitinase 1 and chitinase 2 from neotropical isolates of Beauveria bassiana. FRONTIERS IN BIOINFORMATICS 2024; 4:1434442. [PMID: 39493578 PMCID: PMC11527780 DOI: 10.3389/fbinf.2024.1434442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Accepted: 09/30/2024] [Indexed: 11/05/2024] Open
Abstract
Background The fungus Beauveria bassiana is widely used for agronomical applications, mainly in biological control. B. bassiana uses chitinase enzymes to degrade chitin, a major chemical component found in insect exoskeletons and fungal cell walls. However, until recently, genomic information on neotropical isolates, as well as their metabolic and biotechnological potential, has been limited. Methods Eight complete B. bassiana genomes of Neotropical origin and three references were studied to identify chitinase genes and its corresponding proteins, which were curated and characterized using manual curation and computational tools. We conducted a computational study to highlight functional differences and similarities for chitinase proteins in these Neotropical isolates. Results Eleven chitinase 1 genes were identified, categorized as chitinase 1.1 and chitinase 1.2. Five chitinase 2 genes were identified but presented a higher sequence conservation across all sequences. Interestingly, physicochemical parameters were more similar between chitinase 1.1 and chitinase 2 than between chitinase 1.1 and 1.2. Conclusion Chitinases 1 and 2 demonstrated variations, especially within chitinase 1, which presented a potential paralog. These differences were observed in their physical parameters. Additionally, CHIT2 completely lacks a signal peptide. This implies that CHIT1 might be associated with infection processes, while CHIT2 could be involved in morphogenesis and cellular growth. Therefore, our work highlights the importance of computational studies on local isolates, providing valuable resources for further experimental validation. Intrinsic changes within local species can significantly impact our understanding of complex pathogen-host interactions and offer practical applications, such as biological control.
Collapse
Affiliation(s)
- Juan Segura-Vega
- Laboratorio de Bioinformática Aplicada, Escuela de Ciencias Biológicas, Universidad Nacional de Costa Rica, Heredia, Costa Rica
| | - Allan González-Herrera
- Laboratorio de Control Biológico, Escuela de Ciencias Agrarias, Universidad Nacional de Costa Rica, Heredia, Costa Rica
| | - Ramón Molina-Bravo
- Programa de Biotecnología Vegetal y Recursos Genéticos para el Fitomejoramiento (BIOVERFI), Escuela de Ciencias Agrarias, Universidad Nacional de Costa Rica, Heredia, Costa Rica
| | - Stefany Solano-González
- Laboratorio de Bioinformática Aplicada, Escuela de Ciencias Biológicas, Universidad Nacional de Costa Rica, Heredia, Costa Rica
| |
Collapse
|
5
|
Salgado JFM, Hervé V, Vera MAG, Tokuda G, Brune A. Unveiling lignocellulolytic potential: a genomic exploration of bacterial lineages within the termite gut. MICROBIOME 2024; 12:201. [PMID: 39407345 PMCID: PMC11481507 DOI: 10.1186/s40168-024-01917-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 08/26/2024] [Indexed: 10/19/2024]
Abstract
BACKGROUND The microbial landscape within termite guts varies across termite families. The gut microbiota of lower termites (LT) is dominated by cellulolytic flagellates that sequester wood particles in their digestive vacuoles, whereas in the flagellate-free higher termites (HT), cellulolytic activity has been attributed to fiber-associated bacteria. However, little is known about the role of individual lineages in fiber digestion, particularly in LT. RESULTS We investigated the lignocellulolytic potential of 2223 metagenome-assembled genomes (MAGs) recovered from the gut metagenomes of 51 termite species. In the flagellate-dependent LT, cellulolytic enzymes are restricted to MAGs of Bacteroidota (Dysgonomonadaceae, Tannerellaceae, Bacteroidaceae, Azobacteroidaceae) and Spirochaetota (Breznakiellaceae) and reflect a specialization on cellodextrins, whereas their hemicellulolytic arsenal features activities on xylans and diverse heteropolymers. By contrast, the MAGs derived from flagellate-free HT possess a comprehensive arsenal of exo- and endoglucanases that resembles that of termite gut flagellates, underlining that Fibrobacterota and Spirochaetota occupy the cellulolytic niche that became vacant after the loss of the flagellates. Furthermore, we detected directly or indirectly oxygen-dependent enzymes that oxidize cellulose or modify lignin in MAGs of Pseudomonadota (Burkholderiales, Pseudomonadales) and Actinomycetota (Actinomycetales, Mycobacteriales), representing lineages located at the hindgut wall. CONCLUSIONS The results of this study refine our concept of symbiotic digestion of lignocellulose in termite guts, emphasizing the differential roles of specific bacterial lineages in both flagellate-dependent and flagellate-independent breakdown of cellulose and hemicelluloses, as well as a so far unappreciated role of oxygen in the depolymerization of plant fiber and lignin in the microoxic periphery during gut passage in HT. Video Abstract.
Collapse
Affiliation(s)
- João Felipe M Salgado
- RG Insect Microbiology and Symbiosis, Max Planck Institute for Terrestrial Microbiology, 35043, Marburg, Germany
| | - Vincent Hervé
- RG Insect Microbiology and Symbiosis, Max Planck Institute for Terrestrial Microbiology, 35043, Marburg, Germany
| | - Manuel A G Vera
- RG Insect Microbiology and Symbiosis, Max Planck Institute for Terrestrial Microbiology, 35043, Marburg, Germany
| | - Gaku Tokuda
- Tropical Biosphere Research Center, Center of Molecular Biosciences, University of the Ryukyus, Nishihara, Okinawa, 903-0213, Japan
| | - Andreas Brune
- RG Insect Microbiology and Symbiosis, Max Planck Institute for Terrestrial Microbiology, 35043, Marburg, Germany.
| |
Collapse
|
6
|
Mayo-Pérez S, Gama-Martínez Y, Dávila S, Rivera N, Hernández-Lucas I. LysR-type transcriptional regulators: state of the art. Crit Rev Microbiol 2024; 50:598-630. [PMID: 37635411 DOI: 10.1080/1040841x.2023.2247477] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 08/03/2023] [Accepted: 08/08/2023] [Indexed: 08/29/2023]
Abstract
The LysR-type transcriptional regulators (LTTRs) are DNA-binding proteins present in bacteria, archaea, and in algae. Knowledge about their distribution, abundance, evolution, structural organization, transcriptional regulation, fundamental roles in free life, pathogenesis, and bacteria-plant interaction has been generated. This review focuses on these aspects and provides a current picture of LTTR biology.
Collapse
Affiliation(s)
- S Mayo-Pérez
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Y Gama-Martínez
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - S Dávila
- Centro de Investigación en Dinámica Celular, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico
| | - N Rivera
- IPN: CICATA, Unidad Morelos del Instituto Politécnico Nacional, Atlacholoaya, Mexico
| | - I Hernández-Lucas
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| |
Collapse
|
7
|
Lian W, Zhang L, Wang C, Wu S, He S, Lei J, Zhang Y, You L, Zheng L, Luo X, Ye Z, Hu Z, Wang G, Zhu Y, Li C, Liu J. Systematic identification and functional analysis of root meristem growth factors (RGFs) reveals role of PgRGF1 in modulation of root development and ginsenoside production in Panax ginseng. Int J Biol Macromol 2024; 274:133446. [PMID: 38945337 DOI: 10.1016/j.ijbiomac.2024.133446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 06/10/2024] [Accepted: 06/24/2024] [Indexed: 07/02/2024]
Abstract
Panax ginseng C.A. Mey., known for its medicinal and dietary supplement properties, primarily contains pharmacologically active ginsenosides. However, the regulatory mechanisms linking ginseng root development with ginsenoside biosynthesis are still unclear. Root meristem growth factors (RGFs) are crucial for regulating plant root growth. In our study, we identified five ginseng RGF peptide sequences from the ginseng genome and transcriptome libraries. We treated Arabidopsis and ginseng adventitious roots with exogenous Panax ginseng RGFs (PgRGFs) to assess their activities. Our results demonstrate that PgRGF1 influences gravitropic responses and reduces lateral root formation in Arabidopsis. PgRGF1 has been found to restrict the number and length of ginseng adventitious root branches in ginseng. Given the medicinal properties of ginseng, We determined the ginsenoside content and performed transcriptomic analysis of PgRGF1-treated ginseng adventitious roots. Specifically, the total ginsenoside content in ginseng adventitious roots decreased by 19.98 % and 63.71 % following treatments with 1 μM and 10 μM PgRGF1, respectively, compared to the control. The results revealed that PgRGF1 affects the accumulation of ginsenosides by regulating the expression of genes associated with auxin transportation and ginsenoside biosynthesis. These findings suggest that PgRGF1, as a peptide hormone regulator in ginseng, can modulate adventitious root growth and ginsenoside accumulation.
Collapse
Affiliation(s)
- Weipeng Lian
- School of Pharmacy, Shihezi University, Key Laboratory of Xinjiang Phytomedicine Resource and Utilization, Ministry of Education, Xinjiang, Shihezi 832000, China
| | - Linfan Zhang
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Chenglin Wang
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Shiqi Wu
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Shan He
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Jinlin Lei
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Yonghong Zhang
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Lei You
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Lanlan Zheng
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Xiangyin Luo
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Zhengxiu Ye
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China
| | - Ziyao Hu
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Guodong Wang
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Yun Zhu
- School of Pharmacy, Shihezi University, Key Laboratory of Xinjiang Phytomedicine Resource and Utilization, Ministry of Education, Xinjiang, Shihezi 832000, China.
| | - Chen Li
- Laboratory of Medicinal Plant, Hubei Key Laboratory of Embryonic Stem Cell Research, School of Basic Medicine, Hubei University of Medicine, Shiyan 442000, PR China.
| | - Juan Liu
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China.
| |
Collapse
|
8
|
Koper K, Han SW, Kothadia R, Salamon H, Yoshikuni Y, Maeda HA. Multisubstrate specificity shaped the complex evolution of the aminotransferase family across the tree of life. Proc Natl Acad Sci U S A 2024; 121:e2405524121. [PMID: 38885378 PMCID: PMC11214133 DOI: 10.1073/pnas.2405524121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 05/14/2024] [Indexed: 06/20/2024] Open
Abstract
Aminotransferases (ATs) are an ancient enzyme family that play central roles in core nitrogen metabolism, essential to all organisms. However, many of the AT enzyme functions remain poorly defined, limiting our fundamental understanding of the nitrogen metabolic networks that exist in different organisms. Here, we traced the deep evolutionary history of the AT family by analyzing AT enzymes from 90 species spanning the tree of life (ToL). We found that each organism has maintained a relatively small and constant number of ATs. Mapping the distribution of ATs across the ToL uncovered that many essential AT reactions are carried out by taxon-specific AT enzymes due to wide-spread nonorthologous gene displacements. This complex evolutionary history explains the difficulty of homology-based AT functional prediction. Biochemical characterization of diverse aromatic ATs further revealed their broad substrate specificity, unlike other core metabolic enzymes that evolved to catalyze specific reactions today. Interestingly, however, we found that these AT enzymes that diverged over billion years share common signatures of multisubstrate specificity by employing different nonconserved active site residues. These findings illustrate that AT family enzymes had leveraged their inherent substrate promiscuity to maintain a small yet distinct set of multifunctional AT enzymes in different taxa. This evolutionary history of versatile ATs likely contributed to the establishment of robust and diverse nitrogen metabolic networks that exist throughout the ToL. The study provides a critical foundation to systematically determine diverse AT functions and underlying nitrogen metabolic networks across the ToL.
Collapse
Affiliation(s)
- Kaan Koper
- Department of Botany, University of Wisconsin-Madison, Madison, WI53706
| | - Sang-Woo Han
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Department of Biotechnology, Konkuk University, Chungju27478, South Korea
| | - Ramani Kothadia
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Hugh Salamon
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Yasuo Yoshikuni
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Center for Advanced Bioenergy and Bioproducts Innovation, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Global Center for Food, Land, and Water Resources, Research Faculty of Agriculture, Hokkaido University, Hokkaido, Japan 060-8589
- Institute of Global Innovation Research, Tokyo University of Agriculture and Technology, Tokyo183-8538, Japan
| | - Hiroshi A. Maeda
- Department of Botany, University of Wisconsin-Madison, Madison, WI53706
| |
Collapse
|
9
|
Dan J, Wei W, Ou W, Gao G, Song W, Ye L, Liang H, Guo X, Tan L, Jiang J. Excavation of Biomarker Candidates for the Diagnosis of Talaromyces marneffei Infection via Genome-Wide Prediction and Functional Annotation of Secreted Proteins. ACS OMEGA 2024; 9:27093-27103. [PMID: 38947822 PMCID: PMC11209904 DOI: 10.1021/acsomega.4c00571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 05/28/2024] [Accepted: 06/05/2024] [Indexed: 07/02/2024]
Abstract
Talaromyces marneffei is the third most common infectious pathogen in AIDS patients and leads to the highest death rate in Guangxi, China. The lack of reliable biomarkers is one of the major obstacles in current clinical diagnosis, which largely contributes to this high mortality. Here, we present a study that aimed at identifying diagnostic biomarker candidates through genome-wide prediction and functional annotation of Talaromyces marneffei secreted proteins. A total of 584 secreted proteins then emerged, including 382 classical and 202 nonclassical ones. Among them, there were 87 newly obtained functional annotations in this study. The annotated proteins were further evaluated by combining RNA profiling and a homology comparison. Three proteins were ultimately highlighted as biomarker candidates with robust expression and remarkable specificity. The predicted phosphoinositide phospholipase C and the galactomannoprotein were suggested to play an interactive immune game through metabolism of arachidonic acid. Therefore, they hold promise in developing new tools for clinical diagnosis of Talaromyces marneffei and also possibly serve as molecular targets for future therapy.
Collapse
Affiliation(s)
- Jing Dan
- Collaborative
Innovation Centre of Regenerative Medicine and Medical BioResource
Development and Application Co-constructed by the Province and Ministry, Guangxi Medical University, Nanning, Guangxi 530021, China
- Guangxi
Key Laboratory of AIDS Prevention and Treatment & Biosafety III
Laboratory, Guangxi Medical University, Nanning, Guangxi 530021, China
- Center
for Energy Metabolism and Reproduction, Institute of Biomedicine and
Biotechnology, Shenzhen Institute of Advanced
Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Wudi Wei
- Guangxi
Key Laboratory of AIDS Prevention and Treatment & Biosafety III
Laboratory, Guangxi Medical University, Nanning, Guangxi 530021, China
| | - Weijie Ou
- Center
for Energy Metabolism and Reproduction, Institute of Biomedicine and
Biotechnology, Shenzhen Institute of Advanced
Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Guangshi Gao
- Geekgene
Technology Co. Ltd., Beijing 100091, China
| | - Wanjun Song
- Geekgene
Technology Co. Ltd., Beijing 100091, China
| | - Li Ye
- Guangxi
Key Laboratory of AIDS Prevention and Treatment & Biosafety III
Laboratory, Guangxi Medical University, Nanning, Guangxi 530021, China
| | - Hao Liang
- Collaborative
Innovation Centre of Regenerative Medicine and Medical BioResource
Development and Application Co-constructed by the Province and Ministry, Guangxi Medical University, Nanning, Guangxi 530021, China
- Guangxi
Key Laboratory of AIDS Prevention and Treatment & Biosafety III
Laboratory, Guangxi Medical University, Nanning, Guangxi 530021, China
| | - Xuzhen Guo
- Center
for Energy Metabolism and Reproduction, Institute of Biomedicine and
Biotechnology, Shenzhen Institute of Advanced
Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Lei Tan
- Center
for Energy Metabolism and Reproduction, Institute of Biomedicine and
Biotechnology, Shenzhen Institute of Advanced
Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- College
of Life Sciences, University of Chinese
Academy of Sciences, Beijing 100049, China
- Department
of Cardiology, Shenzhen Guangming District
People’s Hospital, Shenzhen 518055, China
| | - Junjun Jiang
- Collaborative
Innovation Centre of Regenerative Medicine and Medical BioResource
Development and Application Co-constructed by the Province and Ministry, Guangxi Medical University, Nanning, Guangxi 530021, China
- Guangxi
Key Laboratory of AIDS Prevention and Treatment & Biosafety III
Laboratory, Guangxi Medical University, Nanning, Guangxi 530021, China
| |
Collapse
|
10
|
Senkevich K, Parlar SC, Chantereault C, Yu E, Ahmad J, Ruskey JA, Asayesh F, Spiegelman D, Waters C, Monchi O, Dauvilliers Y, Dupré N, Miliukhina I, Timofeeva A, Emelyanov A, Pchelina S, Greenbaum L, Hassin-Baer S, Alcalay RN, Gan-Or Z. Are rare heterozygous SYNJ1 variants associated with Parkinson's disease? MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.29.24307986. [PMID: 38853950 PMCID: PMC11160829 DOI: 10.1101/2024.05.29.24307986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Previous studies have suggested that rare biallelic SYNJ1 mutations may cause autosomal recessive parkinsonism and Parkinson's disease (PD). Our study explored the impact of rare SYNJ1 variants in non-familial settings, including 8,165 PD cases, 818 early-onset PD (EOPD, <50 years) and 70,363 controls. Burden meta-analysis using optimized sequence Kernel association test (SKAT-O) revealed an association between rare nonsynonymous variants in the Sac1 SYNJ1 domain and PD (Pfdr=0.040). Additionally, a meta-analysis focusing on patients with EOPD demonstrated an association between all rare SYNJ1 variants and PD (Pfdr=0.029). Rare SYNJ1 variants may be associated with sporadic PD, and more specifically with EOPD.
Collapse
Affiliation(s)
- Konstantin Senkevich
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Sitki Cem Parlar
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Cloe Chantereault
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Eric Yu
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Jamil Ahmad
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada, Canada
| | - Jennifer A. Ruskey
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada, Canada
| | - Farnaz Asayesh
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Dan Spiegelman
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
| | - Cheryl Waters
- Department of Neurology, College of Physicians and Surgeons, Columbia University Medical Center, NY, USA
| | - Oury Monchi
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada, Canada
- Département de radiologie, radio-oncologie et médecine nucléaire, Université de Montréal, Montréal, QC, Canada
- Centre de recherche de l’Institut universitaire de gériatrie de Montréal, Montréal, QC, Canada
| | - Yves Dauvilliers
- National Reference Center for Narcolepsy, Sleep Unit, Department of Neurology, Guide-Chauliac Hospital, CHU Montpellier, University of Montpellier, Montpellier, France
| | - Nicolas Dupré
- Neuroscience axis, CHU de Québec-Université Laval, Québec, QC, Canada
- Department of Medicine, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | | | - Alla Timofeeva
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Anton Emelyanov
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Sofya Pchelina
- First Pavlov State Medical University of St. Petersburg, Saint-Petersburg, Russia
| | - Lior Greenbaum
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Danek Gertner Institute of Human Genetics, Sheba Medical Center, Tel Hashomer, Israel
- The Joseph Sagol Neuroscience Center, Sheba Medical Center, Tel Hashomer, Israel
| | - Sharon Hassin-Baer
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Joseph Sagol Neuroscience Center, Sheba Medical Center, Tel Hashomer, Israel
- The Movement Disorders Institute, Department of Neurology, Sheba Medical Center, Tel Hashomer, Israel
| | - Roy N. Alcalay
- Department of Neurology, College of Physicians and Surgeons, Columbia University Medical Center, NY, USA
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Division of Movement Disorders, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Ziv Gan-Or
- The Neuro (Montreal Neurological Institute-Hospital), McGill University, Montreal, Quebec, Canada
- Department of Neurology and neurosurgery, McGill University, Montréal, QC, Canada, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
| |
Collapse
|
11
|
Rodríguez-Mejía JL, Hidalgo-Manzano IA, Muriel-Millán LF, Rivera-Gomez N, Sahonero-Canavesi DX, Castillo E, Pardo-López L. A Novel Thermo-Alkaline Stable GDSL/SGNH Esterase with Broad Substrate Specificity from a Deep-Sea Pseudomonas sp. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2024; 26:447-459. [PMID: 38691271 PMCID: PMC11178605 DOI: 10.1007/s10126-024-10308-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 04/03/2024] [Indexed: 05/03/2024]
Abstract
Marine environments harbor a plethora of microorganisms that represent a valuable source of new biomolecules of biotechnological interest. In particular, enzymes from marine bacteria exhibit unique properties due to their high catalytic activity under various stressful and fluctuating conditions, such as temperature, pH, and salinity, fluctuations which are common during several industrial processes. In this study, we report a new esterase (EstGoM) from a marine Pseudomonas sp. isolated at a depth of 1000 m in the Gulf of Mexico. Bioinformatic analyses revealed that EstGoM is an autotransporter esterase (type Va) and belongs to the lipolytic family II, forming a new subgroup. The purified recombinant EstGoM, with a molecular mass of 67.4 kDa, showed the highest hydrolytic activity with p-nitrophenyl octanoate (p-NP C8), although it was also active against p-NP C4, C5, C10, and C12. The optimum pH and temperature for EstGoM were 9 and 60 °C, respectively, but it retained more than 50% of its activity over the pH range of 7-11 and temperature range of 10-75 °C. In addition, EstGoM was tolerant of up to 1 M NaCl and resistant to the presence of several metal ions, detergents, and chemical reagents, such as EDTA and β-mercaptoethanol. The enzymatic properties of EstGoM make it a potential candidate for several industrial applications.
Collapse
Affiliation(s)
- José Luis Rodríguez-Mejía
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México
- Edificio Dr. Carlos Méndez, Centro Universitario de Investigaciones Biomédicas, Universidad de Colima, Campus Central Colima; Avenida 25 de Julio #965, Col. V. Sn. Sebastián, C.P. 28045, Colima, Colima, México
| | - Itzel Anahí Hidalgo-Manzano
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México
| | - Luis Felipe Muriel-Millán
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México
| | - Nancy Rivera-Gomez
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México
- IPN: CICATA Unidad Morelos del Instituto Politécnico Nacional, Blvd. de La Tecnologia 1036-P 2/2, 62790, Atlacholoaya, Morelos, México
| | - Diana X Sahonero-Canavesi
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México
- Department of Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, 1797AB Den Burg, P.O. Box 59, Texel, Netherlands
| | - Edmundo Castillo
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México.
| | - Liliana Pardo-López
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Av. Universidad 2001, Col. Chamilpa, Cuernavaca, Morelos, 62210, México.
| |
Collapse
|
12
|
Ca Ferreira L, de Fa Ferreira Filho L, V Cosate MR, Sakamoto T. Genetic structure and diversity of the rfb locus of pathogenic species of the genus Leptospira. Life Sci Alliance 2024; 7:e202302478. [PMID: 38514188 PMCID: PMC10958091 DOI: 10.26508/lsa.202302478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 03/08/2024] [Accepted: 03/08/2024] [Indexed: 03/23/2024] Open
Abstract
Leptospirosis is caused by pathogenic strains of the genus Leptospira and is considered the most widespread zoonotic bacterial disease. The genus is characterized by the large number of serology variants, which challenges developing effective serotyping methods and vaccines with a broad spectrum. Because knowledge on the genetic basis of the serological diversity among leptospires is still limited, we aimed to explore the genetic structure and patterns of the rfb locus, which is involved in the biosynthesis of lipopolysaccharides, the major surface antigen that defines the serovar in leptospires. Here, we used genomic data of 722 pathogenic samples and compared the gene composition of their rfb locus by hierarchical clustering. Clustering analysis showed that the rfb locus gene composition is species-independent and strongly associated with the serological classification. The samples were grouped into four well-defined classes, which cluster together samples either belonging to the same serogroup or from different serogroups but sharing serological affinity. Our findings can assist in the development of new strategies based on molecular methods, which can lead to better tools for serological identification in this zoonosis.
Collapse
Affiliation(s)
- Leonardo Ca Ferreira
- https://ror.org/04wn09761 Bioinformatics Multidisciplinary Environment (BioME), Instituto Metrópole Digital (IMD), Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | - Luiz de Fa Ferreira Filho
- https://ror.org/04wn09761 Departamento de Engenharia de Computação e Automação (DCA), Centro de Tecnologia (CT), Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | - Maria Raquel V Cosate
- UMass Chain Medical School, Nonhuman Primates Reagent Resources, Department of Medicine, University of Massachusetts, Worcester, MA, USA
| | - Tetsu Sakamoto
- https://ror.org/04wn09761 Bioinformatics Multidisciplinary Environment (BioME), Instituto Metrópole Digital (IMD), Universidade Federal do Rio Grande do Norte, Natal, Brazil
| |
Collapse
|
13
|
Elrashedy A, Nayel M, Salama A, Salama MM, Hasan ME. Bioinformatics approach for structure modeling, vaccine design, and molecular docking of Brucella candidate proteins BvrR, OMP25, and OMP31. Sci Rep 2024; 14:11951. [PMID: 38789443 PMCID: PMC11126717 DOI: 10.1038/s41598-024-61991-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 05/13/2024] [Indexed: 05/26/2024] Open
Abstract
Brucellosis is a zoonotic disease with significant economic and healthcare costs. Despite the eradication efforts, the disease persists. Vaccines prevent disease in animals while antibiotics cure humans with limitations. This study aims to design vaccines and drugs for brucellosis in animals and humans, using protein modeling, epitope prediction, and molecular docking of the target proteins (BvrR, OMP25, and OMP31). Tertiary structure models of three target proteins were constructed and assessed using RMSD, TM-score, C-score, Z-score, and ERRAT. The best models selected from AlphaFold and I-TASSER due to their superior performance according to CASP 12 - CASP 15 were chosen for further analysis. The motif analysis of best models using MotifFinder revealed two, five, and five protein binding motifs, however, the Motif Scan identified seven, six, and eight Post-Translational Modification sites (PTMs) in the BvrR, OMP25, and OMP31 proteins, respectively. Dominant B cell epitopes were predicted at (44-63, 85-93, 126-137, 193-205, and 208-237), (26-46, 52-71, 98-114, 142-155, and 183-200), and (29-45, 58-82, 119-142, 177-198, and 222-251) for the three target proteins. Additionally, cytotoxic T lymphocyte epitopes were detected at (173-181, 189-197, and 202-210), (61-69, 91-99, 159-167, and 181-189), and (3-11, 24-32, 167-175, and 216-224), while T helper lymphocyte epitopes were displayed at (39-53, 57-65, 150-158, 163-171), (79-87, 95-108, 115-123, 128-142, and 189-197), and (39-47, 109-123, 216-224, and 245-253), for the respective target protein. Furthermore, structure-based virtual screening of the ZINC and DrugBank databases using the docking MOE program was followed by ADMET analysis. The best five compounds of the ZINC database revealed docking scores ranged from (- 16.8744 to - 15.1922), (- 16.0424 to - 14.1645), and (- 14.7566 to - 13.3222) for the BvrR, OMP25, and OMP31, respectively. These compounds had good ADMET parameters and no cytotoxicity, while DrugBank compounds didn't meet Lipinski's rule criteria. Therefore, the five selected compounds from the ZINC20 databases may fulfill the pharmacokinetics and could be considered lead molecules for potentially inhibiting Brucella's proteins.
Collapse
Affiliation(s)
- Alyaa Elrashedy
- Department of Animal Medicine and Infectious Diseases (Infectious Diseases), Faculty of Veterinary Medicine, University of Sadat City, Sadat City, Egypt.
| | - Mohamed Nayel
- Department of Animal Medicine and Infectious Diseases (Infectious Diseases), Faculty of Veterinary Medicine, University of Sadat City, Sadat City, Egypt
| | - Akram Salama
- Department of Animal Medicine and Infectious Diseases (Infectious Diseases), Faculty of Veterinary Medicine, University of Sadat City, Sadat City, Egypt
| | - Mohammed M Salama
- Physics Department, Medical Biophysics Division, Faculty of Science, Helwan University, Cairo, Egypt
| | - Mohamed E Hasan
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City, Egypt
| |
Collapse
|
14
|
Yu R, Huang Z, Lam TYC, Sun Y. Utilizing profile hidden Markov model databases for discovering viruses from metagenomic data: a comprehensive review. Brief Bioinform 2024; 25:bbae292. [PMID: 39003531 PMCID: PMC11246558 DOI: 10.1093/bib/bbae292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 05/19/2024] [Accepted: 06/04/2024] [Indexed: 07/15/2024] Open
Abstract
Profile hidden Markov models (pHMMs) are able to achieve high sensitivity in remote homology search, making them popular choices for detecting novel or highly diverged viruses in metagenomic data. However, many existing pHMM databases have different design focuses, making it difficult for users to decide the proper one to use. In this review, we provide a thorough evaluation and comparison for multiple commonly used profile HMM databases for viral sequence discovery in metagenomic data. We characterized the databases by comparing their sizes, their taxonomic coverage, and the properties of their models using quantitative metrics. Subsequently, we assessed their performance in virus identification across multiple application scenarios, utilizing both simulated and real metagenomic data. We aim to offer researchers a thorough and critical assessment of the strengths and limitations of different databases. Furthermore, based on the experimental results obtained from the simulated and real metagenomic data, we provided practical suggestions for users to optimize their use of pHMM databases, thus enhancing the quality and reliability of their findings in the field of viral metagenomics.
Collapse
Affiliation(s)
- Runzhou Yu
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Ziyi Huang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Theo Y C Lam
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| |
Collapse
|
15
|
Lin B, Luo X, Liu Y, Jin X. A comprehensive review and comparison of existing computational methods for protein function prediction. Brief Bioinform 2024; 25:bbae289. [PMID: 39003530 PMCID: PMC11246557 DOI: 10.1093/bib/bbae289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 05/18/2024] [Indexed: 07/15/2024] Open
Abstract
Protein function prediction is critical for understanding the cellular physiological and biochemical processes, and it opens up new possibilities for advancements in fields such as disease research and drug discovery. During the past decades, with the exponential growth of protein sequence data, many computational methods for predicting protein function have been proposed. Therefore, a systematic review and comparison of these methods are necessary. In this study, we divide these methods into four different categories, including sequence-based methods, 3D structure-based methods, PPI network-based methods and hybrid information-based methods. Furthermore, their advantages and disadvantages are discussed, and then their performance is comprehensively evaluated and compared. Finally, we discuss the challenges and opportunities present in this field.
Collapse
Affiliation(s)
- Baohui Lin
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, Guangdong 518118, China
| | - Xiaoling Luo
- Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies, Shenzhen, Guangdong, China
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guangdong 518061, China
| | - Yumeng Liu
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, Guangdong 518118, China
| | - Xiaopeng Jin
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, Guangdong 518118, China
| |
Collapse
|
16
|
Xu X, Yin K, Wu R. Systematic Investigation of the Trafficking of Glycoproteins on the Cell Surface. Mol Cell Proteomics 2024; 23:100761. [PMID: 38593903 PMCID: PMC11087972 DOI: 10.1016/j.mcpro.2024.100761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 03/30/2024] [Accepted: 04/03/2024] [Indexed: 04/11/2024] Open
Abstract
Glycoproteins located on the cell surface play a pivotal role in nearly every extracellular activity. N-glycosylation is one of the most common and important protein modifications in eukaryotic cells, and it often regulates protein folding and trafficking. Glycosylation of cell-surface proteins undergoes meticulous regulation by various enzymes in the endoplasmic reticulum (ER) and the Golgi, ensuring their proper folding and trafficking to the cell surface. However, the impacts of protein N-glycosylation, N-glycan maturity, and protein folding status on the trafficking of cell-surface glycoproteins remain to be explored. In this work, we comprehensively and site-specifically studied the trafficking of cell-surface glycoproteins in human cells. Integrating metabolic labeling, bioorthogonal chemistry, and multiplexed proteomics, we investigated 706 N-glycosylation sites on 396 cell-surface glycoproteins in monocytes, either by inhibiting protein N-glycosylation, disturbing N-glycan maturation, or perturbing protein folding in the ER. The current results reveal their distinct impacts on the trafficking of surface glycoproteins. The inhibition of protein N-glycosylation dramatically suppresses the trafficking of many cell-surface glycoproteins. The N-glycan immaturity has more substantial effects on proteins with high N-glycosylation site densities, while the perturbation of protein folding in the ER exerts a more pronounced impact on surface glycoproteins with larger sizes. Furthermore, for N-glycosylated proteins, their trafficking to the cell surface is related to the secondary structures and adjacent amino acid residues of glycosylation sites. Systematic analysis of surface glycoprotein trafficking advances our understanding of the mechanisms underlying protein secretion and surface presentation.
Collapse
Affiliation(s)
- Xing Xu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, USA
| | - Kejun Yin
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, USA
| | - Ronghu Wu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, USA.
| |
Collapse
|
17
|
Álvarez-Campos P, García-Castro H, Emili E, Pérez-Posada A, Del Olmo I, Peron S, Salamanca-Díaz DA, Mason V, Metzger B, Bely AE, Kenny NJ, Özpolat BD, Solana J. Annelid adult cell type diversity and their pluripotent cellular origins. Nat Commun 2024; 15:3194. [PMID: 38609365 PMCID: PMC11014941 DOI: 10.1038/s41467-024-47401-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 03/27/2024] [Indexed: 04/14/2024] Open
Abstract
Many annelids can regenerate missing body parts or reproduce asexually, generating all cell types in adult stages. However, the putative adult stem cell populations involved in these processes, and the diversity of cell types generated by them, are still unknown. To address this, we recover 75,218 single cell transcriptomes of the highly regenerative and asexually-reproducing annelid Pristina leidyi. Our results uncover a rich cell type diversity including annelid specific types as well as novel types. Moreover, we characterise transcription factors and gene networks that are expressed specifically in these populations. Finally, we uncover a broadly abundant cluster of putative stem cells with a pluripotent signature. This population expresses well-known stem cell markers such as vasa, piwi and nanos homologues, but also shows heterogeneous expression of differentiated cell markers and their transcription factors. We find conserved expression of pluripotency regulators, including multiple chromatin remodelling and epigenetic factors, in piwi+ cells. Finally, lineage reconstruction analyses reveal computational differentiation trajectories from piwi+ cells to diverse adult types. Our data reveal the cell type diversity of adult annelids by single cell transcriptomics and suggest that a piwi+ cell population with a pluripotent stem cell signature is associated with adult cell type differentiation.
Collapse
Affiliation(s)
- Patricia Álvarez-Campos
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK.
- Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM) & Departamento de Biología (Zoología), Facultad de Ciencias, Universidad Autónoma de Madrid, Madrid, Spain.
| | - Helena García-Castro
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Living Systems Institute, University of Exeter, Exeter, UK
| | - Elena Emili
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Alberto Pérez-Posada
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Living Systems Institute, University of Exeter, Exeter, UK
| | - Irene Del Olmo
- Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM) & Departamento de Biología (Zoología), Facultad de Ciencias, Universidad Autónoma de Madrid, Madrid, Spain
| | - Sophie Peron
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Living Systems Institute, University of Exeter, Exeter, UK
| | - David A Salamanca-Díaz
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Living Systems Institute, University of Exeter, Exeter, UK
| | - Vincent Mason
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Bria Metzger
- Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, 05432, USA
- Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, Saint Louis, MO, 63130, USA
| | - Alexandra E Bely
- Department of Biology, University of Maryland, College Park, MD, 20742, USA
| | - Nathan J Kenny
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, Aotearoa, New Zealand
| | - B Duygu Özpolat
- Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, 05432, USA.
- Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, Saint Louis, MO, 63130, USA.
| | - Jordi Solana
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK.
- Living Systems Institute, University of Exeter, Exeter, UK.
| |
Collapse
|
18
|
Hasan ME, Samir A, Khalil MM, Shafaa MW. Bioinformatics approach for prediction and analysis of the Non-Structural Protein 4B (NSP4B) of the Zika virus. J Genet Eng Biotechnol 2024; 22:100336. [PMID: 38494248 PMCID: PMC10860876 DOI: 10.1016/j.jgeb.2023.100336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
BACKGROUND The Nonstructural Protein (NSP) 4B of Zika virus of 251 amino acids from (ZIKV/Human/POLG_ZIKVF) with accession number (A0A024B7W1), Induces the production of Endoplasmic Reticulum ER-derived membrane vesicles, which are the sites of viral replication. To understand the physical basis of how proteins fold in nature and to solve the challenge of protein structure prediction, Ab-initio and comparative modeling are crucial tools. RESULTS The systematic in silico technique, ThreaDom, had only predicted one domain (4 - 190) of NSP4B. I-TASSER, and Alphafold were ranked as the best servers for full-length 3-D protein structure predictions of NSP4B, where the predicted models were evaluated quantitatively using benchmarked metrics including C-score (-3.43), TM-score (0.77949), RMSD (2.73), and Z-score (1.561). The functional and protein binding motifs were realized using motif databases, secondary and surface accessibility predictions combined with Post-Translational Modification Sites (PTMs) prediction. Two highly conserved protein-binding motifs (Flavi NS4B and Bacillus papRprotein), together with three (PTMs) (Casein Kinase II, Myristyl site, and ASN-Glycosylation site) were predicted utilizing the Motif scan and Scanprosite servers. These patterns and PTMs were associated with NSP4B's role in triggering the development of the viral replication complex and its participation in the localization of NS3 and NS5 on the membrane. Only one hit from Structural Classification of Protein (SCOP) matched the protein sequence at positions 10 to 397 and was categorized six-hairpin glycosidases superfamily according to CATH (Class, Architecture, Topology, and Homology). Integrating this NSP4B information with the templates' SCOP and CATH annotations achieves it easier to attribute structure-function/evolution links to both previously known and recently discovered protein structures.
Collapse
Affiliation(s)
- Mohamed E Hasan
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City 32897, Egypt.
| | - Aya Samir
- Physics Department, Medical Biophysics Division, Faculty of Science, Helwan University, Cairo, Egypt
| | - Magdy M Khalil
- Physics Department, Medical Biophysics Division, Faculty of Science, Helwan University, Cairo, Egypt; School of Biotechnology, Badr University in Cairo, Egypt
| | - Medhat W Shafaa
- Physics Department, Medical Biophysics Division, Faculty of Science, Helwan University, Cairo, Egypt
| |
Collapse
|
19
|
Bonello J, Orengo C. FunPredCATH: An ensemble method for predicting protein function using CATH. BIOCHIMICA ET BIOPHYSICA ACTA. PROTEINS AND PROTEOMICS 2024; 1872:140985. [PMID: 38122964 DOI: 10.1016/j.bbapap.2023.140985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 12/05/2023] [Accepted: 12/06/2023] [Indexed: 12/23/2023]
Abstract
MOTIVATION The growth of unannotated proteins in UniProt increases at a very high rate every year due to more efficient sequencing methods. However, the experimental annotation of proteins is a lengthy and expensive process. Using computational techniques to narrow the search can speed up the process by providing highly specific Gene Ontology (GO) terms. METHODOLOGY We propose an ensemble approach that combines three generic base predictors that predict Gene Ontology (BP, CC and MF) terms from sequences across different species. We train our models on UniProtGOA annotation data and use the CATH domain resources to identify the protein families. We then calculate a score based on the prevalence of individual GO terms in the functional families that is then used as an indicator of confidence when assigning the GO term to an uncharacterised protein. METHODS In the ensemble, we use a statistics-based method that scores the occurrence of GO terms in a CATH FunFam against a background set of proteins annotated by the same GO term. We also developed a set-based method that uses Set Intersection and Set Union to score the occurrence of GO terms within the same CATH FunFam. Finally, we also use FunFams-Plus, a predictor method developed by the Orengo Group at UCL to predict GO terms for uncharacterised proteins in the CAFA3 challenge. EVALUATION We evaluated the methods against the CAFA3 benchmark and DomFun. We used the Precision, Recall and Fmax metrics and the benchmark datasets that are used in CAFA3 to evaluate our models and compare them to the CAFA3 results. Our results show that FunPredCATH compares well with top CAFA methods in the different ontologies and benchmarks. CONTRIBUTIONS FunPredCATH compares well with other prediction methods on CAFA3, and the ensemble approach outperforms the base methods. We show that non-IEA models obtain higher Fmax scores than the IEA counterparts, while the models including IEA annotations have higher coverage at the expense of a lower Fmax score.
Collapse
Affiliation(s)
- Joseph Bonello
- Department of Structural and Molecular Biology, University College London, Gower Street, London WC1E 6BT, United Kingdom; Department of Computer Information Systems, University of Malta, Faculty of ICT, Msida, MSD 2080, Malta.
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, Gower Street, London WC1E 6BT, United Kingdom
| |
Collapse
|
20
|
Zheng J, Huang L, Yi H, Yan Y, Zhang X, Akresi J, Yin Y. Carbohydrate-active enzyme annotation in microbiomes using dbCAN. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.10.575125. [PMID: 38260309 PMCID: PMC10802576 DOI: 10.1101/2024.01.10.575125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
CAZymes or carbohydrate-active enzymes are critically important for human gut health, lignocellulose degradation, global carbon recycling, soil health, and plant disease. We developed dbCAN as a web server in 2012 and actively maintain it for automated CAZyme annotation. Considering data privacy and scalability, we provide run_dbcan as a standalone software package since 2018 to allow users perform more secure and scalable CAZyme annotation on their local servers. Here, we offer a comprehensive computational protocol on automated CAZyme annotation of microbiome sequencing data, covering everything from short read pre-processing to data visualization of CAZyme and glycan substrate occurrence and abundance in multiple samples. Using a real-world metagenomic sequencing dataset, this protocol describes commands for dataset and software preparation, metagenome assembly, gene prediction, CAZyme prediction, CAZyme gene cluster (CGC) prediction, glycan substrate prediction, and data visualization. The expected results include publication-quality plots for the abundance of CAZymes, CGCs, and substrates from multiple CAZyme annotation routes (individual sample assembly, co-assembly, and assembly-free). For the individual sample assembly route, this protocol takes ∼33h on a Linux computer with 40 CPUs, while other routes will be faster. This protocol does not require programming experience from users, but it does assume a familiarity with the Linux command-line interface and the ability to run Python scripts in the terminal. The target audience includes the tens of thousands of microbiome researchers who routinely use our web server. This protocol will encourage them to perform more secure, rapid, and scalable CAZyme annotation on their local computer servers.
Collapse
|
21
|
Feuermann M, Gaudet P. Interpreting Gene Ontology Annotations Derived from Sequence Homology Methods. Methods Mol Biol 2024; 2836:285-298. [PMID: 38995546 DOI: 10.1007/978-1-0716-4007-4_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
The Gene Ontology (GO) project describes the functions of the gene products of organisms from all kingdoms of life in a standardized way, enabling powerful analyses of experiments involving genome-wide analysis. The scientific literature is used to convert experimental results into GO annotations that systematically classify gene products' functions. However, to address the fact that only a minor fraction of all genes has been characterized experimentally, multiple predictive methods to assign GO annotations have been developed since the inception of GO. Sequence homologies between novel genes and genes with known functions help to approximate the roles of these non-characterized genes. Here we describe the main sequence homology methods to produce annotations: pairwise comparison (BLAST), protein profile models (InterPro), and phylogenetic-based annotation (PAINT). Some of these methods can be implemented with genome analysis pipelines (BLAST and InterPro2GO), while PAINT is curated by the GO consortium.
Collapse
Affiliation(s)
- Marc Feuermann
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Pascale Gaudet
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland.
| |
Collapse
|
22
|
Sepúlveda-Rebolledo P, González-Rosales C, Dopson M, Pérez-Rueda E, Holmes DS, Valdés JH. Comparative genomics sheds light on transcription factor-mediated regulation in the extreme acidophilic Acidithiobacillia representatives. Res Microbiol 2024; 175:104135. [PMID: 37678513 DOI: 10.1016/j.resmic.2023.104135] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/28/2023] [Accepted: 08/30/2023] [Indexed: 09/09/2023]
Abstract
Extreme acidophiles thrive in acidic environments, confront a multitude of challenges, and demonstrate remarkable adaptability in their metabolism to cope with the ever-changing environmental fluctuations, which encompass variations in temperature, pH levels, and the availability of electron acceptors and donors. The survival and proliferation of members within the Acidithiobacillia class rely on the deployment of transcriptional regulatory systems linked to essential physiological traits. The study of these transcriptional regulatory systems provides valuable insights into critical processes, such as energy metabolism and nutrient assimilation, and how they integrate into major genetic-metabolic circuits. In this study, we examined the transcriptional regulatory repertoires and potential interactions of forty-three Acidithiobacillia complete and draft genomes, encompassing nine species. To investigate the function and diversity of Transcription Factors (TFs) and their DNA Binding Sites (DBSs), we conducted a genome-wide comparative analysis, which allowed us to identify these regulatory elements in representatives of Acidithiobacillia. We classified TFs into gene families and compared their occurrence among all representatives, revealing conservation patterns across the class. The results identified conserved regulators for several pathways, including iron and sulfur oxidation, the main pathways for energy acquisition, providing new evidence for viable regulatory interactions and branch-specific conservation in Acidithiobacillia. The identification of TFs and DBSs not only corroborates existing experimental information for selected species, but also introduces novel candidates for experimental validation. Moreover, these promising candidates have the potential for further extension to new representatives within the class.
Collapse
Affiliation(s)
- Pedro Sepúlveda-Rebolledo
- Centro de Genómica y Bioinformática and PhD. Program on Integrative Genomics, Facultad de Ciencias, Universidad Mayor, Santiago (8580745), Chile.
| | - Carolina González-Rosales
- Center for Bioinformatics and Genome Biology, Fundación Ciencia & Vida, Santiago (8580638), Chile; Centre for Ecology and Evolution in Microbial Model Systems, Linnaeus University, SE-391 82 Kalmar, Sweden.
| | - Mark Dopson
- Centre for Ecology and Evolution in Microbial Model Systems, Linnaeus University, SE-391 82 Kalmar, Sweden.
| | - Ernesto Pérez-Rueda
- Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Unidad Académica del Estado de Yucatán, Mérida, Yucatán, Mexico.
| | - David S Holmes
- Center for Bioinformatics and Genome Biology, Fundación Ciencia & Vida, Santiago (8580638), Chile; Facultad de Medicina y Ciencia, Universidad San Sebastián, Santiago (7510156), Chile.
| | - Jorge H Valdés
- Center for Bioinformatics and Integrative Biology, Facultad de Ciencias de la Vida, Universidad Andrés Bello, Santiago (8370146), Chile.
| |
Collapse
|
23
|
Xie M, Xie R, Wang H. LPI-IBWA: Predicting lncRNA-protein interactions based on an improved Bi-Random walk algorithm. Methods 2023; 220:98-105. [PMID: 37972912 DOI: 10.1016/j.ymeth.2023.11.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/14/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023] Open
Abstract
Many studies have shown that long-chain noncoding RNAs (lncRNAs) are involved in a variety of biological processes such as post-transcriptional gene regulation, splicing, and translation by combining with corresponding proteins. Predicting lncRNA-protein interactions is an effective approach to infer the functions of lncRNAs. The paper proposes a new computational model named LPI-IBWA. At first, LPI-IBWA uses similarity kernel fusion (SKF) to integrate various types of biological information to construct lncRNA and protein similarity networks. Then, a bounded matrix completion model and a weighted k-nearest known neighbors algorithm are utilized to update the initial sparse lncRNA-protein interaction matrix. Based on the updated lncRNA-protein interaction matrix, the lncRNA similarity network and the protein similarity network are integrated into a heterogeneous network. Finally, an improved Bi-Random walk algorithm is used to predict novel latent lncRNA-protein interactions. 5-fold cross-validation experiments on a benchmark dataset showed that the AUC and AUPR of LPI-IBWA reach 0.920 and 0.736, respectively, which are higher than those of other state-of-the-art methods. Furthermore, the experimental results of case studies on a novel dataset also illustrated that LPI-IBWA could efficiently predict potential lncRNA-protein interactions.
Collapse
Affiliation(s)
- Minzhu Xie
- College of Information Science and Engineering, Hunan Normal University, China.
| | - Ruijie Xie
- College of Information Science and Engineering, Hunan Normal University, China.
| | - Hao Wang
- College of Information Science and Engineering, Hunan Normal University, China.
| |
Collapse
|
24
|
Romei M, Carpentier M, Chomilier J, Lecointre G. Origins and Functional Significance of Eukaryotic Protein Folds. J Mol Evol 2023; 91:854-864. [PMID: 38060007 DOI: 10.1007/s00239-023-10136-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 10/03/2023] [Indexed: 12/08/2023]
Abstract
Folds are the architecture and topology of a protein domain. Categories of folds are very few compared to the astronomical number of sequences. Eukaryotes have more protein folds than Archaea and Bacteria. These folds are of two types: shared with Archaea and/or Bacteria on one hand and specific to eukaryotic clades on the other hand. The first kind of folds is inherited from the first endosymbiosis and confirms the mixed origin of eukaryotes. In a dataset of 1073 folds whose presence or absence has been evidenced among 210 species equally distributed in the three super-kingdoms, we have identified 28 eukaryotic folds unambiguously inherited from Bacteria and 40 eukaryotic folds unambiguously inherited from Archaea. Compared to previous studies, the repartition of informational function is higher than expected for folds originated from Bacteria and as high as expected for folds inherited from Archaea. The second type of folds is specifically eukaryotic and associated with an increase of new folds within eukaryotes distributed in particular clades. Reconstructed ancestral states coupled with dating of each node on the tree of life provided fold appearance rates. The rate is on average twice higher within Eukaryota than within Bacteria or Archaea. The highest rates are found in the origins of eukaryotes, holozoans, metazoans, metazoans stricto sensu, and vertebrates: the roots of these clades correspond to bursts of fold evolution. We could correlate the functions of some of the fold synapomorphies within eukaryotes with significant evolutionary events. Among them, we find evidence for the rise of multicellularity, adaptive immune system, or virus folds which could be linked to an ecological shift made by tetrapods.
Collapse
Affiliation(s)
- Martin Romei
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205), Sorbonne Université, MNHN, CNRS, EPHE, UA, Paris, France
- IMPMC (UMR 7590), BiBiP, Sorbonne Université, CNRS, MNHN, Paris, France
| | - Mathilde Carpentier
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205), Sorbonne Université, MNHN, CNRS, EPHE, UA, Paris, France.
| | - Jacques Chomilier
- IMPMC (UMR 7590), BiBiP, Sorbonne Université, CNRS, MNHN, Paris, France
| | - Guillaume Lecointre
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205), Sorbonne Université, MNHN, CNRS, EPHE, UA, Paris, France
| |
Collapse
|
25
|
Howard SA, Carr CM, Sbahtu HI, Onwukwe U, López MJ, Dobson ADW, McCarthy RR. Enrichment of native plastic-associated biofilm communities to enhance polyester degrading activity. Environ Microbiol 2023; 25:2698-2718. [PMID: 37515381 PMCID: PMC10947123 DOI: 10.1111/1462-2920.16466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 06/30/2023] [Indexed: 07/30/2023]
Abstract
Plastic pollution is an increasing worldwide problem urgently requiring a solution. While recycling rates are increasing globally, only 9% of all plastic waste has been recycled, and with the cost and limited downstream uses of recycled plastic, an alternative is needed. Here, we found that expanded polystyrene (EPS) promoted high levels of bacterial biofilm formation and sought out environmental EPS waste to characterize these native communities. We demonstrated that the EPS attached communities had limited plastic degrading activity. We then performed a long-term enrichment experiment where we placed a robust selection pressure on these communities by limiting carbon availability such that the waste plastic was the only carbon source. Seven of the resulting enriched bacterial communities had increased plastic degrading activity compared to the starting bacterial communities. Pseudomonas stutzeri was predominantly identified in six of the seven enriched communities as the strongest polyester degrader. Sequencing of one isolate of P. stutzeri revealed two putative polyesterases and one putative MHETase. This indicates that waste plastic-associated biofilms are a source for bacteria that have plastic-degrading potential, and that this potential can be unlocked through selective pressure and further in vitro enrichment experiments, resulting in biodegradative communities that are better than nature.
Collapse
Affiliation(s)
- Sophie A. Howard
- Centre for Inflammation Research and Translational Medicine, Division of Biosciences, Department of Life Sciences, College of Health and Life SciencesBrunel University LondonUxbridgeUK
| | - Clodagh M. Carr
- School of MicrobiologyUniversity College CorkCorkIreland
- SSPC‐SFI Research Centre for PharmaceuticalsUniversity College CorkCorkIreland
| | - Habteab Isaack Sbahtu
- Centre for Inflammation Research and Translational Medicine, Division of Biosciences, Department of Life Sciences, College of Health and Life SciencesBrunel University LondonUxbridgeUK
| | - Uchechukwu Onwukwe
- Experimental Techniques Centre, College of Engineering, Design and Physical SciencesBrunel University LondonUxbridgeUK
| | - Maria J. López
- Department of Biology and Geology, CITE II‐BUniversity of Almería, Agrifood Campus of International Excellence ceiA3, CIAIMBITALAlmeriaSpain
| | - Alan D. W. Dobson
- School of MicrobiologyUniversity College CorkCorkIreland
- SSPC‐SFI Research Centre for PharmaceuticalsUniversity College CorkCorkIreland
| | - Ronan R. McCarthy
- Centre for Inflammation Research and Translational Medicine, Division of Biosciences, Department of Life Sciences, College of Health and Life SciencesBrunel University LondonUxbridgeUK
| |
Collapse
|
26
|
Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023; 13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]
Abstract
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
Collapse
Affiliation(s)
- Petr Kouba
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Faculty of
Electrical Engineering, Czech Technical
University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
| | - Pavel Kohout
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Faraneh Haddadi
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Anton Bushuiev
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Raman Samusevich
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Jiri Sedlar
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Jiri Damborsky
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Tomas Pluskal
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Josef Sivic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
27
|
Lv G, Xia Y, Qi Z, Zhao Z, Tang L, Chen C, Yang S, Wang Q, Gu L. LncRNA-protein interaction prediction with reweighted feature selection. BMC Bioinformatics 2023; 24:410. [PMID: 37904080 PMCID: PMC10617115 DOI: 10.1186/s12859-023-05536-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open
Abstract
LncRNA-protein interactions are ubiquitous in organisms and play a crucial role in a variety of biological processes and complex diseases. Many computational methods have been reported for lncRNA-protein interaction prediction. However, the experimental techniques to detect lncRNA-protein interactions are laborious and time-consuming. Therefore, to address this challenge, this paper proposes a reweighting boosting feature selection (RBFS) method model to select key features. Specially, a reweighted apporach can adjust the contribution of each observational samples to learning model fitting; let higher weights are given more influence samples than those with lower weights. Feature selection with boosting can efficiently rank to iterate over important features to obtain the optimal feature subset. Besides, in the experiments, the RBFS method is applied to the prediction of lncRNA-protein interactions. The experimental results demonstrate that our method achieves higher accuracy and less redundancy with fewer features.
Collapse
Affiliation(s)
- Guohao Lv
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Yingchun Xia
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Zhao Qi
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Zihao Zhao
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Lianggui Tang
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Cheng Chen
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Shuai Yang
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Qingyong Wang
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China
| | - Lichuan Gu
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, Anhui, China.
| |
Collapse
|
28
|
Simpson J, Kozak CA, Boso G. Evolutionary conservation of an ancient retroviral gagpol gene in Artiodactyla. J Virol 2023; 97:e0053523. [PMID: 37668369 PMCID: PMC10537755 DOI: 10.1128/jvi.00535-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 06/28/2023] [Indexed: 09/06/2023] Open
Abstract
The genomes of mammals contain fingerprints of past infections by ancient retroviruses that invaded the germline of their ancestors. Most of these endogenous retroviruses (ERVs) contain only remnants of the original retrovirus; however, on rare occasions, ERV genes can be co-opted for a beneficial host function. While most studies of co-opted ERVs have focused on envelope genes, including the syncytins that function in placentation, there are examples of co-opted gag genes including one we recently discovered in simian primates. Here, we searched for other intact gag genes in non-primate mammalian lineages. We began by examining the genomes of extant camel species, which represent a basal lineage in the order Artiodactyla. This identified a gagpol gene with a large open reading frame (ORF) (>3,500 bp) in the same orthologous location in Artiodactyla species but that is absent in other mammals. Thus, this ERV was fixed in the common ancestor of all Artiodactyla at least 64 million years ago. The amino acid sequence of this gene, termed ARTgagpol, contains recognizable matrix, capsid, nucleocapsid, and reverse transcriptase domains in ruminants, with an RNase H domain in camels and pigs. Phylogenetic analysis and structural prediction of its reverse transcriptase and RNase H domains groups ARTgagpol with gammaretroviruses. Transcriptomic analysis shows ARTgagpol expression in multiple tissues suggestive of a co-opted host function. These findings identify the oldest and largest ERV-derived gagpol gene with an intact ORF in mammals, an intriguing milestone in the co-evolution of mammals and retroviruses. IMPORTANCE Retroviruses are unique among viruses that infect animals as they integrate their reverse-transcribed double-stranded DNA into host chromosomes. When this happens in a germline cell, such as sperm, egg, or their precursors, the integrated retroviral copies can be passed on to the next generation as endogenous retroviruses (ERVs). On rare occasions, the genes of these ERVs can be domesticated by the host. In this study we used computational similarity searches to identify an ancient ERV with an intact viral gagpol gene in the genomes of camels that is also found in the same genomic location in other even-toed ungulates suggesting that it is at least 64 million years old. Broad tissue expression and predicted preservation of the reverse transcriptase fold of this protein suggest that it may be domesticated for a host function. This is the oldest known intact gagpol gene of an ancient retrovirus in mammals.
Collapse
Affiliation(s)
- J'Zaria Simpson
- Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, USA
| | - Christine A. Kozak
- Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, USA
| | - Guney Boso
- Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland, USA
| |
Collapse
|
29
|
Arikawa K, Hosokawa M. Uncultured prokaryotic genomes in the spotlight: An examination of publicly available data from metagenomics and single-cell genomics. Comput Struct Biotechnol J 2023; 21:4508-4518. [PMID: 37771751 PMCID: PMC10523443 DOI: 10.1016/j.csbj.2023.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/10/2023] [Accepted: 09/10/2023] [Indexed: 09/30/2023] Open
Abstract
Owing to the ineffectiveness of traditional culture techniques for the vast majority of microbial species, culture-independent analyses utilizing next-generation sequencing and bioinformatics have become essential for gaining insight into microbial ecology and function. This mini-review focuses on two essential methods for obtaining genetic information from uncultured prokaryotes, metagenomics and single-cell genomics. We analyzed the registration status of uncultured prokaryotic genome data from major public databases and assessed the advantages and limitations of both the methods. Metagenomics generates a significant quantity of sequence data and multiple prokaryotic genomes using straightforward experimental procedures. However, in ecosystems with high microbial diversity, such as soil, most genes are presented as brief, disconnected contigs, and lack association of highly conserved genes and mobile genetic elements with individual species genomes. Although technically more challenging, single-cell genomics offers valuable insights into complex ecosystems by providing strain-resolved genomes, addressing issues in metagenomics. Recent technological advancements, such as long-read sequencing, machine learning algorithms, and in silico protein structure prediction, in combination with vast genomic data, have the potential to overcome the current technical challenges and facilitate a deeper understanding of uncultured microbial ecosystems and microbial dark matter genes and proteins. In light of this, it is imperative that continued innovation in both methods and technologies take place to create high-quality reference genome databases that will support future microbial research and industrial applications.
Collapse
Affiliation(s)
- Koji Arikawa
- Department of Life Science and Medical Bioscience, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo 162-8480, Japan
- bitBiome, Inc., 513 Wasedatsurumaki-cho, Shinjuku-ku, Tokyo 162-0041, Japan
| | - Masahito Hosokawa
- Department of Life Science and Medical Bioscience, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo 162-8480, Japan
- bitBiome, Inc., 513 Wasedatsurumaki-cho, Shinjuku-ku, Tokyo 162-0041, Japan
- Research Organization for Nano and Life Innovation, Waseda University, 513 Wasedatsurumaki-cho, Shinjuku-ku, Tokyo 162-0041, Japan
- Institute for Advanced Research of Biosystem Dynamics, Waseda Research Institute for Science and Engineering, 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
- Computational Bio Big-Data Open Innovation Laboratory, National Institute of Advanced Industrial Science and Technology, 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
| |
Collapse
|
30
|
Gofton AW, Popa-Baez A, Takano A, Soennichsen K, Michie M, Short M, Supriyono S, Pascoe J, Cusbert S, Mulley R. Characterisation and comparative genomics of three new Varanus-associated Borrelia spp. from Indonesia and Australia. Parasit Vectors 2023; 16:317. [PMID: 37670353 PMCID: PMC10481545 DOI: 10.1186/s13071-023-05937-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/16/2023] [Indexed: 09/07/2023] Open
Abstract
BACKGROUND Borrelia are important disease-causing tick- and louse-borne spirochaetes than can infect a wide variety of vertebrates, including humans and reptiles. Reptile-associated (REP) Borrelia, once considered a peculiarity, are now recognised as a distinct and important evolutionary lineage, and are increasingly being discovered worldwide in association with novel hosts. Numerous novel Borrelia spp. associated with monitor lizards (Varanus spp.) have been recently identified throughout the Indo-Pacific region; however, there is a lack of genomic data on these Borrelia. METHODS We used metagenomic techniques to sequence almost complete genomes of novel Borrelia spp. from Varanus varius and Varanus giganteus from Australia, and used long- and short-read technologies to sequence the complete genomes of two strains of a novel Borrelia sp. previously isolated from ticks infesting Varanus salvator from Indonesia. We investigated intra- and interspecies genomic diversity, including plasmid diversity and relatedness, among Varanus-associated Borrelia and other available REP Borrelia and, based on 712 whole genome orthologues, produced the most complete phylogenetic analysis, to the best of our knowledge, of REP Borrelia to date. RESULTS The genomic architecture of Varanus-associated Borrelia spp. is similar to that of Borrelia spp. that cause relapsing fever (RF), and includes a highly conserved megaplasmid and numerous smaller linear and circular plasmids that lack structural consistency between species. Analysis of PF32 and PF57/62 plasmid partitioning genes indicated that REP Borrelia plasmids fall into at least six distinct plasmid families, some of which are related to previously defined Borrelia plasmid families, whereas the others appear to be unique. REP Borrelia contain immunogenic variable major proteins that are homologous to those found in Borrelia spp. that cause RF, although they are limited in copy number and variability and have low sequence identities to RF variable major proteins. Phylogenetic analyses based on single marker genes and 712 single copy orthologs also definitively demonstrated the monophyly of REP Borrelia as a unique lineage. CONCLUSIONS In this work we present four new genomes from three novel Borrelia, and thus double the number of REP Borrelia genomes publicly available. The genomic characterisation of these Borrelia clearly demonstrates their distinctiveness as species, and we propose the names Borrelia salvatorii, 'Candidatus Borrelia undatumii', and 'Candidatus Borrelia rubricentralis' for them.
Collapse
Affiliation(s)
- Alexander William Gofton
- Health and Biosecurity, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, Australia
| | - Angel Popa-Baez
- Health and Biosecurity, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, Australia
| | - Ai Takano
- Department of Veterinary Medicine, Joint Faculty of Veterinary Medicine, Yamaguchi University, Yamaguchi, Japan
| | - Kari Soennichsen
- Institute for Applied Ecology, University of Canberra, Canberra, Australia
| | - Michelle Michie
- Health and Biosecurity, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, Australia
| | - Makenna Short
- Health and Biosecurity, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, Australia
| | - Supriyono Supriyono
- Department of Animal Diseases and Veterinary Health, Bogor Agricultural University, Bogor, Indonesia
| | - Jack Pascoe
- School of Agricultural and Ecosystem Sciences, University of Melbourne, Melbourne, Australia
| | - Sue Cusbert
- School of Science and Health, Western Sydney University, Penrith, Australia
| | - Robert Mulley
- School of Science and Health, Western Sydney University, Penrith, Australia
| |
Collapse
|
31
|
Xu S, Suttapitugsakul S, Tong M, Wu R. Systematic analysis of the impact of phosphorylation and O-GlcNAcylation on protein subcellular localization. Cell Rep 2023; 42:112796. [PMID: 37453062 PMCID: PMC10530397 DOI: 10.1016/j.celrep.2023.112796] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 05/02/2023] [Accepted: 06/27/2023] [Indexed: 07/18/2023] Open
Abstract
The subcellular localization of proteins is critical for their functions in eukaryotic cells and is tightly correlated with protein modifications. Here, we comprehensively investigate the nuclear-cytoplasmic distributions of the phosphorylated, O-GlcNAcylated, and non-modified forms of proteins to dissect the correlation between protein distribution and modifications. Phosphorylated and O-GlcNAcylated proteins have overall higher nuclear distributions than non-modified ones. Different distributions among the phosphorylated, O-GlcNAcylated, and non-modified forms of proteins are associated with protein size, structure, and function, as well as local environment and adjacent residues around modification sites. Moreover, we perform site-mutagenesis experiments using phosphomimetic and phospho-null mutants of two proteins to validate the proteomic results. Additionally, the effects of the OGT/OGA inhibition on glycoprotein distribution are systematically investigated, and the distribution changes of glycoproteins are related to their abundance changes under the inhibitions. Systematic investigation of the relationship between protein modification and localization advances our understanding of protein functions.
Collapse
Affiliation(s)
- Senhan Xu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Suttipong Suttapitugsakul
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Ming Tong
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Ronghu Wu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| |
Collapse
|
32
|
Fan Z, Wang LY, Xiao L, Tan B, Luo B, Ren TY, Liu N, Zhang ZS, Bai M. Lampshade web spider Ectatosticta davidi chromosome-level genome assembly provides evidence for its phylogenetic position. Commun Biol 2023; 6:748. [PMID: 37463957 PMCID: PMC10354039 DOI: 10.1038/s42003-023-05129-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 07/10/2023] [Indexed: 07/20/2023] Open
Abstract
The spider of Ectatosticta davidi, belonging to the lamp-shade web spider family, Hypochilidae, which is closely related to Hypochilidae and Filistatidae and recovered as sister of the rest Araneomorphs spiders. Here we show the final assembled genome of E. davidi with 2.16 Gb in 15 chromosomes. Then we confirm the evolutionary position of Hypochilidae. Moreover, we find that the GMC gene family exhibit high conservation throughout the evolution of true spiders. We also find that the MaSp genes of E. davidi may represent an early stage of MaSp and MiSp genes in other true spiders, while CrSp shares a common origin with AgSp and PySp but differ from MaSp. Altogether, this study contributes to addressing the limited availability of genomic sequences from Hypochilidae spiders, and provides a valuable resource for investigating the genomic evolution of spiders.
Collapse
Affiliation(s)
- Zheng Fan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Lu-Yu Wang
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Lin Xiao
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Bing Tan
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Bin Luo
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Tian-Yu Ren
- School of Life Sciences, Southwest University, 400700, Chongqing, China
| | - Ning Liu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China.
| | - Zhi-Sheng Zhang
- School of Life Sciences, Southwest University, 400700, Chongqing, China.
| | - Ming Bai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China.
- Northeast Asia Biodiversity Research Center, Northeast Forestry University, 150040, Harbin, China.
- University of Chinese Academy of Sciences, 100049, Beijing, China.
| |
Collapse
|
33
|
Feng T, Pucker B, Kuang T, Song B, Yang Y, Lin N, Zhang H, Moore MJ, Brockington SF, Wang Q, Deng T, Wang H, Sun H. The genome of the glasshouse plant noble rhubarb (Rheum nobile) provides a window into alpine adaptation. Commun Biol 2023; 6:706. [PMID: 37429977 DOI: 10.1038/s42003-023-05044-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 06/14/2023] [Indexed: 07/12/2023] Open
Abstract
Glasshouse plants are species that trap warmth via specialized morphology and physiology, mimicking a human glasshouse. In the Himalayan alpine region, the highly specialized glasshouse morphology has independently evolved in distinct lineages to adapt to intensive UV radiation and low temperature. Here we demonstrate that the glasshouse structure - specialized cauline leaves - is highly effective in absorbing UV light but transmitting visible and infrared light, creating an optimal microclimate for the development of reproductive organs. We reveal that this glasshouse syndrome has evolved at least three times independently in the rhubarb genus Rheum. We report the genome sequence of the flagship glasshouse plant Rheum nobile and identify key genetic network modules in association with the morphological transition to specialized glasshouse leaves, including active secondary cell wall biogenesis, upregulated cuticular cutin biosynthesis, and suppression of photosynthesis and terpenoid biosynthesis. The distinct cell wall organization and cuticle development might be important for the specialized optical property of glasshouse leaves. We also find that the expansion of LTRs has likely played an important role in noble rhubarb adaptation to high elevation environments. Our study will enable additional comparative analyses to identify the genetic basis underlying the convergent occurrence of glasshouse syndrome.
Collapse
Affiliation(s)
- Tao Feng
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
- CAS Key Laboratory for Plant Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China
| | - Boas Pucker
- Department of Plant Sciences, University of Cambridge, Tennis Court Road, Cambridge, CB2 3EA, UK
- CeBiTec & Faculty of Biology, Bielefeld University, Universitaetsstrasse, Bielefeld, 33615, Germany
- Institute of Plant Biology & BRICS, TU Braunschweig, 38106, Braunschweig, Germany
| | - Tianhui Kuang
- CAS Key Laboratory for Plant Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Bo Song
- CAS Key Laboratory for Plant Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Ya Yang
- Department of Plant and Microbial Biology, University of Minnesota, Twin Cities, St. Paul, MN, 55108, USA
| | - Nan Lin
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China
| | - Huajie Zhang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China
| | - Michael J Moore
- Department of Biology, Oberlin College, Oberlin, OH, 44074, USA
| | - Samuel F Brockington
- Department of Plant Sciences, University of Cambridge, Tennis Court Road, Cambridge, CB2 3EA, UK
| | - Qingfeng Wang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China
| | - Tao Deng
- CAS Key Laboratory for Plant Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| | - Hengchang Wang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China.
| | - Hang Sun
- CAS Key Laboratory for Plant Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| |
Collapse
|
34
|
Wang G, Yin H, Zhao T, Yang D, Jia S, Qiao C. De novo transcriptome assembly of Aureobasidium melanogenum CGMCC18996 to analyze the β-poly(L-malic acid) biosynthesis pathway under the CaCO3 addition. FOOD SCIENCE AND HUMAN WELLNESS 2023. [DOI: 10.1016/j.fshw.2022.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
35
|
Wang Y, Zhao D, Zhang W, Wang S, Wu Y, Wang S, Yang Y, Guo B. Four PQQ-Dependent Alcohol Dehydrogenases Responsible for the Oxidative Detoxification of Deoxynivalenol in a Novel Bacterium Ketogulonicigenium vulgare D3_3 Originated from the Feces of Tenebrio molitor Larvae. Toxins (Basel) 2023; 15:367. [PMID: 37368668 PMCID: PMC10301637 DOI: 10.3390/toxins15060367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 05/25/2023] [Accepted: 05/26/2023] [Indexed: 06/29/2023] Open
Abstract
Deoxynivalenol (DON) is frequently detected in cereals and cereal-based products and has a negative impact on human and animal health. In this study, an unprecedented DON-degrading bacterial isolate D3_3 was isolated from a sample of Tenebrio molitor larva feces. A 16S rRNA-based phylogenetic analysis and genome-based average nucleotide identity comparison clearly revealed that strain D3_3 belonged to the species Ketogulonicigenium vulgare. This isolate D3_3 could efficiently degrade 50 mg/L of DON under a broad range of conditions, such as pHs of 7.0-9.0 and temperatures of 18-30 °C, as well as during aerobic or anaerobic cultivation. 3-keto-DON was identified as the sole and finished DON metabolite using mass spectrometry. In vitro toxicity tests revealed that 3-keto-DON had lower cytotoxicity to human gastric epithelial cells and higher phytotoxicity to Lemna minor than its parent mycotoxin DON. Additionally, four genes encoding pyrroloquinoline quinone (PQQ)-dependent alcohol dehydrogenases in the genome of isolate D3_3 were identified as being responsible for the DON oxidation reaction. Overall, as a highly potent DON-degrading microbe, a member of the genus Ketogulonicigenium is reported for the first time in this study. The discovery of this DON-degrading isolate D3_3 and its four dehydrogenases will allow microbial strains and enzyme resources to become available for the future development of DON-detoxifying agents for food and animal feed.
Collapse
Affiliation(s)
- Yang Wang
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Donglei Zhao
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Wei Zhang
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Songshan Wang
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Yu Wu
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Songxue Wang
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Yongtan Yang
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| | - Baoyuan Guo
- Academy of National Food and Strategic Reserves Administration, Beijing 100037, China; (Y.W.)
| |
Collapse
|
36
|
Álvarez-Campos P, García-Castro H, Emili E, Pérez-Posada A, Salamanca-Díaz DA, Mason V, Metzger B, Bely AE, Kenny N, Özpolat BD, Solana J. Annelid adult cell type diversity and their pluripotent cellular origins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.537979. [PMID: 37163014 PMCID: PMC10168269 DOI: 10.1101/2023.04.25.537979] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Annelids are a broadly distributed, highly diverse, economically and environmentally important group of animals. Most species can regenerate missing body parts, and many are able to reproduce asexually. Therefore, many annelids can generate all adult cell types in adult stages. However, the putative adult stem cell populations involved in these processes, as well as the diversity of adult cell types generated by them, are still unknown. Here, we recover 75,218 single cell transcriptomes of Pristina leidyi, a highly regenerative and asexually-reproducing freshwater annelid. We characterise all major annelid adult cell types, and validate many of our observations by HCR in situ hybridisation. Our results uncover complex patterns of regionally expressed genes in the annelid gut, as well as neuronal, muscle and epidermal specific genes. We also characterise annelid-specific cell types such as the chaetal sacs and globin+ cells, and novel cell types of enigmatic affinity, including a vigilin+ cell type, a lumbrokinase+ cell type, and a diverse set of metabolic cells. Moreover, we characterise transcription factors and gene networks that are expressed specifically in these populations. Finally, we uncover a broadly abundant cluster of putative stem cells with a pluripotent signature. This population expresses well-known stem cell markers such as vasa, piwi and nanos homologues, but also shows heterogeneous expression of differentiated cell markers and their transcription factors. In these piwi+ cells, we also find conserved expression of pluripotency regulators, including multiple chromatin remodelling and epigenetic factors. Finally, lineage reconstruction analyses reveal the existence of differentiation trajectories from piwi+ cells to diverse adult types. Our data reveal the cell type diversity of adult annelids for the first time and serve as a resource for studying annelid cell types and their evolution. On the other hand, our characterisation of a piwi+ cell population with a pluripotent stem cell signature will serve as a platform for the study of annelid stem cells and their role in regeneration.
Collapse
Affiliation(s)
- Patricia Álvarez-Campos
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM) & Departamento de Biología (Zoología), Facultad de Ciencias, Universidad Autónoma de Madrid, Madrid, Spain
| | - Helena García-Castro
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Elena Emili
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Alberto Pérez-Posada
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | | | - Vincent Mason
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| | - Bria Metzger
- Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, USA, 05432
- Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, MO, USA, 63130
| | | | - Nathan Kenny
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
- Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, Aotearoa New Zealand
| | - B Duygu Özpolat
- Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, USA, 05432
- Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, MO, USA, 63130
| | - Jordi Solana
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
| |
Collapse
|
37
|
Xu S, Yin K, Wu R. Combining Selective Enrichment and a Boosting Approach to Globally and Site-Specifically Characterize Protein Co-translational O-GlcNAcylation. Anal Chem 2023; 95:4371-4380. [PMID: 36802545 PMCID: PMC9996615 DOI: 10.1021/acs.analchem.2c04779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Protein O-GlcNAcylation plays extremely important roles in mammalian cells, regulating signal transduction and gene expression. This modification can happen during protein translation, and systematic and site-specific analysis of protein co-translational O-GlcNAcylation can advance our understanding of this important modification. However, it is extraordinarily challenging because normally O-GlcNAcylated proteins are very low abundant and the abundances of co-translational ones are even much lower. Here, we developed a method integrating selective enrichment, a boosting approach, and multiplexed proteomics to globally and site-specifically characterize protein co-translational O-GlcNAcylation. The boosting approach using the TMT labeling dramatically enhances the detection of co-translational glycopeptides with low abundance when enriched O-GlcNAcylated peptides from cells with a much longer labeling time was used as a boosting sample. More than 180 co-translational O-GlcNAcylated proteins were site-specifically identified. Further analyses revealed that among co-translational glycoproteins, those related to DNA binding and transcription are highly overrepresented using the total identified O-GlcNAcylated proteins in the same cells as the background. Compared with the glycosylation sites on all glycoproteins, co-translational sites have different local structures and adjacent amino acid residues. Overall, an integrative method was developed to identify protein co-translational O-GlcNAcylation, which is very useful to advance our understanding of this important modification.
Collapse
Affiliation(s)
- Senhan Xu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Kejun Yin
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Ronghu Wu
- School of Chemistry and Biochemistry and the Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
38
|
Sanderson T, Bileschi ML, Belanger D, Colwell LJ. ProteInfer, deep neural networks for protein functional inference. eLife 2023; 12:e80942. [PMID: 36847334 PMCID: PMC10063232 DOI: 10.7554/elife.80942] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 02/24/2023] [Indexed: 03/01/2023] Open
Abstract
Predicting the function of a protein from its amino acid sequence is a long-standing challenge in bioinformatics. Traditional approaches use sequence alignment to compare a query sequence either to thousands of models of protein families or to large databases of individual protein sequences. Here we introduce ProteInfer, which instead employs deep convolutional neural networks to directly predict a variety of protein functions - Enzyme Commission (EC) numbers and Gene Ontology (GO) terms - directly from an unaligned amino acid sequence. This approach provides precise predictions which complement alignment-based methods, and the computational efficiency of a single neural network permits novel and lightweight software interfaces, which we demonstrate with an in-browser graphical interface for protein function prediction in which all computation is performed on the user's personal computer with no data uploaded to remote servers. Moreover, these models place full-length amino acid sequences into a generalised functional space, facilitating downstream analysis and interpretation. To read the interactive version of this paper, please visit https://google-research.github.io/proteinfer/.
Collapse
Affiliation(s)
| | | | | | - Lucy J Colwell
- Google AIBostonUnited States
- University of CambridgeCambridgeUnited Kingdom
| |
Collapse
|
39
|
Ng TA, Rashid S, Kwoh CK. Virulence network of interacting domains of influenza a and mouse proteins. FRONTIERS IN BIOINFORMATICS 2023; 3:1123993. [PMID: 36875146 PMCID: PMC9982101 DOI: 10.3389/fbinf.2023.1123993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 02/03/2023] [Indexed: 02/19/2023] Open
Abstract
There exist several databases that provide virus-host protein interactions. While most provide curated records of interacting virus-host protein pairs, information on the strain-specific virulence factors or protein domains involved, is lacking. Some databases offer incomplete coverage of influenza strains because of the need to sift through vast amounts of literature (including those of major viruses including HIV and Dengue, besides others). None have offered complete, strain specific protein-protein interaction records for the influenza A group of viruses. In this paper, we present a comprehensive network of predicted domain-domain interaction(s) (DDI) between influenza A virus (IAV) and mouse host proteins, that will allow the systematic study of disease factors by taking the virulence information (lethal dose) into account. From a previously published dataset of lethal dose studies of IAV infection in mice, we constructed an interacting domain network of mouse and viral protein domains as nodes with weighted edges. The edges were scored with the Domain Interaction Statistical Potential (DISPOT) to indicate putative DDI. The virulence network can be easily navigated via a web browser, with the associated virulence information (LD50 values) prominently displayed. The network will aid influenza A disease modeling by providing strain-specific virulence levels with interacting protein domains. It can possibly contribute to computational methods for uncovering influenza infection mechanisms mediated through protein domain interactions between viral and host proteins. It is available at https://iav-ppi.onrender.com/home.
Collapse
Affiliation(s)
| | | | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
40
|
Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar G, Bileschi M, Bork P, Bridge A, Colwell L, Gough J, Haft D, Letunić I, Marchler-Bauer A, Mi H, Natale D, Orengo C, Pandurangan A, Rivoire C, Sigrist CJA, Sillitoe I, Thanki N, Thomas PD, Tosatto SCE, Wu C, Bateman A. InterPro in 2022. Nucleic Acids Res 2023; 51:D418-D427. [PMID: 36350672 PMCID: PMC9825450 DOI: 10.1093/nar/gkac993] [Citation(s) in RCA: 814] [Impact Index Per Article: 814.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/12/2022] [Accepted: 10/28/2022] [Indexed: 11/11/2022] Open
Abstract
The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. Here, we report recent developments with InterPro (version 90.0) and its associated software, including updates to data content and to the website. These developments extend and enrich the information provided by InterPro, and provide a more user friendly access to the data. Additionally, we have worked on adding Pfam website features to the InterPro website, as the Pfam website will be retired in late 2022. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB. Moreover, we report the development of a card game as a method of engaging the non-scientific community. Finally, we discuss the benefits and challenges brought by the use of artificial intelligence for protein structure prediction.
Collapse
Affiliation(s)
- Typhaine Paysan-Lafosse
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Matthias Blum
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Sara Chuguransky
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Tiago Grego
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Beatriz Lázaro Pinto
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Gustavo A Salazar
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | | | - Peer Bork
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Yonsei Frontier Lab (YFL), Yonsei University, 03722 Seoul, South Korea
- Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany
| | - Alan Bridge
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva 4, Switzerland
| | - Lucy Colwell
- Google Research, Brain team, Cambridge, MA, USA
- Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Julian Gough
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Ave, Trumpington, Cambridge CB2 0QH, UK
| | - Daniel H Haft
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Ivica Letunić
- Biobyte Solutions GmbH, Bothestr 142, 69126 Heidelberg, Germany
| | - Aron Marchler-Bauer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Huaiyu Mi
- Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Darren A Natale
- Protein Information Resource, Georgetown University Medical Center, Washington, DC 20007, USA
| | - Christine A Orengo
- Department of Structural and Molecular Biology, University College London, Gower St, Bloomsbury, London WC1E 6BT, UK
| | - Arun P Pandurangan
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Ave, Trumpington, Cambridge CB2 0QH, UK
- Department of Biochemistry, Sanger Building, University of Cambridge, Cambridge, UK
| | - Catherine Rivoire
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva 4, Switzerland
| | - Christian J A Sigrist
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva 4, Switzerland
| | - Ian Sillitoe
- Department of Structural and Molecular Biology, University College London, Gower St, Bloomsbury, London WC1E 6BT, UK
| | - Narmada Thanki
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Paul D Thomas
- Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy
| | - Cathy H Wu
- Protein Information Resource, Georgetown University Medical Center, Washington, DC 20007, USA
- Center for Bioinformatics and Computational Biology and Protein Information Resource, University of Delaware, Newark, DE 19711, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| |
Collapse
|
41
|
Bota PM, Oliva B, Fernandez-Fuentes N. Theoretical 3D Modeling of NLRP3 Inflammasome Complex. Methods Mol Biol 2023; 2696:269-280. [PMID: 37578729 DOI: 10.1007/978-1-0716-3350-2_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
The NOD-like receptor pyrin domain containing 3 (NLRP3) is a multidomain protein that plays a key role in innate immune response. Structures of NLRP3 in different conformational states and bound to cognate partners are available. In this chapter we present an approach to model the oligomeric structure of NLRP3 by homology modeling using multiple templates, symmetry, and refinement. The overall process presented here represents advanced exercise in structural modeling that provides unique insights into the biological role and activation of NLRP3 oligomer. Finally, the same approach can be easily adapted to the rest of the members of the NLRP family.
Collapse
Affiliation(s)
- Patricia Mirela Bota
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
| | - Baldo Oliva
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain.
| | - Narcis Fernandez-Fuentes
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, UK
| |
Collapse
|
42
|
Patel VK, Das A, Kumari R, Kajla S. In silico Analysis of Diverse Endo-β-1,4-glucanases Reveals Their Molecular Evolution. J EVOL BIOCHEM PHYS+ 2023. [DOI: 10.1134/s0022093023010088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
|
43
|
Ben Boubaker R, Tiss A, Henrion D, Chabbert M. Homology Modeling in the Twilight Zone: Improved Accuracy by Sequence Space Analysis. Methods Mol Biol 2023; 2627:1-23. [PMID: 36959439 DOI: 10.1007/978-1-0716-2974-1_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]
Abstract
The analysis of the relationship between sequence and structure similarities during the evolution of a protein family has revealed a limit of sequence divergence for which structural conservation can be confidently assumed and homology modeling is reliable. Below this limit, the twilight zone corresponds to sequence divergence for which homology modeling becomes increasingly difficult and requires specific methods. Either with conventional threading methods or with recent deep learning methods, such as AlphaFold, the challenge relies on the identification of a template that shares not only a common ancestor (homology) but also a conserved structure with the query. As both homology and structural conservation are transitive properties, mining of sequence databases followed by multidimensional scaling (MDS) of the query sequence space can reveal intermediary sequences to infer homology and structural conservation between the query and the template. Here, as a case study, we studied the plethodontid receptivity factor isoform 1 (PRF1) from Plethodon jordani, a member of a pheromone protein family present only in lungless salamanders and weakly related to cytokines of the IL6 family. A variety of conventional threading methods led to the cytokine CNTF as a template. Sequence mining, followed by phylogenetic and MDS analysis, provided missing links between PRF1 and CNTF and allowed reliable homology modeling. In addition, we compared automated models obtained from web servers to a customized model to show how modeling can be improved by expert information.
Collapse
Affiliation(s)
- Rym Ben Boubaker
- UMR CNRS 6015 - INSERM 1083, Laboratoire MITOVASC, Université d'Angers, Angers, France
| | - Asma Tiss
- UMR CNRS 6015 - INSERM 1083, Laboratoire MITOVASC, Université d'Angers, Angers, France
| | - Daniel Henrion
- UMR CNRS 6015 - INSERM 1083, Laboratoire MITOVASC, Université d'Angers, Angers, France
| | - Marie Chabbert
- UMR CNRS 6015 - INSERM 1083, Laboratoire MITOVASC, Université d'Angers, Angers, France.
| |
Collapse
|
44
|
Miller J, Zimin AV, Gordus A. Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus. Gigascience 2022; 12:giad002. [PMID: 36762707 PMCID: PMC9912274 DOI: 10.1093/gigascience/giad002] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 11/18/2022] [Accepted: 01/03/2023] [Indexed: 02/11/2023] Open
Abstract
The orb web is a remarkable example of animal architecture that is observed in families of spiders that diverged over 200 million years ago. While several genomes exist for araneid orb-weavers, none exist for other orb-weaving families, hampering efforts to investigate the genetic basis of this complex behavior. Here we present a chromosome-level genome assembly for the cribellate orb-weaving spider Uloborus diversus. The assembly reinforces evidence of an ancient arachnid genome duplication and identifies complete open reading frames for every class of spidroin gene, which encode the proteins that are the key structural components of spider silks. We identified the 2 X chromosomes for U. diversus and identify candidate sex-determining loci. This chromosome-level assembly will be a valuable resource for evolutionary research into the origins of orb-weaving, spidroin evolution, chromosomal rearrangement, and chromosomal sex determination in spiders.
Collapse
Affiliation(s)
- Jeremiah Miller
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Aleksey V Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Andrew Gordus
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
- Solomon H. Snyder Department of Neuroscience, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
45
|
Chen IMA, Chu K, Palaniappan K, Ratner A, Huang J, Huntemann M, Hajek P, Ritter S, Webb C, Wu D, Varghese N, Reddy TBK, Mukherjee S, Ovchinnikova G, Nolan M, Seshadri R, Roux S, Visel A, Woyke T, Eloe-Fadrosh E, Kyrpides N, Ivanova N. The IMG/M data management and analysis system v.7: content updates and new features. Nucleic Acids Res 2022; 51:D723-D732. [PMID: 36382399 PMCID: PMC9825475 DOI: 10.1093/nar/gkac976] [Citation(s) in RCA: 100] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/05/2022] [Accepted: 10/17/2022] [Indexed: 11/17/2022] Open
Abstract
The Integrated Microbial Genomes & Microbiomes system (IMG/M: https://img.jgi.doe.gov/m/) at the Department of Energy (DOE) Joint Genome Institute (JGI) continues to provide support for users to perform comparative analysis of isolate and single cell genomes, metagenomes, and metatranscriptomes. In addition to datasets produced by the JGI, IMG v.7 also includes datasets imported from public sources such as NCBI Genbank, SRA, and the DOE National Microbiome Data Collaborative (NMDC), or submitted by external users. In the past couple years, we have continued our effort to help the user community by improving the annotation pipeline, upgrading the contents with new reference database versions, and adding new analysis functionalities such as advanced scaffold search, Average Nucleotide Identity (ANI) for high-quality metagenome bins, new cassette search, improved gene neighborhood display, and improvements to metatranscriptome data display and analysis. We also extended the collaboration and integration efforts with other DOE-funded projects such as NMDC and DOE Biology Knowledgebase (KBase).
Collapse
Affiliation(s)
- I-Min A Chen
- To whom correspondence should be addressed. Tel: +1 510 495 8437;
| | - Ken Chu
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Krishnaveni Palaniappan
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Anna Ratner
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Jinghua Huang
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Marcel Huntemann
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Patrick Hajek
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Stephan J Ritter
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Cody Webb
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Dongying Wu
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Neha J Varghese
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - T B K Reddy
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Supratim Mukherjee
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Galina Ovchinnikova
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Matt Nolan
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Rekha Seshadri
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Simon Roux
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Axel Visel
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Tanja Woyke
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Emiley A Eloe-Fadrosh
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Natalia N Ivanova
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| |
Collapse
|
46
|
Peng L, Wang C, Tian X, Zhou L, Li K. Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3456-3468. [PMID: 34587091 DOI: 10.1109/tcbb.2021.3116232] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The identification of lncRNA-protein interactions (LPIs) is important to understand the biological functions and molecular mechanisms of lncRNAs. However, most computational models are evaluated on a unique dataset, thereby resulting in prediction bias. Furthermore, previous models have not uncovered potential proteins (or lncRNAs) interacting with a new lncRNA (or protein). Finally, the performance of these models can be improved. In this study, we develop a Deep Learning framework with Dual-net Neural architecture to find potential LPIs (LPI-DLDN). First, five LPI datasets are collected. Second, the features of lncRNAs and proteins are extracted by Pyfeat and BioTriangle, respectively. Third, these features are concatenated as a vector after dimension reduction. Finally, a deep learning model with dual-net neural architecture is designed to classify lncRNA-protein pairs. LPI-DLDN is compared with six state-of-the-art LPI prediction methods (LPI-XGBoost, LPI-HeteSim, LPI-NRLMF, PLIPCOM, LPI-CNNCP, and Capsule-LPI) under four cross validations. The results demonstrate the powerful LPI classification performance of LPI-DLDN. Case study analyses show that there may be interactions between RP11-439E19.10 and Q15717, and between RP11-196G18.22 and Q9NUL5. The novelty of LPI-DLDN remains, integrating various biological features, designing a novel deep learning-based LPI identification framework, and selecting the optimal LPI feature subset based on feature importance ranking.
Collapse
|
47
|
Gundappa MK, Peñaloza C, Regan T, Boutet I, Tanguy A, Houston RD, Bean TP, Macqueen DJ. Chromosome-level reference genome for European flat oyster ( Ostrea edulis L.). Evol Appl 2022; 15:1713-1729. [PMID: 36426132 PMCID: PMC9679249 DOI: 10.1111/eva.13460] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 07/07/2022] [Accepted: 07/13/2022] [Indexed: 11/30/2022] Open
Abstract
The European flat oyster (Ostrea edulis L.) is a bivalve naturally distributed across Europe, which was an integral part of human diets for centuries, until anthropogenic activities and disease outbreaks severely reduced wild populations. Despite a growing interest in genetic applications to support population management and aquaculture, a reference genome for this species is lacking to date. Here, we report a chromosome-level assembly and annotation for the European Flat oyster genome, generated using Oxford Nanopore, Illumina, Dovetail OmniC™ proximity ligation and RNA sequencing. A contig assembly (N50: 2.38 Mb) was scaffolded into the expected karyotype of 10 pseudochromosomes. The final assembly is 935.13 Mb, with a scaffold-N50 of 95.56 Mb, with a predicted repeat landscape dominated by unclassified elements specific to O. edulis. The assembly was verified for accuracy and completeness using multiple approaches, including a novel linkage map built with ddRAD-Seq technology, comprising 4016 SNPs from four full-sib families (eight parents and 163 F1 offspring). Annotation of the genome integrating multitissue transcriptome data, comparative protein evidence and ab-initio gene prediction identified 35,699 protein-coding genes. Chromosome-level synteny was demonstrated against multiple high-quality bivalve genome assemblies, including an O. edulis genome generated independently for a French O. edulis individual. Comparative genomics was used to characterize gene family expansions during Ostrea evolution that potentially facilitated adaptation. This new reference genome for European flat oyster will enable high-resolution genomics in support of conservation and aquaculture initiatives, and improves our understanding of bivalve genome evolution.
Collapse
Affiliation(s)
- Manu Kumar Gundappa
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| | - Carolina Peñaloza
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| | - Tim Regan
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| | - Isabelle Boutet
- Station Biologique de RoscoffLaboratoire Adaptation et Diversité en Milieu Marin (UMR 7144 AD2M CNRS‐Sorbonne Université)RoscoffFrance
| | - Arnaud Tanguy
- Station Biologique de RoscoffLaboratoire Adaptation et Diversité en Milieu Marin (UMR 7144 AD2M CNRS‐Sorbonne Université)RoscoffFrance
| | - Ross D. Houston
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| | - Tim P. Bean
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| | - Daniel J. Macqueen
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUK
| |
Collapse
|
48
|
Manriquez‐Sandoval E, Fried SD. DomainMapper: Accurate domain structure annotation including those with non-contiguous topologies. Protein Sci 2022; 31:e4465. [PMID: 36208126 PMCID: PMC9601794 DOI: 10.1002/pro.4465] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/30/2022] [Accepted: 10/03/2022] [Indexed: 11/11/2022]
Abstract
Automated domain annotation is an important tool for structural informatics. These pipelines typically involve searching query sequences against hidden Markov model (HMM) profiles, yielding matches to profiles for various domains. However, domain annotation can be ambiguous or inaccurate when proteins contain domains with non-contiguous residue ranges, and especially when insertional domains are hosted within them. Here, we present DomainMapper, an algorithm that accurately assigns a unique domain structure annotation to a query sequence, including those with complex topologies. We validate our domain assignments using the AlphaFold database and confirm that non-contiguity is pervasive (10.74% of all domains in yeast and 4.52% in human). Using this resource, we find that certain folds have strong propensities to be non-contiguous or insertional across the Tree of Life. DomainMapper is freely available and can be ran as a single command-line function.
Collapse
Affiliation(s)
| | - Stephen D. Fried
- T. C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMDUSA
- Department of ChemistryJohns Hopkins UniversityBaltimoreMDUSA
| |
Collapse
|
49
|
De novo genome assembly and annotation of Holothuria scabra (Jaeger, 1833) from nanopore sequencing reads. Genes Genomics 2022; 44:1487-1498. [DOI: 10.1007/s13258-022-01322-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 09/29/2022] [Indexed: 11/04/2022]
|
50
|
Díaz-Muñoz C, Verce M, De Vuyst L, Weckx S. Phylogenomics of a Saccharomyces cerevisiae cocoa strain reveals adaptation to a West African fermented food population. iScience 2022; 25:105309. [PMID: 36304120 PMCID: PMC9593892 DOI: 10.1016/j.isci.2022.105309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 07/22/2022] [Accepted: 10/03/2022] [Indexed: 11/27/2022] Open
Abstract
Various yeast strains have been proposed as candidate starter cultures for cocoa fermentation, especially strains of Saccharomyces cerevisiae. In the current study, the genome of the cocoa strain S. cerevisiae IMDO 050523 was unraveled based on a combination of long- and short-read sequencing. It consisted of 16 nuclear chromosomes and a mitochondrial chromosome, which were organized in 20 contigs, with only two small gaps. A phylogenomic analysis of this genome together with another 105 S cerevisiae genomes, among which 20 from cocoa strains showed a geographical distribution of the latter, including S. cerevisiae IMDO 050523. Its genome clustered together with that of a West African fermented food population, indicating a wider adaptation to West African food niches than cocoa. Furthermore, S. cerevisiae IMDO 050523 contained genetic signatures involved in sucrose hydrolysis, pectin degradation, osmotolerance, and conserved amino acid changes in key ester-producing enzymes that could point toward specific niche adaptations.
Collapse
Affiliation(s)
- Cristian Díaz-Muñoz
- Research Group of Industrial Microbiology and Food Biotechnology, Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Marko Verce
- Research Group of Industrial Microbiology and Food Biotechnology, Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Luc De Vuyst
- Research Group of Industrial Microbiology and Food Biotechnology, Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Stefan Weckx
- Research Group of Industrial Microbiology and Food Biotechnology, Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium,Corresponding author
| |
Collapse
|