1
|
Basu Mallick S, Das S, Venkatasubramanian A, Kundu S, Datta PP. Comprehensive in silico analyses of fifty-one uncharacterized proteins from Vibrio cholerae. PLoS One 2024; 19:e0311301. [PMID: 39365770 PMCID: PMC11452002 DOI: 10.1371/journal.pone.0311301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2024] [Accepted: 09/17/2024] [Indexed: 10/06/2024] Open
Abstract
Due to the rise of multidrug-resistant strains of Vibrio cholerae and the recent cholera outbreaks in African and Asian nations, it is imperative to identify novel therapeutic targets and possible vaccine candidates. In this regard, this work primarily aims to identify and characterize new antigenic molecules using comparative RNA sequencing data and label-free proteomics data, carried out with essential GTPase cgtA knockdown and wild-type strain of V. cholerae. We identified hitherto 51 characterized proteins from high-throughput RNA-sequencing and proteomics data. This work involved the assessment of their physicochemical characteristics, subcellular localization, solubility, structures, and functional annotations. In addition, the immunoinformatic and reverse vaccinology technique was used to find new vaccine targets with high antigenicity, low allergenicity, and low toxicity profiles. Among the 51 proteins, 24 were selected based on their immunogenic profiles to identify B/T-cell epitopes. In addition, 20 prospective therapeutic targets were identified using virulence predictions and related investigations. Furthermore, two proteins, UniProt ID- Q9KRD2 and Q9KU58, with molecular weight of 92kDa and 12kDa, respectively, were chosen for cloning and expression towards in vitro biochemical characterization based on their range of expression patterns, high antigenic, low allergenic, and low toxicity properties. In conclusion, we believe that this study will reveal new facets and avenues for drug discovery and put us a step forward toward novel therapeutic interventions against the deadly disease of cholera.
Collapse
Affiliation(s)
- Sritapa Basu Mallick
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, WB, India
| | - Sagarika Das
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, WB, India
| | - Aravind Venkatasubramanian
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, WB, India
| | - Sourabh Kundu
- Ramakrishna Mission and Vivekananda Educational and Research Institute, Narendrapur, Kolkata, WB, India
| | - Partha Pratim Datta
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, WB, India
| |
Collapse
|
2
|
Pal DC, Anik TA, Rahman AA, Mahfujur Rahman SM. Identification and Functional Annotation of Hypothetical Proteins of Pan-Drug-Resistant Providencia rettgeri Strain MRSN845308 Toward Designing Antimicrobial Drug Targets. Bioinform Biol Insights 2024; 18:11779322241280580. [PMID: 39372506 PMCID: PMC11452876 DOI: 10.1177/11779322241280580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 08/08/2024] [Indexed: 10/08/2024] Open
Abstract
Providencia rettgeri has increasingly been responsible for several infections, including urinary tract, post-burn wounds, neonatal sepsis, and others. The emergence of drug-resistant isolates of P rettgeri, accompanied by intrinsic and acquired antibiotic resistance, has exacerbated the challenge of treating such infections, necessitating the development of novel therapeutics. Hypothetical proteins (HPs) form a major portion of cellular proteins and can be targeted by these novel therapeutics. In this study, 410 HPs from a pan-drug-resistant (PDR) P rettgeri strain (MRSN845308) were functionally annotated and characterized by physicochemical properties, localization, virulence, essentiality, druggability, and functionality. Among 410 HPs, the VirulentPred 2.0 tool and VICMpred combinedly predicted 33 HPs as virulent, whereas 48 HPs were highly interacting proteins based on the STRING v12 database. BlastKOALA and eggNOG-mapper v2.1.12 predicted 13 HPs involved in several metabolic pathways like Riboflavin metabolism and Lipopolysaccharide biosynthesis. Overall, 83 HPs were selected as primary drug targets; however, only 80 remained after nonhomology searching and essentiality analysis. In addition, all were detected as novel drug targets according to DrugBank 5.1.12. Considering the potential of membrane and extracellular proteins, 29 HPs (extracellular, outer, and inner membrane) were selected based on the combined prediction from PSORTb v3.0.3, CELLO v.2.5, BUSCA, SOSUIGramN, and PSLpred. According to the prevalence of those HPs in different strains of P rettgeri sequences in National Center for Biotechnology Information Identical Protein Groups (NCBI-IPG), 5 HPs were selected as final drug targets. In addition, 5 other HPs annotated as transporter proteins were also added to the list. As no crystal structures of our targets are present, 3-dimensional structures of selected HPs were predicted by the AlphaFold Server powered by AlphaFold 3. Our findings might facilitate a better understanding of the mechanism of virulence and pathogenesis, and up-to-date annotations can make uncharacterized HPs easy to identify as targets for novel therapeutics.
Collapse
|
3
|
Mincer TJ, Bos RP, Zettler ER, Zhao S, Asbun AA, Orsi WD, Guzzetta VS, Amaral-Zettler LA. Sargasso Sea Vibrio bacteria: Underexplored potential pathovars in a perturbed habitat. WATER RESEARCH 2023; 242:120033. [PMID: 37244770 DOI: 10.1016/j.watres.2023.120033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/25/2023] [Accepted: 05/01/2023] [Indexed: 05/29/2023]
Abstract
We fully sequenced the genomes of 16 Vibrio cultivars isolated from eel larvae, plastic marine debris (PMD), the pelagic brown macroalga Sargassum, and seawater samples collected from the Caribbean and Sargasso Seas of the North Atlantic Ocean. Annotation and mapping of these 16 bacterial genome sequences to a PMD-derived Vibrio metagenome-assembled genome created for this study showcased vertebrate pathogen genes closely-related to cholera and non-cholera pathovars. Phenotype testing of cultivars confirmed rapid biofilm formation, hemolytic, and lipophospholytic activities, consistent with pathogenic potential. Our study illustrates that open ocean vibrios represent a heretofore undescribed group of microbes, some representing potential new species, possessing an amalgam of pathogenic and low nutrient acquisition genes, reflecting their pelagic habitat and the substrates and hosts they colonize.
Collapse
Affiliation(s)
- Tracy J Mincer
- Harbor Branch Oceanographic Institute, Florida Atlantic University, Fort Pierce, FL, USA; Department of Biology, Wilkes Honors College, Florida Atlantic University, Jupiter, FL, USA.
| | - Ryan P Bos
- Harbor Branch Oceanographic Institute, Florida Atlantic University, Fort Pierce, FL, USA
| | - Erik R Zettler
- Department of Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, Texel, the Netherlands
| | - Shiye Zhao
- Japan Agency for Marine-Earth Science and Technology, 2-15 Natsushimacho, Yokosuka 237-0061, Japan
| | - Alejandro A Asbun
- Department of Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, Texel, the Netherlands
| | - William D Orsi
- Department of Earth and Environmental Sciences, Paleontology and Geobiology,Ludwig-Maximilians-Universität München, 80333 Munich, Germany
| | | | - Linda A Amaral-Zettler
- Department of Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, Texel, the Netherlands; Department of Freshwater and Marine Ecology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, the Netherlands; Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA.
| |
Collapse
|
4
|
Bowman JP. Genome-wide and constrained ordination-based analyses of EC code data support reclassification of the species of Massilia La Scola et al. 2000 into Telluria Bowman et al. 1993, Mokoshia gen. nov. and Zemynaea gen. nov. Int J Syst Evol Microbiol 2023; 73. [PMID: 37589187 DOI: 10.1099/ijsem.0.005991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023] Open
Abstract
Based on genome-wide data, Massilia species belonging to the clade including Telluria mixta LMG 11547T should be entirely transferred to the genus Telluria owing to the nomenclatural priority of the type species Telluria mixta. This results in the transfer of 35 Massilia species to the genus Telluria. The presented data also supports the creation of two new genera since peripherally branching Massilia species are distinct from Telluria and other related genera. It is proposed that 13 Massilia species are transferred to Mokoshia gen. nov. with the type species designated Mokoshia eurypsychrophila comb. nov. The species Massilia arenosa is proposed to belong to the genus Zemynaea gen. nov. as the type species Zemynaea arenosa comb. nov. The genome-wide analysis was well supported by canonical ordination analysis of Enzyme Commission (EC) codes annotated from genomes via pannzer2. This new approach was performed to assess the conclusions of the genome-based data and reduce possible ambiguity in the taxonomic decision making. Cross-validation of EC code data compared within canonical plots validated the reclassifications and correctly visualized the expected genus-level taxonomic relationships. The approach is complementary to genome-wide methodology and could be used for testing sequence alignment based data across genetically related genera. In addition to the proposed broader reclassifications, invalidly described species 'Massilia antibiotica', 'Massilia aromaticivorans', 'Massilia cellulosiltytica' and 'Massilia humi' are described as Telluria antibiotica sp. nov., Telluria aromaticivorans sp. nov., Telluria cellulosilytica sp. nov. and Pseudoduganella humi sp. nov., respectively. In addition, Telluria chitinolytica is reclassified as Pseudoduganella chitinolytica comb. nov. The use of combined genome-wide and annotation descriptors compared using canonical ordination clarifies the taxonomy of Telluria and its sibling genera and provides another way to evaluate complex taxonomic data.
Collapse
Affiliation(s)
- John P Bowman
- Tasmanian Institute of Agriculture, University of Tasmania, Sandy Bay, Hobart, Tasmania, 7005, Australia
| |
Collapse
|
5
|
Vengatharajuloo V, Goh HH, Hassan M, Govender N, Sulaiman S, Afiqah-Aleng N, Harun S, Mohamed-Hussein ZA. Gene Co-Expression Network Analysis Reveals Key Regulatory Genes in Metisa plana Hormone Pathways. INSECTS 2023; 14:503. [PMID: 37367319 DOI: 10.3390/insects14060503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/09/2023] [Accepted: 05/16/2023] [Indexed: 06/28/2023]
Abstract
Metisa plana Walker (Lepidoptera: Psychidae) is a major oil palm pest species distributed across Southeast Asia. M. plana outbreaks are regarded as serious ongoing threats to the oil palm industry due to their ability to significantly reduce fruit yield and subsequent productivity. Currently, conventional pesticide overuses may harm non-target organisms and severely pollute the environment. This study aims to identify key regulatory genes involved in hormone pathways during the third instar larvae stage of M. plana gene co-expression network analysis. A weighted gene co-expression network analysis (WGCNA) was conducted on the M. plana transcriptomes to construct a gene co-expression network. The transcriptome datasets were obtained from different development stages of M. plana, i.e., egg, third instar larvae, pupa, and adult. The network was clustered using the DPClusO algorithm and validated using Fisher's exact test and receiver operating characteristic (ROC) analysis. The clustering analysis was performed on the network and 20 potential regulatory genes (such as MTA1-like, Nub, Grn, and Usp) were identified from ten top-most significant clusters. Pathway enrichment analysis was performed to identify hormone signalling pathways and these pathways were identified, i.e., hormone-mediated signalling, steroid hormone-mediated signalling, and intracellular steroid hormone receptor signalling as well as six regulatory genes Hnf4, Hr4, MED14, Usp, Tai, and Trr. These key regulatory genes have a potential as important targets in future upstream applications and validation studies in the development of biorational pesticides against M. plana and the RNA interference (RNAi) gene silencing method.
Collapse
Affiliation(s)
| | - Hoe-Han Goh
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Maizom Hassan
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Nisha Govender
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Suhaila Sulaiman
- FGV R&D Sdn Bhd, FGV Innovation Center, PT23417 Lengkuk Teknologi, Bandar Baru Enstek, Nilai 71760, Negeri Sembilan, Malaysia
| | - Nor Afiqah-Aleng
- Institute of Marine Biotechnology, Universiti Malaysia Terengganu, Kuala Nerus 21030, Terengganu, Malaysia
| | - Sarahani Harun
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| |
Collapse
|
6
|
Ziegler C, Martin J, Sinner C, Morcos F. Latent generative landscapes as maps of functional diversity in protein sequence space. Nat Commun 2023; 14:2222. [PMID: 37076519 PMCID: PMC10113739 DOI: 10.1038/s41467-023-37958-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 04/05/2023] [Indexed: 04/21/2023] Open
Abstract
Variational autoencoders are unsupervised learning models with generative capabilities, when applied to protein data, they classify sequences by phylogeny and generate de novo sequences which preserve statistical properties of protein composition. While previous studies focus on clustering and generative features, here, we evaluate the underlying latent manifold in which sequence information is embedded. To investigate properties of the latent manifold, we utilize direct coupling analysis and a Potts Hamiltonian model to construct a latent generative landscape. We showcase how this landscape captures phylogenetic groupings, functional and fitness properties of several systems including Globins, β-lactamases, ion channels, and transcription factors. We provide support on how the landscape helps us understand the effects of sequence variability observed in experimental data and provides insights on directed and natural protein evolution. We propose that combining generative properties and functional predictive power of variational autoencoders and coevolutionary analysis could be beneficial in applications for protein engineering and design.
Collapse
Affiliation(s)
- Cheyenne Ziegler
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX, 75080, USA
| | - Jonathan Martin
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX, 75080, USA
| | - Claude Sinner
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX, 75080, USA
| | - Faruck Morcos
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX, 75080, USA.
- Department of Bioengineering, University of Texas at Dallas, Richardson, TX, 75080, USA.
- Center for Systems Biology, University of Texas at Dallas, Richardson, TX, 75080, USA.
| |
Collapse
|
7
|
Systems biology's role in leveraging microalgal biomass potential: Current status and future perspectives. ALGAL RES 2022. [DOI: 10.1016/j.algal.2022.102963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
8
|
Analysis of amplification and association polymorphisms in the bovine beta-defensin 129 (BBD129) gene revealed its function in bull fertility. Sci Rep 2022; 12:19042. [PMID: 36352091 PMCID: PMC9646896 DOI: 10.1038/s41598-022-23654-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 11/03/2022] [Indexed: 11/10/2022] Open
Abstract
β-defensins are adsorbable on the sperm surface in the male reproductive tract (MRT) and enhance sperm functional characteristics. The beta-defensin 129 (DEFB129) antimicrobial peptide is involved in sperm maturation, motility, and fertilization. However, its role in bovine fertility has not been well investigated. This study examines the relationship between the bovine BBD129 gene and Bos indicus x Bos taurus bull fertility. The complete coding sequence of BBD129 mRNA was identified by RNA Ligase Mediated-Rapid Amplification of cDNA End (RLM-RACE) and Sanger sequencing methodologies. It consisted of 582 nucleotides (nts) including 5' untranslated region (UTR) (46nts) and 3'UTR (23nts). It conserves all beta-defensin-like features. The expression level of BBD129 was checked by RT-qPCR and maximal expression was detected in the corpus-epididymis region compared to other parts of MRT. Polymorphism in BBD129 was also confirmed by Sanger sequencing of 254 clones from 5 high fertile (HF) and 6 low fertile (LF) bulls at two positions, 169 T > G and 329A > G, which change the S57A and N110S in the protein sequence respectively. These two mutations give rise to four types of BBD129 haplotypes. The non-mutated TA-BBD129 (169 T/329A) haplotype was substantially more prevalent among high-fertile bulls (P < 0.005), while the double-site mutated GG-BBD129 (169 T > G/329A > G) haplotype was significantly more prevalent among low-fertile bulls (P < 0.005). The in silico analysis confirmed that the polymorphism in BBD129 results in changes in mRNA secondary structure, protein conformations, protein stability, extracellular-surface availability, post-translational modifications (O-glycosylation and phosphorylation), and affects antibacterial and immunomodulatory capabilities. In conclusion, the mRNA expression of BBD129 in the MRT indicates its region-specific dynamics in sperm maturation. BBD129 polymorphisms were identified as the deciding elements accountable for the changed proteins with impaired functionality, contributing to cross-bred bulls' poor fertility.
Collapse
|
9
|
Shawan MMAK, Jahan N, Ahamed T, Das A, Khan MA, Hossain S, Sarker SR. <i>In silico</i> subtractive genomics approach characterizes a hypothetical protein (MG_476) from <i>microplasma genitalium</i> G37. JOURNAL OF CLINICAL AND EXPERIMENTAL INVESTIGATIONS 2022. [DOI: 10.29333/jcei/12377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
10
|
Hakala K, Kaewphan S, Bjorne J, Mehryary F, Moen H, Tolvanen M, Salakoski T, Ginter F. Neural Network and Random Forest Models in Protein Function Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1772-1781. [PMID: 33306472 DOI: 10.1109/tcbb.2020.3044230] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Over the past decade, the demand for automated protein function prediction has increased due to the volume of newly sequenced proteins. In this paper, we address the function prediction task by developing an ensemble system automatically assigning Gene Ontology (GO) terms to the given input protein sequence. We develop an ensemble system which combines the GO predictions made by random forest (RF) and neural network (NN) classifiers. Both RF and NN models rely on features derived from BLAST sequence alignments, taxonomy and protein signature analysis tools. In addition, we report on experiments with a NN model that directly analyzes the amino acid sequence as its sole input, using a convolutional layer. The Swiss-Prot database is used as the training and evaluation data. In the CAFA3 evaluation, which relies on experimental verification of the functional predictions, our submitted ensemble model demonstrates competitive performance ranking among top-10 best-performing systems out of over 100 submitted systems. In this paper, we evaluate and further improve the CAFA3-submitted system. Our machine learning models together with the data pre-processing and feature generation tools are publicly available as an open source software at https://github.com/TurkuNLP/CAFA3.
Collapse
|
11
|
Ilgisonis EV, Pogodin PV, Kiseleva OI, Tarbeeva SN, Ponomarenko EA. Evolution of Protein Functional Annotation: Text Mining Study. J Pers Med 2022; 12:jpm12030479. [PMID: 35330478 PMCID: PMC8952229 DOI: 10.3390/jpm12030479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 11/23/2022] Open
Abstract
Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through “handshakes” to biochemical processes.
Collapse
|
12
|
Reijnders MJ. Wei2GO: weighted sequence similarity-based protein function prediction. PeerJ 2022; 10:e12931. [PMID: 35186498 PMCID: PMC8855713 DOI: 10.7717/peerj.12931] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 01/21/2022] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Protein function prediction is an important part of bioinformatics and genomics studies. There are many different predictors available, however most of these are in the form of web-servers instead of open-source locally installable versions. Such local versions are necessary to perform large scale genomics studies due to the presence of limitations imposed by web servers such as queues, prediction speed, and updatability of databases. METHODS This paper describes Wei2GO: a weighted sequence similarity and python-based open-source protein function prediction software. It uses DIAMOND and HMMScan sequence alignment searches against the UniProtKB and Pfam databases respectively, transfers Gene Ontology terms from the reference protein to the query protein, and uses a weighing algorithm to calculate a score for the Gene Ontology annotations. RESULTS Wei2GO is compared against the Argot2 and Argot2.5 web servers, which use a similar concept, and DeepGOPlus which acts as a reference. Wei2GO shows an increase in performance according to precision and recall curves, Fmax scores, and Smin scores for biological process and molecular function ontologies. Computational time compared to Argot2 and Argot2.5 is decreased from several hours to several minutes. AVAILABILITY Wei2GO is written in Python 3, and can be found at https://gitlab.com/mreijnders/Wei2GO.
Collapse
Affiliation(s)
- Maarten J.M.F. Reijnders
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
13
|
Integrated bioinformatics based subtractive genomics approach to decipher the therapeutic function of hypothetical proteins from Salmonella typhi XDR H-58 strain. Biotechnol Lett 2022; 44:279-298. [PMID: 35037232 PMCID: PMC8761513 DOI: 10.1007/s10529-021-03219-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 12/12/2021] [Indexed: 11/21/2022]
Abstract
Purpose The efficacy of drugs against Salmonella infection have compromised due to emerging XDR H58 strain. There is a dire need to find novel antimicrobial drug targets as well as drug candidates to cure by the XDR strain of Salmonella. It is observed that the complete genome sequence of the XDR H58 strain contains a large number of hypothetical proteins with unknown cellular and biological functions. Hence, it is indispensable to annotate these proteins functionally as well as structurally to identify novel drug targets. Methods In the current study, a comparative genomics and proteomics based approach was applied to find the novel drug targets in XDR strain while comparing the MDR and NR strains of Salmonella typhi. Results The characterization of ~ 350 hypothetical proteins were performed through determination of their physio-chemical properties, sub-cellular localization, functional annotation, and structure-based studies. As a result, only five proteins were prioritized as essential, druggable, and virulent proteins. Moreover, only one protein i.e. WP_000916613.1 was functionally annotated with high confidence and subjected to further structure-based analysis. Conclusion The current study presents a hypothetical protein from the XDR S. typhi proteome as a potential pharmacological target against which novel therapeutic candidates may be predicted. The outcome of the current study may lead to formulate a general set of pipelines for better understanding of the role of hypothetical proteins in pathogenesis of not only Salmonella but also for other pathogens. Supplementary Information The online version contains supplementary material available at 10.1007/s10529-021-03219-6.
Collapse
|
14
|
Rollano-Peñaloza OM, Mollinedo PA, Widell S, Rasmusson AG. Transcriptomic Analysis of Quinoa Reveals a Group of Germin-Like Proteins Induced by Trichoderma. FRONTIERS IN FUNGAL BIOLOGY 2021; 2:768648. [PMID: 37744129 PMCID: PMC10512214 DOI: 10.3389/ffunb.2021.768648] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 11/04/2021] [Indexed: 09/26/2023]
Abstract
Symbiotic strains of fungi in the genus Trichoderma affect growth and pathogen resistance of many plant species, but the interaction is not known in molecular detail. Here we describe the transcriptomic response of two cultivars of the crop Chenopodium quinoa to axenic co-cultivation with Trichoderma harzianum BOL-12 and Trichoderma afroharzianum T22. The response of C. quinoa roots to BOL-12 and T22 in the early phases of interaction was studied by RNA sequencing and RT-qPCR verification. Interaction with the two fungal strains induced partially overlapping gene expression responses. Comparing the two plant genotypes, a broad spectrum of putative quinoa defense genes were found activated in the cultivar Kurmi but not in the Real cultivar. In cultivar Kurmi, relatively small effects were observed for classical pathogen response pathways but instead a C. quinoa-specific clade of germin-like genes were activated. Germin-like genes were found to be more rapidly induced in cultivar Kurmi as compared to Real. The same germin-like genes were found to also be upregulated systemically in the leaves. No strong correlation was observed between any of the known hormone-mediated defense response pathways and any of the quinoa-Trichoderma interactions. The differences in responses are relevant for the capabilities of applying Trichoderma agents for crop protection of different cultivars of C. quinoa.
Collapse
Affiliation(s)
- Oscar M. Rollano-Peñaloza
- Instituto de Investigaciones Quimicas, Universidad Mayor de San Andrés, La Paz, Bolivia
- Department of Biology, Lund University, Lund, Sweden
| | - Patricia A. Mollinedo
- Instituto de Investigaciones Quimicas, Universidad Mayor de San Andrés, La Paz, Bolivia
| | | | | |
Collapse
|
15
|
Deragon E, Schuler M, Aiese Cigliano R, Dellero Y, Si Larbi G, Falconet D, Jouhet J, Maréchal E, Michaud M, Amato A, Rébeillé F. An Oil Hyper-Accumulator Mutant Highlights Peroxisomal ATP Import as a Regulatory Step for Fatty Acid Metabolism in Aurantiochytrium limacinum. Cells 2021; 10:2680. [PMID: 34685660 PMCID: PMC8534400 DOI: 10.3390/cells10102680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 09/24/2021] [Accepted: 09/24/2021] [Indexed: 11/17/2022] Open
Abstract
Thraustochytrids are marine protists that naturally accumulate triacylglycerol with long chains of polyunsaturated fatty acids, such as ω3-docosahexaenoic acid (DHA). They represent a sustainable response to the increasing demand for these "essential" fatty acids (FAs). Following an attempt to transform a strain of Aurantiochytrium limacinum, we serendipitously isolated a clone that did not incorporate any recombinant DNA but contained two to three times more DHA than the original strain. Metabolic analyses indicated a deficit in FA catabolism. However, whole transcriptome analysis did not show down-regulation of genes involved in FA catabolism. Genome sequencing revealed extensive DNA deletion in one allele encoding a putative peroxisomal adenylate transporter. Phylogenetic analyses and yeast complementation experiments confirmed the gene as a peroxisomal adenylate nucleotide transporter (AlANT1), homologous to yeast ScANT1 and plant peroxisomal adenylate nucleotide carrier AtPNC genes. In yeast and plants, a deletion of the peroxisomal adenylate transporter inhibits FA breakdown and induces FA accumulation, a phenotype similar to that described here. In response to this metabolic event, several compensatory mechanisms were observed. In particular, genes involved in FA biosynthesis were upregulated, also contributing to the high FA accumulation. These results support AlANT1 as a promising target for enhancing DHA production in Thraustochytrids.
Collapse
Affiliation(s)
- Etienne Deragon
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Martin Schuler
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | | | - Younès Dellero
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
- Institute of Genetic, Environment and Plant Protection, UMR 1349 IGEPP INRA, Agrocampus Ouest Rennes, Université Rennes 1, Domaine de la Motte BP35327, CEDEX, 35653 Le Rheu, France
| | - Gregory Si Larbi
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Denis Falconet
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Juliette Jouhet
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Eric Maréchal
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Morgane Michaud
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Alberto Amato
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| | - Fabrice Rébeillé
- Laboratoire de Physiologie Cellulaire et Végétale, Université Grenoble Alpes, CNRS, CEA, INRAE, CEDEX 9, 38054 Grenoble, France; (E.D.); (M.S.); (Y.D.); (G.S.L.); (D.F.); (J.J.); (E.M.); (M.M.)
| |
Collapse
|
16
|
Savini G, Scolari F, Ometto L, Rota-Stabelli O, Carraretto D, Gomulski LM, Gasperi G, Abd-Alla AMM, Aksoy S, Attardo GM, Malacrida AR. Viviparity and habitat restrictions may influence the evolution of male reproductive genes in tsetse fly (Glossina) species. BMC Biol 2021; 19:211. [PMID: 34556101 PMCID: PMC8461966 DOI: 10.1186/s12915-021-01148-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 09/06/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Glossina species (tsetse flies), the sole vectors of African trypanosomes, maintained along their long evolutionary history a unique reproductive strategy, adenotrophic viviparity. Viviparity reduces their reproductive rate and, as such, imposes strong selective pressures on males for reproductive success. These species live in sub-Saharan Africa, where the distributions of the main sub-genera Fusca, Morsitans, and Palpalis are restricted to forest, savannah, and riverine habitats, respectively. Here we aim at identifying the evolutionary patterns of the male reproductive genes of six species belonging to these three main sub-genera. We then interpreted the different patterns we found across the species in the light of viviparity and the specific habitat restrictions, which are known to shape reproductive behavior. RESULTS We used a comparative genomic approach to build consensus evolutionary trees that portray the selective pressure acting on the male reproductive genes in these lineages. Such trees reflect the long and divergent demographic history that led to an allopatric distribution of the Fusca, Morsitans, and Palpalis species groups. A dataset of over 1700 male reproductive genes remained conserved over the long evolutionary time scale (estimated at 26.7 million years) across the genomes of the six species. We suggest that this conservation may result from strong functional selective pressure on the male imposed by viviparity. It is noteworthy that more than half of these conserved genes are novel sequences that are unique to the Glossina genus and are candidates for selection in the different lineages. CONCLUSIONS Tsetse flies represent a model to interpret the evolution and differentiation of male reproductive biology under different, but complementary, perspectives. In the light of viviparity, we must take into account that these genes are constrained by a post-fertilization arena for genomic conflicts created by viviparity and absent in ovipositing species. This constraint implies a continuous antagonistic co-evolution between the parental genomes, thus accelerating inter-population post-zygotic isolation and, ultimately, favoring speciation. Ecological restrictions that affect reproductive behavior may further shape such antagonistic co-evolution.
Collapse
Affiliation(s)
- Grazia Savini
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Francesca Scolari
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- Institute of Molecular Genetics IGM-CNR "Luigi Luca Cavalli-Sforza", Pavia, Italy
| | - Lino Ometto
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Omar Rota-Stabelli
- Research and Innovation Centre, Fondazione Edmund Mach (FEM), San Michele all'Adige, Italy
- Center Agriculture Food Environment (C3A), University of Trento, Trento, Italy
| | - Davide Carraretto
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Ludvik M Gomulski
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Giuliano Gasperi
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Adly M M Abd-Alla
- Insect Pest Control Laboratory, Joint FAO/IAEA Programme of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria.
| | - Serap Aksoy
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
| | - Geoffrey M Attardo
- Department of Entomology and Nematology, University of California, Davis, Davis, CA, USA
| | - Anna R Malacrida
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy.
| |
Collapse
|
17
|
Chahed A, Lazazzara V, Moretto M, Nesler A, Corneo PE, Barka EA, Pertot I, Puopolo G, Perazzolli M. The Differential Growth Inhibition of Phytophthora spp. Caused by the Rare Sugar Tagatose Is Associated With Species-Specific Metabolic and Transcriptional Changes. Front Microbiol 2021; 12:711545. [PMID: 34305881 PMCID: PMC8292896 DOI: 10.3389/fmicb.2021.711545] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 06/16/2021] [Indexed: 12/03/2022] Open
Abstract
Tagatose is a rare sugar with no negative impacts on human health and selective inhibitory effects on plant-associated microorganisms. Tagatose inhibited mycelial growth and negatively affected mitochondrial processes in Phytophthora infestans, but not in Phytophthora cinnamomi. The aim of this study was to elucidate metabolic changes and transcriptional reprogramming activated by P. infestans and P. cinnamomi in response to tagatose, in order to clarify the differential inhibitory mechanisms of tagatose and the species-specific reactions to this rare sugar. Phytophthora infestans and P. cinnamomi activated distinct metabolic and transcriptional changes in response to the rare sugar. Tagatose negatively affected mycelial growth, sugar content and amino acid content in P. infestans with a severe transcriptional reprogramming that included the downregulation of genes involved in transport, sugar metabolism, signal transduction, and growth-related process. Conversely, tagatose incubation upregulated genes related to transport, energy metabolism, sugar metabolism and oxidative stress in P. cinnamomi with no negative effects on mycelial growth, sugar content and amino acid content. Differential inhibitory effects of tagatose on Phytophthora spp. were associated with an attempted reaction of P. infestans, which was not sufficient to attenuate the negative impacts of the rare sugar and with an efficient response of P. cinnamomi with the reprogramming of multiple metabolic processes, such as genes related to glucose transport, pentose metabolism, tricarboxylic acid cycle, reactive oxygen species detoxification, mitochondrial and alternative respiration processes. Knowledge on the differential response of Phytophthora spp. to tagatose represent a step forward in the understanding functional roles of rare sugars.
Collapse
Affiliation(s)
- Abdessalem Chahed
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Bi-PA nv, Londerzeel, Belgium.,Department of Induced Resistance and Plant Bioprotection, University of Reims, Reims, France
| | - Valentina Lazazzara
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
| | - Marco Moretto
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
| | - Andrea Nesler
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Bi-PA nv, Londerzeel, Belgium
| | - Paola Elisa Corneo
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Italy
| | - Essaid Ait Barka
- Department of Induced Resistance and Plant Bioprotection, University of Reims, Reims, France
| | - Ilaria Pertot
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Italy
| | - Gerardo Puopolo
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Italy
| | - Michele Perazzolli
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy.,Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Italy
| |
Collapse
|
18
|
Wimalanathan K, Lawrence-Dill CJ. Gene Ontology Meta Annotator for Plants (GOMAP). PLANT METHODS 2021; 17:54. [PMID: 34034755 PMCID: PMC8146647 DOI: 10.1186/s13007-021-00754-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 05/10/2021] [Indexed: 05/03/2023]
Abstract
Annotating gene structures and functions to genome assemblies is necessary to make assembly resources useful for biological inference. Gene Ontology (GO) term assignment is the most used functional annotation system, and new methods for GO assignment have improved the quality of GO-based function predictions. The Gene Ontology Meta Annotator for Plants (GOMAP) is an optimized, high-throughput, and reproducible pipeline for genome-scale GO annotation of plants. We containerized GOMAP to increase portability and reproducibility and also optimized its performance for HPC environments. Here we report on the pipeline's availability and performance for annotating large, repetitive plant genomes and describe how GOMAP was used to annotate multiple maize genomes as a test case. Assessment shows that GOMAP expands and improves the number of genes annotated and annotations assigned per gene as well as the quality (based on [Formula: see text]) of GO assignments in maize. GOMAP has been deployed to annotate other species including wheat, rice, barley, cotton, and soy. Instructions and access to the GOMAP Singularity container are freely available online at https://bioinformapping.com/gomap/ . A list of annotated genomes and links to data is maintained at https://dill-picl.org/projects/gomap/ .
Collapse
Affiliation(s)
- Kokulapalan Wimalanathan
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, 50010, USA.
- Department of Genetics Development and Cell Biology, Iowa State University, Ames, IA, 50010, USA.
- Greenlight Biosciences Inc., Medford, MA, 02155, USA.
| | - Carolyn J Lawrence-Dill
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, 50010, USA.
- Department of Genetics Development and Cell Biology, Iowa State University, Ames, IA, 50010, USA.
- Department of Agronomy, Iowa State University, Ames, IA, 50010, USA.
| |
Collapse
|
19
|
Moro G, Masseroli M. Gene function finding through cross-organism ensemble learning. BioData Min 2021; 14:14. [PMID: 33579334 PMCID: PMC7879670 DOI: 10.1186/s13040-021-00239-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 01/10/2021] [Indexed: 11/12/2022] Open
Abstract
Background Structured biological information about genes and proteins is a valuable resource to improve discovery and understanding of complex biological processes via machine learning algorithms. Gene Ontology (GO) controlled annotations describe, in a structured form, features and functions of genes and proteins of many organisms. However, such valuable annotations are not always reliable and sometimes are incomplete, especially for rarely studied organisms. Here, we present GeFF (Gene Function Finder), a novel cross-organism ensemble learning method able to reliably predict new GO annotations of a target organism from GO annotations of another source organism evolutionarily related and better studied. Results Using a supervised method, GeFF predicts unknown annotations from random perturbations of existing annotations. The perturbation consists in randomly deleting a fraction of known annotations in order to produce a reduced annotation set. The key idea is to train a supervised machine learning algorithm with the reduced annotation set to predict, namely to rebuild, the original annotations. The resulting prediction model, in addition to accurately rebuilding the original known annotations for an organism from their perturbed version, also effectively predicts new unknown annotations for the organism. Moreover, the prediction model is also able to discover new unknown annotations in different target organisms without retraining.We combined our novel method with different ensemble learning approaches and compared them to each other and to an equivalent single model technique. We tested the method with five different organisms using their GO annotations: Homo sapiens, Mus musculus, Bos taurus, Gallus gallus and Dictyostelium discoideum. The outcomes demonstrate the effectiveness of the cross-organism ensemble approach, which can be customized with a trade-off between the desired number of predicted new annotations and their precision.A Web application to browse both input annotations used and predicted ones, choosing the ensemble prediction method to use, is publicly available at http://tiny.cc/geff/. Conclusions Our novel cross-organism ensemble learning method provides reliable predicted novel gene annotations, i.e., functions, ranked according to an associated likelihood value. They are very valuable both to speed the annotation curation, focusing it on the prioritized new annotations predicted, and to complement known annotations available.
Collapse
Affiliation(s)
- Gianluca Moro
- DISI - University of Bologna, Via dell'Università, Cesena (FC), Italy.
| | - Marco Masseroli
- DEIB, Politecnico di Milano, Piazza L. Da Vinci 32, Milan, 20133, Italy
| |
Collapse
|
20
|
Ke HM, Lee HH, Lin CYI, Liu YC, Lu MR, Hsieh JWA, Chang CC, Wu PH, Lu MJ, Li JY, Shang G, Lu RJH, Nagy LG, Chen PY, Kao HW, Tsai IJ. Mycena genomes resolve the evolution of fungal bioluminescence. Proc Natl Acad Sci U S A 2020; 117:31267-31277. [PMID: 33229585 PMCID: PMC7733832 DOI: 10.1073/pnas.2010761117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Mushroom-forming fungi in the order Agaricales represent an independent origin of bioluminescence in the tree of life; yet the diversity, evolutionary history, and timing of the origin of fungal luciferases remain elusive. We sequenced the genomes and transcriptomes of five bonnet mushroom species (Mycena spp.), a diverse lineage comprising the majority of bioluminescent fungi. Two species with haploid genome assemblies ∼150 Mb are among the largest in Agaricales, and we found that a variety of repeats between Mycena species were differentially mediated by DNA methylation. We show that bioluminescence evolved in the last common ancestor of mycenoid and the marasmioid clade of Agaricales and was maintained through at least 160 million years of evolution. Analyses of synteny across genomes of bioluminescent species resolved how the luciferase cluster was derived by duplication and translocation, frequently rearranged and lost in most Mycena species, but conserved in the Armillaria lineage. Luciferase cluster members were coexpressed across developmental stages, with the highest expression in fruiting body caps and stipes, suggesting fruiting-related adaptive functions. Our results contribute to understanding a de novo origin of bioluminescence and the corresponding gene cluster in a diverse group of enigmatic fungal species.
Collapse
Affiliation(s)
- Huei-Mien Ke
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan;
| | - Hsin-Han Lee
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Chan-Yi Ivy Lin
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06520
| | - Yu-Ching Liu
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Min R Lu
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 106, Taiwan
| | - Jo-Wei Allison Hsieh
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 106, Taiwan
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei 115, Taiwan
| | - Chiung-Chih Chang
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
- Department of Life Sciences, National Chung Hsing University, Taichung 402, Taiwan
| | - Pei-Hsuan Wu
- Master Program for Plant Medicine and Good Agricultural Practice, National Chung Hsing University, Taichung 402, Taiwan
| | - Meiyeh Jade Lu
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Jeng-Yi Li
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Gaus Shang
- Department of Biotechnology, Ming Chuan University, Taoyuan 333, Taiwan
| | - Rita Jui-Hsien Lu
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei 115, Taiwan
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110
| | - László G Nagy
- Synthetic and Systems Biology Unit, Biological Research Centre, 6726 Szeged, Hungary
- Department of Plant Anatomy, Institute of Biology, Eötvös Loránd University, Budapest, 1117 Hungary
| | - Pao-Yang Chen
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 106, Taiwan
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei 115, Taiwan
| | - Hsiao-Wei Kao
- Department of Life Sciences, National Chung Hsing University, Taichung 402, Taiwan
| | - Isheng Jason Tsai
- Biodiversity Research Center, Academia Sinica, Taipei 115, Taiwan;
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 106, Taiwan
| |
Collapse
|
21
|
Structural and functional annotation of PR/SET Domain (PRDM) protein family: In-silico study elaborating role of PRDM12 mutation in congenital insensitivity to pain. Comput Biol Chem 2020; 89:107382. [PMID: 33010785 DOI: 10.1016/j.compbiolchem.2020.107382] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 08/11/2020] [Accepted: 09/18/2020] [Indexed: 11/22/2022]
Abstract
Congenital insensitivity to pain (CIP), classified as a type of hereditary sensory and autonomic neuropathies, is a rare disease in which the affected individuals fail to perceive sensation of pain. One of the PR/SET Domain Proteins, PRDM12, has been identified in recent past as a candidate gene for congenital insensitivity to pain. In the present study, we performed whole exome sequencing in a Pakistani family with CIP phenotype to ascertain the causative mutation. We identified a previously described alanine repeat duplication in PRDM12 (Ala353_Ala359dup) in this family. After this, we performed structural annotations for PR/SET Domain (PRDM) containing protein family to prognosticate the potential hypothetical structure of PRDM proteins with physical and chemical parameters. Out of nineteen members of this family, four members (PRDM5, PRDM8, PRDM12 and PRDM13) were specially focused because of their role in neurological disorders. Predictions about structure and interactions of these proteins revealed novel interacting molecules and pathways. Detailed in silico analysis of PRDM12 was performed to elaborate importance of its domain structure in interaction with other proteins and its role in pain insensitivity phenotype. These results have substantially enhanced our understanding regarding the etiology of congenital pain insensitivity and would stimulate further research on therapy and prevention.
Collapse
|
22
|
Liu W. SemanticGO: a tool for gene functional similarity analysis in Arabidopsis thaliana and rice. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2020; 297:110527. [PMID: 32563466 DOI: 10.1016/j.plantsci.2020.110527] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2020] [Revised: 05/07/2020] [Accepted: 05/08/2020] [Indexed: 06/11/2023]
Abstract
Gene or pathway functional similarities are important information for researchers. However, these similarities are often described sparsely and qualitatively. The latent semantic analysis of Arabidopsis thaliana (Arabidopsis) Gene Ontology (GO) data produced a set of 200-dimension feature vectors for each gene. Pathways were represented by summing the vectors of the pathway member genes. Thus, the similarities between genes and pathways were assessed. Additionally, the gene feature vectors were correlated with external gene data, including gene expression and gene network connectivity, to elucidate the associated functions. The gene feature vectors were decoded, and their applications were demonstrated. A simple online tool, SemanticGO (http://bioinformatics.fafu.edu.cn/semanticGO/), is herein provided to enable researchers to explore the similarities between genes and pathways in both Arabidopsis and rice.
Collapse
Affiliation(s)
- Wei Liu
- Department of Bioinformatics, School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China.
| |
Collapse
|
23
|
Morabito C, Aiese Cigliano R, Maréchal E, Rébeillé F, Amato A. Illumina and PacBio DNA sequencing data, de novo assembly and annotation of the genome of Aurantiochytrium limacinum strain CCAP_4062/1. Data Brief 2020; 31:105729. [PMID: 32490088 PMCID: PMC7262427 DOI: 10.1016/j.dib.2020.105729] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 05/07/2020] [Accepted: 05/13/2020] [Indexed: 11/25/2022] Open
Abstract
The complete genome of the thraustochytrid Aurantiochytrium limacinum strain CCAP_4062/1 was sequenced using both Illumina Novaseq 6000 and third generation sequencing technology PacBio RSII in order to obtain trustworthy assembly and annotation. The reads from both platforms were combined at multiple levels in order to obtain a reliable assembly, then compared to the A. limacinum ATCCⓇ MYA1381™ reference genome. The final assembly was annotated with the help of strain CCAP_4062/1 RNAseq data. A. limacinum strain CCAP_4062/1 is an industrial strain used for the production of very long chain polyunsaturated fatty acids, like the docosahexaenoic acid that is an essential fatty acid synthesised only at very low pace in humans and vertebrates . Thraustochytrids in general and Aurantiochytrium more specifically, are used for carotenoid and squalene production as well. Beside their biotechnological interest, thraustochytrids play a crucial role in both inshore and oceanic basins ecosystems. Genome sequences will foster biotechnological as well as ecological studies.
Collapse
Affiliation(s)
- Christian Morabito
- Laboratoire de Physiologie Cellulaire Végétale, Université Grenoble Alpes, CEA, CNRS, INRAE, IRIG-LPCV, 38054 Grenoble Cedex 9, France
| | | | - Eric Maréchal
- Laboratoire de Physiologie Cellulaire Végétale, Université Grenoble Alpes, CEA, CNRS, INRAE, IRIG-LPCV, 38054 Grenoble Cedex 9, France
| | - Fabrice Rébeillé
- Laboratoire de Physiologie Cellulaire Végétale, Université Grenoble Alpes, CEA, CNRS, INRAE, IRIG-LPCV, 38054 Grenoble Cedex 9, France
| | - Alberto Amato
- Laboratoire de Physiologie Cellulaire Végétale, Université Grenoble Alpes, CEA, CNRS, INRAE, IRIG-LPCV, 38054 Grenoble Cedex 9, France
| |
Collapse
|
24
|
Genomic Analysis of Intrinsically Disordered Proteins in the Genus Camelus. Int J Mol Sci 2020; 21:ijms21114010. [PMID: 32503351 PMCID: PMC7312968 DOI: 10.3390/ijms21114010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/14/2020] [Accepted: 05/18/2020] [Indexed: 12/11/2022] Open
Abstract
Intrinsically disordered proteins/regions (IDPs/IDRs) fail to fold completely into 3D structures, but have major roles in determining protein function. While natively disordered proteins/regions have been found to fulfill a wide variety of primary cellular roles, the functions of many disordered proteins in numerous species remain to be uncovered. Here, we perform the first large-scale study of IDPs/IDRs in the genus Camelus, one of the most important mammalians in Asia and North Africa, in order to explore the biological roles of these proteins. The study includes the prediction of disordered proteins/regions in Camelus species and in humans using multiple state-of-the-art prediction tools. Additionally, we provide a comparative analysis of Camelus and Homo sapiens IDPs/IDRs for the sake of highlighting the distinctive use of disorder in each genus. Our findings indicate that the human proteome is more disordered than the Camelus proteome. Gene Ontology analysis also revealed that Camelus IDPs are enriched in glutathione catabolism and lactose biosynthesis.
Collapse
|
25
|
Clues of in vivo nuclear gene regulation by mitochondrial short non-coding RNAs. Sci Rep 2020; 10:8219. [PMID: 32427953 PMCID: PMC7237437 DOI: 10.1038/s41598-020-65084-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 04/22/2020] [Indexed: 12/15/2022] Open
Abstract
Gene expression involves multiple processes, from transcription to translation to the mature, functional peptide, and it is regulated at multiple levels. Small RNA molecules are known to bind RNA messengers affecting their fate in the cytoplasm (a process generically termed ‘RNA interference’). Such small regulatory RNAs are well-known to be originated from the nuclear genome, while the role of mitochondrial genome in RNA interference was largely overlooked. However, evidence is growing that mitochondrial DNA does provide the cell a source of interfering RNAs. Small mitochondrial highly transcribed RNAs (smithRNAs) have been proposed to be transcribed from the mitochondrion and predicted to regulate nuclear genes. Here, for the first time, we show in vivo clues of the activity of two smithRNAs in the Manila clam, Ruditapes philippinarum. Moreover, we show that smithRNAs are present and can be annotated in representatives of the three main bilaterian lineages; in some cases, they were already described and assigned to a small RNA category (e.g., piRNAs) given their biogenesis, while in other cases their biogenesis remains unclear. If mitochondria may affect nuclear gene expression through RNA interference, this opens a plethora of new possibilities for them to interact with the nucleus and makes metazoan mitochondrial DNA a much more complex genome than previously thought.
Collapse
|
26
|
Makrodimitris S, van Ham RCHJ, Reinders MJT. Improving protein function prediction using protein sequence and GO-term similarities. Bioinformatics 2020; 35:1116-1124. [PMID: 30169569 PMCID: PMC6449755 DOI: 10.1093/bioinformatics/bty751] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2017] [Revised: 07/04/2018] [Accepted: 08/28/2018] [Indexed: 12/26/2022] Open
Abstract
MOTIVATION Most automatic functional annotation methods assign Gene Ontology (GO) terms to proteins based on annotations of highly similar proteins. We advocate that proteins that are less similar are still informative. Also, despite their simplicity and structure, GO terms seem to be hard for computers to learn, in particular the Biological Process ontology, which has the most terms (>29 000). We propose to use Label-Space Dimensionality Reduction (LSDR) techniques to exploit the redundancy of GO terms and transform them into a more compact latent representation that is easier to predict. RESULTS We compare proteins using a sequence similarity profile (SSP) to a set of annotated training proteins. We introduce two new LSDR methods, one based on the structure of the GO, and one based on semantic similarity of terms. We show that these LSDR methods, as well as three existing ones, improve the Critical Assessment of Functional Annotation performance of several function prediction algorithms. Cross-validation experiments on Arabidopsis thaliana proteins pinpoint the superiority of our GO-aware LSDR over generic LSDR. Our experiments on A.thaliana proteins show that the SSP representation in combination with a kNN classifier outperforms state-of-the-art and baseline methods in terms of cross-validated F-measure. AVAILABILITY AND IMPLEMENTATION Source code for the experiments is available at https://github.com/stamakro/SSP-LSDR. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Stavros Makrodimitris
- Department of Intelligent Systems, Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands.,Department of Bioinformatics, Keygene N.V., Wageningen, The Netherlands
| | - Roeland C H J van Ham
- Department of Intelligent Systems, Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands.,Department of Bioinformatics, Keygene N.V., Wageningen, The Netherlands
| | - Marcel J T Reinders
- Department of Intelligent Systems, Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands
| |
Collapse
|
27
|
Hart AJ, Ginzburg S, Xu MS, Fisher CR, Rahmatpour N, Mitton JB, Paul R, Wegrzyn JL. EnTAP: Bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes. Mol Ecol Resour 2019; 20:591-604. [PMID: 31628884 DOI: 10.1111/1755-0998.13106] [Citation(s) in RCA: 97] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2018] [Revised: 09/18/2019] [Accepted: 09/24/2019] [Indexed: 11/28/2022]
Abstract
EnTAP (Eukaryotic Non-Model Transcriptome Annotation Pipeline) was designed to improve the accuracy, speed, and flexibility of functional gene annotation for de novo assembled transcriptomes in non-model eukaryotes. This software package addresses the fragmentation and related assembly issues that result in inflated transcript estimates and poor annotation rates of protein-coding transcripts. Following filters applied through assessment of true expression and frame selection, open-source tools are leveraged to functionally annotate the reduced set of translated proteins. Downstream features include fast similarity search across five repositories, protein domain assignment, orthologous gene family assessment, and Gene Ontology (GO) term assignment. The final annotation integrates across multiple databases and selects an optimal assignment from a combination of weighted metrics describing similarity search score, taxonomic relationship, and informativeness. Researchers have the option to include additional filters to identify and remove contaminants, identify associated pathways, and prepare the transcripts for enrichment analysis. This fully featured pipeline is easy to install, configure, and runs significantly faster than comparable annotation packages. EnTAP is optimized to generate extensive functional information for the gene space of organisms with limited or poorly characterized genomic resources.
Collapse
Affiliation(s)
- Alexander J Hart
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Samuel Ginzburg
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Muyang Sam Xu
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Cera R Fisher
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Nasim Rahmatpour
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Jeffry B Mitton
- Department of Ecology and Evolutionary Biology, University of Colorado Boulder, Boulder, CO, USA
| | - Robin Paul
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
28
|
Affiliation(s)
- Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Centre for Clinical Research, St. Ann’s Hospital, 602 00 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Centre for Clinical Research, St. Ann’s Hospital, 602 00 Brno, Czech Republic
| |
Collapse
|
29
|
Vicente R, Bolger AM, Martínez-Carrasco R, Pérez P, Gutiérrez E, Usadel B, Morcuende R. De Novo Transcriptome Analysis of Durum Wheat Flag Leaves Provides New Insights Into the Regulatory Response to Elevated CO 2 and High Temperature. FRONTIERS IN PLANT SCIENCE 2019; 10:1605. [PMID: 31921252 PMCID: PMC6915051 DOI: 10.3389/fpls.2019.01605] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 11/14/2019] [Indexed: 05/08/2023]
Abstract
Global warming is becoming a significant problem for food security, particularly in the Mediterranean basin. The use of molecular techniques to study gene-level responses to environmental changes in non-model organisms is increasing and may help to improve the mechanistic understanding of durum wheat response to elevated CO2 and high temperature. With this purpose, we performed transcriptome RNA sequencing (RNA-Seq) analyses combined with physiological and biochemical studies in the flag leaf of plants grown in field chambers at ear emergence. Enhanced photosynthesis by elevated CO2 was accompanied by an increase in biomass and starch and fructan content, and a decrease in N compounds, as chlorophyll, soluble proteins, and Rubisco content, in association with a decline of nitrate reductase and initial and total Rubisco activities. While high temperature led to a decline of chlorophyll, Rubisco activity, and protein content, the glucose content increased and starch decreased. Furthermore, elevated CO2 induced several genes involved in mitochondrial electron transport, a few genes for photosynthesis and fructan synthesis, and most of the genes involved in secondary metabolism and gibberellin and jasmonate metabolism, whereas those related to light harvesting, N assimilation, and other hormone pathways were repressed. High temperature repressed genes for C, energy, N, lipid, secondary, and hormone metabolisms. Under the combined increases in atmospheric CO2 and temperature, the transcript profile resembled that previously reported for high temperature, although elevated CO2 partly alleviated the downregulation of primary and secondary metabolism genes. The results suggest that there was a reprogramming of primary and secondary metabolism under the future climatic scenario, leading to coordinated regulation of C-N metabolism towards C-rich metabolites at elevated CO2 and a shift away from C-rich secondary metabolites at high temperature. Several candidate genes differentially expressed were identified, including protein kinases, receptor kinases, and transcription factors.
Collapse
Affiliation(s)
- Rubén Vicente
- Institute of Natural Resources and Agrobiology of Salamanca (IRNASA), Consejo Superior de Investigaciones Científicas (CSIC), Salamanca, Spain
| | | | - Rafael Martínez-Carrasco
- Institute of Natural Resources and Agrobiology of Salamanca (IRNASA), Consejo Superior de Investigaciones Científicas (CSIC), Salamanca, Spain
| | - Pilar Pérez
- Institute of Natural Resources and Agrobiology of Salamanca (IRNASA), Consejo Superior de Investigaciones Científicas (CSIC), Salamanca, Spain
| | - Elena Gutiérrez
- Institute of Natural Resources and Agrobiology of Salamanca (IRNASA), Consejo Superior de Investigaciones Científicas (CSIC), Salamanca, Spain
| | - Björn Usadel
- Institute for Biology 1, RWTH Aachen University, Aachen, Germany
- Institute of Bio- and Geosciences, IBG-2: Plant Sciences, Forschungszentrum Jülich, Jülich, Germany
| | - Rosa Morcuende
- Institute of Natural Resources and Agrobiology of Salamanca (IRNASA), Consejo Superior de Investigaciones Científicas (CSIC), Salamanca, Spain
| |
Collapse
|
30
|
Buchan DWA, Jones DT. Learning a functional grammar of protein domains using natural language word embedding techniques. Proteins 2019; 88:616-624. [PMID: 31703152 DOI: 10.1002/prot.25842] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 10/08/2019] [Accepted: 11/03/2019] [Indexed: 11/10/2022]
Abstract
In this paper, using Word2vec, a widely-used natural language processing method, we demonstrate that protein domains may have a learnable implicit semantic "meaning" in the context of their functional contributions to the multi-domain proteins in which they are found. Word2vec is a group of models which can be used to produce semantically meaningful embeddings of words or tokens in a fixed-dimension vector space. In this work, we treat multi-domain proteins as "sentences" where domain identifiers are tokens which may be considered as "words." Using all InterPro (Finn et al. 2017) pfam domain assignments we observe that the embedding could be used to suggest putative GO assignments for Pfam (Finn et al. 2016) domains of unknown function.
Collapse
Affiliation(s)
- Daniel W A Buchan
- Department of Computer Science, University College London, London, UK
| | - David T Jones
- Department of Computer Science, University College London, London, UK
| |
Collapse
|
31
|
Rentzsch R, Deneke C, Nitsche A, Renard BY. Predicting bacterial virulence factors - evaluation of machine learning and negative data strategies. Brief Bioinform 2019; 21:1596-1608. [PMID: 32978619 DOI: 10.1093/bib/bbz076] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 05/17/2019] [Accepted: 06/01/2019] [Indexed: 11/12/2022] Open
Abstract
Bacterial proteins dubbed virulence factors (VFs) are a highly diverse group of sequences, whose only obvious commonality is the very property of being, more or less directly, involved in virulence. It is therefore tempting to speculate whether their prediction, based on direct sequence similarity (seqsim) to known VFs, could be enhanced or even replaced by using machine-learning methods. Specifically, when trained on a large and diverse set of VFs, such may be able to detect putative, non-trivial characteristics shared by otherwise unrelated VF families and therefore better predict novel VFs with insignificant similarity to each individual family. We therefore first reassess the performance of dimer-based Support Vector Machines, as used in the widely used MP3 method, in light of seqsim-only and seqsim/dimer-hybrid classifiers. We then repeat the analysis with a novel, considerably more diverse data set, also addressing the important problem of negative data selection. Finally, we move on to the real-world use case of proteome-wide VF prediction, outlining different approaches to estimating specificity in this scenario. We find that direct seqsim is of unparalleled importance and therefore should always be exploited. Further, we observe strikingly low correlations between different feature and classifier types when ranking proteins by VF likeness. We therefore propose a 'best of each world' approach to prioritize proteins for experimental testing, focussing on the top predictions of each classifier. Further, classifiers for individual VF families should be developed.
Collapse
Affiliation(s)
- Robert Rentzsch
- Bioinformatics Unit (MF 1), Robert Koch Institute, Berlin.,Institute for Innovation and Technology (IIT), Steinplatz 1, Berlin
| | - Carlus Deneke
- Bioinformatics Unit (MF 1), Robert Koch Institute, Berlin.,Molecular Microbiology and Genome Analysis Unit, German Federal Institute for Risk Assessment, Berlin
| | - Andreas Nitsche
- Centre for Biological Threats and Special Pathogens: Highly Pathogenic Viruses (ZBS 1), Robert Koch Institute, Berlin
| | | |
Collapse
|
32
|
Morkūnaitė-Haimi Š, Vinskiene J, Stanienė G, Haimi P. Differential Chloroplast Proteomics of Temperature Adaptation in Apple (Malus x domestica Borkh.) Microshoots. Proteomics 2019; 19:e1800142. [PMID: 31430045 DOI: 10.1002/pmic.201800142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 08/15/2019] [Indexed: 11/10/2022]
Abstract
Temperature stress is one of the most common external factors that plants have to adapt to. Accordingly, plants have developed several adaptation mechanisms to deal with temperature stress. Chloroplasts are one of the organelles that are responsible for the sensing of the temperature signal and triggering a response. Here, chloroplasts are purified from low temperature (4° C), control (22° C) and high temperature (30° C) grown Malus x domestica microshoots. The purity of the chloroplast fractions is evaluated by marker proteins, as well as by using in silico subcellular localization predictions. The proteins are digested using filter-aided sample processing and analyzed using nano-LC MS/MS. 733 proteins are observed corresponding to published Malus x domestica gene models and 16 chloroplast genome -encoded proteins in the chloroplast preparates. In ANOVA, 56 proteins are found to be significantly differentially abundant (p < 0.01) between chloroplasts isolated from plants grown in different conditions. The differentially abundant proteins are involved in protein digestion, cytoskeleton structure, cellular redox state and photosynthesis, or have protective functions. Additionally, a putative chloroplastic aquaporin is observed. Data are available via ProteomeXchange with identifier PXD014212.
Collapse
Affiliation(s)
- Šarūnė Morkūnaitė-Haimi
- Institute of Horticulture, Lithuanian Research Centre for Agriculture and Forestry, st. 30, Babtai LT-54333, Kaunas, Lithuania
| | - Jurgita Vinskiene
- Institute of Horticulture, Lithuanian Research Centre for Agriculture and Forestry, st. 30, Babtai LT-54333, Kaunas, Lithuania
| | - Gražina Stanienė
- Institute of Horticulture, Lithuanian Research Centre for Agriculture and Forestry, st. 30, Babtai LT-54333, Kaunas, Lithuania
| | - Perttu Haimi
- Institute of Horticulture, Lithuanian Research Centre for Agriculture and Forestry, st. 30, Babtai LT-54333, Kaunas, Lithuania
| |
Collapse
|
33
|
Törönen P, Medlar A, Holm L. PANNZER2: a rapid functional annotation web server. Nucleic Acids Res 2019; 46:W84-W88. [PMID: 29741643 PMCID: PMC6031051 DOI: 10.1093/nar/gky350] [Citation(s) in RCA: 259] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 04/27/2018] [Indexed: 11/12/2022] Open
Abstract
The unprecedented growth of high-throughput sequencing has led to an ever-widening annotation gap in protein databases. While computational prediction methods are available to make up the shortfall, a majority of public web servers are hindered by practical limitations and poor performance. Here, we introduce PANNZER2 (Protein ANNotation with Z-scoRE), a fast functional annotation web server that provides both Gene Ontology (GO) annotations and free text description predictions. PANNZER2 uses SANSparallel to perform high-performance homology searches, making bulk annotation based on sequence similarity practical. PANNZER2 can output GO annotations from multiple scoring functions, enabling users to see which predictions are robust across predictors. Finally, PANNZER2 predictions scored within the top 10 methods for molecular function and biological process in the CAFA2 NK-full benchmark. The PANNZER2 web server is updated on a monthly schedule and is accessible at http://ekhidna2.biocenter.helsinki.fi/sanspanz/. The source code is available under the GNU Public Licence v3.
Collapse
Affiliation(s)
- Petri Törönen
- Institute of Biotechnology, HiLife, University of Helsinki, 00014 Helsinki, Finland
| | - Alan Medlar
- Institute of Biotechnology, HiLife, University of Helsinki, 00014 Helsinki, Finland.,School of Informatics, University of Edinburgh, UK
| | - Liisa Holm
- Institute of Biotechnology, HiLife, University of Helsinki, 00014 Helsinki, Finland.,Faculty of Biological and Environmental Sciences, University of Helsinki, 00014 Helsinki, Finland
| |
Collapse
|
34
|
Environmental conditions shape the nature of a minimal bacterial genome. Nat Commun 2019; 10:3100. [PMID: 31308405 PMCID: PMC6629657 DOI: 10.1038/s41467-019-10837-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 06/04/2019] [Indexed: 12/16/2022] Open
Abstract
Of the 473 genes in the genome of the bacterium with the smallest genome generated to date, 149 genes have unknown function, emphasising a universal problem; less than 1% of proteins have experimentally determined annotations. Here, we combine the results from state-of-the-art in silico methods for functional annotation and assign functions to 66 of the 149 proteins. Proteins that are still not annotated lack orthologues, lack protein domains, and/ or are membrane proteins. Twenty-four likely transporter proteins are identified indicating the importance of nutrient uptake into and waste disposal out of the minimal bacterial cell in a nutrient-rich environment after removal of metabolic enzymes. Hence, the environment shapes the nature of a minimal genome. Our findings also show that the combination of multiple different state-of-the-art in silico methods for annotating proteins is able to predict functions, even for difficult to characterise proteins and identify crucial gaps for further development.
Collapse
|
35
|
|
36
|
Featherston J, Arakaki Y, Hanschen ER, Ferris PJ, Michod RE, Olson BJSC, Nozaki H, Durand PM. The 4-Celled Tetrabaena socialis Nuclear Genome Reveals the Essential Components for Genetic Control of Cell Number at the Origin of Multicellularity in the Volvocine Lineage. Mol Biol Evol 2019; 35:855-870. [PMID: 29294063 DOI: 10.1093/molbev/msx332] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Multicellularity is the premier example of a major evolutionary transition in individuality and was a foundational event in the evolution of macroscopic biodiversity. The volvocine chlorophyte lineage is well suited for studying this process. Extant members span unicellular, simple colonial, and obligate multicellular taxa with germ-soma differentiation. Here, we report the nuclear genome sequence of one of the most morphologically simple organisms in this lineage-the 4-celled colonial Tetrabaena socialis and compare this to the three other complete volvocine nuclear genomes. Using conservative estimates of gene family expansions a minimal set of expanded gene families was identified that associate with the origin of multicellularity. These families are rich in genes related to developmental processes. A subset of these families is lineage specific, which suggests that at a genomic level the evolution of multicellularity also includes lineage-specific molecular developments. Multiple points of evidence associate modifications to the ubiquitin proteasomal pathway (UPP) with the beginning of coloniality. Genes undergoing positive or accelerating selection in the multicellular volvocines were found to be enriched in components of the UPP and gene families gained at the origin of multicellularity include components of the UPP. A defining feature of colonial/multicellular life cycles is the genetic control of cell number. The genomic data presented here, which includes diversification of cell cycle genes and modifications to the UPP, align the genetic components with the evolution of this trait.
Collapse
Affiliation(s)
- Jonathan Featherston
- Evolutionary Studies Institute, University of the Witwatersrand, Johannesburg, South Africa.,Agricultural Research Council, Biotechnology Platform, Pretoria, South Africa
| | - Yoko Arakaki
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Hongo, Japan
| | - Erik R Hanschen
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ
| | - Patrick J Ferris
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ
| | - Richard E Michod
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ
| | | | - Hisayoshi Nozaki
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Hongo, Japan
| | - Pierre M Durand
- Evolutionary Studies Institute, University of the Witwatersrand, Johannesburg, South Africa
| |
Collapse
|
37
|
Shibl AA, Ngugi DK, Talarmin A, Thompson LR, Blom J, Stingl U. The genome of a novel isolate of Prochlorococcus from the Red Sea contains transcribed genes for compatible solute biosynthesis. FEMS Microbiol Ecol 2018; 94:5090968. [PMID: 30188995 DOI: 10.1093/femsec/fiy182] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 09/04/2018] [Indexed: 11/14/2022] Open
Abstract
Marine microbes possess genomic and physiological adaptations to cope with varying environmental conditions. So far, the effects of high salinity on the most abundant marine photoautotrophic organism, Prochlorococcus, in marine oligotrophic environments, are mostly unknown. Here, we report the isolation of a new Prochlorococcus strain (RSP50) belonging to high-light (HL) clade II from the Red Sea, one of the warmest and most saline bodies of water in the global oceans. A comparative genomic analysis identified a set of 59 genes that were exclusive to RSP50 relative to currently available Prochlorococcus genomes, the majority of which (70%) encode for hypothetical proteins of unknown function. However, three of the unique genes encode for a complete pathway for the biosynthesis of the compatible solute glucosylglycerol, and are homologous to enzymes found in the sister lineage Synechococcus. Metatranscriptomic analyses of this metabolic pathway in the water column of the Red Sea revealed that the corresponding genes were constitutively transcribed, independent of depth and light, suggesting that osmoregulation using glucosylglycerol is a general feature of HL II Prochlorococcus in the Red Sea.
Collapse
Affiliation(s)
- Ahmed A Shibl
- Marine Microbial Ecology Lab, Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.,Biology Department, New York University Abu Dhabi, P.O. Box 129188, Abu Dhabi, UAE
| | - David K Ngugi
- Marine Microbial Ecology Lab, Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.,Department of Microorganisms, Leibniz Insitute DSMZ, Inhoffenstrasse 7B, 38124 Braunschweig, Germany
| | - Agathe Talarmin
- Marine Microbial Ecology Lab, Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.,Petroleum Geosciences Department, Petroleum Institute, P.O. Box 2533 Abu Dhabi, UAE
| | - Luke R Thompson
- Marine Microbial Ecology Lab, Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.,Department of Biological Sciences and Northern Gulf Institute, University of Southern Mississippi, Hattiesburg, Mississippi, USA.,Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, stationed at Southwest Fisheries Science Center, La Jolla, California, USA
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus Liebig University, D-35392 Giessen, Germany
| | - Ulrich Stingl
- Marine Microbial Ecology Lab, Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.,University of Florida, UF/IFAS, Department for Microbiology & Cell Science, Fort Lauderdale Research and Education Center, Davie, FL 33314, USA
| |
Collapse
|
38
|
Ghiselli F, Iannello M, Puccio G, Chang PL, Plazzi F, Nuzhdin SV, Passamonti M. Comparative Transcriptomics in Two Bivalve Species Offers Different Perspectives on the Evolution of Sex-Biased Genes. Genome Biol Evol 2018; 10:1389-1402. [PMID: 29897459 PMCID: PMC6007409 DOI: 10.1093/gbe/evy082] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/19/2018] [Indexed: 12/13/2022] Open
Abstract
Comparative genomics has become a central tool for evolutionary biology, and a better knowledge of understudied taxa represents the foundation for future work. In this study, we characterized the transcriptome of male and female mature gonads in the European clam Ruditapes decussatus, compared with that in the Manila clam Ruditapes philippinarum providing, for the first time in bivalves, information about transcription dynamics and sequence evolution of sex-biased genes. In both the species, we found a relatively low number of sex-biased genes (1,284, corresponding to 41.3% of the orthologous genes between the two species), probably due to the absence of sexual dimorphism, and the transcriptional bias is maintained in only 33% of the orthologs. The dN/dS is generally low, indicating purifying selection, with genes where the female-biased transcription is maintained between the two species showing a significantly higher dN/dS. Genes involved in embryo development, cell proliferation, and maintenance of genome stability show a faster sequence evolution. Finally, we report a lack of clear correlation between transcription level and evolutionary rate in these species, in contrast with studies that reported a negative correlation. We discuss such discrepancy and call into question some methodological approaches and rationales generally used in this type of comparative studies.
Collapse
Affiliation(s)
- Fabrizio Ghiselli
- Department of Biological, Geological, and Environmental Sciences, University of Bologna, Italy
| | - Mariangela Iannello
- Department of Biological, Geological, and Environmental Sciences, University of Bologna, Italy
| | - Guglielmo Puccio
- Department of Biological, Geological, and Environmental Sciences, University of Bologna, Italy
| | - Peter L Chang
- Program in Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, USA
| | - Federico Plazzi
- Department of Biological, Geological, and Environmental Sciences, University of Bologna, Italy
| | - Sergey V Nuzhdin
- Program in Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, USA
| | - Marco Passamonti
- Department of Biological, Geological, and Environmental Sciences, University of Bologna, Italy
| |
Collapse
|
39
|
Formentin E, Barizza E, Stevanato P, Falda M, Massa F, Tarkowskà D, Novák O, Lo Schiavo F. Fast Regulation of Hormone Metabolism Contributes to Salt Tolerance in Rice ( Oryzasativa spp. Japonica, L.) by Inducing Specific Morpho-Physiological Responses. PLANTS (BASEL, SWITZERLAND) 2018; 7:E75. [PMID: 30223560 PMCID: PMC6161274 DOI: 10.3390/plants7030075] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 09/10/2018] [Accepted: 09/13/2018] [Indexed: 11/16/2022]
Abstract
Clear evidence has highlighted a role for hormones in the plant stress response, including salt stress. Interplay and cross-talk among different hormonal pathways are of vital importance in abiotic stress tolerance. A genome-wide transcriptional analysis was performed on leaves and roots of three-day salt treated and untreated plants of two Italian rice varieties, Baldo and Vialone Nano, which differ in salt sensitivity. Genes correlated with hormonal pathways were identified and analyzed. The contents of abscisic acid, indoleacetic acid, cytokinins, and gibberellins were measured in roots, stems, and leaves of seedlings exposed for one and three days to salt stress. From the transcriptomic analysis, a huge number of genes emerged as being involved in hormone regulation in response to salt stress. The expression profile of genes involved in biosynthesis, signaling, response, catabolism, and conjugation of phytohormones was analyzed and integrated with the measurements of hormones in roots, stems, and leaves of seedlings. Significant changes in the hormone levels, along with differences in morphological responses, emerged between the two varieties. These results support the faster regulation of hormones metabolism in the tolerant variety that allows a prompt growth reprogramming and the setting up of an acclimation program, leading to specific morpho-physiological responses and growth recovery.
Collapse
Affiliation(s)
- Elide Formentin
- Department of Biology, University of Padova, 35131 Padua, Italy.
| | | | - Piergiorgio Stevanato
- Department of Agronomy, Animals, Natural Resources and Environment-DAFNAE, University of Padova, 35020 Legnaro (Padova), Italy.
| | - Marco Falda
- Department of Molecular Medicine, University of Padova, Viale G.Colombo 3, 35121 Padova, Italy.
| | - Federica Massa
- Department of Biology, University of Padova, 35131 Padua, Italy.
| | - Danuše Tarkowskà
- Laboratory of Growth Regulators, Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany AS CR & Faculty of Science, Palacký University, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic.
| | - Ondřej Novák
- Laboratory of Growth Regulators, Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany AS CR & Faculty of Science, Palacký University, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic.
| | | |
Collapse
|
40
|
Doğan T. HPO2GO: prediction of human phenotype ontology term associations for proteins using cross ontology annotation co-occurrences. PeerJ 2018; 6:e5298. [PMID: 30083448 PMCID: PMC6076985 DOI: 10.7717/peerj.5298] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2018] [Accepted: 07/03/2018] [Indexed: 01/24/2023] Open
Abstract
Analysing the relationships between biomolecules and the genetic diseases is a highly active area of research, where the aim is to identify the genes and their products that cause a particular disease due to functional changes originated from mutations. Biological ontologies are frequently employed in these studies, which provides researchers with extensive opportunities for knowledge discovery through computational data analysis. In this study, a novel approach is proposed for the identification of relationships between biomedical entities by automatically mapping phenotypic abnormality defining HPO terms with biomolecular function defining GO terms, where each association indicates the occurrence of the abnormality due to the loss of the biomolecular function expressed by the corresponding GO term. The proposed HPO2GO mappings were extracted by calculating the frequency of the co-annotations of the terms on the same genes/proteins, using already existing curated HPO and GO annotation sets. This was followed by the filtering of the unreliable mappings that could be observed due to chance, by statistical resampling of the co-occurrence similarity distributions. Furthermore, the biological relevance of the finalized mappings were discussed over selected cases, using the literature. The resulting HPO2GO mappings can be employed in different settings to predict and to analyse novel gene/protein—ontology term—disease relations. As an application of the proposed approach, HPO term—protein associations (i.e., HPO2protein) were predicted. In order to test the predictive performance of the method on a quantitative basis, and to compare it with the state-of-the-art, CAFA2 challenge HPO prediction target protein set was employed. The results of the benchmark indicated the potential of the proposed approach, as HPO2GO performance was among the best (Fmax = 0.35). The automated cross ontology mapping approach developed in this work may be extended to other ontologies as well, to identify unexplored relation patterns at the systemic level. The datasets, results and the source code of HPO2GO are available for download at: https://github.com/cansyl/HPO2GO.
Collapse
Affiliation(s)
- Tunca Doğan
- Department of Health Informatics, Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.,Cancer Systems Biology Laboratory (KanSiL), Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.,European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
| |
Collapse
|
41
|
Zhou R, Macaya-Sanz D, Rodgers-Melnick E, Carlson CH, Gouker FE, Evans LM, Schmutz J, Jenkins JW, Yan J, Tuskan GA, Smart LB, DiFazio SP. Characterization of a large sex determination region in Salix purpurea L. (Salicaceae). Mol Genet Genomics 2018; 293:1437-1452. [PMID: 30022352 DOI: 10.1007/s00438-018-1473-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Accepted: 07/09/2018] [Indexed: 12/30/2022]
Abstract
Dioecy has evolved numerous times in plants, but heteromorphic sex chromosomes are apparently rare. Sex determination has been studied in multiple Salix and Populus (Salicaceae) species, and P. trichocarpa has an XY sex determination system on chromosome 19, while S. suchowensis and S. viminalis have a ZW system on chromosome 15. Here we use whole genome sequencing coupled with quantitative trait locus mapping and a genome-wide association study to characterize the genomic composition of the non-recombining portion of the sex determination region. We demonstrate that Salix purpurea also has a ZW system on chromosome 15. The sex determination region has reduced recombination, high structural polymorphism, an abundance of transposable elements, and contains genes that are involved in sex expression in other plants. We also show that chromosome 19 contains sex-associated markers in this S. purpurea assembly, along with other autosomes. This raises the intriguing possibility of a translocation of the sex determination region within the Salicaceae lineage, suggesting a common evolutionary origin of the Populus and Salix sex determination loci.
Collapse
Affiliation(s)
- Ran Zhou
- Department of Biology, West Virginia University, 53 Campus Drive, Morgantown, WV, 26506-6057, USA
| | - David Macaya-Sanz
- Department of Biology, West Virginia University, 53 Campus Drive, Morgantown, WV, 26506-6057, USA
| | - Eli Rodgers-Melnick
- Department of Biology, West Virginia University, 53 Campus Drive, Morgantown, WV, 26506-6057, USA
| | - Craig H Carlson
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, NY, 14456, USA
| | - Fred E Gouker
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, NY, 14456, USA
| | - Luke M Evans
- Department of Biology, West Virginia University, 53 Campus Drive, Morgantown, WV, 26506-6057, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute of Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA.,Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Jerry W Jenkins
- HudsonAlpha Institute of Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA
| | - Juying Yan
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Gerald A Tuskan
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.,Biosciences Division, Oak Ridge National Lab, Oak Ridge, USA
| | - Lawrence B Smart
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, NY, 14456, USA
| | - Stephen P DiFazio
- Department of Biology, West Virginia University, 53 Campus Drive, Morgantown, WV, 26506-6057, USA.
| |
Collapse
|
42
|
Manivel G, Meyyazhagan A, Durairaj D R, Piramanayagam S. Genome-wide analysis of Excretory/Secretory proteins in Trypanosoma brucei brucei: Insights into functional characteristics and identification of potential targets by immunoinformatics approach. Genomics 2018; 111:1124-1133. [PMID: 30006035 DOI: 10.1016/j.ygeno.2018.07.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Revised: 06/28/2018] [Accepted: 07/08/2018] [Indexed: 11/28/2022]
Abstract
Trypanosoma brucei brucei (T.b.brucei) is an extra-cellular parasite that causes Animal African Trypanosomiasis (AAT) disease in animals. Till day, this disease is more difficult to treat and control due to lack of efficient vaccines and early diagnosis of the parasite infection. T.b.brucei Excretory/Secretory (ES) proteins were involved in pathogenesis and key for understanding the host-parasite interactions. Functions of T.b.brucei's ES proteins were poorly investigated and experimental identification is expensive and time-consuming. Bioinformatics approaches are cost-effective by facilitating the experimental analysis of potential drug targets for parasitic diseases. Here we applied several bioinformatics tools to predict and functionalize the annotation of 1104 ES proteins and immunoinformatics approaches carried out to predict and evaluate the epitopes in T.b.brucei. Secretory information, functional annotations and potential epitopes of each ES proteins were available at http://tbb.insilico.in. This study provides functional information of T.b.brucei for experimental studies to identify potential targets for diagnosis and therapeutics development.
Collapse
Affiliation(s)
- Gowdham Manivel
- Department of Bioinformatics, Bharathiar University, Coimbatore, India.
| | - Arun Meyyazhagan
- Cytogenetics Department, EuroEspes Biomedical Research Center, Institute of Medical Science and Genomic Medicine, 15165 Bergondo, Corunna, Spain
| | - Ruban Durairaj D
- Department of Bioinformatics, Bharathiar University, Coimbatore, India
| | | |
Collapse
|
43
|
Wimalanathan K, Friedberg I, Andorf CM, Lawrence‐Dill CJ. Maize GO Annotation-Methods, Evaluation, and Review (maize-GAMER). PLANT DIRECT 2018; 2:e00052. [PMID: 31245718 PMCID: PMC6508527 DOI: 10.1002/pld3.52] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Revised: 01/25/2018] [Accepted: 02/17/2018] [Indexed: 05/20/2023]
Abstract
We created a new high-coverage, robust, and reproducible functional annotation of maize protein-coding genes based on Gene Ontology (GO) term assignments. Whereas the existing Phytozome and Gramene maize GO annotation sets only cover 41% and 56% of maize protein-coding genes, respectively, this study provides annotations for 100% of the genes. We also compared the quality of our newly derived annotations with the existing Gramene and Phytozome functional annotation sets by comparing all three to a manually annotated gold standard set of 1,619 genes where annotations were primarily inferred from direct assay or mutant phenotype. Evaluations based on the gold standard indicate that our new annotation set is measurably more accurate than those from Phytozome and Gramene. To derive this new high-coverage, high-confidence annotation set, we used sequence similarity and protein domain presence methods as well as mixed-method pipelines that were developed for the Critical Assessment of Function Annotation (CAFA) challenge. Our project to improve maize annotations is called maize-GAMER (GO Annotation Method, Evaluation, and Review), and the newly derived annotations are accessible via MaizeGDB (http://download.maizegdb.org/maize-GAMER) and CyVerse (B73 RefGen_v3 5b+ at doi.org/10.7946/P2S62P and B73 RefGen_v4 Zm00001d.2 at doi.org/10.7946/P2M925).
Collapse
Affiliation(s)
- Kokulapalan Wimalanathan
- Bioinformatics and Computational BiologyIowa State UniversityAmesIAUSA
- Department of Genetics Development and Cell BiologyIowa State UniversityAmesIAUSA
| | - Iddo Friedberg
- Bioinformatics and Computational BiologyIowa State UniversityAmesIAUSA
- Department of Veterinary Microbiology and Preventive MedicineIowa State UniversityAmesIAUSA
| | - Carson M. Andorf
- USDA‐ARS Corn Insects and Crop Genetics Research UnitIowa State UniversityAmesIAUSA
- Department of Computer ScienceIowa State UniversityAmesIAUSA
| | - Carolyn J. Lawrence‐Dill
- Bioinformatics and Computational BiologyIowa State UniversityAmesIAUSA
- Department of Genetics Development and Cell BiologyIowa State UniversityAmesIAUSA
- Department of AgronomyIowa State UniversityAmesIAUSA
| |
Collapse
|
44
|
Oostra V, Saastamoinen M, Zwaan BJ, Wheat CW. Strong phenotypic plasticity limits potential for evolutionary responses to climate change. Nat Commun 2018. [PMID: 29520061 PMCID: PMC5843647 DOI: 10.1038/s41467-018-03384-9] [Citation(s) in RCA: 102] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Phenotypic plasticity, the expression of multiple phenotypes from one genome, is a widespread adaptation to short-term environmental fluctuations, but whether it facilitates evolutionary adaptation to climate change remains contentious. Here, we investigate seasonal plasticity and adaptive potential in an Afrotropical butterfly expressing distinct phenotypes in dry and wet seasons. We assess the transcriptional architecture of plasticity in a full-factorial analysis of heritable and environmental effects across 72 individuals, and reveal pervasive gene expression differences between the seasonal phenotypes. Strikingly, intra-population genetic variation for plasticity is largely absent, consistent with specialisation to a particular environmental cue reliably predicting seasonal transitions. Under climate change, deteriorating accuracy of predictive cues will likely aggravate maladaptive phenotype-environment mismatches and increase selective pressures on reaction norms. However, the observed paucity of genetic variation for plasticity limits evolutionary responses, potentially weakening prospects for population persistence. Thus, seasonally plastic species may be especially vulnerable to climate change.
Collapse
Affiliation(s)
- Vicencio Oostra
- Department of Genetics, Evolution and Environment, University College London, The Darwin Building, Gower Street, London, WC1E 6BT, UK. .,Department of Plant Sciences, Laboratory of Genetics, Wageningen University, PO Box 16, 6700AA, Wageningen, The Netherlands.
| | - Marjo Saastamoinen
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, PO Box 65, Helsinki, FI-00014, Finland
| | - Bas J Zwaan
- Department of Plant Sciences, Laboratory of Genetics, Wageningen University, PO Box 16, 6700AA, Wageningen, The Netherlands
| | - Christopher W Wheat
- Department of Zoology, Population Genetics, Stockholm University, S-10691, Stockholm, Sweden
| |
Collapse
|
45
|
Wang Z, Zhao C, Wang Y, Sun Z, Wang N. PANDA: Protein function prediction using domain architecture and affinity propagation. Sci Rep 2018; 8:3484. [PMID: 29472600 PMCID: PMC5823857 DOI: 10.1038/s41598-018-21849-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Accepted: 02/09/2018] [Indexed: 12/23/2022] Open
Abstract
We developed PANDA (Propagation of Affinity and Domain Architecture) to predict protein functions in the format of Gene Ontology (GO) terms. PANDA at first executes profile-profile alignment algorithm to search against PfamA, KOG, COG, and SwissProt databases, and then launches PSI-BLAST against UniProt for homologue search. PANDA integrates a domain architecture inference algorithm based on the Bayesian statistics that calculates the probability of having a GO term. All the candidate GO terms are pooled and filtered based on Z-score. After that, the remaining GO terms are clustered using an affinity propagation algorithm based on the GO directed acyclic graph, followed by a second round of filtering on the clusters of GO terms. We benchmarked the performance of all the baseline predictors PANDA integrates and also for every pooling and filtering step of PANDA. It can be found that PANDA achieves better performances in terms of area under the curve for precision and recall compared to the baseline predictors. PANDA can be accessed from http://dna.cs.miami.edu/PANDA/ .
Collapse
Affiliation(s)
- Zheng Wang
- Department of Computer Science, University of Miami, 1364 Memorial Drive, P.O. Box 248154, Coral Gables, FL, 33124, USA.
| | - Chenguang Zhao
- School of Computing, University of Southern Mississippi, 118 College Drive #5106, Hattiesburg, MS, 39406, USA
| | - Yiheng Wang
- School of Computing, University of Southern Mississippi, 118 College Drive #5106, Hattiesburg, MS, 39406, USA
| | - Zheng Sun
- Department of Mathematics and Computer Science, The Citadel, 171 Moulrie Street, Charleston, SC, 29409, USA
| | - Nan Wang
- Department of Computer Science, New Jersey City University, 2039 Kennedy Blvd, Jersey City, NJ, 07305, USA
| |
Collapse
|
46
|
Formentin E, Sudiro C, Perin G, Riccadonna S, Barizza E, Baldoni E, Lavezzo E, Stevanato P, Sacchi GA, Fontana P, Toppo S, Morosinotto T, Zottini M, Lo Schiavo F. Transcriptome and Cell Physiological Analyses in Different Rice Cultivars Provide New Insights Into Adaptive and Salinity Stress Responses. FRONTIERS IN PLANT SCIENCE 2018; 9:204. [PMID: 29556243 PMCID: PMC5844958 DOI: 10.3389/fpls.2018.00204] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 02/02/2018] [Indexed: 05/20/2023]
Abstract
Salinity tolerance has been extensively investigated in recent years due to its agricultural importance. Several features, such as the regulation of ionic transporters and metabolic adjustments, have been identified as salt tolerance hallmarks. Nevertheless, due to the complexity of the trait, the results achieved to date have met with limited success in improving the salt tolerance of rice plants when tested in the field, thus suggesting that a better understanding of the tolerance mechanisms is still required. In this work, differences between two varieties of rice with contrasting salt sensitivities were revealed by the imaging of photosynthetic parameters, ion content analysis and a transcriptomic approach. The transcriptomic analysis conducted on tolerant plants supported the setting up of an adaptive program consisting of sodium distribution preferentially limited to the roots and older leaves, and in the activation of regulatory mechanisms of photosynthesis in the new leaves. As a result, plants resumed grow even under prolonged saline stress. In contrast, in the sensitive variety, RNA-seq analysis revealed a misleading response, ending in senescence and cell death. The physiological response at the cellular level was investigated by measuring the intracellular profile of H2O2 in the roots, using a fluorescent probe. In the roots of tolerant plants, a quick response was observed with an increase in H2O2 production within 5 min after salt treatment. The expression analysis of some of the genes involved in perception, signal transduction and salt stress response confirmed their early induction in the roots of tolerant plants compared to sensitive ones. By inhibiting the synthesis of apoplastic H2O2, a reduction in the expression of these genes was detected. Our results indicate that quick H2O2 signaling in the roots is part of a coordinated response that leads to adaptation instead of senescence in salt-treated rice plants.
Collapse
Affiliation(s)
- Elide Formentin
- Department of Biology, University of Padova, Padova, Italy
- *Correspondence: Elide Formentin,
| | | | - Giorgio Perin
- Department of Biology, University of Padova, Padova, Italy
| | - Samantha Riccadonna
- Research and Innovation Centre, Edmund Mach Foundation, San Michele all’Adige, Italy
| | | | - Elena Baldoni
- Department of Agricultural and Environmental Sciences - Production, Landscape, Agroenergy, University of Milan, Milan, Italy
| | - Enrico Lavezzo
- Department of Molecular Medicine, University of Padova, Padova, Italy
| | - Piergiorgio Stevanato
- Department of Agronomy, Food, Natural Resources, Animals and the Environment, University of Padova, Padova, Italy
| | - Gian Attilio Sacchi
- Department of Agricultural and Environmental Sciences - Production, Landscape, Agroenergy, University of Milan, Milan, Italy
| | - Paolo Fontana
- Research and Innovation Centre, Edmund Mach Foundation, San Michele all’Adige, Italy
| | - Stefano Toppo
- Department of Molecular Medicine, University of Padova, Padova, Italy
| | | | | | | |
Collapse
|
47
|
HashGO: hashing gene ontology for protein function prediction. Comput Biol Chem 2017; 71:264-273. [DOI: 10.1016/j.compbiolchem.2017.09.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 09/25/2017] [Indexed: 10/18/2022]
|
48
|
Rifaioglu AS, Doğan T, Saraç ÖS, Ersahin T, Saidi R, Atalay MV, Martin MJ, Cetin-Atalay R. Large-scale automated function prediction of protein sequences and an experimental case study validation on PTEN transcript variants. Proteins 2017; 86:135-151. [PMID: 29098713 DOI: 10.1002/prot.25416] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2017] [Revised: 10/24/2017] [Accepted: 11/01/2017] [Indexed: 12/24/2022]
Abstract
Recent advances in computing power and machine learning empower functional annotation of protein sequences and their transcript variations. Here, we present an automated prediction system UniGOPred, for GO annotations and a database of GO term predictions for proteomes of several organisms in UniProt Knowledgebase (UniProtKB). UniGOPred provides function predictions for 514 molecular function (MF), 2909 biological process (BP), and 438 cellular component (CC) GO terms for each protein sequence. UniGOPred covers nearly the whole functionality spectrum in Gene Ontology system and it can predict both generic and specific GO terms. UniGOPred was run on CAFA2 challenge target protein sequences and it is categorized within the top 10 best performing methods for the molecular function category. In addition, the performance of UniGOPred is higher compared to the baseline BLAST classifier in all categories of GO. UniGOPred predictions are compared with UniProtKB/TrEMBL database annotations as well. Furthermore, the proposed tool's ability to predict negatively associated GO terms that defines the functions that a protein does not possess, is discussed. UniGOPred annotations were also validated by case studies on PTEN protein variants experimentally and on CHD8 protein variants with literature. UniGOPred protein functional annotation system is available as an open access tool at http://cansyl.metu.edu.tr/UniGOPred.html.
Collapse
Affiliation(s)
- Ahmet Sureyya Rifaioglu
- Department of Computer Engineering, Middle East Technical University, Ankara, 06800, Turkey.,Department of Computer Engineering, İskenderun Technical University, Hatay, 31200, Turkey
| | - Tunca Doğan
- Protein Function Development Team, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom.,CanSyL, Graduate School of Informatics, Middle East Technical University, Ankara, 06800, Turkey
| | - Ömer Sinan Saraç
- Department of Computer Engineering, Istanbul Technical University, İstanbul, 34467, Turkey
| | - Tulin Ersahin
- CanSyL, Graduate School of Informatics, Middle East Technical University, Ankara, 06800, Turkey
| | - Rabie Saidi
- Protein Function Development Team, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Mehmet Volkan Atalay
- Department of Computer Engineering, Middle East Technical University, Ankara, 06800, Turkey
| | - Maria Jesus Martin
- Protein Function Development Team, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Rengul Cetin-Atalay
- CanSyL, Graduate School of Informatics, Middle East Technical University, Ankara, 06800, Turkey
| |
Collapse
|
49
|
Ong E, Wong MU, He Y. Identification of New Features from Known Bacterial Protective Vaccine Antigens Enhances Rational Vaccine Design. Front Immunol 2017; 8:1382. [PMID: 29123525 PMCID: PMC5662880 DOI: 10.3389/fimmu.2017.01382] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Accepted: 10/06/2017] [Indexed: 11/13/2022] Open
Abstract
With many protective vaccine antigens reported in the literature and verified experimentally, how to use the knowledge mined from these antigens to support rational vaccine design and study underlying design mechanism remains unclear. In order to address the problem, a systematic bioinformatics analysis was performed on 291 Gram-positive and Gram-negative bacterial protective antigens with experimental evidence manually curated in the Protegen database. The bioinformatics analyses evaluated included subcellular localization, adhesin probability, peptide signaling, transmembrane α-helix and β-barrel, conserved domain, Clusters of Orthologous Groups, and Gene Ontology functional annotations. Here we showed the critical role of adhesins, along with subcellular localization, peptide signaling, in predicting secreted extracellular or surface-exposed protective antigens, with mechanistic explanations supported by functional analysis. We also found a significant negative correlation of transmembrane α-helix to antigen protectiveness in Gram-positive and Gram-negative pathogens, while a positive correlation of transmembrane β-barrel was observed in Gram-negative pathogens. The commonly less-focused cytoplasmic and cytoplasmic membrane proteins could be potentially predicted with the help of other selection criteria such as adhesin probability and functional analysis. The significant findings in this study can support rational vaccine design and enhance our understanding of vaccine design mechanisms.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Mei U Wong
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, United States
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, United States.,Center of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
50
|
Navarrete-Perea J, Isasa M, Paulo JA, Corral-Corral R, Flores-Bautista J, Hernández-Téllez B, Bobes RJ, Fragoso G, Sciutto E, Soberón X, Gygi SP, Laclette JP. Quantitative multiplexed proteomics of Taenia solium cysts obtained from the skeletal muscle and central nervous system of pigs. PLoS Negl Trop Dis 2017; 11:e0005962. [PMID: 28945737 PMCID: PMC5634658 DOI: 10.1371/journal.pntd.0005962] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Revised: 10/10/2017] [Accepted: 09/13/2017] [Indexed: 01/01/2023] Open
Abstract
In human and porcine cysticercosis caused by the tapeworm Taenia solium, the larval stage (cysts) can infest several tissues including the central nervous system (CNS) and the skeletal muscles (SM). The cyst’s proteomics changes associated with the tissue localization in the host tissues have been poorly studied. Quantitative multiplexed proteomics has the power to evaluate global proteome changes in response to different conditions. Here, using a TMT-multiplexed strategy we identified and quantified over 4,200 proteins in cysts obtained from the SM and CNS of pigs, of which 891 were host proteins. To our knowledge, this is the most extensive intermixing of host and parasite proteins reported for tapeworm infections.Several antigens in cysticercosis, i.e., GP50, paramyosin and a calcium-binding protein were enriched in skeletal muscle cysts. Our results suggested the occurrence of tissue-enriched antigen that could be useful in the improvement of the immunodiagnosis for cysticercosis. Using several algorithms for epitope detection, we selected 42 highly antigenic proteins enriched for each tissue localization of the cysts. Taking into account the fold changes and the antigen/epitope contents, we selected 10 proteins and produced synthetic peptides from the best epitopes. Nine peptides were recognized by serum antibodies of cysticercotic pigs, suggesting that those peptides are antigens. Mixtures of peptides derived from SM and CNS cysts yielded better results than mixtures of peptides derived from a single tissue location, however the identification of the ‘optimal’ tissue-enriched antigens remains to be discovered. Through machine learning technologies, we determined that a reliable immunodiagnostic test for porcine cysticercosis required at least five different antigenic determinants. Human and porcine cysticercosis caused by Taenia solium is a parasite disease still endemic in developing countries. The cysts can be located in different host tissues, including different organs of the central nervous system and the skeletal muscles. The molecular mechanisms associated with the tissue localization of the cysts are not well understood. Here, we described the proteome changes of the cysts obtained from different host tissues from infected pigs using quantitative multiplex proteomics. We explored the diversity of host proteins identified in the cyst’s protein extracts and we also explored the immune-localization of several host-related proteins within the cysts, and propose their possible function. We identified several proteins and antigens enriched for a given tissue localization. Several synthetic peptides designed from these tissue-enriched antigens were tested trough ELISA. Using a combination of peptide mixtures and machine learning technologies we were able to distinguish non cysticercotic and cysticercotic pig’s sera. The tissue-enriched proteins/antigens could be useful for the development of improved immuno-diagnostic tests capable of discriminate the tissue-localization of the cysts.
Collapse
Affiliation(s)
- José Navarrete-Perea
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Marta Isasa
- Dept. of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Joao A Paulo
- Dept. of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Ricardo Corral-Corral
- Dept. of Biochemistry and Structural Biology, Institute of Cell Physiology, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Jeanette Flores-Bautista
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Beatriz Hernández-Téllez
- Dept. of Tissue and Cell Biology, School of Medicine, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Raúl J Bobes
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Gladis Fragoso
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Edda Sciutto
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Xavier Soberón
- Instituto Nacional de Medicina Genómica, Ciudad de México, México.,Dept. of Biocatalysis and Cellular Engineering, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Morelos, México
| | - Steven P Gygi
- Dept. of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Juan P Laclette
- Dept. of Immunology, Institute for Biomedical Research, Universidad Nacional Autónoma de México, Ciudad de México, México
| |
Collapse
|