1
|
Liu M, Yu J, Yang M, Cao L, Chen C. Adaptive evolution of chloroplast division mechanisms during plant terrestrialization. Cell Rep 2024; 43:113950. [PMID: 38489264 DOI: 10.1016/j.celrep.2024.113950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 01/12/2024] [Accepted: 02/27/2024] [Indexed: 03/17/2024] Open
Abstract
Despite extensive research, the origin and evolution of the chloroplast division machinery remain unclear. Here, we employ recently sequenced genomes and transcriptomes of Archaeplastida clades to identify the core components of chloroplast division and reconstruct their evolutionary histories, respectively. Our findings show that complete division ring structures emerged in Charophytes. We find that Glaucophytes experienced strong selection pressure, generating diverse variants adapted to the changing terrestrial environments. By integrating the functions of chloroplast division genes (CDGs) annotated in a workflow developed using large-scale multi-omics data, we further show that dispersed duplications acquire more species-specific functions under stronger selection pressures. Notably, PARC6, a dispersed duplicate CDG, regulates leaf color and plant growth in Solanum lycopersicum, demonstrating neofunctionalization. Our findings provide an integrated perspective on the functional evolution of chloroplast division machinery and highlight the potential of dispersed duplicate genes as the primary source of adaptive evolution of chloroplast division.
Collapse
Affiliation(s)
- Moyang Liu
- Shanghai Collaborative Innovation Center of Agri-Seeds, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jing Yu
- Shanghai Collaborative Innovation Center of Agri-Seeds, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Ming Yang
- Shanghai Collaborative Innovation Center of Agri-Seeds, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Lingyan Cao
- Joint International Research Laboratory of Metabolic and Developmental Sciences, State Key Laboratory of Hybrid Rice, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Cheng Chen
- Shanghai Collaborative Innovation Center of Agri-Seeds, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China.
| |
Collapse
|
2
|
Liu M, Sun W, Ma Z, Guo C, Chen J, Wu Q, Wang X, Chen H. Integrated network analyses identify MYB4R1 neofunctionalization in the UV-B adaptation of Tartary buckwheat. PLANT COMMUNICATIONS 2022; 3:100414. [PMID: 35923114 PMCID: PMC9700134 DOI: 10.1016/j.xplc.2022.100414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 06/20/2022] [Accepted: 07/20/2022] [Indexed: 06/15/2023]
Abstract
A hallmark of adaptive evolution is innovation in gene function, which is associated with the development of distinct roles for genes during plant evolution; however, assessing functional innovation over long periods of time is not trivial. Tartary buckwheat (Fagopyrum tataricum) originated in the Himalayan region and has been exposed to intense UV-B radiation for a long time, making it an ideal species for studying novel UV-B response mechanisms in plants. Here, we developed a workflow to obtain a co-functional network of UV-B responses using data from more than 10,000 samples in more than 80 projects with multi-species and multi-omics data. Dissecting the entire network revealed that flavonoid biosynthesis was most significantly related to the UV-B response. Importantly, we found that the regulatory factor MYB4R1, which resides at the core of the network, has undergone neofunctionalization. In vitro and in vivo experiments demonstrated that MYB4R1 regulates flavonoid and anthocyanin accumulation in response to UV-B in buckwheat by binding to L-box motifs in the FtCHS, FtFLS, and FtUFGT promoters. We used deep learning to develop a visual discrimination model of buckwheat flavonoid content based on natural populations exposed to global UV-B radiation. Our study highlights the critical role of gene neofunctionalization in UV-B adaptation.
Collapse
Affiliation(s)
- Moyang Liu
- College of Life Science, Sichuan Agricultural University, Ya'an 625014, China; Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Wenjun Sun
- College of Life Science, Sichuan Agricultural University, Ya'an 625014, China
| | - Zhaotang Ma
- College of Life Science, Sichuan Agricultural University, Ya'an 625014, China; State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Key Laboratory of Major Crop Diseases and Rice Research Institute, Sichuan Agricultural University, Chengdu 611130, China
| | - Chaocheng Guo
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jiahao Chen
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China; Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qi Wu
- College of Life Science, Sichuan Agricultural University, Ya'an 625014, China
| | - Xiyin Wang
- School of Life Science, North China University of Science and Technology, Tangshan 063210, China.
| | - Hui Chen
- College of Life Science, Sichuan Agricultural University, Ya'an 625014, China.
| |
Collapse
|
3
|
Liu M, Guo C, Xie K, Chen K, Chen J, Wang Y, Wang X. A cross-species co-functional gene network underlying leaf senescence. HORTICULTURE RESEARCH 2022; 10:uhac251. [PMID: 36643763 PMCID: PMC9832971 DOI: 10.1093/hr/uhac251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 11/08/2022] [Indexed: 06/17/2023]
Abstract
The complex leaf senescence process is governed by various levels of transcriptional and translational regulation. Several features of the leaf senescence process are similar across species, yet the extent to which the molecular mechanisms underlying the process of leaf senescence are conserved remains unclear. Currently used experimental approaches permit the identification of individual pathways that regulate various physiological and biochemical processes; however, the large-scale regulatory network underpinning intricate processes like leaf senescence cannot be built using these methods. Here, we discovered a series of conserved genes involved in leaf senescence in a common horticultural crop (Solanum lycopersicum), a monocot plant (Oryza sativa), and a eudicot plant (Arabidopsis thaliana) through analyses of the evolutionary relationships and expression patterns among genes. Our analyses revealed that the genetic basis of leaf senescence is largely conserved across species. We also created a multi-omics workflow using data from more than 10 000 samples from 85 projects and constructed a leaf senescence-associated co-functional gene network with 2769 conserved, high-confidence functions. Furthermore, we found that the mitochondrial unfolded protein response (UPRmt) is the central biological process underlying leaf senescence. Specifically, UPRmt responds to leaf senescence by maintaining mitostasis through a few cross-species conserved transcription factors (e.g. NAC13) and metabolites (e.g. ornithine). The co-functional network built in our study indicates that UPRmt figures prominently in cross-species conserved mechanisms. Generally, the results of our study provide new insights that will aid future studies of leaf senescence.
Collapse
Affiliation(s)
- Moyang Liu
- Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Chaocheng Guo
- Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Kexuan Xie
- Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Kai Chen
- Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jiahao Chen
- Shanghai Collaborative Innovation Center of Agri-Seeds/School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | | | - Xu Wang
- Corresponding author. E-mail: ,
| |
Collapse
|
4
|
Obayashi T, Kodate S, Hibara H, Kagaya Y, Kinoshita K. COXPRESdb v8: an animal gene coexpression database navigating from a global view to detailed investigations. Nucleic Acids Res 2022; 51:D80-D87. [PMID: 36350658 PMCID: PMC9825429 DOI: 10.1093/nar/gkac983] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 10/12/2022] [Accepted: 10/15/2022] [Indexed: 11/10/2022] Open
Abstract
Gene coexpression is synchronization of gene expression across many cellular and environmental conditions and is widely used to infer the biological function of genes. Gene coexpression information is complex, comprising a complete graph of all genes in the genome, and requires appropriate visualization and analysis tools. Since its initial release in 2007, the animal gene expression database COXPRESdb (https://coxpresdb.jp) has been continuously improved by adding new gene coexpression data and analysis tools. Here, we report COXPRESdb version 8, which has been enhanced with new features for an overview, summary, and individual examination of coexpression relationships: CoexMap to display coexpression on a genome scale, pathway enrichment analysis to summarize the function of coexpressed genes, and CoexPub to bridges coexpression and existing knowledge. COXPRESdb also facilitates downstream analyses such as interspecies comparisons by integrating RNAseq and microarray coexpression data in a union-type gene coexpression. COXPRESdb strongly support users with the new coexpression data and enhanced functionality.
Collapse
Affiliation(s)
- Takeshi Obayashi
- To whom correspondence should be addressed. Tel: +81 22 795 4741; Fax: +81 22 795 4765;
| | - Shun Kodate
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, 980-8573, Japan
| | - Himiko Hibara
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679, Japan
| | - Yuki Kagaya
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
5
|
Ślęczkowska M, Almomani R, Marchi M, de Greef BTA, Sopacua M, Hoeijmakers JGJ, Lindsey P, Salvi E, Bönhof GJ, Ziegler D, Malik RA, Waxman SG, Lauria G, Faber CG, Smeets HJM, Gerrits MM. Peripheral Ion Channel Gene Screening in Painful- and Painless-Diabetic Neuropathy. Int J Mol Sci 2022; 23:ijms23137190. [PMID: 35806193 PMCID: PMC9266298 DOI: 10.3390/ijms23137190] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/21/2022] [Accepted: 06/23/2022] [Indexed: 02/01/2023] Open
Abstract
Neuropathic pain is common in diabetic peripheral neuropathy (DN), probably caused by pathogenic ion channel gene variants. Therefore, we performed molecular inversion probes-next generation sequencing of 5 transient receptor potential cation channels, 8 potassium channels and 2 calcium-activated chloride channel genes in 222 painful- and 304 painless-DN patients. Twelve painful-DN (5.4%) patients showed potentially pathogenic variants (five nonsense/frameshift, seven missense, one out-of-frame deletion) in ANO3 (n = 3), HCN1 (n = 1), KCNK18 (n = 2), TRPA1 (n = 3), TRPM8 (n = 3) and TRPV4 (n = 1) and fourteen painless-DN patients (4.6%-three nonsense/frameshift, nine missense, one out-of-frame deletion) in ANO1 (n = 1), KCNK18 (n = 3), KCNQ3 (n = 1), TRPA1 (n = 2), TRPM8 (n = 1), TRPV1 (n = 3) and TRPV4 (n = 3). Missense variants were present in both conditions, presumably with loss- or gain-of-functions. KCNK18 nonsense/frameshift variants were found in painless/painful-DN, making a causal role in pain less likely. Surprisingly, premature stop-codons with likely nonsense-mediated RNA-decay were more frequent in painful-DN. Although limited in number, painful-DN patients with ion channel gene variants reported higher maximal pain during the night and day. Moreover, painful-DN patients with TRP variants had abnormal thermal thresholds and more severe pain during the night and day. Our results suggest a role of ion channel gene variants in neuropathic pain, but functional validation is required.
Collapse
Affiliation(s)
- Milena Ślęczkowska
- Department of Toxicogenomics, Maastricht University, 6229 ER Maastricht, The Netherlands; (R.A.); (P.L.); (H.J.M.S.)
- School of Mental Health and Neuroscience, Maastricht University, 6229 ER Maastricht, The Netherlands
- Correspondence:
| | - Rowida Almomani
- Department of Toxicogenomics, Maastricht University, 6229 ER Maastricht, The Netherlands; (R.A.); (P.L.); (H.J.M.S.)
- Department of Neurology, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands; (B.T.A.d.G.); (M.S.); (J.G.J.H.); (C.G.F.)
- Department of Medical Laboratory Sciences, Jordan University of Science and Technology, Irbid 22110, Jordan
| | - Margherita Marchi
- Neuroalgology Unit, IRCCS Foundation “Carlo Besta” Neurological Institute, 20133 Milan, Italy; (M.M.); (E.S.); (G.L.)
| | - Bianca T. A. de Greef
- Department of Neurology, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands; (B.T.A.d.G.); (M.S.); (J.G.J.H.); (C.G.F.)
| | - Maurice Sopacua
- Department of Neurology, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands; (B.T.A.d.G.); (M.S.); (J.G.J.H.); (C.G.F.)
| | - Janneke G. J. Hoeijmakers
- Department of Neurology, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands; (B.T.A.d.G.); (M.S.); (J.G.J.H.); (C.G.F.)
| | - Patrick Lindsey
- Department of Toxicogenomics, Maastricht University, 6229 ER Maastricht, The Netherlands; (R.A.); (P.L.); (H.J.M.S.)
| | - Erika Salvi
- Neuroalgology Unit, IRCCS Foundation “Carlo Besta” Neurological Institute, 20133 Milan, Italy; (M.M.); (E.S.); (G.L.)
| | - Gidon J. Bönhof
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany; (G.J.B.); (D.Z.)
- Department of Endocrinology and Diabetology, Medical Faculty, University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Dan Ziegler
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany; (G.J.B.); (D.Z.)
| | - Rayaz A. Malik
- Division of Cardiovascular Sciences, University of Manchester, Manchester M13 9PL, UK;
- Department of Medicine, Weill Cornell Medicine-Qatar, Doha P.O. Box 24144, Qatar
| | - Stephen G. Waxman
- Department of Neurology and Center for Neuroscience and Regeneration Research, Yale University School of Medicine, New Haven, CT 06510, USA;
- Center for Neuroscience and Regeneration Research, Veterans Affairs Medical Center, West Haven, CT 06516, USA
| | - Giuseppe Lauria
- Neuroalgology Unit, IRCCS Foundation “Carlo Besta” Neurological Institute, 20133 Milan, Italy; (M.M.); (E.S.); (G.L.)
| | - Catharina G. Faber
- Department of Neurology, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands; (B.T.A.d.G.); (M.S.); (J.G.J.H.); (C.G.F.)
| | - Hubert J. M. Smeets
- Department of Toxicogenomics, Maastricht University, 6229 ER Maastricht, The Netherlands; (R.A.); (P.L.); (H.J.M.S.)
- School of Mental Health and Neuroscience, Maastricht University, 6229 ER Maastricht, The Netherlands
| | - Monique M. Gerrits
- Department of Clinical Genetics, Maastricht University Medical Centre, 6229 HX Maastricht, The Netherlands;
| |
Collapse
|
6
|
Obayashi T, Hibara H, Kagaya Y, Aoki Y, Kinoshita K. ATTED-II v11: A Plant Gene Coexpression Database Using a Sample Balancing Technique by Subagging of Principal Components. PLANT & CELL PHYSIOLOGY 2022; 63:869-881. [PMID: 35353884 DOI: 10.1093/pcp/pcac041] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 02/06/2022] [Accepted: 03/29/2022] [Indexed: 05/25/2023]
Abstract
ATTED-II (https://atted.jp) is a gene coexpression database for nine plant species based on publicly available RNAseq and microarray data. One of the challenges in constructing condition-independent coexpression data based on publicly available gene expression data is managing the inherent sampling bias. Here, we report ATTED-II version 11, wherein we adopted a coexpression calculation methodology to balance the samples using principal component analysis and ensemble calculation. This approach has two advantages. First, omitting principal components with low contribution rates reduces the main contributors of noise. Second, balancing large differences in contribution rates enables considering various sample conditions entirely. In addition, based on RNAseq- and microarray-based coexpression data, we provide species-representative, integrated coexpression information to enhance the efficiency of interspecies comparison of the coexpression data. These coexpression data are provided as a standardized z-score to facilitate integrated analysis with different data sources. We believe that with these improvements, ATTED-II is more valuable and powerful for supporting interspecies comparative studies and integrated analyses using heterogeneous data.
Collapse
Affiliation(s)
- Takeshi Obayashi
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Himiko Hibara
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Yuki Kagaya
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
| | - Yuichi Aoki
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, 980-8573 Japan
| | - Kengo Kinoshita
- Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8679 Japan
- Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, 980-8573 Japan
- Institute of Development, Aging, and Cancer, Tohoku University, 4-1 Seiryo-machi, Aoba-ku, Sendai, 980-8575 Japan
| |
Collapse
|
7
|
Peng M, de Vries RP. Machine learning prediction of novel pectinolytic enzymes in Aspergillus niger through integrating heterogeneous (post-) genomics data. Microb Genom 2021; 7. [PMID: 34874247 PMCID: PMC8767319 DOI: 10.1099/mgen.0.000674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Pectinolytic enzymes are a variety of enzymes involved in breaking down pectin, a complex and abundant plant cell-wall polysaccharide. In nature, pectinolytic enzymes play an essential role in allowing bacteria and fungi to depolymerize and utilize pectin. In addition, pectinases have been widely applied in various industries, such as the food, wine, textile, paper and pulp industries. Due to their important biological function and increasing industrial potential, discovery of novel pectinolytic enzymes has received global interest. However, traditional enzyme characterization relies heavily on biochemical experiments, which are time consuming, laborious and expensive. To accelerate identification of novel pectinolytic enzymes, an automatic approach is needed. We developed a machine learning (ML) approach for predicting pectinases in the industrial workhorse fungus, Aspergillus niger. The prediction integrated a diverse range of features, including evolutionary profile, gene expression, transcriptional regulation and biochemical characteristics. Results on both the training and the independent testing dataset showed that our method achieved over 90 % accuracy, and recalled over 60 % of pectinolytic genes. Application of the ML model on the A. niger genome led to the identification of 83 pectinases, covering both previously described pectinases and novel pectinases that do not belong to any known pectinolytic enzyme family. Our study demonstrated the tremendous potential of ML in discovery of new industrial enzymes through integrating heterogeneous (post-) genomimcs data.
Collapse
Affiliation(s)
- Mao Peng
- Fungal Physiology, Westerdijk Fungal Biodiversity Institute, & Fungal Molecular Physiology, Utrecht University, Utrecht, The Netherlands
- *Correspondence: Mao Peng,
| | - Ronald P. de Vries
- Fungal Physiology, Westerdijk Fungal Biodiversity Institute, & Fungal Molecular Physiology, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
8
|
van Esveld SL, Meerstein‐Kessel L, Boshoven C, Baaij JF, Barylyuk K, Coolen JPM, van Strien J, Duim RAJ, Dutilh BE, Garza DR, Letterie M, Proellochs NI, de Ridder MN, Venkatasubramanian PB, de Vries LE, Waller RF, Kooij TWA, Huynen MA. A Prioritized and Validated Resource of Mitochondrial Proteins in Plasmodium Identifies Unique Biology. mSphere 2021; 6:e0061421. [PMID: 34494883 PMCID: PMC8550323 DOI: 10.1128/msphere.00614-21] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 08/23/2021] [Indexed: 11/20/2022] Open
Abstract
Plasmodium species have a single mitochondrion that is essential for their survival and has been successfully targeted by antimalarial drugs. Most mitochondrial proteins are imported into this organelle, and our picture of the Plasmodium mitochondrial proteome remains incomplete. Many data sources contain information about mitochondrial localization, including proteome and gene expression profiles, orthology to mitochondrial proteins from other species, coevolutionary relationships, and amino acid sequences, each with different coverage and reliability. To obtain a comprehensive, prioritized list of Plasmodium falciparum mitochondrial proteins, we rigorously analyzed and integrated eight data sets using Bayesian statistics into a predictive score per protein for mitochondrial localization. At a corrected false discovery rate of 25%, we identified 445 proteins with a sensitivity of 87% and a specificity of 97%. They include proteins that have not been identified as mitochondrial in other eukaryotes but have characterized homologs in bacteria that are involved in metabolism or translation. Mitochondrial localization of seven Plasmodium berghei orthologs was confirmed by epitope labeling and colocalization with a mitochondrial marker protein. One of these belongs to a newly identified apicomplexan mitochondrial protein family that in P. falciparum has four members. With the experimentally validated mitochondrial proteins and the complete ranked P. falciparum proteome, which we have named PlasmoMitoCarta, we present a resource to study unique proteins of Plasmodium mitochondria. IMPORTANCE The unique biology and medical relevance of the mitochondrion of the malaria parasite Plasmodium falciparum have made it the subject of many studies. However, we actually do not have a comprehensive assessment of which proteins reside in this organelle. Many omics data are available that are predictive of mitochondrial localization, such as proteomics data and expression data. Individual data sets are, however, rarely complete and can provide conflicting evidence. We integrated a wide variety of available omics data in a manner that exploits the relative strengths of the data sets. Our analysis gave a predictive score for the mitochondrial localization to each nuclear encoded P. falciparum protein and identified 445 likely mitochondrial proteins. We experimentally validated the mitochondrial localization of seven of the new mitochondrial proteins, confirming the quality of the complete list. These include proteins that have not been observed mitochondria before, adding unique mitochondrial functions to P. falciparum.
Collapse
Affiliation(s)
- Selma L. van Esveld
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
- Radboud Center for Mitochondrial Medicine, Radboudumc, Nijmegen, the Netherlands
| | - Lisette Meerstein‐Kessel
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
- Radboud Institute for Health Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Cas Boshoven
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Jochem F. Baaij
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Konstantin Barylyuk
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Jordy P. M. Coolen
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Joeri van Strien
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Ronald A. J. Duim
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Bas E. Dutilh
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
- Theoretical Biology and Bioinformatics, Science for Life, Utrecht University, Utrecht, the Netherlands
| | - Daniel R. Garza
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
- Laboratory of Molecular Bacteriology (Rega Institute), Department of Microbiology, Immunology and Transplantation, KU Leuven, Leuven, Belgium
| | - Marijn Letterie
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Nicholas I. Proellochs
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Michelle N. de Ridder
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | | | - Laura E. de Vries
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Ross F. Waller
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Taco W. A. Kooij
- Department of Medical Microbiology, Radboudumc Center for Infectious Diseases, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - Martijn A. Huynen
- Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
- Radboud Center for Mitochondrial Medicine, Radboudumc, Nijmegen, the Netherlands
| |
Collapse
|
9
|
Guo L, Govindaraj P, Kievit M, de Coo IFM, Gerards M, Hellebrekers DMEI, Stassen APM, Gayathri N, Taly AB, Sankaran BP, Smeets HJM. Whole exome sequencing reveals a homozygous C1QBP deletion as the cause of progressive external ophthalmoplegia and multiple mtDNA deletions. Neuromuscul Disord 2021; 31:859-864. [PMID: 34419324 DOI: 10.1016/j.nmd.2021.06.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 05/31/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022]
Abstract
Whole exome sequencing (WES), analyzed with GENESIS and WeGET, revealed a homozygous deletion in the C1QBP gene in a patient with progressive external ophthalmoplegia (PEO) and multiple mtDNA deletions. The gene encodes the mitochondria-located complementary 1 Q subcomponent-binding protein, involved in mitochondrial homeostasis. Biallelic mutations in C1QBP cause mitochondrial cardiomyopathy and/or PEO with variable age of onset. Our patient showed only late-onset PEO-plus syndrome without overt cardiac involvement. Available data suggest that early-onset cardiomyopathy variants localize in important structural domains and PEO-plus variants in the coiled-coil region. Our patient demonstrates that C1QBP mutations should be considered in individuals with PEO with or without cardiomyopathy.
Collapse
Affiliation(s)
- Le Guo
- School for Mental Health and Neuroscience (MHeNS), Maastricht University, Maastricht, the Netherlands; Department of Toxicogenomics, Clinical Genomics Unit, Maastricht University, Maastricht, the Netherlands
| | - Periyasamy Govindaraj
- Department of Neuropathology, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India; Neuromuscular Laboratory, Neurobiology Research Center, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India; Center for DNA Fingerprinting and Diagnostics (CDFD), Hyderabad, India
| | - Mariëlle Kievit
- School for Mental Health and Neuroscience (MHeNS), Maastricht University, Maastricht, the Netherlands
| | - Irenaeus F M de Coo
- School for Mental Health and Neuroscience (MHeNS), Maastricht University, Maastricht, the Netherlands; Department of Toxicogenomics, Clinical Genomics Unit, Maastricht University, Maastricht, the Netherlands
| | - Mike Gerards
- Maastricht Center for Systems Biology (MacsBio), Maastricht University, Maastricht, the Netherlands
| | - Debby M E I Hellebrekers
- Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, the Netherlands
| | - Alphons P M Stassen
- Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, the Netherlands
| | - Narayanappa Gayathri
- Department of Neuropathology, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India; Neuromuscular Laboratory, Neurobiology Research Center, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Arun B Taly
- Neuromuscular Laboratory, Neurobiology Research Center, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India; Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Bindu Parayil Sankaran
- The Faculty of Medicine and Health, The Children's Hospital at Westmead Clinical School, Sydney Medical School, The University of Sydney, NSW, Australia
| | - Hubert J M Smeets
- School for Mental Health and Neuroscience (MHeNS), Maastricht University, Maastricht, the Netherlands; Department of Toxicogenomics, Clinical Genomics Unit, Maastricht University, Maastricht, the Netherlands; School for Oncology and Developmental Biology (GROW), Maastricht University, Maastricht, the Netherlands.
| |
Collapse
|
10
|
Maind A, Raut S. Mining conditions specific hub genes from RNA-Seq gene-expression data via biclustering and their application to drug discovery. IET Syst Biol 2020; 13:194-203. [PMID: 31318337 PMCID: PMC8687431 DOI: 10.1049/iet-syb.2018.5058] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Gene‐expression data is being widely used for various clinical research. It represents expression levels of thousands of genes across the various experimental conditions simultaneously. Mining conditions specific hub genes from gene‐expression data is a challenging task. Conditions specific hub genes signify the functional behaviour of bicluster across the subset of conditions and can act as prognostic or diagnostic markers of the diseases. In this study, the authors have introduced a new approach for identifying conditions specific hub genes from the RNA‐Seq data using a biclustering algorithm. In the proposed approach, efficient ‘runibic’ biclustering algorithm, the concept of gene co‐expression network and concept of protein–protein interaction network have been used for getting better performance. The result shows that the proposed approach extracts biologically significant conditions specific hub genes which play an important role in various biological processes and pathways. These conditions specific hub genes can be used as prognostic or diagnostic biomarkers. Conditions specific hub genes will be helpful to reduce the analysis time and increase the accuracy of further research. Also, they summarised application of the proposed approach to the drug discovery process.
Collapse
Affiliation(s)
- Ankush Maind
- Department of Computer Science and Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India.
| | - Shital Raut
- Department of Computer Science and Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India
| |
Collapse
|
11
|
Verdonschot JAJ, Robinson EL, James KN, Mohamed MW, Claes GRF, Casas K, Vanhoutte EK, Hazebroek MR, Kringlen G, Pasierb MM, van den Wijngaard A, Glatz JFC, Heymans SRB, Krapels IPC, Nahas S, Brunner HG, Szklarczyk R. Mutations in PDLIM5 are rare in dilated cardiomyopathy but are emerging as potential disease modifiers. Mol Genet Genomic Med 2019; 8:e1049. [PMID: 31880413 PMCID: PMC7005607 DOI: 10.1002/mgg3.1049] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 10/23/2019] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND A causal genetic mutation is found in 40% of families with dilated cardiomyopathy (DCM), leaving a large percentage of families genetically unsolved. This prevents adequate counseling and clear recommendations in these families. We aim to identify novel genes or modifiers associated with DCM. METHODS We performed computational ranking of human genes based on coexpression with a predefined set of genes known to be associated with DCM, which allowed us to prioritize gene candidates for their likelihood of being involved in DCM. Top candidates will be checked for variants in the available whole-exome sequencing data of 142 DCM patients. RNA was isolated from cardiac biopsies to investigate gene expression. RESULTS PDLIM5 was classified as the top candidate. An interesting heterozygous variant (189_190delinsGG) was found in a DCM patient with a known pathogenic truncating TTN-variant. The PDLIM5 loss-of-function (LoF) variant affected all cardiac-specific isoforms of PDLIM5 and no LoF variants were detected in the same region in a control cohort of 26,000 individuals. RNA expression of PDLIM5 and its direct interactors (MYOT, LDB3, and MYOZ2) was increased in cardiac tissue of this patient, indicating a possible compensatory mechanism. The PDLIM5 variant cosegregated with the TTN-variant and the phenotype, leading to a high disease penetrance in this family. A second patient was an infant with a homozygous 10 kb-deletion of exon 2 in PDLIM5 resulting in early-onset cardiac disease, showing the importance of PDLIM5 in cardiac function. CONCLUSIONS Heterozygous PDLIM5 variants are rare and therefore will not have a major contribution in DCM. Although they likely play a role in disease development as this gene plays a major role in contracting cardiomyocytes and homozygous variants lead to early-onset cardiac disease. Other environmental and/or genetic factors are probably necessary to unveil the cardiac phenotype in PDLIM5 mutation carriers.
Collapse
Affiliation(s)
- Job A J Verdonschot
- Department of Cardiology, Maastricht University Medical Centre, Maastricht, The Netherlands.,Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Emma L Robinson
- Department of Cardiology, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Kiely N James
- Rady Children's Institute for Genomic Medicine, San Diego, CA, USA
| | - Mohamed W Mohamed
- Sanford Children's Hospital, Fargo, ND, USA.,North Dakota University, Fargo, ND, USA
| | - Godelieve R F Claes
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Kari Casas
- Sanford Children's Hospital, Fargo, ND, USA.,North Dakota University, Fargo, ND, USA
| | - Els K Vanhoutte
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Mark R Hazebroek
- Department of Cardiology, Maastricht University Medical Centre, Maastricht, The Netherlands
| | | | | | - Arthur van den Wijngaard
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Jan F C Glatz
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Stephane R B Heymans
- Department of Cardiology, Maastricht University Medical Centre, Maastricht, The Netherlands.,Department of Cardiovascular Research, University of Leuven, Leuven, Belgium.,Netherlands Heart Institute (ICIN), Utrecht, The Netherlands
| | - Ingrid P C Krapels
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Shareef Nahas
- Rady Children's Institute for Genomic Medicine, San Diego, CA, USA
| | - Han G Brunner
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands.,Department of Human Genetics, Donders Center for Neuroscience, Radboudumc, Nijmegen, The Netherlands.,GROW Institute for Developmental Biology and Cancer, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Radek Szklarczyk
- Department of Clinical Genetics, Maastricht University Medical Centre, Maastricht, The Netherlands
| |
Collapse
|
12
|
Li H, Rukina D, David FPA, Li TY, Oh CM, Gao AW, Katsyuba E, Bou Sleiman M, Komljenovic A, Huang Q, Williams RW, Robinson-Rechavi M, Schoonjans K, Morgenthaler S, Auwerx J. Identifying gene function and module connections by the integration of multispecies expression compendia. Genome Res 2019; 29:2034-2045. [PMID: 31754022 PMCID: PMC6886503 DOI: 10.1101/gr.251983.119] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 10/31/2019] [Indexed: 12/13/2022]
Abstract
The functions of many eukaryotic genes are still poorly understood. Here, we developed and validated a new method, termed GeneBridge, which is based on two linked approaches to impute gene function and bridge genes with biological processes. First, Gene-Module Association Determination (G-MAD) allows the annotation of gene function. Second, Module-Module Association Determination (M-MAD) allows predicting connectivity among modules. We applied the GeneBridge tools to large-scale multispecies expression compendia—1700 data sets with over 300,000 samples from human, mouse, rat, fly, worm, and yeast—collected in this study. G-MAD identifies novel functions of genes—for example, DDT in mitochondrial respiration and WDFY4 in T cell activation—and also suggests novel components for modules, such as for cholesterol biosynthesis. By applying G-MAD on data sets from respective tissues, tissue-specific functions of genes were identified—for instance, the roles of EHHADH in liver and kidney, as well as SLC6A1 in brain and liver. Using M-MAD, we identified a list of module-module associations, such as those between mitochondria and proteasome, mitochondria and histone demethylation, as well as ribosomes and lipid biosynthesis. The GeneBridge tools together with the expression compendia are available as an open resource, which will facilitate the identification of connections linking genes, modules, phenotypes, and diseases.
Collapse
Affiliation(s)
- Hao Li
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Daria Rukina
- Institute of Mathematics, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Fabrice P A David
- Gene Expression Core Facility, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.,SV-IT, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Terytty Yang Li
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Chang-Myung Oh
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Arwen W Gao
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Elena Katsyuba
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Maroun Bou Sleiman
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Andrea Komljenovic
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland.,Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland
| | - Qingyao Huang
- Laboratory of Metabolic Signaling, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Robert W Williams
- Department of Genetics, Genomics and Informatics, University of Tennessee, Memphis, Tennessee 38163, USA
| | - Marc Robinson-Rechavi
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland.,Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland
| | - Kristina Schoonjans
- Laboratory of Metabolic Signaling, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Stephan Morgenthaler
- Institute of Mathematics, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Johan Auwerx
- Laboratory of Integrative Systems Physiology, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| |
Collapse
|
13
|
van Esveld SL, Cansız-Arda Ş, Hensen F, van der Lee R, Huynen MA, Spelbrink JN. A Combined Mass Spectrometry and Data Integration Approach to Predict the Mitochondrial Poly(A) RNA Interacting Proteome. Front Cell Dev Biol 2019; 7:283. [PMID: 31803741 PMCID: PMC6873792 DOI: 10.3389/fcell.2019.00283] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 11/01/2019] [Indexed: 01/03/2023] Open
Abstract
In order to synthesize the 13 oxidative phosphorylation proteins encoded by mammalian mtDNA, a large assortment of nuclear encoded proteins is required. These include mitoribosomal proteins and various RNA processing, modification and degradation enzymes. RNA crosslinking has been successfully applied to identify whole-cell poly(A) RNA-binding proteomes, but this method has not been adapted to identify mitochondrial poly(A) RNA-binding proteomes. Here we developed and compared two related methods that specifically enrich for mitochondrial poly(A) RNA-binding proteins and analyzed bound proteins using mass spectrometry. To obtain a catalog of the mitochondrial poly(A) RNA interacting proteome, we used Bayesian data integration to combine these two mitochondrial-enriched datasets as well as published whole-cell datasets of RNA-binding proteins with various online resources, such as mitochondrial localization from MitoCarta 2.0 and co-expression analyses. Our integrated analyses ranked the complete human proteome for the likelihood of mtRNA interaction. We show that at a specific, inclusive cut-off of the corrected false discovery rate (cFDR) of 69%, we improve the number of predicted proteins from 185 to 211 with our mass spectrometry data as input for the prediction instead of the published whole-cell datasets. The chosen cut-off determines the cFDR: the less proteins included, the lower the cFDR will be. For the top 100 proteins, inclusion of our data instead of the published whole-cell datasets improve the cFDR from 54% to 31%. We show that the mass spectrometry method most specific for mitochondrial RNA-binding proteins involves ex vivo 4-thiouridine labeling followed by mitochondrial isolation with subsequent in organello UV-crosslinking.
Collapse
Affiliation(s)
- Selma L. van Esveld
- Radboud Center for Mitochondrial Medicine, Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, Netherlands
| | - Şirin Cansız-Arda
- Department of Pediatrics, Radboud Center for Mitochondrial Medicine, Radboud University Medical Centre, Nijmegen, Netherlands
| | - Fenna Hensen
- Department of Pediatrics, Radboud Center for Mitochondrial Medicine, Radboud University Medical Centre, Nijmegen, Netherlands
| | - Robin van der Lee
- Radboud Center for Mitochondrial Medicine, Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, Netherlands
| | - Martijn A. Huynen
- Radboud Center for Mitochondrial Medicine, Center for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, Netherlands
| | - Johannes N. Spelbrink
- Department of Pediatrics, Radboud Center for Mitochondrial Medicine, Radboud University Medical Centre, Nijmegen, Netherlands
| |
Collapse
|
14
|
Maind A, Raut S. COSCEB: Comprehensive search for column-coherent evolution biclusters and its application to hub gene identification. J Biosci 2019; 44:48. [PMID: 31180061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Biclustering is an increasingly used data mining technique for searching groups of co-expressed genes across the subset of experimental conditions from the gene-expression data. The group of co-expressed genes is present in the form of various patterns called a bicluster. A bicluster provides significant insights related to the functionality of genes and plays an important role in various clinical applications such as drug discovery, biomarker discovery, gene network analysis, gene identification, disease diagnosis, pathway analysis etc. This paper presents a novel unsupervised approach 'COmprehensive Search for Column-Coherent Evolution Biclusters (COSCEB)' for a comprehensive search of biologically significant column-coherent evolution biclusters. The concept of column subspace extraction from each gene pair and Longest Common Contiguous Subsequence (LCCS) is employed to identify significant biclusters. The experiments have been performed on both synthetic as well as real datasets. The performance of COSCEB is evaluated with the help of key issues. The issues are comprehensive search, Deep OPSM bicluster, bicluster types, bicluster accuracy, bicluster size, noise, overlapping, output nature, computational complexity and biologically significant biclusters. The performance of COSCEB is compared with six all-time famous biclustering algorithms SAMBA, OPSM, xMotif, Bimax, Deep OPSM- and UniBic. The result shows that the proposed approach performs effectively on most of the issues and extracts all possible biologically significant column-coherent evolution biclusters which are far more than other biclustering algorithms. Along with the proposed approach, we have also presented the case study which shows the application of significant biclusters for hub gene identification.
Collapse
Affiliation(s)
- Ankush Maind
- Department of Computer Science and Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra 440 010, India
| | | |
Collapse
|
15
|
COSCEB: Comprehensive search for column-coherent evolution biclusters and its application to hub gene identification. J Biosci 2019. [DOI: 10.1007/s12038-019-9862-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
16
|
CiliaCarta: An integrated and validated compendium of ciliary genes. PLoS One 2019; 14:e0216705. [PMID: 31095607 PMCID: PMC6522010 DOI: 10.1371/journal.pone.0216705] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 04/26/2019] [Indexed: 12/25/2022] Open
Abstract
The cilium is an essential organelle at the surface of mammalian cells whose dysfunction causes a wide range of genetic diseases collectively called ciliopathies. The current rate at which new ciliopathy genes are identified suggests that many ciliary components remain undiscovered. We generated and rigorously analyzed genomic, proteomic, transcriptomic and evolutionary data and systematically integrated these using Bayesian statistics into a predictive score for ciliary function. This resulted in 285 candidate ciliary genes. We generated independent experimental evidence of ciliary associations for 24 out of 36 analyzed candidate proteins using multiple cell and animal model systems (mouse, zebrafish and nematode) and techniques. For example, we show that OSCP1, which has previously been implicated in two distinct non-ciliary processes, causes ciliogenic and ciliopathy-associated tissue phenotypes when depleted in zebrafish. The candidate list forms the basis of CiliaCarta, a comprehensive ciliary compendium covering 956 genes. The resource can be used to objectively prioritize candidate genes in whole exome or genome sequencing of ciliopathy patients and can be accessed at http://bioinformatics.bio.uu.nl/john/syscilia/ciliacarta/.
Collapse
|
17
|
Dhifli W, Puig J, Dispot A, Elati M. Latent network-based representations for large-scale gene expression data analysis. BMC Bioinformatics 2019; 19:466. [PMID: 30717663 PMCID: PMC7394327 DOI: 10.1186/s12859-018-2481-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Accepted: 11/09/2018] [Indexed: 12/12/2022] Open
Abstract
Background With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. Results In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. Conclusion Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.
Collapse
Affiliation(s)
- Wajdi Dhifli
- University of Lille, 42, rue Paul Duez, Lille, 59000, France
| | - Julia Puig
- University of Lille, 42, rue Paul Duez, Lille, 59000, France
| | - Aurélien Dispot
- University of Lille, 42, rue Paul Duez, Lille, 59000, France
| | - Mohamed Elati
- University of Lille, 42, rue Paul Duez, Lille, 59000, France. .,UMR 8030 ; Génomique Métabolique / Laboratoire iSSB ; CEA-CNRS-UEVE, Genopole campus 1, 5 rue Henri Desbruères, Évry, 91030 Cedex, France.
| |
Collapse
|
18
|
Régnier M, Gourbeyre P, Pinton P, Napper S, Laffite J, Cossalter AM, Bailly JD, Lippi Y, Bertrand-Michel J, Bracarense APFRL, Guillou H, Loiseau N, Oswald IP. Identification of Signaling Pathways Targeted by the Food Contaminant FB1: Transcriptome and Kinome Analysis of Samples from Pig Liver and Intestine. Mol Nutr Food Res 2017; 61. [PMID: 28875582 DOI: 10.1002/mnfr.201700433] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Revised: 07/20/2017] [Indexed: 11/11/2022]
Abstract
SCOPE Fumonisin B1 (FB1) is a mycotoxin produced by Fusarium species. In mammals, this toxin causes widespread organ-specific damage; it promotes hepatotoxicity, is immunotoxic, alters intestinal functions etc. Despite its inhibitory effect on de novo ceramide synthesis, its molecular mechanism of action and toxicity is not totally elucidated. METHODS AND RESULTS To explore the mechanism of FB1 toxicity, we analyzed the transcriptome and the kinome of two organs targeted by FB1: the liver and the jejunum. Pigs were fed for 4 weeks a control diet or a FB1-contaminated diet (10 mg/kg). As expected, FB1-exposed pigs gained less weight and displayed a higher sphinganine/sphingosine ratio. Comparison of the transcriptomes and the kinomes of treated versus control pigs showed striking differences. Among the disrupted pathways in liver and jejunum, we highlight Protein Kinase B (AKT) / Phosphatase and tensin homolog (PTEN) at the intersection of the FB1-modulated pathways. CONCLUSION Most of the effects of FB1 are mediated by the regulation of ceramide level, which influences protein phosphatase 2 (PP2A) and the phosphoinositide 3-kinase (PI3K)/AKT signaling pathway. This pathway might be a new target to counteract toxic effect of Fumonisin B1, which is one of the most spread food contaminant in the world.
Collapse
Affiliation(s)
- Marion Régnier
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Pascal Gourbeyre
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Philippe Pinton
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Scott Napper
- Vaccine and Infectious Disease Organization - International Vaccine Center, University of Saskatchewan, Saskatoon, Saskatchewan, Canada.,Department of Biochemistry, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Joëlle Laffite
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Anne-Marie Cossalter
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Jean-Denis Bailly
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Yannick Lippi
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Justine Bertrand-Michel
- MetaToul-Lipidomic Facility-MetaboHUB, INSERM UMR1048, Institute of Cardiovascular and Metabolic Diseases, Université Paul Sabatier-Toulouse III, Toulouse, France
| | - Ana Paula F R L Bracarense
- Universidade Estadual de Londrina, Laboratory of Animal Pathology, Campus Universitário, Londrina, Paraná, Brazil
| | - Hervé Guillou
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Nicolas Loiseau
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Isabelle P Oswald
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, Toulouse, France
| |
Collapse
|
19
|
Li Y, Jourdain AA, Calvo SE, Liu JS, Mootha VK. CLIC, a tool for expanding biological pathways based on co-expression across thousands of datasets. PLoS Comput Biol 2017; 13:e1005653. [PMID: 28719601 PMCID: PMC5546725 DOI: 10.1371/journal.pcbi.1005653] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2016] [Revised: 08/07/2017] [Accepted: 06/21/2017] [Indexed: 12/31/2022] Open
Abstract
In recent years, there has been a huge rise in the number of publicly available transcriptional profiling datasets. These massive compendia comprise billions of measurements and provide a special opportunity to predict the function of unstudied genes based on co-expression to well-studied pathways. Such analyses can be very challenging, however, since biological pathways are modular and may exhibit co-expression only in specific contexts. To overcome these challenges we introduce CLIC, CLustering by Inferred Co-expression. CLIC accepts as input a pathway consisting of two or more genes. It then uses a Bayesian partition model to simultaneously partition the input gene set into coherent co-expressed modules (CEMs), while assigning the posterior probability for each dataset in support of each CEM. CLIC then expands each CEM by scanning the transcriptome for additional co-expressed genes, quantified by an integrated log-likelihood ratio (LLR) score weighted for each dataset. As a byproduct, CLIC automatically learns the conditions (datasets) within which a CEM is operative. We implemented CLIC using a compendium of 1774 mouse microarray datasets (28628 microarrays) or 1887 human microarray datasets (45158 microarrays). CLIC analysis reveals that of 910 canonical biological pathways, 30% consist of strongly co-expressed gene modules for which new members are predicted. For example, CLIC predicts a functional connection between protein C7orf55 (FMC1) and the mitochondrial ATP synthase complex that we have experimentally validated. CLIC is freely available at www.gene-clic.org. We anticipate that CLIC will be valuable both for revealing new components of biological pathways as well as the conditions in which they are active.
Collapse
Affiliation(s)
- Yang Li
- Howard Hughes Medical Institute and Department of Molecular Biology and the Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, United States of America and Department of Systems Biology, Harvard Medical School, Boston, MA United States of America
- Department of Statistics, Harvard University, Cambridge, MA, United States of America
| | - Alexis A. Jourdain
- Howard Hughes Medical Institute and Department of Molecular Biology and the Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, United States of America and Department of Systems Biology, Harvard Medical School, Boston, MA United States of America
- Broad Institute, Cambridge, MA, United States of America
| | - Sarah E. Calvo
- Howard Hughes Medical Institute and Department of Molecular Biology and the Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, United States of America and Department of Systems Biology, Harvard Medical School, Boston, MA United States of America
- Broad Institute, Cambridge, MA, United States of America
- * E-mail: (SEC); (JSL); (VKM)
| | - Jun S. Liu
- Department of Statistics, Harvard University, Cambridge, MA, United States of America
- * E-mail: (SEC); (JSL); (VKM)
| | - Vamsi K. Mootha
- Howard Hughes Medical Institute and Department of Molecular Biology and the Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, United States of America and Department of Systems Biology, Harvard Medical School, Boston, MA United States of America
- Broad Institute, Cambridge, MA, United States of America
- * E-mail: (SEC); (JSL); (VKM)
| |
Collapse
|
20
|
Zhang X, Chu Q, Guo G, Dong G, Li X, Zhang Q, Zhang S, Zhang Z, Wang Y. Genome-wide association studies identified multiple genetic loci for body size at four growth stages in Chinese Holstein cattle. PLoS One 2017; 12:e0175971. [PMID: 28426785 PMCID: PMC5398616 DOI: 10.1371/journal.pone.0175971] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2017] [Accepted: 04/03/2017] [Indexed: 12/14/2022] Open
Abstract
The growth and maturity of cattle body size affect not only feed efficiency, but also productivity and longevity. Dissecting the genetic architecture of body size is critical for cattle breeding to improve both efficiency and productivity. The volume and weight of body size are indicated by several measurements. Among them, Heart Girth (HG) and Hip Height (HH) are the most important traits. They are widely used as predictors of body weight (BW). Few association studies have been conducted for HG and HH in cattle focusing on single growth stage. In this study, we extended the Genome-wide association studies to a full spectrum of four growth stages (6-, 12-, 18-, and 24-months after birth) in Chinese Holstein heifers. The whole genomic single nucleotide polymorphisms (SNPs) were obtained from the Illumina BovineSNP50 v2 BeadChip genotyped on 3,325 individuals. Estimated breeding values (EBVs) were derived for both HG and HH at the four different ages and analyzed separately for GWAS by using the Fixed and random model Circuitous Probability Unification (FarmCPU) method. In total, 27 SNPs were identified to be significantly associated with HG and HH at different growth stages. We found 66 candidate genes located nearby the associated SNPs, including nine genes that were known as highly related to development and skeletal and muscular growth. In addition, biological function analysis was performed by Ingenuity Pathway Analysis and an interaction network related to development was obtained, which contained 16 genes out of the 66 candidates. The set of putative genes provided valuable resources and can help elucidate the genomic architecture and mechanisms underlying growth traits in dairy cattle.
Collapse
Affiliation(s)
- Xu Zhang
- Key Laboratory of Agricultural Animal Genetics and Breeding, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, P.R. China
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, United States of America
| | - Qin Chu
- Institute of Animal Husbandry and Veterinary Medicine, Beijing Academy of Agriculture and Forestry Sciences, Beijing, P.R. China
| | - Gang Guo
- Beijing Sunlon Livestock Development Co. Ltd, Beijing, P.R. China
| | - Ganghui Dong
- Beijing Sunlon Livestock Development Co. Ltd, Beijing, P.R. China
| | - Xizhi Li
- Beijing Sunlon Livestock Development Co. Ltd, Beijing, P.R. China
| | - Qin Zhang
- Key Laboratory of Agricultural Animal Genetics and Breeding, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| | - Shengli Zhang
- Key Laboratory of Agricultural Animal Genetics and Breeding, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, United States of America
| | - Yachun Wang
- Key Laboratory of Agricultural Animal Genetics and Breeding, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| |
Collapse
|
21
|
Crow M, Paul A, Ballouz S, Huang ZJ, Gillis J. Exploiting single-cell expression to characterize co-expression replicability. Genome Biol 2016; 17:101. [PMID: 27165153 PMCID: PMC4862082 DOI: 10.1186/s13059-016-0964-6] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 04/25/2016] [Indexed: 01/25/2023] Open
Abstract
Background Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trace to their basis in the data. Here, we use both published and novel single-cell RNA sequencing (RNA-seq) data to understand fundamental drivers of gene-gene connectivity and replicability in co-expression networks. Results We perform the first major analysis of single-cell co-expression, sampling from 31 individual studies. Using neighbor voting in cross-validation, we find that single-cell network connectivity is less likely to overlap with known functions than co-expression derived from bulk data, with functional variation within cell types strongly resembling that also occurring across cell types. To identify features and analysis practices that contribute to this connectivity, we perform our own single-cell RNA-seq experiment of 126 cortical interneurons in an experimental design targeted to co-expression. By assessing network replicability, semantic similarity and overall functional connectivity, we identify technical factors influencing co-expression and suggest how they can be controlled for. Many of the technical effects we identify are expression-level dependent, making expression level itself highly predictive of network topology. We show this occurs generally through re-analysis of the BrainSpan RNA-seq data. Conclusions Technical properties of single-cell RNA-seq data create confounds in co-expression networks which can be identified and explicitly controlled for in any supervised analysis. This is useful both in improving co-expression performance and in characterizing single-cell data in generally applicable terms, permitting cross-laboratory comparison within a common framework. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0964-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Megan Crow
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring, Harbor, NY, 11724, USA
| | - Anirban Paul
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring, Harbor, NY, 11724, USA
| | - Sara Ballouz
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring, Harbor, NY, 11724, USA
| | - Z Josh Huang
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring, Harbor, NY, 11724, USA
| | - Jesse Gillis
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring, Harbor, NY, 11724, USA.
| |
Collapse
|