1
|
Wei Y, Qian H, Zhang X, Wang J, Yan H, Xiao N, Zeng S, Chen B, Yang Q, Lu H, Xie J, Xie Z, Qin D, Li Z. Progress in multi-omics studies of osteoarthritis. Biomark Res 2025; 13:26. [PMID: 39934890 DOI: 10.1186/s40364-025-00732-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2024] [Accepted: 01/15/2025] [Indexed: 02/13/2025] Open
Abstract
Osteoarthritis (OA), a ubiquitous degenerative joint disorder, is marked by pain and disability, profoundly impacting patients' quality of life. As the population ages, the global prevalence of OA is escalating. Omics technologies have become instrumental in investigating complex diseases like OA, offering comprehensive insights into its pathogenesis and progression by uncovering disease-specific alterations across genomics, transcriptomics, proteomics, and metabolomics levels. In this review, we systematically analyzed and summarized the application and recent achievements of omics technologies in OA research by scouring relevant literature in databases such as PubMed. These studies have shed light on new potential therapeutic targets and biomarkers, charting fresh avenues for OA diagnosis and treatment. Furthermore, in our discussion, we highlighted the immense potential of spatial omics technologies in unraveling the molecular mechanisms of OA and in the development of novel therapeutic strategies, proposing future research directions and challenges. Collectively, this study encapsulates the pivotal advances in current OA research and prospects for future investigation, providing invaluable references for a deeper understanding and treatment of OA. This review aims to synthesize the recent progress of omics technologies in the realm of OA, aspiring to furnish theoretical foundations and research orientations for more profound studies of OA in the future.
Collapse
Affiliation(s)
- Yuanyuan Wei
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - He Qian
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Xiaoyu Zhang
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Jian Wang
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Heguo Yan
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Niqin Xiao
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Sanjin Zeng
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Bingbing Chen
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Qianqian Yang
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Hongting Lu
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Jing Xie
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China
| | - Zhaohu Xie
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China.
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China.
| | - Dongdong Qin
- School of Basic Medical Sciences, Yunnan University of Chinese Medicine, Kunming, Yunnan, China.
- Key Laboratory of Traditional Chinese Medicine for Prevention and Treatment of Neuropsychiatric Diseases, Yunnan University of Chinese Medicine, Kunming, Yunnan, China.
| | - Zhaofu Li
- First Clinical Medical College, Yunnan University of Chinese Medicine, Kunming, Yunnan, China.
| |
Collapse
|
2
|
Fang Z, Peltz G. Twenty-first century mouse genetics is again at an inflection point. Lab Anim (NY) 2025; 54:9-15. [PMID: 39592878 PMCID: PMC11695262 DOI: 10.1038/s41684-024-01491-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 11/12/2024] [Indexed: 11/28/2024]
Abstract
The laboratory mouse has been the premier model organism for biomedical research owing to the availability of multiple well-characterized inbred strains, its mammalian physiology and its homozygous genome, and because experiments can be performed under conditions that control environmental variables. Moreover, its genome can be genetically modified to assess the impact of allelic variation on phenotype. Mouse models have been used to discover or test many therapies that are commonly used today. Mouse genetic discoveries are often made using genome-wide association study methods that compare allelic differences in panels of inbred mouse strains with their phenotypic responses. Here we examine changes in the methods used to analyze mouse genetic models of biomedical traits during the twenty-first century. To do this, we first examine where mouse genetics was before the first inflection point, which was just before the revolution in genome sequencing that occurred 20 years ago, and then describe the factors that have accelerated the pace of mouse genetic discovery. We focus on mouse genetic studies that have generated findings that either were translated to humans or could impact clinical medicine or drug development. We next explore how advances in computational capabilities and in DNA sequencing methodology during the past 20 years could enhance the ability of mouse genetics to produce solutions for twenty-first century public-health problems.
Collapse
Affiliation(s)
- Zhuoqing Fang
- Department of Anesthesia, Pain and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Gary Peltz
- Department of Anesthesia, Pain and Perioperative Medicine, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
3
|
Shireen H, Batool F, Khatoon H, Parveen N, Sehar NU, Hussain I, Ali S, Abbasi AA. Predicting genome-wide tissue-specific enhancers via combinatorial transcription factor genomic occupancy analysis. FEBS Lett 2025; 599:100-119. [PMID: 39367524 DOI: 10.1002/1873-3468.15030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 08/27/2024] [Accepted: 09/13/2024] [Indexed: 10/06/2024]
Abstract
Enhancers are non-coding cis-regulatory elements crucial for transcriptional regulation. Mutations in enhancers can disrupt gene regulation, leading to disease phenotypes. Identifying enhancers and their tissue-specific activity is challenging due to their lack of stereotyped sequences. This study presents a sequence-based computational model that uses combinatorial transcription factor (TF) genomic occupancy to predict tissue-specific enhancers. Trained on diverse datasets, including ENCODE and Vista enhancer browser data, the model predicted 25 000 forebrain-specific cis-regulatory modules (CRMs) in the human genome. Validation using biochemical features, disease-associated SNPs, and in vivo zebrafish analysis confirmed its effectiveness. This model aids in predicting enhancers lacking well-characterized chromatin features, complementing experimental approaches in tissue-specific enhancer discovery.
Collapse
Affiliation(s)
- Huma Shireen
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| | - Fatima Batool
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| | - Hizran Khatoon
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| | - Nazia Parveen
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| | - Noor Us Sehar
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| | - Irfan Hussain
- Centre for Regenerative Medicine and Stem Cells Research, Agha Khan University hospital, Karachi, Pakistan
| | - Shahid Ali
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, IL, USA
| | - Amir Ali Abbasi
- National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, Pakistan
| |
Collapse
|
4
|
Cocoș R, Popescu BO. Scrutinizing neurodegenerative diseases: decoding the complex genetic architectures through a multi-omics lens. Hum Genomics 2024; 18:141. [PMID: 39736681 DOI: 10.1186/s40246-024-00704-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2024] [Accepted: 12/10/2024] [Indexed: 01/01/2025] Open
Abstract
Neurodegenerative diseases present complex genetic architectures, reflecting a continuum from monogenic to oligogenic and polygenic models. Recent advances in multi-omics data, coupled with systems genetics, have significantly refined our understanding of how these data impact neurodegenerative disease mechanisms. To contextualize these genetic discoveries, we provide a comprehensive critical overview of genetic architecture concepts, from Mendelian inheritance to the latest insights from oligogenic and omnigenic models. We explore the roles of common and rare genetic variants, gene-gene and gene-environment interactions, and epigenetic influences in shaping disease phenotypes. Additionally, we emphasize the importance of multi-omics layers including genomic, transcriptomic, proteomic, epigenetic, and metabolomic data in elucidating the molecular mechanisms underlying neurodegeneration. Special attention is given to missing heritability and the contribution of rare variants, particularly in the context of pleiotropy and network pleiotropy. We examine the application of single-cell omics technologies, transcriptome-wide association studies, and epigenome-wide association studies as key approaches for dissecting disease mechanisms at tissue- and cell-type levels. Our review introduces the OmicPeak Disease Trajectory Model, a conceptual framework for understanding the genetic architecture of neurodegenerative disease progression, which integrates multi-omics data across biological layers and time points. This review highlights the critical importance of adopting a systems genetics approach to unravel the complex genetic architecture of neurodegenerative diseases. Finally, this emerging holistic understanding of multi-omics data and the exploration of the intricate genetic landscape aim to provide a foundation for establishing more refined genetic architectures of these diseases, enhancing diagnostic precision, predicting disease progression, elucidating pathogenic mechanisms, and refining therapeutic strategies for neurodegenerative conditions.
Collapse
Affiliation(s)
- Relu Cocoș
- Department of Medical Genetics, 'Carol Davila' University of Medicine and Pharmacy, Bucharest, Romania.
- Genomics Research and Development Institute, Bucharest, Romania.
| | - Bogdan Ovidiu Popescu
- Department of Clinical Neurosciences, 'Carol Davila' University of Medicine and Pharmacy, Bucharest, Romania.
| |
Collapse
|
5
|
Qi X, Ullah A, Yu W, Jin X, Liu H. Estimating the Genetic Risk of First-Degree Relatives for Chronic Diseases Using the Short Tandem Repeat Score as Model of Polygenic Inheritance. Biochem Genet 2024:10.1007/s10528-024-11003-0. [PMID: 39733222 DOI: 10.1007/s10528-024-11003-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 12/10/2024] [Indexed: 12/30/2024]
Abstract
This study aims to establish a genetic risk assessment model based on a score of short tandem repeats (STRs) of polygenic inheritance. A total of 396 children and their biological parents were collected for STR genotyping. The numbers of tandem repeats of two alleles in one STR locus were assumed to be a quantitative genetic strength for disease incidence. The sums of 19 STR loci were considered a quantitative genetic strength per individual. Various thresholds of the STRs between paternal, maternal, and childhood data were recorded. As an exemplar, for thresholds of 25%, the first quarter = 1. All other samples = 0. The consistency rate for heredity (CH) was calculated from the difference in the morbidity of children between parents with and without disease groups. The ratio of observed CH to expected CH was defined as the heredity index (HI). Actual Pedigree data (finger-crossing test) confirmed the accuracy of the STR score. The genetic risk of first-degree relatives could be estimated using easily acquired data (incidence in an unrelated population). Our findings can provide a polygenic genetic model for estimating the incidence and genetic risk of chronic disease in first-degree relatives.
Collapse
Affiliation(s)
- Xia Qi
- College of Medical Laboratory, Dalian Medical University, Dalian, 116044, People's Republic of China
| | - Anwar Ullah
- College of Medical Laboratory, Dalian Medical University, Dalian, 116044, People's Republic of China
| | - Weijian Yu
- College of Medical Laboratory, Dalian Medical University, Dalian, 116044, People's Republic of China
| | - Xiaojun Jin
- College of Medical Laboratory, Dalian Medical University, Dalian, 116044, People's Republic of China
| | - Hui Liu
- College of Medical Laboratory, Dalian Medical University, Dalian, 116044, People's Republic of China.
| |
Collapse
|
6
|
Zhang WY, Chang YJ, Shi RH. Artificial intelligence enhances the management of esophageal squamous cell carcinoma in the precision oncology era. World J Gastroenterol 2024; 30:4267-4280. [PMID: 39492825 PMCID: PMC11525855 DOI: 10.3748/wjg.v30.i39.4267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 08/31/2024] [Accepted: 09/19/2024] [Indexed: 10/12/2024] Open
Abstract
Esophageal squamous cell carcinoma (ESCC) is the most common histological type of esophageal cancer with a poor prognosis. Early diagnosis and prognosis assessment are crucial for improving the survival rate of ESCC patients. With the advancement of artificial intelligence (AI) technology and the proliferation of medical digital information, AI has demonstrated promising sensitivity and accuracy in assisting precise detection, treatment decision-making, and prognosis assessment of ESCC. It has become a unique opportunity to enhance comprehensive clinical management of ESCC in the era of precision oncology. This review examines how AI is applied to the diagnosis, treatment, and prognosis assessment of ESCC in the era of precision oncology, and analyzes the challenges and potential opportunities that AI faces in clinical translation. Through insights into future prospects, it is hoped that this review will contribute to the real-world application of AI in future clinical settings, ultimately alleviating the disease burden caused by ESCC.
Collapse
Affiliation(s)
- Wan-Yue Zhang
- School of Medicine, Southeast University, Nanjing 221000, Jiangsu Province, China
| | - Yong-Jian Chang
- School of Cyber Science and Engineering, Southeast University, Nanjing 210009, Jiangsu Province, China
| | - Rui-Hua Shi
- Department of Gastroenterology, Zhongda Hospital, Southeast University, Nanjing 210009, Jiangsu Province, China
| |
Collapse
|
7
|
Xiong S, Zhang J, Luo H, Zhang Y, Xiao Q. A heterogeneous graph transformer framework for accurate cancer driver gene prediction and downstream analysis. Methods 2024; 232:9-17. [PMID: 39426693 DOI: 10.1016/j.ymeth.2024.09.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 09/26/2024] [Accepted: 09/29/2024] [Indexed: 10/21/2024] Open
Abstract
Accurately predicting cancer driver genes remains a formidable challenge amidst the burgeoning volume and intricacy of cancer genomic data. In this investigation, we propose HGTDG, an innovative heterogeneous graph transformer framework tailored for precisely predicting cancer driver genes and exploring downstream tasks. A heterogeneous graph construction module is central to the framework, which assembles a gene-protein heterogeneous network leveraging the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and protein-protein interactions sourced from the STRING (search tool for recurring instances of neighboring genes) database. Moreover, our framework introduces a pioneering heterogeneous graph transformer module, harnessing multi-head attention mechanisms for nuanced node embedding. This transformative module proficiently captures distinct representations for both nodes and edges, thereby enriching the model's predictive capacity. Subsequently, the generated node embeddings are seamlessly integrated into a classification module, facilitating the discrimination between driver and non-driver genes. Our experimental findings evince the superiority of HGTDG over existing methodologies, as evidenced by the enhanced performance metrics, including the area under the receiver operating characteristic curves (AUROC) and the area under the precision-recall curves (AUPRC). Furthermore, the downstream analysis utilizing the newly identified cancer driver genes underscores the efficacy and versatility of our proposed framework.
Collapse
Affiliation(s)
- Shuwen Xiong
- School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
| | - Junming Zhang
- School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
| | - Hong Luo
- School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
| | - Yongqing Zhang
- School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
| | - Qinyin Xiao
- Sichuan Institute of Computer Sciences, Chengdu, 610041, China.
| |
Collapse
|
8
|
Ge Y, Li T, Feng X, Wu M, Liu H. Structured feature ranking for genomic marker identification accommodating multiple types of networks. Biometrics 2024; 80:ujae158. [PMID: 39745855 DOI: 10.1093/biomtc/ujae158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 10/04/2024] [Accepted: 12/12/2024] [Indexed: 01/04/2025]
Abstract
Numerous statistical methods have been developed to search for genomic markers associated with the development, progression, and response to treatment of complex diseases. Among them, feature ranking plays a vital role due to its intuitive formulation and computational efficiency. However, most of the existing methods are based on the marginal importance of molecular predictors and share the limitation that the dependence (network) structures among predictors are not well accommodated, where a disease phenotype usually reflects various biological processes that interact in a complex network. In this paper, we propose a structured feature ranking method for identifying genomic markers, where such network structures are effectively accommodated using Laplacian regularization. The proposed method innovatively investigates multiple network scenarios, where the networks can be known a priori and data-dependently estimated. In addition, we rigorously explore the noise and uncertainty in the networks and control their impacts with proper selection of tuning parameters. These characteristics make the proposed method enjoy especially broad applicability. Theoretical result of our proposal is rigorously established. Compared to the original marginal measure, the proposed network structured measure can achieve sure screening properties with a faster convergence rate under mild conditions. Extensive simulations and analysis of The Cancer Genome Atlas melanoma data demonstrate the improvement of finite sample performance and practical usefulness of the proposed method.
Collapse
Affiliation(s)
- Yeheng Ge
- School of Statistics and Data Science, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai 200433, China
| | - Tao Li
- School of Statistics and Data Science, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai 200433, China
| | - Xingdong Feng
- School of Statistics and Data Science, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai 200433, China
| | - Mengyun Wu
- School of Statistics and Data Science, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai 200433, China
| | - Hailong Liu
- Department of Urology, Xinhua Hospital affiliated to Shanghai Jiao Tong University School of Medicine, 1665 Kongjiang Road, Shanghai 200092, China
| |
Collapse
|
9
|
Malakhov MM, Dai B, Shen XT, Pan W. A bootstrap model comparison test for identifying genes with context-specific patterns of genetic regulation. Ann Appl Stat 2024; 18:1840-1857. [PMID: 39421855 PMCID: PMC11484521 DOI: 10.1214/23-aoas1859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Understanding how genetic variation affects gene expression is essential for a complete picture of the functional pathways that give rise to complex traits. Although numerous studies have established that many genes are differentially expressed in distinct human tissues and cell types, no tools exist for identifying the genes whose expression is differentially regulated. Here we introduce DRAB (differential regulation analysis by bootstrapping), a gene-based method for testing whether patterns of genetic regulation are significantly different between tissues or other biological contexts. DRAB first leverages the elastic net to learn context-specific models of local genetic regulation and then applies a novel bootstrap-based model comparison test to check their equivalency. Unlike previous model comparison tests, our proposed approach can determine whether population-level models have equal predictive performance by accounting for the variability of feature selection and model training. We validated DRAB on mRNA expression data from a variety of human tissues in the Genotype-Tissue Expression (GTEx) Project. DRAB yielded biologically reasonable results and had sufficient power to detect genes with tissue-specific regulatory profiles while effectively controlling false positives. By providing a framework that facilitates the prioritization of differentially regulated genes, our study enables future discoveries on the genetic architecture of molecular phenotypes.
Collapse
Affiliation(s)
| | - Ben Dai
- Department of Statistics, The Chinese University of Hong Kong
| | | | - Wei Pan
- Division of Biostatistics and Health Data Science, University of Minnesota
| |
Collapse
|
10
|
Liang Y, Quan X, Gu R, Meng Z, Gan H, Wu Z, Sun Y, Pan H, Han P, Liu S, Dou G. Repurposing existing drugs for the treatment ofCOVID-19/SARS-CoV-2: A review of pharmacological effects and mechanism of action. Heliyon 2024; 10:e35988. [PMID: 39247343 PMCID: PMC11379597 DOI: 10.1016/j.heliyon.2024.e35988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 08/05/2024] [Accepted: 08/07/2024] [Indexed: 09/10/2024] Open
Abstract
Following the coronavirus disease-2019 outbreak caused by severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2), there is an ongoing need to seek drugs that target COVID-19. First off, novel drugs have a long development cycle, high investment cost, and are high risk. Second, novel drugs must be evaluated for activity, efficacy, safety, and metabolic performance, contributing to the development cycle, investment cost, and risk. We searched the Cochrane COVID-19 Study Register (including PubMed, Embase, CENTRAL, ClinicalTrials.gov, WHO ICTRP, and medRxiv), Web of Science (Science Citation Index, Emerging Citation Index), and WHO COVID-19 Coronaviral Disease Global Literature to identify completed and ongoing studies as of February 20, 2024. We evaluated the pharmacological effects, in vivo and in vitro data of the 16 candidates in the paper. The difficulty of studying these candidates in clinical trials involving COVID-19 patients, dosage of repurposed drugs, etc. is discussed in detail. Ultimately, Metformin is more suitable for prophylactic administration or mildly ill patients; the combination of Oseltamivir, Tamoxifen, and Dexamethasone is suitable for moderately and severely ill patients; and more clinical trials are needed for Azvudine, Ribavirin, Colchicine, and Cepharanthine to demonstrate efficacy.
Collapse
Affiliation(s)
- Yutong Liang
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Xiaoxiao Quan
- Beijing Institute of Radiation Medicine, Beijing, China
- Scientific Experimental Center of Guangxi University of Chinese Medicine, Nanning, China
| | - Ruolan Gu
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Zhiyun Meng
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Hui Gan
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Zhuona Wu
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Yunbo Sun
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Huajie Pan
- General Internal Medicine Department, Jingnan Medical District, PLA General Hospital, Beijing, China
| | - Peng Han
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Shuchen Liu
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Guifang Dou
- Beijing Institute of Radiation Medicine, Beijing, China
| |
Collapse
|
11
|
Pan Z, Theesfeld CL. Deciphering missense coding variants with AlphaMissense. Kidney Int 2024; 106:175-178. [PMID: 38647510 DOI: 10.1016/j.kint.2024.02.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 02/12/2024] [Indexed: 04/25/2024]
Affiliation(s)
- Zhicheng Pan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, New York, USA
| | - Chandra L Theesfeld
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, USA.
| |
Collapse
|
12
|
Li MM, Huang Y, Sumathipala M, Liang MQ, Valdeolivas A, Ananthakrishnan AN, Liao K, Marbach D, Zitnik M. Contextual AI models for single-cell protein biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.18.549602. [PMID: 37503080 PMCID: PMC10370131 DOI: 10.1101/2023.07.18.549602] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Understanding protein function and developing molecular therapies require deciphering the cell types in which proteins act as well as the interactions between proteins. However, modeling protein interactions across biological contexts remains challenging for existing algorithms. Here, we introduce Pinnacle, a geometric deep learning approach that generates context-aware protein representations. Leveraging a multi-organ single-cell atlas, Pinnacle learns on contextualized protein interaction networks to produce 394,760 protein representations from 156 cell type contexts across 24 tissues. Pinnacle's embedding space reflects cellular and tissue organization, enabling zero-shot retrieval of the tissue hierarchy. Pretrained protein representations can be adapted for downstream tasks: enhancing 3D structure-based representations for resolving immuno-oncological protein interactions, and investigating drugs' effects across cell types. Pinnacle outperforms state-of-the-art models in nominating therapeutic targets for rheumatoid arthritis and inflammatory bowel diseases, and pinpoints cell type contexts with higher predictive capability than context-free models. Pinnacle's ability to adjust its outputs based on the context in which it operates paves way for large-scale context-specific predictions in biology.
Collapse
Affiliation(s)
- Michelle M. Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Yepeng Huang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Marissa Sumathipala
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Man Qing Liang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Alberto Valdeolivas
- Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Ashwin N. Ananthakrishnan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of Gastroenterology, Massachusetts General Hospital, Boston, MA, USA
| | - Katherine Liao
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women’s Hospital, Boston, MA, USA
| | - Daniel Marbach
- Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Allston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Harvard Data Science Initiative, Cambridge, MA, USA
| |
Collapse
|
13
|
Zhao L, Cheng L, Yang Y, Wang P, Tian P, Yang T, Nian H, Cao L. Biomimetic Hydrogen-Bonded G ⋅ C ⋅ G ⋅ C Quadruplex within a Tetraphenylethene-Based Octacationic Spirobicycle in Water. Angew Chem Int Ed Engl 2024; 63:e202405150. [PMID: 38591857 DOI: 10.1002/anie.202405150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 04/01/2024] [Accepted: 04/09/2024] [Indexed: 04/10/2024]
Abstract
In biological systems, nucleotide quadruplexes (such as G-quadruplexes) in DNA and RNA that are held together by multiple hydrogen bonds play a crucial functional role. The biomimetic formation of these hydrogen-bonded quadruplexes captured by artificial systems in water poses a significant challenge but can offer valuable insights into these complex functional structures. Herein, we report the formation of biomimetic hydrogen-bonded G ⋅ C ⋅ G ⋅ C quadruplex captured by a tetraphenylethene (TPE) based octacationic spirobicycle (1). The spirobicyclic compound possesses a three-dimensional (3D) crossing dual-cavity structure, which enables the encapsulation of four d(GpC) dinucleotide molecules, thereby realizing 1 : 4 host-guest complexation in water. The X-ray structure reveals that four d(GpC) molecules further form a two-layer G ⋅ C ⋅ G ⋅ C quadruplex with Watson-Crick hydrogen bonds, which are stabilized within the dual hydrophobic cavities of 1 through the cooperative non-covalent interactions of hydrogen bonds, CH⋅⋅⋅π interactions, and hydrophobic effect. Due to the dynamically-rotational propeller chirality of TPE units, 1 with adaptive chirality can further serve as a chiroptical sensor to exhibit opposite Cotton effects with mirror-image CD spectra for the pH-dependent hydrogen-bonded assemblies of d(GpC) including the Watson-Crick G ⋅ C ⋅ G ⋅ C (pH 9.22) and Hoogsteen G ⋅ C+ ⋅ G ⋅ C+ (pH 5.74) quartets through the host-guest chirality transfer in water.
Collapse
Affiliation(s)
- Lingyu Zhao
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Lin Cheng
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Yanxia Yang
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Pingxia Wang
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Ping Tian
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Ting Yang
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Hao Nian
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| | - Liping Cao
- College of Chemistry and Materials Science, Northwest University, Xi'an, 710069, P. R. China)
| |
Collapse
|
14
|
Lin M, Hu L, Shen S, Liu J, Liu Y, Xu Y, Chen H, Sugimoto K, Li J, Kamitsukasa I, Hiwasa T, Wang H, Xu A. Atherosclerosis-related biomarker PABPC1 predicts pan-cancer events. Stroke Vasc Neurol 2024; 9:108-125. [PMID: 37311641 PMCID: PMC11103157 DOI: 10.1136/svn-2022-002246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 05/25/2023] [Indexed: 06/15/2023] Open
Abstract
BACKGROUND Atherosclerosis (AS) and tumours are the leading causes of death worldwide and share common risk factors, detection methods and molecular markers. Therefore, searching for serum markers shared by AS and tumours is beneficial to the early diagnosis of patients. METHODS The sera of 23 patients with AS-related transient ischaemic attack were screened by serological identification of antigens through recombinant cDNA expression cloning (SEREX), and cDNA clones were identified. Pathway function enrichment analysis was performed on cDNA clones to identify their biological pathways and determine whether they were related to AS or tumours. Subsequently, gene-gene and protein-protein interactions were performed and AS-associated markers would be discovered. The expression of AS biomarkers in human normal organs and pan-cancer tumour tissues were explored. Then, immune infiltration level and tumour mutation burden of various immune cells were evaluated. Survival curves analysis could show the expression of AS markers in pan-cancer. RESULTS AS-related sera were screened by SEREX, and 83 cDNA clones with high homology were obtained. Through functional enrichment analysis, it was found that their functions were closely related to AS and tumour functions. After multiple biological information interaction screening and the external cohort validating, poly(A) binding protein cytoplasmic 1 (PABPC1) was found to be a potential AS biomarker. To assess whether PABPC1 was related to pan-cancer, its expression in different tumour pathological stages and ages was screened. Since AS-associated proteins were closely related to cancer immune infiltration, we investigated and found that PABPC1 had the same role in pan-cancer. Finally, analysis of Kaplan-Meier survival curves revealed that high PABPC1 expression in pan-cancer was associated with high risk of death. CONCLUSIONS Through the findings of SEREX and bioinformatics pan-cancer analysis, we concluded that PABPC1 might serve as a potential biomarker for the prediction and diagnosis of AS and pan-cancer.
Collapse
Affiliation(s)
- Miao Lin
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Anesthesiology, The First Affiliated Hospital, Jinan University, Guangzhou, China
| | - Liubing Hu
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Anesthesiology, The First Affiliated Hospital, Jinan University, Guangzhou, China
- The Biomedical Translational Research Institute,Faculty of Medical Science, Jinan University, Guangzhou, China
| | - Si Shen
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Radiology, Medical Imaging Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
| | - Jiyue Liu
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Anesthesiology, The First Affiliated Hospital, Jinan University, Guangzhou, China
| | - Yanyan Liu
- The Biomedical Translational Research Institute,Faculty of Medical Science, Jinan University, Guangzhou, China
| | - Yixian Xu
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Anesthesiology, The First Affiliated Hospital, Jinan University, Guangzhou, China
| | - Honglin Chen
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Radiology, Medical Imaging Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
| | - Kazuo Sugimoto
- Department of Biochemistry and Genetics, Graduate School of Medicine, Chiba University, Chiba, Japan
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jianshuang Li
- The Biomedical Translational Research Institute,Faculty of Medical Science, Jinan University, Guangzhou, China
| | - Ikuo Kamitsukasa
- Department of Neurology, Chiba Rosai Hospital, Chiba, Japan
- Department of Neurology, Chibaken Saiseikai Narashino Hospital, Chiba, Japan
| | - Takaki Hiwasa
- Department of Biochemistry and Genetics, Graduate School of Medicine, Chiba University, Chiba, Japan
- Department of Neurological Surgery, Graduate School of Medicine, Chiba University, Chiba, Japan
| | - Hao Wang
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Anesthesiology, The First Affiliated Hospital, Jinan University, Guangzhou, China
- Department of Biochemistry and Genetics, Graduate School of Medicine, Chiba University, Chiba, Japan
| | - Anding Xu
- Stroke Center, The First Affiliated Hospital, Jinan University, Guangzhou, China
| |
Collapse
|
15
|
Lu S, Yang J, Gu Y, He D, Wu H, Sun W, Xu D, Li C, Guo C. Advances in Machine Learning Processing of Big Data from Disease Diagnosis Sensors. ACS Sens 2024; 9:1134-1148. [PMID: 38363978 DOI: 10.1021/acssensors.3c02670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]
Abstract
Exploring accurate, noninvasive, and inexpensive disease diagnostic sensors is a critical task in the fields of chemistry, biology, and medicine. The complexity of biological systems and the explosive growth of biomarker data have driven machine learning to become a powerful tool for mining and processing big data from disease diagnosis sensors. With the development of bioinformatics and artificial intelligence (AI), machine learning models formed by data mining have been able to guide more sensitive and accurate molecular computing. This review presents an overview of big data collection approaches and fundamental machine learning algorithms and discusses recent advances in machine learning and molecular computational disease diagnostic sensors. More specifically, we highlight existing modular workflows and key opportunities and challenges for machine learning to achieve disease diagnosis through big data mining.
Collapse
Affiliation(s)
- Shasha Lu
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Jianyu Yang
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Yu Gu
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Dongyuan He
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Haocheng Wu
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Wei Sun
- College of Chemistry and Chemical Engineering, Hainan Normal University, Haikou 571158, China
| | - Dong Xu
- Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou 310022, China
| | - Changming Li
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| | - Chunxian Guo
- School of Materials Science and Engineering, Suzhou University of Science and Technology, Suzhou 215011, China
| |
Collapse
|
16
|
Chen Y, Wang W, Yang Z, Peng H, Ni Z, Sun Q, Guo W. Innovative computational tools provide new insights into the polyploid wheat genome. ABIOTECH 2024; 5:52-70. [PMID: 38576428 PMCID: PMC10987449 DOI: 10.1007/s42994-023-00131-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 12/14/2023] [Indexed: 04/06/2024]
Abstract
Bread wheat (Triticum aestivum) is an important crop and serves as a significant source of protein and calories for humans, worldwide. Nevertheless, its large and allopolyploid genome poses constraints on genetic improvement. The complex reticulate evolutionary history and the intricacy of genomic resources make the deciphering of the functional genome considerably more challenging. Recently, we have developed a comprehensive list of versatile computational tools with the integration of statistical models for dissecting the polyploid wheat genome. Here, we summarize the methodological innovations and applications of these tools and databases. A series of step-by-step examples illustrates how these tools can be utilized for dissecting wheat germplasm resources and unveiling functional genes associated with important agronomic traits. Furthermore, we outline future perspectives on new advanced tools and databases, taking into consideration the unique features of bread wheat, to accelerate genomic-assisted wheat breeding.
Collapse
Affiliation(s)
- Yongming Chen
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Wenxi Wang
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Zhengzhao Yang
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Huiru Peng
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Zhongfu Ni
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Qixin Sun
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| | - Weilong Guo
- Frontiers Science Center for Molecular Design Breeding, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100193 China
| |
Collapse
|
17
|
Tang S, Peel E, Belov K, Hogg CJ, Farquharson KA. Multi-omics resources for the Australian southern stuttering frog (Mixophyes australis) reveal assorted antimicrobial peptides. Sci Rep 2024; 14:3991. [PMID: 38368484 PMCID: PMC10874372 DOI: 10.1038/s41598-024-54522-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 02/13/2024] [Indexed: 02/19/2024] Open
Abstract
The number of genome-level resources for non-model species continues to rapidly expand. However, frog species remain underrepresented, with up to 90% of frog genera having no genomic or transcriptomic data. Here, we assemble the first genomic and transcriptomic resources for the recently described southern stuttering frog (Mixophyes australis). The southern stuttering frog is ground-dwelling, inhabiting naturally vegetated riverbanks in south-eastern Australia. Using PacBio HiFi long-read sequencing and Hi-C scaffolding, we generated a high-quality genome assembly, with a scaffold N50 of 369.3 Mb and 95.1% of the genome contained in twelve scaffolds. Using this assembly, we identified the mitochondrial genome, and assembled six tissue-specific transcriptomes. We also bioinformatically characterised novel sequences of two families of antimicrobial peptides (AMPs) in the southern stuttering frog, the cathelicidins and β-defensins. While traditional peptidomic approaches to peptide discovery have typically identified one or two AMPs in a frog species from skin secretions, our bioinformatic approach discovered 12 cathelicidins and two β-defensins that were expressed in a range of tissues. We investigated the novelty of the peptides and found diverse predicted activities. Our bioinformatic approach highlights the benefits of multi-omics resources in peptide discovery and contributes valuable genomic resources in an under-represented taxon.
Collapse
Affiliation(s)
- Simon Tang
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Emma Peel
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Katherine Belov
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| | - Carolyn J Hogg
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia.
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia.
| | - Katherine A Farquharson
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW, 2006, Australia
- Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Sydney, Sydney, NSW, 2006, Australia
| |
Collapse
|
18
|
Zhang M, Wang J, Wang W, Yang G, Peng J. Predicting cell-type specific disease genes of diabetes with the biological network. Comput Biol Med 2024; 169:107849. [PMID: 38101116 DOI: 10.1016/j.compbiomed.2023.107849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/21/2023] [Accepted: 12/11/2023] [Indexed: 12/17/2023]
Abstract
Type 2 diabetes (T2D) is a chronic condition that can lead to significant harm, such as heart disease, kidney disease, nerve damage, and blindness. Although T2D-related genes have been identified through Genome-wide association studies (GWAS) and various computational methods, the biological mechanism of T2D at the cell type level remains unclear. Exploring cell type-specific genes related to T2D is essential to understand the cellular mechanisms underlying the disease. To address this issue, we introduce DiGCellNet (predicting Disease Genes with Cell type specificity based on biological Networks), a model that integrates graph convolutional network (GCN) and multi-task learning (MTL) to predict T2D-associated cell type-specific genes based on the biological network. Our work represents the first attempt to predict cell type-specific disease genes using GCN and MTL. We evaluate our approach by predicting genes specific to four cell types and demonstrate that the proposed DiGCellNet outperforms other models that combine node embeddings with traditional machine learning algorithms. Moreover, DiGCellNet successfully identifies CALM1 as a gene specific to beta cell type in T2D cases, and this association is confirmed using an independent dataset. The code is available at https://github.com/23AIBox/23AIBox-DiGCellNet.
Collapse
Affiliation(s)
- Menghan Zhang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China; The National Engineering Laboratory for Integrated Aerospace-Ground-Ocean Big Data Application Technology, Xi'an, 710072, China
| | - Jingru Wang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China; The National Engineering Laboratory for Integrated Aerospace-Ground-Ocean Big Data Application Technology, Xi'an, 710072, China
| | - Wei Wang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China; The National Engineering Laboratory for Integrated Aerospace-Ground-Ocean Big Data Application Technology, Xi'an, 710072, China
| | - Guang Yang
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China; The National Engineering Laboratory for Integrated Aerospace-Ground-Ocean Big Data Application Technology, Xi'an, 710072, China
| | - Jiajie Peng
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China; The National Engineering Laboratory for Integrated Aerospace-Ground-Ocean Big Data Application Technology, Xi'an, 710072, China; School of Computer Science, Research and Development Institute of Northwestern Polytechnical University in Shenzhen, Shenzhen, 518000, China.
| |
Collapse
|
19
|
Lin S, Jia P. scGraph2Vec: a deep generative model for gene embedding augmented by graph neural network and single-cell omics data. Gigascience 2024; 13:giae108. [PMID: 39704704 DOI: 10.1093/gigascience/giae108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 10/18/2024] [Accepted: 11/26/2024] [Indexed: 12/21/2024] Open
Abstract
BACKGROUND Exploring the cellular processes of genes from the aspects of biological networks is of great interest to understanding the properties of complex diseases and biological systems. Biological networks, such as protein-protein interaction networks and gene regulatory networks, provide insights into the molecular basis of cellular processes and often form functional clusters in different tissue and disease contexts. RESULTS We present scGraph2Vec, a deep learning framework for generating informative gene embeddings. scGraph2Vec extends the variational graph autoencoder framework and integrates single-cell datasets and gene-gene interaction networks. We demonstrate that the gene embeddings are biologically interpretable and enable the identification of gene clusters representing functional or tissue-specific cellular processes. By comparing similar tools, we showed that scGraph2Vec clearly distinguished different gene clusters and aggregated more biologically functional genes. scGraph2Vec can be widely applied in diverse biological contexts. We illustrated that the embeddings generated by scGraph2Vec can infer disease-associated genes from genome-wide association study data (e.g., COVID-19 and Alzheimer's disease), identify additional driver genes in lung adenocarcinoma, and reveal regulatory genes responsible for maintaining or transitioning melanoma cell states. CONCLUSIONS scGraph2Vec not only reconstructs tissue-specific gene networks but also obtains a latent representation of genes implying their biological functions.
Collapse
Affiliation(s)
- Shiqi Lin
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peilin Jia
- National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
20
|
Gervasoni S, Manelfi C, Adobati S, Talarico C, Biswas AD, Pedretti A, Vistoli G, Beccari AR. Target Prediction by Multiple Virtual Screenings: Analyzing the SARS-CoV-2 Phenotypic Screening by the Docking Simulations Submitted to the MEDIATE Initiative. Int J Mol Sci 2023; 25:450. [PMID: 38203621 PMCID: PMC10779154 DOI: 10.3390/ijms25010450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 12/15/2023] [Accepted: 12/21/2023] [Indexed: 01/12/2024] Open
Abstract
Phenotypic screenings are usually combined with deconvolution techniques to characterize the mechanism of action for the retrieved hits. These studies can be supported by various computational analyses, although docking simulations are rarely employed. The present study aims to assess if multiple docking calculations can prove successful in target prediction. In detail, the docking simulations submitted to the MEDIATE initiative are utilized to predict the viral targets involved in the hits retrieved by a recently published cytopathic screening. Multiple docking results are combined by the EFO approach to develop target-specific consensus models. The combination of multiple docking simulations enhances the performances of the developed consensus models (average increases in EF1% value of 40% and 25% when combining three and two docking runs, respectively). These models are able to propose reliable targets for about half of the retrieved hits (31 out of 59). Thus, the study emphasizes that docking simulations might be effective in target identification and provide a convincing validation for the collaborative strategies that inspire the MEDIATE initiative. Disappointingly, cross-target and cross-program correlations suggest that common scoring functions are not specific enough for the simulated target.
Collapse
Affiliation(s)
- Silvia Gervasoni
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Via Mangiagalli, 25, I-20133 Milano, Italy; (S.G.); (S.A.); (A.P.)
- Department of Physics, Università di Cagliari, I-09042 Monserrato, Italy
| | - Candida Manelfi
- EXSCALATE, Dompé Farmaceutici S.p.A., Via Tommaso De Amicis, 95, I-80131 Napoli, Italy; (C.M.); (C.T.); (A.D.B.); (A.R.B.)
| | - Sara Adobati
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Via Mangiagalli, 25, I-20133 Milano, Italy; (S.G.); (S.A.); (A.P.)
| | - Carmine Talarico
- EXSCALATE, Dompé Farmaceutici S.p.A., Via Tommaso De Amicis, 95, I-80131 Napoli, Italy; (C.M.); (C.T.); (A.D.B.); (A.R.B.)
| | - Akash Deep Biswas
- EXSCALATE, Dompé Farmaceutici S.p.A., Via Tommaso De Amicis, 95, I-80131 Napoli, Italy; (C.M.); (C.T.); (A.D.B.); (A.R.B.)
| | - Alessandro Pedretti
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Via Mangiagalli, 25, I-20133 Milano, Italy; (S.G.); (S.A.); (A.P.)
| | - Giulio Vistoli
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Via Mangiagalli, 25, I-20133 Milano, Italy; (S.G.); (S.A.); (A.P.)
| | - Andrea R. Beccari
- EXSCALATE, Dompé Farmaceutici S.p.A., Via Tommaso De Amicis, 95, I-80131 Napoli, Italy; (C.M.); (C.T.); (A.D.B.); (A.R.B.)
| |
Collapse
|
21
|
Zhang Z, Lamson AR, Shelley M, Troyanskaya O. Interpretable neural architecture search and transfer learning for understanding CRISPR-Cas9 off-target enzymatic reactions. NATURE COMPUTATIONAL SCIENCE 2023; 3:1056-1066. [PMID: 38177723 DOI: 10.1038/s43588-023-00569-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 11/08/2023] [Indexed: 01/06/2024]
Abstract
Finely tuned enzymatic pathways control cellular processes, and their dysregulation can lead to disease. Developing predictive and interpretable models for these pathways is challenging because of the complexity of the pathways and of the cellular and genomic contexts. Here we introduce Elektrum, a deep learning framework that addresses these challenges with data-driven and biophysically interpretable models for determining the kinetics of biochemical systems. First, it uses in vitro kinetic assays to rapidly hypothesize an ensemble of high-quality kinetically interpretable neural networks (KINNs) that predict reaction rates. It then employs a transfer learning step, where the KINNs are inserted as intermediary layers into deeper convolutional neural networks, fine-tuning the predictions for reaction-dependent in vivo outcomes. We apply Elektrum to predict CRISPR-Cas9 off-target editing probabilities and demonstrate that Elektrum achieves improved performance, regularizes neural network architectures and maintains physical interpretability.
Collapse
Affiliation(s)
- Zijun Zhang
- Division of Artificial Intelligence in Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Adam R Lamson
- Center for Computational Biology, Flatiron Institute, New York City, NY, USA
| | - Michael Shelley
- Center for Computational Biology, Flatiron Institute, New York City, NY, USA.
- Courant Institute of Mathematical Sciences, New York University, New York City, NY, USA.
| | - Olga Troyanskaya
- Center for Computational Biology, Flatiron Institute, New York City, NY, USA.
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
22
|
Pagliara D, Ciolfi A, Pedace L, Haghshenas S, Ferilli M, Levy MA, Miele E, Nardini C, Cappelletti C, Relator R, Pitisci A, De Vito R, Pizzi S, Kerkhof J, McConkey H, Nazio F, Kant SG, Di Donato M, Agolini E, Matraxia M, Pasini B, Pelle A, Galluccio T, Novelli A, Barakat TS, Andreani M, Rossi F, Mecucci C, Savoia A, Sadikovic B, Locatelli F, Tartaglia M. Identification of a robust DNA methylation signature for Fanconi anemia. Am J Hum Genet 2023; 110:1938-1949. [PMID: 37865086 PMCID: PMC10645556 DOI: 10.1016/j.ajhg.2023.09.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 09/26/2023] [Accepted: 09/27/2023] [Indexed: 10/23/2023] Open
Abstract
Fanconi anemia (FA) is a clinically variable and genetically heterogeneous cancer-predisposing disorder representing the most common bone marrow failure syndrome. It is caused by inactivating predominantly biallelic mutations involving >20 genes encoding proteins with roles in the FA/BRCA DNA repair pathway. Molecular diagnosis of FA is challenging due to the wide spectrum of the contributing gene mutations and structural rearrangements. The assessment of chromosomal fragility after exposure to DNA cross-linking agents is generally required to definitively confirm diagnosis. We assessed peripheral blood genome-wide DNA methylation (DNAm) profiles in 25 subjects with molecularly confirmed clinical diagnosis of FA (FANCA complementation group) using Illumina's Infinium EPIC array. We identified 82 differentially methylated CpG sites that allow to distinguish subjects with FA from healthy individuals and subjects with other genetic disorders, defining an FA-specific DNAm signature. The episignature was validated using a second cohort of subjects with FA involving different complementation groups, documenting broader genetic sensitivity and demonstrating its specificity using the EpiSign Knowledge Database. The episignature properly classified DNA samples obtained from bone marrow aspirates, demonstrating robustness. Using the selected probes, we trained a machine-learning model able to classify EPIC DNAm profiles in molecularly unsolved cases. Finally, we show that the generated episignature includes CpG sites that do not undergo functional selective pressure, allowing diagnosis of FA in individuals with reverted phenotype due to gene conversion. These findings provide a tool to accelerate diagnostic testing in FA and broaden the clinical utility of DNAm profiling in the diagnostic setting.
Collapse
Affiliation(s)
- Daria Pagliara
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Andrea Ciolfi
- Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Lucia Pedace
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Sadegheh Haghshenas
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
| | - Marco Ferilli
- Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Michael A Levy
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
| | - Evelina Miele
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Claudia Nardini
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Camilla Cappelletti
- Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Raissa Relator
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
| | - Angela Pitisci
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Rita De Vito
- Department of Laboratories, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Simone Pizzi
- Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Jennifer Kerkhof
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada
| | - Haley McConkey
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada; Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
| | - Francesca Nazio
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy
| | - Sarina G Kant
- Department of Clinical Genetics, Erasmus MC University Medical Center, 3015 Rotterdam, the Netherlands
| | - Maddalena Di Donato
- Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children Hospital, IRCCS, 00146 Rome, Italy
| | - Emanuele Agolini
- Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children Hospital, IRCCS, 00146 Rome, Italy
| | - Marta Matraxia
- Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children Hospital, IRCCS, 00146 Rome, Italy
| | - Barbara Pasini
- AOU Città della salute e della scienza di Torino, Molinette's Hospital, 10126 Torino, Italy
| | - Alessandra Pelle
- AOU Città della salute e della scienza di Torino, Molinette's Hospital, 10126 Torino, Italy
| | - Tiziana Galluccio
- Laboratory of Transplant Immunogenetics, Department of Hematology/Oncology, Cell and Gene Therapy, IRCCS Bambino Gesù Children's Hospital, 00146 Rome, Italy
| | - Antonio Novelli
- Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children Hospital, IRCCS, 00146 Rome, Italy
| | - Tahsin Stefan Barakat
- Department of Clinical Genetics, Erasmus MC University Medical Center, 3015 Rotterdam, the Netherlands; ENCORE Expertise Center for Neurodevelopmental Disorders, Erasmus MC University Medical Center, 3015 Rotterdam, the Netherlands
| | - Marco Andreani
- Laboratory of Transplant Immunogenetics, Department of Hematology/Oncology, Cell and Gene Therapy, IRCCS Bambino Gesù Children's Hospital, 00146 Rome, Italy
| | - Francesca Rossi
- Department of Woman, Child and of General and Specialist Surgery, University of Campania "Luigi Vanvitelli," 80138 Naples, Italy
| | - Cristina Mecucci
- Institute of Hematology and Center for Hemato-Oncology Research, University and Hospital of Perugia, 06123 Perugia, Italy
| | - Anna Savoia
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, 37134 Verona, Italy
| | - Bekim Sadikovic
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON N6A 5W9, Canada; Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
| | - Franco Locatelli
- Department of Hematology/Oncology and Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy; Department of Pediatrics, Catholic University of the Sacred Hearth, 00168 Rome, Italy.
| | - Marco Tartaglia
- Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146 Rome, Italy.
| |
Collapse
|
23
|
Malakhov MM, Dai B, Shen XT, Pan W. A BOOTSTRAP MODEL COMPARISON TEST FOR IDENTIFYING GENES WITH CONTEXT-SPECIFIC PATTERNS OF GENETIC REGULATION. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.06.531446. [PMID: 36945657 PMCID: PMC10028853 DOI: 10.1101/2023.03.06.531446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
Understanding how genetic variation affects gene expression is essential for a complete picture of the functional pathways that give rise to complex traits. Although numerous studies have established that many genes are differentially expressed in distinct human tissues and cell types, no tools exist for identifying the genes whose expression is differentially regulated. Here we introduce DRAB (Differential Regulation Analysis by Bootstrapping), a gene-based method for testing whether patterns of genetic regulation are significantly different between tissues or other biological contexts. DRAB first leverages the elastic net to learn context-specific models of local genetic regulation and then applies a novel bootstrap-based model comparison test to check their equivalency. Unlike previous model comparison tests, our proposed approach can determine whether population-level models have equal predictive performance by accounting for the variability of feature selection and model training. We validated DRAB on mRNA expression data from a variety of human tissues in the Genotype-Tissue Expression (GTEx) Project. DRAB yielded biologically reasonable results and had sufficient power to detect genes with tissue-specific regulatory profiles while effectively controlling false positives. By providing a framework that facilitates the prioritization of differentially regulated genes, our study enables future discoveries on the genetic architecture of molecular phenotypes.
Collapse
Affiliation(s)
| | - Ben Dai
- Department of Statistics, The Chinese University of Hong Kong
| | | | - Wei Pan
- Division of Biostatistics, University of Minnesota
| |
Collapse
|
24
|
Li J, Zhao T, Guan D, Pan Z, Bai Z, Teng J, Zhang Z, Zheng Z, Zeng J, Zhou H, Fang L, Cheng H. Learning functional conservation between human and pig to decipher evolutionary mechanisms underlying gene expression and complex traits. CELL GENOMICS 2023; 3:100390. [PMID: 37868039 PMCID: PMC10589632 DOI: 10.1016/j.xgen.2023.100390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 04/25/2023] [Accepted: 08/02/2023] [Indexed: 10/24/2023]
Abstract
Assessment of genomic conservation between humans and pigs at the functional level can improve the potential of pigs as a human biomedical model. To address this, we developed a deep learning-based approach to learn the genomic conservation at the functional level (DeepGCF) between species by integrating 386 and 374 functional profiles from humans and pigs, respectively. DeepGCF demonstrated better prediction performance compared with the previous method. In addition, the resulting DeepGCF score captures the functional conservation between humans and pigs by examining chromatin states, sequence ontologies, and regulatory variants. We identified a core set of genomic regions as functionally conserved that plays key roles in gene regulation and is enriched for the heritability of complex traits and diseases in humans. Our results highlight the importance of cross-species functional comparison in illustrating the genetic and evolutionary basis of complex phenotypes.
Collapse
Affiliation(s)
- Jinghui Li
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| | - Tianjing Zhao
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| | - Dailu Guan
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| | - Zhonghao Bai
- Center for Quantitative Genetics and Genomics (QGG), Aarhus University, 8000 Aarhus, Denmark
| | - Jinyan Teng
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - Zhe Zhang
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - Zhili Zheng
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Jian Zeng
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics (QGG), Aarhus University, 8000 Aarhus, Denmark
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|
25
|
Simonovsky E, Sharon M, Ziv M, Mauer O, Hekselman I, Jubran J, Vinogradov E, Argov CM, Basha O, Kerber L, Yogev Y, Segrè AV, Im HK, Birk O, Rokach L, Yeger‐Lotem E. Predicting molecular mechanisms of hereditary diseases by using their tissue-selective manifestation. Mol Syst Biol 2023; 19:e11407. [PMID: 37232043 PMCID: PMC10407743 DOI: 10.15252/msb.202211407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 04/30/2023] [Accepted: 05/10/2023] [Indexed: 05/27/2023] Open
Abstract
How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Collapse
Affiliation(s)
- Eyal Simonovsky
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Moran Sharon
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Maya Ziv
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omry Mauer
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Idan Hekselman
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Juman Jubran
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Ekaterina Vinogradov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Chanan M Argov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omer Basha
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Kerber
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Yuval Yogev
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
| | - Ayellet V Segrè
- Ocular Genomics Institute, Massachusetts Eye and EarHarvard Medical SchoolBostonMAUSA
- The Broad Institute of MIT and HarvardCambridgeMAUSA
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of MedicineThe University of ChicagoChicagoILUSA
| | | | - Ohad Birk
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Rokach
- Department of Software & Information Systems EngineeringBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Esti Yeger‐Lotem
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| |
Collapse
|
26
|
Cutshaw G, Uthaman S, Hassan N, Kothadiya S, Wen X, Bardhan R. The Emerging Role of Raman Spectroscopy as an Omics Approach for Metabolic Profiling and Biomarker Detection toward Precision Medicine. Chem Rev 2023; 123:8297-8346. [PMID: 37318957 PMCID: PMC10626597 DOI: 10.1021/acs.chemrev.2c00897] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Omics technologies have rapidly evolved with the unprecedented potential to shape precision medicine. Novel omics approaches are imperative toallow rapid and accurate data collection and integration with clinical information and enable a new era of healthcare. In this comprehensive review, we highlight the utility of Raman spectroscopy (RS) as an emerging omics technology for clinically relevant applications using clinically significant samples and models. We discuss the use of RS both as a label-free approach for probing the intrinsic metabolites of biological materials, and as a labeled approach where signal from Raman reporters conjugated to nanoparticles (NPs) serve as an indirect measure for tracking protein biomarkers in vivo and for high throughout proteomics. We summarize the use of machine learning algorithms for processing RS data to allow accurate detection and evaluation of treatment response specifically focusing on cancer, cardiac, gastrointestinal, and neurodegenerative diseases. We also highlight the integration of RS with established omics approaches for holistic diagnostic information. Further, we elaborate on metal-free NPs that leverage the biological Raman-silent region overcoming the challenges of traditional metal NPs. We conclude the review with an outlook on future directions that will ultimately allow the adaptation of RS as a clinical approach and revolutionize precision medicine.
Collapse
Affiliation(s)
- Gabriel Cutshaw
- Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50012, USA
- Nanovaccine Institute, Iowa State University, Ames, IA 50012, USA
| | - Saji Uthaman
- Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50012, USA
- Nanovaccine Institute, Iowa State University, Ames, IA 50012, USA
| | - Nora Hassan
- Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50012, USA
- Nanovaccine Institute, Iowa State University, Ames, IA 50012, USA
| | - Siddhant Kothadiya
- Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50012, USA
- Nanovaccine Institute, Iowa State University, Ames, IA 50012, USA
| | - Xiaona Wen
- Biologics Analytical Research and Development, Merck & Co., Inc., Rahway, NJ, 07065, USA
| | - Rizia Bardhan
- Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50012, USA
- Nanovaccine Institute, Iowa State University, Ames, IA 50012, USA
| |
Collapse
|
27
|
Xiao Y, Wang J, Li J, Zhang P, Li J, Zhou Y, Zhou Q, Chen M, Sheng X, Liu Z, Han X, Guo G. An analytical framework for decoding cell type-specific genetic variation of gene regulation. Nat Commun 2023; 14:3884. [PMID: 37391400 PMCID: PMC10313894 DOI: 10.1038/s41467-023-39538-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 06/16/2023] [Indexed: 07/02/2023] Open
Abstract
A deeper understanding of genetic regulation and functional mechanisms underlying genetic associations with complex traits and diseases is impeded by cellular heterogeneity and linkage disequilibrium. To address these limits, we introduce Huatuo, a framework to decode genetic variation of gene regulation at cell type and single-nucleotide resolutions by integrating deep-learning-based variant predictions with population-based association analyses. We apply Huatuo to generate a comprehensive cell type-specific genetic variation landscape across human tissues and further evaluate their potential roles in complex diseases and traits. Finally, we show that Huatuo's inferences permit prioritizations of driver cell types associated with complex traits and diseases and allow for systematic insights into the mechanisms of phenotype-causal genetic variation.
Collapse
Affiliation(s)
- Yanyu Xiao
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China
| | - Jingjing Wang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China.
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China.
| | - Jiaqi Li
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China
| | - Peijing Zhang
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China
| | - Jingyu Li
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China
| | - Yincong Zhou
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, 310003, China
| | - Qing Zhou
- Life Sciences Institute, Zhejiang University, Hang Zhou, Zhejiang, 310058, China
| | - Ming Chen
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, 310003, China
| | - Xin Sheng
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China
| | - Zhihong Liu
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China
| | - Xiaoping Han
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang, 310058, China.
| | - Guoji Guo
- Center for Stem Cell and Regenerative Medicine, and Bone Marrow Transplantation Center of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310000, China.
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, 311121, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, Zhejiang, 310058, China.
- Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 314400, China.
| |
Collapse
|
28
|
Zhang L, Parvin R, Chen M, Hu D, Fan Q, Ye F. High-throughput microfluidic droplets in biomolecular analytical system: A review. Biosens Bioelectron 2023; 228:115213. [PMID: 36906989 DOI: 10.1016/j.bios.2023.115213] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 02/13/2023] [Accepted: 03/06/2023] [Indexed: 03/11/2023]
Abstract
Droplet microfluidic technology has revolutionized biomolecular analytical research, as it has the capability to reserve the genotype-to-phenotype linkage and assist for revealing the heterogeneity. Massive and uniform picolitre droplets feature dividing solution to the level that single cell and single molecule in each droplet can be visualized, barcoded, and analyzed. Then, the droplet assays can unfold intensive genomic data, offer high sensitivity, and screen and sort from a large number of combinations or phenotypes. Based on these unique advantages, this review focuses on up-to-date research concerning diverse screening applications utilizing droplet microfluidic technology. The emerging progress of droplet microfluidic technology is first introduced, including efficient and scaling-up in droplets encapsulation, and prevalent batch operations. Then the new implementations of droplet-based digital detection assays and single-cell muti-omics sequencing are briefly examined, along with related applications such as drug susceptibility testing, multiplexing for cancer subtype identification, interactions of virus-to-host, and multimodal and spatiotemporal analysis. Meanwhile, we specialize in droplet-based large-scale combinational screening regarding desired phenotypes, with an emphasis on sorting for immune cells, antibodies, enzymatic properties, and proteins produced by directed evolution methods. Finally, some challenges, deployment and future perspective of droplet microfluidics technology in practice are also discussed.
Collapse
Affiliation(s)
- Lexiang Zhang
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China
| | - Rokshana Parvin
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China
| | - Mingshuo Chen
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China
| | - Dingmeng Hu
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China
| | - Qihui Fan
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China; Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Fangfu Ye
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, Zhejiang, 325000, China; Zhejiang Engineering Research Center for Tissue Repair Materials, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, 325000, China; Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing, 100190, China.
| |
Collapse
|
29
|
Zhu J, Luo J, Ma Y. Screening of serum exosome markers for colorectal cancer based on Boruta and multi-cluster feature selection algorithms. Mol Cell Toxicol 2023. [DOI: 10.1007/s13273-023-00348-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
30
|
Kardynska M, Kogut D, Pacholczyk M, Smieja J. Mathematical modeling of regulatory networks of intracellular processes - Aims and selected methods. Comput Struct Biotechnol J 2023; 21:1523-1532. [PMID: 36851915 PMCID: PMC9958294 DOI: 10.1016/j.csbj.2023.02.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 02/03/2023] [Accepted: 02/03/2023] [Indexed: 02/11/2023] Open
Abstract
Regulatory networks structure and signaling pathways dynamics are uncovered in time- and resource consuming experimental work. However, it is increasingly supported by modeling, analytical and computational techniques as well as discrete mathematics and artificial intelligence applied to to extract knowledge from existing databases. This review is focused on mathematical modeling used to analyze dynamics and robustness of these networks. This paper presents a review of selected modeling methods that facilitate advances in molecular biology.
Collapse
Affiliation(s)
- Malgorzata Kardynska
- Dept. of Biosensors and Processing of Biomedical Signals, Silesian University of Technology, Gliwice, Poland
| | - Daria Kogut
- Dept. of Biosensors and Processing of Biomedical Signals, Silesian University of Technology, Gliwice, Poland.,Dept. of Systems Biology and Engineering, Silesian University of Technology, Gliwice, Poland
| | - Marcin Pacholczyk
- Dept. of Biosensors and Processing of Biomedical Signals, Silesian University of Technology, Gliwice, Poland.,Dept. of Systems Biology and Engineering, Silesian University of Technology, Gliwice, Poland
| | - Jaroslaw Smieja
- Dept. of Biosensors and Processing of Biomedical Signals, Silesian University of Technology, Gliwice, Poland.,Dept. of Systems Biology and Engineering, Silesian University of Technology, Gliwice, Poland
| |
Collapse
|
31
|
Information Security Protection of Internet of Energy Using Ensemble Public Key Algorithm under Big Data. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING 2023. [DOI: 10.1155/2023/6853902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
This work aims to solve the specific problem in the Power Internet of Things (PIoT). PIoT is vulnerable to monitoring, tampering, forgery, and other attacks during frequent data interaction under the background of big data, leading to a severe threat to the power grid’s Information Security (ISEC). Cryptosystems can solve ISEC problems, such as confidentiality, data integrity, authentication, identity recognition, data control, and nonrepudiation. Thereupon, this work expounds on cryptography from public-key encryption and digital signature and puts forward the model of network information attack. Then, the security of the two cryptograms is certified against the two cyberattack modes. On this basis, an Identity-based Combined Encryption and Signature (IBCES) ensemble scheme is proposed by combining public-key encryption with the digital signature. Finally, the security of the proposed IBCES’s encryption and the signature schemes is verified, and the results prove their feasibility. The results show that the proposed IBCEs are effective and feasible, fully meeting the information confidentiality requirements. Additionally, smart grid against Information Security (ISEC) algorithms must comprehensively consider network resources and computing power. This work creatively combines the two cryptosystems. The proposal breaks the traditional key segmentation principle by applying the same key to different cryptosystems and ensures the independent security of the two cryptosystems. The conclusion provides technical support for future research on cryptography.
Collapse
|
32
|
Deep learning in regulatory genomics: from identification to design. Curr Opin Biotechnol 2023; 79:102887. [PMID: 36640453 DOI: 10.1016/j.copbio.2022.102887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 11/12/2022] [Accepted: 12/14/2022] [Indexed: 01/14/2023]
Abstract
Genomics and deep learning are a natural match since both are data-driven fields. Regulatory genomics refers to functional noncoding DNA regulating gene expression. In recent years, deep learning applications on regulatory genomics have achieved remarkable advances so-much-so that it has revolutionized the rules of the game of the computational methods in this field. Here, we review two emerging trends: (i) the modeling of very long input sequence (up to 200 kb), which requires self-matched modularization of model architecture; (ii) on the balance of model predictability and model interpretability because the latter is more able to meet biological demands. Finally, we discuss how to employ these two routes to design synthetic regulatory DNA, as a promising strategy for optimizing crop agronomic properties.
Collapse
|
33
|
Yu X, Zhou S, Zou H, Wang Q, Liu C, Zang M, Liu T. Survey of deep learning techniques for disease prediction based on omics data. HUMAN GENE 2023; 35:201140. [DOI: 10.1016/j.humgen.2022.201140] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2025]
|
34
|
Andreassen OA, Hindley GFL, Frei O, Smeland OB. New insights from the last decade of research in psychiatric genetics: discoveries, challenges and clinical implications. World Psychiatry 2023; 22:4-24. [PMID: 36640404 PMCID: PMC9840515 DOI: 10.1002/wps.21034] [Citation(s) in RCA: 73] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/07/2022] [Indexed: 01/15/2023] Open
Abstract
Psychiatric genetics has made substantial progress in the last decade, providing new insights into the genetic etiology of psychiatric disorders, and paving the way for precision psychiatry, in which individual genetic profiles may be used to personalize risk assessment and inform clinical decision-making. Long recognized to be heritable, recent evidence shows that psychiatric disorders are influenced by thousands of genetic variants acting together. Most of these variants are commonly occurring, meaning that every individual has a genetic risk to each psychiatric disorder, from low to high. A series of large-scale genetic studies have discovered an increasing number of common and rare genetic variants robustly associated with major psychiatric disorders. The most convincing biological interpretation of the genetic findings implicates altered synaptic function in autism spectrum disorder and schizophrenia. However, the mechanistic understanding is still incomplete. In line with their extensive clinical and epidemiological overlap, psychiatric disorders appear to exist on genetic continua and share a large degree of genetic risk with one another. This provides further support to the notion that current psychiatric diagnoses do not represent distinct pathogenic entities, which may inform ongoing attempts to reconceptualize psychiatric nosology. Psychiatric disorders also share genetic influences with a range of behavioral and somatic traits and diseases, including brain structures, cognitive function, immunological phenotypes and cardiovascular disease, suggesting shared genetic etiology of potential clinical importance. Current polygenic risk score tools, which predict individual genetic susceptibility to illness, do not yet provide clinically actionable information. However, their precision is likely to improve in the coming years, and they may eventually become part of clinical practice, stressing the need to educate clinicians and patients about their potential use and misuse. This review discusses key recent insights from psychiatric genetics and their possible clinical applications, and suggests future directions.
Collapse
Affiliation(s)
- Ole A Andreassen
- NORMENT Centre, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| | - Guy F L Hindley
- NORMENT Centre, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Oleksandr Frei
- NORMENT Centre, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
| | - Olav B Smeland
- NORMENT Centre, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| |
Collapse
|
35
|
Shea A, Bartz J, Zhang L, Dong X. Predicting mutational function using machine learning. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 791:108457. [PMID: 36965820 PMCID: PMC10239318 DOI: 10.1016/j.mrrev.2023.108457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/11/2023] [Accepted: 03/20/2023] [Indexed: 03/27/2023]
Abstract
Genetic variations are one of the major causes of phenotypic variations between human individuals. Although beneficial as being the substrate of evolution, germline mutations may cause diseases, including Mendelian diseases and complex diseases such as diabetes and heart diseases. Mutations occurring in somatic cells are a main cause of cancer and likely cause age-related phenotypes and other age-related diseases. Because of the high abundance of genetic variations in the human genome, i.e., millions of germline variations per human subject and thousands of additional somatic mutations per cell, it is technically challenging to experimentally verify the function of every possible mutation and their interactions. Significant progress has been made to solve this problem using computational approaches, especially machine learning (ML). Here, we review the progress and achievements made in recent years in this field of research. We classify the computational models in two ways: one according to their prediction goals including protein structural alterations, gene expression changes, and disease risks, and the other according to their methodologies, including non-machine learning methods, classical machine learning methods, and deep neural network methods. For models in each category, we discuss their architecture, prediction accuracy, and potential limitations. This review provides new insights into the applications and future directions of computational approaches in understanding the role of mutations in aging and disease.
Collapse
Affiliation(s)
- Anthony Shea
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - Josh Bartz
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA; Bioinformatics and Computational Biology Program, University of Minnesota, Minneapolis, MN 55455, USA
| | - Lei Zhang
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Xiao Dong
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
36
|
Wang Y, Sun Z, He Q, Li J, Ni M, Yang M. Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships. PATTERNS (NEW YORK, N.Y.) 2022; 4:100651. [PMID: 36699743 PMCID: PMC9868676 DOI: 10.1016/j.patter.2022.100651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 05/19/2022] [Accepted: 11/07/2022] [Indexed: 12/12/2022]
Abstract
Leveraging molecular networks to discover disease-relevant modules is a long-standing challenge. With the accumulation of interactomes, there is a pressing need for powerful computational approaches to handle the inevitable noise and context-specific nature of biological networks. Here, we introduce Graphene, a two-step self-supervised representation learning framework tailored to concisely integrate multiple molecular networks and adapted to gene functional analysis via downstream re-training. In practice, we first leverage GNN (graph neural network) pre-training techniques to obtain initial node embeddings followed by re-training Graphene using a graph attention architecture, achieving superior performance over competing methods for pathway gene recovery, disease gene reprioritization, and comorbidity prediction. Graphene successfully recapitulates tissue-specific gene expression across disease spectrum and demonstrates shared heritability of common mental disorders. Graphene can be updated with new interactomes or other omics features. Graphene holds promise to decipher gene function under network context and refine GWAS (genome-wide association study) hits and offers mechanistic insights via decoding diseases from genome to networks to phenotypes.
Collapse
Affiliation(s)
- Yi Wang
- MGI, BGI-Shenzhen, Shenzhen, China
| | - Zijun Sun
- Computer Center, Peking University, Beijing, China
| | | | - Jiwei Li
- Department of Computer Science, Zhejiang University, Hangzhou, China
| | - Ming Ni
- MGI, BGI-Shenzhen, Shenzhen, China
- MGI-QingDao, BGI-Shenzhen, Qingdao, China
| | - Meng Yang
- MGI, BGI-Shenzhen, Shenzhen, China
- Corresponding author
| |
Collapse
|
37
|
Barrio-Hernandez I, Beltrao P. Network analysis of genome-wide association studies for drug target prioritisation. Curr Opin Chem Biol 2022; 71:102206. [PMID: 36087372 DOI: 10.1016/j.cbpa.2022.102206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 07/29/2022] [Accepted: 08/05/2022] [Indexed: 01/27/2023]
Abstract
Over the past decades, genome-wide association studies (GWAS) have led to a dramatic expansion of genetic variants implicated with human traits and diseases. These advances are expected to result in new drug targets but the identification of causal genes and the cell biology underlying human diseases from GWAS remains challenging. Here, we review protein interaction network-based methods to analyse GWAS data. These approaches can rank candidate drug targets at GWAS-associated loci or among interactors of disease genes without direct genetic support. These methods identify the cell biology affected in common across diseases, offering opportunities for drug repurposing, as well as be combined with expression data to identify focal tissues and cell types. Going forward, we expect that these methods will further improve from advances in the characterisation of context specific interaction networks and the joint analysis of rare and common genetic signals.
Collapse
Affiliation(s)
- Inigo Barrio-Hernandez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK; Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK.
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK; Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK; Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, 8093, Switzerland.
| |
Collapse
|
38
|
Long E, Patel H, Byun J, Amos CI, Choi J. Functional studies of lung cancer GWAS beyond association. Hum Mol Genet 2022; 31:R22-R36. [PMID: 35776125 PMCID: PMC9585683 DOI: 10.1093/hmg/ddac140] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/01/2022] [Accepted: 06/16/2022] [Indexed: 11/14/2022] Open
Abstract
Fourteen years after the first genome-wide association study (GWAS) of lung cancer was published, approximately 45 genomic loci have now been significantly associated with lung cancer risk. While functional characterization was performed for several of these loci, a comprehensive summary of the current molecular understanding of lung cancer risk has been lacking. Further, many novel computational and experimental tools now became available to accelerate the functional assessment of disease-associated variants, moving beyond locus-by-locus approaches. In this review, we first highlight the heterogeneity of lung cancer GWAS findings across histological subtypes, ancestries and smoking status, which poses unique challenges to follow-up studies. We then summarize the published lung cancer post-GWAS studies for each risk-associated locus to assess the current understanding of biological mechanisms beyond the initial statistical association. We further summarize strategies for GWAS functional follow-up studies considering cutting-edge functional genomics tools and providing a catalog of available resources relevant to lung cancer. Overall, we aim to highlight the importance of integrating computational and experimental approaches to draw biological insights from the lung cancer GWAS results beyond association.
Collapse
Affiliation(s)
- Erping Long
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Harsh Patel
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Jinyoung Byun
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, 77030, USA
- Section of Epidemiology and Population Sciences, Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Christopher I Amos
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, 77030, USA
- Section of Epidemiology and Population Sciences, Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
- Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Jiyeon Choi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| |
Collapse
|
39
|
Newey PJ. Approach to the patient with a variant of uncertain significance on genetic testing. Clin Endocrinol (Oxf) 2022; 97:400-408. [PMID: 35996232 DOI: 10.1111/cen.14818] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 08/12/2022] [Accepted: 08/13/2022] [Indexed: 11/29/2022]
Abstract
Establishing a genetic diagnosis may lead to major health benefits for the patient and their wider family, but is dependent on the accurate interpretation of test results. The processes of variant interpretation are by their nature imprecise such that the potential for uncertain test results (i.e., variant(s) of uncertain significance [VUS]) are an inevitable consequence of genomic testing. With an increased responsibility for diagnostic testing in the hands of the specialty physician (e.g., endocrinologist) rather than clinical geneticist, it is essential that they are familiar with the possible outcomes of testing including an understanding of the VUS category. While uncertainty is endemic to many aspects of clinical medicine, receiving a VUS result may pose a considerable challenge to both the clinician and the patient. In this article, a framework to support decision-making when confronted with a VUS variant is provided, focusing on the key components of the genetic testing pathway. This highlights the importance of assessing the VUS result in the context of the clinical presentation and genetic testing strategy, the value of multidisciplinary team working and ensuring good communication with the patient.
Collapse
Affiliation(s)
- Paul J Newey
- Division of Molecular and Clinical Medicine, Ninewells Hospital & Medical School, University of Dundee, Dundee, Scotland, UK
| |
Collapse
|
40
|
Grant WB, Boucher BJ, Al Anouti F, Pilz S. Comparing the Evidence from Observational Studies and Randomized Controlled Trials for Nonskeletal Health Effects of Vitamin D. Nutrients 2022; 14:3811. [PMID: 36145186 PMCID: PMC9501276 DOI: 10.3390/nu14183811] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/13/2022] [Accepted: 09/14/2022] [Indexed: 12/12/2022] Open
Abstract
Although observational studies of health outcomes generally suggest beneficial effects with, or following, higher serum 25-hydroxyvitamin D [25(OH)D] concentrations, randomized controlled trials (RCTs) have generally not supported those findings. Here we review results from observational studies and RCTs regarding how vitamin D status affects several nonskeletal health outcomes, including Alzheimer's disease and dementia, autoimmune diseases, cancers, cardiovascular disease, COVID-19, major depressive disorder, type 2 diabetes, arterial hypertension, all-cause mortality, respiratory tract infections, and pregnancy outcomes. We also consider relevant findings from ecological, Mendelian randomization, and mechanistic studies. Although clear discrepancies exist between findings of observational studies and RCTs on vitamin D and human health benefits these findings should be interpreted cautiously. Bias and confounding are seen in observational studies and vitamin D RCTs have several limitations, largely due to being designed like RCTs of therapeutic drugs, thereby neglecting vitamin D's being a nutrient with a unique metabolism that requires specific consideration in trial design. Thus, RCTs of vitamin D can fail for several reasons: few participants' having low baseline 25(OH)D concentrations, relatively small vitamin D doses, participants' having other sources of vitamin D, and results being analyzed without consideration of achieved 25(OH)D concentrations. Vitamin D status and its relevance for health outcomes can usefully be examined using Hill's criteria for causality in a biological system from results of observational and other types of studies before further RCTs are considered and those findings would be useful in developing medical and public health policy, as they were for nonsmoking policies. A promising approach for future RCT design is adjustable vitamin D supplementation based on interval serum 25(OH)D concentrations to achieve target 25(OH)D levels suggested by findings from observational studies.
Collapse
Affiliation(s)
- William B. Grant
- Sunlight, Nutrition and Health Research Center, San Francisco, CA 94164-1603, USA
| | - Barbara J. Boucher
- The London School of Medicine and Dentistry, The Blizard Institute, Barts, Queen Mary University of London, London E1 2AT, UK
| | - Fatme Al Anouti
- Department of Health Sciences, College of Natural and Health Sciences, Zayed University, Abu Dhabi 144534, United Arab Emirates
| | - Stefan Pilz
- Division of Endocrinology and Diabetology, Department of Internal Medicine, Medical University of Graz, 8036 Graz, Austria
| |
Collapse
|
41
|
Bai N, Lu X, Jin L, Alimujiang M, Ma J, Hu F, Xu Y, Sun J, Xu J, Zhang R, Han J, Hu C, Yang Y. CLSTN3 gene variant associates with obesity risk and contributes to dysfunction in white adipose tissue. Mol Metab 2022; 63:101531. [PMID: 35753632 PMCID: PMC9254126 DOI: 10.1016/j.molmet.2022.101531] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 06/13/2022] [Accepted: 06/17/2022] [Indexed: 11/29/2022] Open
Abstract
Objective White adipose tissue (WAT) possesses the remarkable remodeling capacity, and maladaptation of this ability contributes to the development of obesity and associated comorbidities. Calsyntenin-3 (CLSTN3) is a transmembrane protein that promotes synapse development in brain. Even though this gene has been reported to be associated with adipose tissue, its role in the regulation of WAT function is unknown yet. We aim to further assess the expression pattern of CLSTN3 gene in human adipose tissue, and investigate its regulatory impact on WAT function. Methods In our study, we observed the expression pattern of Clstn3/CLSTN3 gene in mouse and human WAT. Genetic association study and expression quantitative trait loci analysis were combined to identify the phenotypic effect of CLSTN3 gene variant in humans. This was followed by mouse experiments using adeno-associated virus-mediated human CLSTN3 overexpression in inguinal WAT. We investigated the effect of CLSTN3 on WAT function and overall metabolic homeostasis, as well as the possible underlying molecular mechanism. Results We observed that CLSTN3 gene was routinely expressed in human WAT and predominantly enriched in adipocyte fraction. Furthermore, we identified that the variant rs7296261 in the CLSTN3 locus was associated with a high risk of obesity, and its risk allele was linked to an increase in CLSTN3 expression in human WAT. Overexpression of CLSTN3 in inguinal WAT of mice resulted in diet-induced local dysfunctional expansion, liver steatosis, and systemic metabolic deficiency. In vivo and ex vivo lipolysis assays demonstrated that CLSTN3 overexpression attenuated catecholamine-stimulated lipolysis. Mechanistically, CLSTN3 could interact with amyloid precursor protein (APP) in WAT and increase APP accumulation in mitochondria, which in turn impaired adipose mitochondrial function and promoted obesity. Conclusion Taken together, we provide the evidence for a novel role of CLSTN3 in modulating WAT function, thereby reinforcing the fact that targeting CLSTN3 may be a potential approach for the treatment of obesity and associated metabolic diseases. CLSTN3 is expressed in the adipocyte fraction of human adipose tissue and mainly localizes to the plasma membrane. SNP rs7296261 in human CLSTN3 locus is associated with obesity risk. Overexpression of CLSTN3 leads to adipose tissue dysfunction in mice. CLSTN3 can attenuate catecholamine-stimulated lipolysis. CLSTN3 overexpression increases mitochondrial APP localization of mouse adipose tissue.
Collapse
Affiliation(s)
- Ningning Bai
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Xuhong Lu
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Li Jin
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Miriayi Alimujiang
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Jingyuan Ma
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Fan Hu
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Yuejie Xu
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Jingjing Sun
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Jun Xu
- Department of Geriatrics, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China
| | - Rong Zhang
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China
| | - Junfeng Han
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China.
| | - Cheng Hu
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China.
| | - Ying Yang
- Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; Shanghai Clinical Center for Diabetes, Shanghai Key Clinical Center for Metabolic Disease, Shanghai, China; Shanghai Key Laboratory of Diabetes Mellitus, Shanghai Diabetes Institute, Shanghai, China.
| |
Collapse
|
42
|
Hou S, Zhang P, Yang K, Wang L, Ma C, Li Y, Li S. Decoding multilevel relationships with the human tissue-cell-molecule network. Brief Bioinform 2022; 23:6585388. [PMID: 35551347 DOI: 10.1093/bib/bbac170] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 04/09/2022] [Accepted: 04/16/2022] [Indexed: 02/01/2023] Open
Abstract
Understanding the biological functions of molecules in specific human tissues or cell types is crucial for gaining insights into human physiology and disease. To address this issue, it is essential to systematically uncover associations among multilevel elements consisting of disease phenotypes, tissues, cell types and molecules, which could pose a challenge because of their heterogeneity and incompleteness. To address this challenge, we describe a new methodological framework, called Graph Local InfoMax (GLIM), based on a human multilevel network (HMLN) that we established by introducing multiple tissues and cell types on top of molecular networks. GLIM can systematically mine the potential relationships between multilevel elements by embedding the features of the HMLN through contrastive learning. Our simulation results demonstrated that GLIM consistently outperforms other state-of-the-art algorithms in disease gene prediction. Moreover, GLIM was also successfully used to infer cell markers and rewire intercellular and molecular interactions in the context of specific tissues or diseases. As a typical case, the tissue-cell-molecule network underlying gastritis and gastric cancer was first uncovered by GLIM, providing systematic insights into the mechanism underlying the occurrence and development of gastric cancer. Overall, our constructed methodological framework has the potential to systematically uncover complex disease mechanisms and mine high-quality relationships among phenotypical, tissue, cellular and molecular elements.
Collapse
Affiliation(s)
- Siyu Hou
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| | - Peng Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| | - Kuo Yang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China.,School of Computer and Information Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Lan Wang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| | - Changzheng Ma
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| | - Yanda Li
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| | - Shao Li
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics, Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, 100084 Beijing, China
| |
Collapse
|
43
|
Zhu X, Tang L, Mao J, Hameed Y, Zhang J, Li N, Wu D, Huang Y, Li C. Decoding the Mechanism behind the Pathogenesis of the Focal Segmental Glomerulosclerosis. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:1941038. [PMID: 35693262 PMCID: PMC9175094 DOI: 10.1155/2022/1941038] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 02/26/2022] [Accepted: 03/07/2022] [Indexed: 12/21/2022]
Abstract
Focal segmental glomerulosclerosis (FSGS) is a chronic glomerular disease associated with podocyte injury which is named after the pathologic features of the kidney. The aim of this study is to decode the key changes in gene expression and regulatory network involved in the formation of FSGS. Integrated network analysis included Gene Expression Omnibus (GEO) datasets to identify differentially expressed genes (DEGs) between FSGS patients and healthy donors. Bioinformatics analysis was used to identify the roles of the DEGs and included the development of protein-protein interaction (PPI) networks, Gene Ontology (GO), and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses, and the key modules were assured. The expression levels of DEGs were validated using the additional dataset. Eventually, transcription factors and ceRNA networks were established to illuminate the regulatory relationships in the formation of FSGS. 1130 DEGs including 475 upregulated genes and 655 downregulated genes with functional enrichment analysis were determined. Further analysis uncovered that the validated hub genes were defined as candidate genes, including Complement C3a Receptor 1 (C3AR1), C-C Motif Chemokine Receptor 1(CCR1), C-X3-C Motif Chemokine Ligand 1 (CX3CL1), Melatonin Receptor 1A (MTNR1A), and Purinergic Receptor P2Y13 (P2RY13). More importantly, we identified transcription factors and mRNA-miRNA-lncRNA regulatory networks associated with the candidate genes. The candidate genes and regulatory networks discovered in this study can help to comprehend the molecular mechanism of FSGS and supply potential targets for the diagnosis and therapy of FSGS.
Collapse
Affiliation(s)
- Xiao Zhu
- School of Laboratory Medicine, Hangzhou Medical College, Hangzhou 310053, China
| | - Liping Tang
- The Eighth Medical Center, Chinese PLA General Hospital, Beijing 100091, China
| | - Jingxin Mao
- College of Pharmaceutical Sciences, Southwest University, Chongqing 400715, China
| | - Yasir Hameed
- Department of Biochemistry and Biotechnology, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
| | - Jingyu Zhang
- Southern Marine Science and Engineering Guangdong Laboratory (Zhanjiang), Guangdong Medical University, Zhanjiang 524024, China
| | - Ning Li
- Southern Marine Science and Engineering Guangdong Laboratory (Zhanjiang), Guangdong Medical University, Zhanjiang 524024, China
| | - Danny Wu
- Southern Marine Science and Engineering Guangdong Laboratory (Zhanjiang), Guangdong Medical University, Zhanjiang 524024, China
| | - Yongmei Huang
- Southern Marine Science and Engineering Guangdong Laboratory (Zhanjiang), Guangdong Medical University, Zhanjiang 524024, China
| | - Chen Li
- Department of Biology, Chemistry, Pharmacy, Free University of Berlin, Berlin 14195, Germany
| |
Collapse
|
44
|
Miglioli C, Bakalli G, Orso S, Karemera M, Molinari R, Guerrier S, Mili N. Evidence of antagonistic predictive effects of miRNAs in breast cancer cohorts through data-driven networks. Sci Rep 2022; 12:5166. [PMID: 35338170 PMCID: PMC8956684 DOI: 10.1038/s41598-022-08737-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 03/07/2022] [Indexed: 12/03/2022] Open
Abstract
Non-coding micro RNAs (miRNAs) dysregulation seems to play an important role in the pathways involved in breast cancer occurrence and progression. In different studies, opposite functions may be assigned to the same miRNA, either promoting the disease or protecting from it. Our research tackles the following issues: (i) why aren’t there any concordant findings in many research studies regarding the role of miRNAs in the progression of breast cancer? (ii) could a miRNA have either an activating effect or an inhibiting one in cancer progression according to the other miRNAs with which it interacts? For this purpose, we analyse the AHUS dataset made available on the ArrayExpress platform by Haakensen et al. The breast tissue specimens were collected over 7 years between 2003 and 2009. miRNA-expression profiling was obtained for 55 invasive carcinomas and 70 normal breast tissue samples. Our statistical analysis is based on a recently developed model and feature selection technique which, instead of selecting a single model (i.e. a unique combination of miRNAs), delivers a set of models with equivalent predictive capabilities that allows to interpret and visualize the interaction of these features. As a result, we discover a set of 112 indistinguishable models (in a predictive sense) each with 4 or 5 miRNAs. Within this set, by comparing the model coefficients, we are able to identify three classes of miRNA: (i) oncogenic miRNAs; (ii) protective miRNAs; (iii) undefined miRNAs which can play both an oncogenic and a protective role according to the network with which they interact. These results shed new light on the biological action of miRNAs in breast cancer and may contribute to explain why, in some cases, different studies attribute opposite functions to the same miRNA.
Collapse
Affiliation(s)
- Cesare Miglioli
- University of Geneva, Geneva School of Economics and Management, Geneva, 1205, Switzerland.
| | - Gaetan Bakalli
- Auburn University, Department of Mathematics and Statistics, Auburn, AL, 36849, USA
| | - Samuel Orso
- University of Geneva, Geneva School of Economics and Management, Geneva, 1205, Switzerland
| | - Mucyo Karemera
- Auburn University, Department of Mathematics and Statistics, Auburn, AL, 36849, USA
| | - Roberto Molinari
- Auburn University, Department of Mathematics and Statistics, Auburn, AL, 36849, USA
| | - Stéphane Guerrier
- University of Geneva, Geneva School of Economics and Management, Geneva, 1205, Switzerland.,University of Geneva, Faculty of Science, Geneva, 1211, Switzerland
| | - Nabil Mili
- University of Lausanne, Lausanne, 1015, Switzerland.
| |
Collapse
|
45
|
Carrasco-Reinado R, Bermudez-Sauco M, Escobar-Niño A, Cantoral JM, Fernández-Acero FJ. Development of the "Applied Proteomics" Concept for Biotechnology Applications in Microalgae: Example of the Proteome Data in Nannochloropsis gaditana. Mar Drugs 2021; 20:38. [PMID: 35049892 PMCID: PMC8780095 DOI: 10.3390/md20010038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/19/2021] [Accepted: 12/26/2021] [Indexed: 11/23/2022] Open
Abstract
Most of the marine ecosystems on our planet are still unknown. Among these ecosystems, microalgae act as a baseline due to their role as primary producers. The estimated millions of species of these microorganisms represent an almost infinite source of potentially active biocomponents offering unlimited biotechnology applications. This review considers current research in microalgae using the "omics" approach, which today is probably the most important biotechnology tool. These techniques enable us to obtain a large volume of data from a single experiment. The specific focus of this review is proteomics as a technique capable of generating a large volume of interesting information in a single proteomics assay, and particularly the concept of applied proteomics. As an example, this concept has been applied to the study of Nannochloropsis gaditana, in which proteomics data generated are transformed into information of high commercial value by identifying proteins with direct applications in the biomedical and agri-food fields, such as the protein designated UCA01 which presents antitumor activity, obtained from N. gaditana.
Collapse
Affiliation(s)
- Rafael Carrasco-Reinado
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - María Bermudez-Sauco
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Almudena Escobar-Niño
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Jesús M. Cantoral
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Francisco Javier Fernández-Acero
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| |
Collapse
|
46
|
Huminiecki Ł. Virtual Gene Concept and a Corresponding Pragmatic Research Program in Genetical Data Science. ENTROPY (BASEL, SWITZERLAND) 2021; 24:17. [PMID: 35052043 PMCID: PMC8774939 DOI: 10.3390/e24010017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/02/2021] [Accepted: 12/14/2021] [Indexed: 06/14/2023]
Abstract
Mendel proposed an experimentally verifiable paradigm of particle-based heredity that has been influential for over 150 years. The historical arguments have been reflected in the near past as Mendel's concept has been diversified by new types of omics data. As an effect of the accumulation of omics data, a virtual gene concept forms, giving rise to genetical data science. The concept integrates genetical, functional, and molecular features of the Mendelian paradigm. I argue that the virtual gene concept should be deployed pragmatically. Indeed, the concept has already inspired a practical research program related to systems genetics. The program includes questions about functionality of structural and categorical gene variants, about regulation of gene expression, and about roles of epigenetic modifications. The methodology of the program includes bioinformatics, machine learning, and deep learning. Education, funding, careers, standards, benchmarks, and tools to monitor research progress should be provided to support the research program.
Collapse
Affiliation(s)
- Łukasz Huminiecki
- Evolutionary, Computational, and Statistical Genetics, Department of Molecula Biology, Institute of Genetics and Animal Biotechnology, Polish Academy of Sciences, Postępu 36A, Jastrzębiec, 05-552 Warsaw, Poland
| |
Collapse
|
47
|
Cortes-Figueiredo F, Carvalho FS, Fonseca AC, Paul F, Ferro JM, Schönherr S, Weissensteiner H, Morais VA. From Forensics to Clinical Research: Expanding the Variant Calling Pipeline for the Precision ID mtDNA Whole Genome Panel. Int J Mol Sci 2021; 22:12031. [PMID: 34769461 PMCID: PMC8584537 DOI: 10.3390/ijms222112031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 10/28/2021] [Accepted: 10/29/2021] [Indexed: 02/06/2023] Open
Abstract
Despite a multitude of methods for the sample preparation, sequencing, and data analysis of mitochondrial DNA (mtDNA), the demand for innovation remains, particularly in comparison with nuclear DNA (nDNA) research. The Applied Biosystems™ Precision ID mtDNA Whole Genome Panel (Thermo Fisher Scientific, USA) is an innovative library preparation kit suitable for degraded samples and low DNA input. However, its bioinformatic processing occurs in the enterprise Ion Torrent Suite™ Software (TSS), yielding BAM files aligned to an unorthodox version of the revised Cambridge Reference Sequence (rCRS), with a heteroplasmy threshold level of 10%. Here, we present an alternative customizable pipeline, the PrecisionCallerPipeline (PCP), for processing samples with the correct rCRS output after Ion Torrent sequencing with the Precision ID library kit. Using 18 samples (3 original samples and 15 mixtures) derived from the 1000 Genomes Project, we achieved overall improved performance metrics in comparison with the proprietary TSS, with optimal performance at a 2.5% heteroplasmy threshold. We further validated our findings with 50 samples from an ongoing independent cohort of stroke patients, with PCP finding 98.31% of TSS's variants (TSS found 57.92% of PCP's variants), with a significant correlation between the variant levels of variants found with both pipelines.
Collapse
Affiliation(s)
- Filipe Cortes-Figueiredo
- VMorais Lab—Mitochondria Biology & Neurodegeneration, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisbon, Portugal; (F.C.-F.); (F.S.C.)
- NeuroCure Clinical Research Center, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany;
| | - Filipa S. Carvalho
- VMorais Lab—Mitochondria Biology & Neurodegeneration, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisbon, Portugal; (F.C.-F.); (F.S.C.)
| | - Ana Catarina Fonseca
- José Ferro Lab—Clinical Research in Non-communicable Neurological Diseases, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisbon, Portugal; (A.C.F.); (J.M.F.)
- Serviço de Neurologia, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, 1649-035 Lisbon, Portugal
| | - Friedemann Paul
- NeuroCure Clinical Research Center, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany;
- Experimental and Clinical Research Center, Charité—Universitätsmedizin Berlin and Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany
| | - José M. Ferro
- José Ferro Lab—Clinical Research in Non-communicable Neurological Diseases, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisbon, Portugal; (A.C.F.); (J.M.F.)
- Serviço de Neurologia, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, 1649-035 Lisbon, Portugal
| | - Sebastian Schönherr
- Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, 6020 Innsbruck, Austria;
| | - Hansi Weissensteiner
- Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, 6020 Innsbruck, Austria;
| | - Vanessa A. Morais
- VMorais Lab—Mitochondria Biology & Neurodegeneration, Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisbon, Portugal; (F.C.-F.); (F.S.C.)
| |
Collapse
|