1
|
Zhao J, Yao W, Gao H, Kuang Z, Shi L, Wang H, Dang Z. Degenerative Disease Diagnosis and Analysis Based on Tissue Specificity of DNA Methylation. Int J Mol Sci 2025; 26:452. [PMID: 39859168 PMCID: PMC11765164 DOI: 10.3390/ijms26020452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2024] [Revised: 12/28/2024] [Accepted: 01/06/2025] [Indexed: 01/27/2025] Open
Abstract
The tissue specificity of DNA methylation refers to the significant differences in DNA methylation patterns in different tissues. This specificity regulates gene expression, thereby supporting the specific functions of each tissue and the maintenance of normal physiological activities. Abnormal tissue-specific patterns of DNA methylation are closely related to age-related diseases. This abnormal methylation pattern affects the regulation of gene expression, which may lead to changes in cell function and promote the occurrence of pathological conditions. By analyzing the differences in these methylation patterns, key CpG sites for disease diagnosis can be effectively screened. The main goal of this paper is to use the characteristics associated with tissue-specific abnormal expression and disease to construct an age-related disease diagnosis model. First, we combined chi-square tests and logistic regression to identify tissue-specific and disease-specific CpG sites, laying the foundation for accurate medical diagnosis, and verified the biological relevance of these CpG sites through enrichment analysis. Then we used the Transformer model to fit these CpG sites and realized the automatic diagnosis of age-related diseases. Our work proves that the tissue specificity of DNA methylation has the potential to diagnose age-related diseases, and proves the scientific nature of our proposed diagnostic method from a biological perspective.
Collapse
Affiliation(s)
- Jian Zhao
- School of Computer Science and Technology, Changchun University, Changchun 130022, China; (J.Z.); (W.Y.); (H.G.); (Z.D.)
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
| | - Wei Yao
- School of Computer Science and Technology, Changchun University, Changchun 130022, China; (J.Z.); (W.Y.); (H.G.); (Z.D.)
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
| | - Hanlin Gao
- School of Computer Science and Technology, Changchun University, Changchun 130022, China; (J.Z.); (W.Y.); (H.G.); (Z.D.)
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
| | - Zhejun Kuang
- School of Computer Science and Technology, Changchun University, Changchun 130022, China; (J.Z.); (W.Y.); (H.G.); (Z.D.)
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
| | - Lijuan Shi
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
- College of Electronic Information Engineering, Changchun University, Changchun 130012, China
| | - Han Wang
- School of Information Science and Technology, Institute of Computational Biology, Northeast Normal University, Changchun 130117, China
| | - Zhuozheng Dang
- School of Computer Science and Technology, Changchun University, Changchun 130022, China; (J.Z.); (W.Y.); (H.G.); (Z.D.)
- Jilin Provincial Key Laboratory of Human Health Status Identification Function & Enhancement, Changchun 130022, China;
- Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled, Changchun University, Ministry of Education, Changchun 130022, China
| |
Collapse
|
2
|
Karaglani M, Agorastos A, Panagopoulou M, Parlapani E, Athanasis P, Bitsios P, Tzitzikou K, Theodosiou T, Iliopoulos I, Bozikas VP, Chatzaki E. A novel blood-based epigenetic biosignature in first-episode schizophrenia patients through automated machine learning. Transl Psychiatry 2024; 14:257. [PMID: 38886359 PMCID: PMC11183091 DOI: 10.1038/s41398-024-02946-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 05/15/2024] [Accepted: 05/17/2024] [Indexed: 06/20/2024] Open
Abstract
Schizophrenia (SCZ) is a chronic, severe, and complex psychiatric disorder that affects all aspects of personal functioning. While SCZ has a very strong biological component, there are still no objective diagnostic tests. Lately, special attention has been given to epigenetic biomarkers in SCZ. In this study, we introduce a three-step, automated machine learning (AutoML)-based, data-driven, biomarker discovery pipeline approach, using genome-wide DNA methylation datasets and laboratory validation, to deliver a highly performing, blood-based epigenetic biosignature of diagnostic clinical value in SCZ. Publicly available blood methylomes from SCZ patients and healthy individuals were analyzed via AutoML, to identify SCZ-specific biomarkers. The methylation of the identified genes was then analyzed by targeted qMSP assays in blood gDNA of 30 first-episode drug-naïve SCZ patients and 30 healthy controls (CTRL). Finally, AutoML was used to produce an optimized disease-specific biosignature based on patient methylation data combined with demographics. AutoML identified a SCZ-specific set of novel gene methylation biomarkers including IGF2BP1, CENPI, and PSME4. Functional analysis investigated correlations with SCZ pathology. Methylation levels of IGF2BP1 and PSME4, but not CENPI were found to differ, IGF2BP1 being higher and PSME4 lower in the SCZ group as compared to the CTRL group. Additional AutoML classification analysis of our experimental patient data led to a five-feature biosignature including all three genes, as well as age and sex, that discriminated SCZ patients from healthy individuals [AUC 0.755 (0.636, 0.862) and average precision 0.758 (0.690, 0.825)]. In conclusion, this three-step pipeline enabled the discovery of three novel genes and an epigenetic biosignature bearing potential value as promising SCZ blood-based diagnostics.
Collapse
Affiliation(s)
- Makrina Karaglani
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, GR-68132, Alexandroupolis, Greece
- Institute of Agri-food and Life Sciences, University Research & Innovation Center, H.M.U.R.I.C., Hellenic Mediterranean University, GR-71003, Crete, Greece
| | - Agorastos Agorastos
- Institute of Agri-food and Life Sciences, University Research & Innovation Center, H.M.U.R.I.C., Hellenic Mediterranean University, GR-71003, Crete, Greece
- II. Department of Psychiatry, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, GR-56430, Thessaloniki, Greece
| | - Maria Panagopoulou
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, GR-68132, Alexandroupolis, Greece
- Institute of Agri-food and Life Sciences, University Research & Innovation Center, H.M.U.R.I.C., Hellenic Mediterranean University, GR-71003, Crete, Greece
| | - Eleni Parlapani
- Ι. Department of Psychiatry, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, GR-56429, Thessaloniki, Greece
| | - Panagiotis Athanasis
- II. Department of Psychiatry, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, GR-56430, Thessaloniki, Greece
| | - Panagiotis Bitsios
- Department of Psychiatry and Behavioral Sciences, Faculty of Medicine, University of Crete, GR-71500, Heraklion, Greece
| | - Konstantina Tzitzikou
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, GR-68132, Alexandroupolis, Greece
| | - Theodosis Theodosiou
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, GR-68132, Alexandroupolis, Greece
- ABCureD P.C, GR-68131, Alexandroupolis, Greece
| | - Ioannis Iliopoulos
- Division of Basic Sciences, School of Medicine, University of Crete, GR-71003, Heraklion, Greece
| | - Vasilios-Panteleimon Bozikas
- II. Department of Psychiatry, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, GR-56430, Thessaloniki, Greece
| | - Ekaterini Chatzaki
- Laboratory of Pharmacology, Department of Medicine, Democritus University of Thrace, GR-68132, Alexandroupolis, Greece.
- Institute of Agri-food and Life Sciences, University Research & Innovation Center, H.M.U.R.I.C., Hellenic Mediterranean University, GR-71003, Crete, Greece.
- ABCureD P.C, GR-68131, Alexandroupolis, Greece.
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, 70013, Heraklion, Greece.
| |
Collapse
|
3
|
Lakiotaki K, Papadovasilakis Z, Lagani V, Fafalios S, Charonyktakis P, Tsagris M, Tsamardinos I. Automated machine learning for genome wide association studies. Bioinformatics 2023; 39:btad545. [PMID: 37672022 PMCID: PMC10562960 DOI: 10.1093/bioinformatics/btad545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 06/29/2023] [Accepted: 09/05/2023] [Indexed: 09/07/2023] Open
Abstract
MOTIVATION Genome-wide association studies (GWAS) present several computational and statistical challenges for their data analysis, including knowledge discovery, interpretability, and translation to clinical practice. RESULTS We develop, apply, and comparatively evaluate an automated machine learning (AutoML) approach, customized for genomic data that delivers reliable predictive and diagnostic models, the set of genetic variants that are important for predictions (called a biosignature), and an estimate of the out-of-sample predictive power. This AutoML approach discovers variants with higher predictive performance compared to standard GWAS methods, computes an individual risk prediction score, generalizes to new, unseen data, is shown to better differentiate causal variants from other highly correlated variants, and enhances knowledge discovery and interpretability by reporting multiple equivalent biosignatures. AVAILABILITY AND IMPLEMENTATION Code for this study is available at: https://github.com/mensxmachina/autoML-GWAS. JADBio offers a free version at: https://jadbio.com/sign-up/. SNP data can be downloaded from the EGA repository (https://ega-archive.org/). PRS data are found at: https://www.aicrowd.com/challenges/opensnp-height-prediction. Simulation data to study population structure can be found at: https://easygwas.ethz.ch/data/public/dataset/view/1/.
Collapse
Affiliation(s)
| | - Zaharias Papadovasilakis
- Department of Computer Science, University of Crete, Heraklion, Greece
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, GR-70013 Heraklion, Greece
- Laboratory of Immune Regulation and Tolerance, School of Medicine, University of Crete, Heraklion, Greece
| | - Vincenzo Lagani
- Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology KAUST, Thuwal 23952, Saudi Arabia
- SDAIA-KAUST Center of Excellence in Data Science and Artificial Intelligence, Thuwal 23952, Saudi Arabia
- Institute of Chemical Biology, Ilia State University, Tbilisi, Georgia
| | - Stefanos Fafalios
- Department of Computer Science, University of Crete, Heraklion, Greece
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, GR-70013 Heraklion, Greece
| | - Paulos Charonyktakis
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, GR-70013 Heraklion, Greece
| | - Michail Tsagris
- Department of Computer Science, University of Crete, Heraklion, Greece
- Department of Economics, University of Crete, Heraklion, Greece
| | - Ioannis Tsamardinos
- Department of Computer Science, University of Crete, Heraklion, Greece
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, GR-70013 Heraklion, Greece
| |
Collapse
|
4
|
Papadakis VM, Cheimonidi C, Panagopoulou M, Karaglani M, Apalaki P, Katsara K, Kenanakis G, Theodosiou T, Constantinidis TC, Stratigi K, Chatzaki E. Label-Free Human Disease Characterization through Circulating Cell-Free DNA Analysis Using Raman Spectroscopy. Int J Mol Sci 2023; 24:12384. [PMID: 37569759 PMCID: PMC10418917 DOI: 10.3390/ijms241512384] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 07/30/2023] [Accepted: 08/01/2023] [Indexed: 08/13/2023] Open
Abstract
Circulating cell-free DNA (ccfDNA) is a liquid biopsy biomaterial attracting significant attention for the implementation of precision medicine diagnostics. Deeper knowledge related to its structure and biology would enable the development of such applications. In this study, we employed Raman spectroscopy to unravel the biomolecular profile of human ccfDNA in health and disease. We established reference Raman spectra of ccfDNA samples from healthy males and females with different conditions, including cancer and diabetes, extracting information about their chemical composition. Comparative observations showed a distinct spectral pattern in ccfDNA from breast cancer patients taking neoadjuvant therapy. Raman analysis of ccfDNA from healthy, prediabetic, and diabetic males uncovered some differences in their biomolecular fingerprints. We also studied ccfDNA released from human benign and cancer cell lines and compared it to their respective gDNA, confirming it mirrors its cellular origin. Overall, we explored for the first time Raman spectroscopy in the study of ccfDNA and provided spectra of samples from different sources. Our findings introduce Raman spectroscopy as a new approach to implementing liquid biopsy diagnostics worthy of further elaboration.
Collapse
Affiliation(s)
- Vassilis M. Papadakis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, 70013 Heraklion, Greece; (V.M.P.); (C.C.); (P.A.); (K.S.)
- Department of Industrial Design and Production Engineering, University of West Attica, 12244 Athens, Greece
- Institute of Agri-Food and Life Sciences, University Research & Innovation Center, Hellenic Mediterranean University, 71410 Heraklion, Greece; (M.P.); (M.K.)
| | - Christina Cheimonidi
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, 70013 Heraklion, Greece; (V.M.P.); (C.C.); (P.A.); (K.S.)
- Institute of Agri-Food and Life Sciences, University Research & Innovation Center, Hellenic Mediterranean University, 71410 Heraklion, Greece; (M.P.); (M.K.)
| | - Maria Panagopoulou
- Institute of Agri-Food and Life Sciences, University Research & Innovation Center, Hellenic Mediterranean University, 71410 Heraklion, Greece; (M.P.); (M.K.)
- Laboratory of Pharmacology, Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
| | - Makrina Karaglani
- Institute of Agri-Food and Life Sciences, University Research & Innovation Center, Hellenic Mediterranean University, 71410 Heraklion, Greece; (M.P.); (M.K.)
- Laboratory of Pharmacology, Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
| | - Paraskevi Apalaki
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, 70013 Heraklion, Greece; (V.M.P.); (C.C.); (P.A.); (K.S.)
| | - Klytaimnistra Katsara
- Institute of Electronic Structure and Laser, Foundation for Research and Technology-Hellas, N. Plastira 100, Vasilika Vouton, 70013 Heraklion, Greece (G.K.)
- Department of Agriculture, Hellenic Mediterranean University—Hellas, Estavromenos, 71410 Heraklion, Greece
| | - George Kenanakis
- Institute of Electronic Structure and Laser, Foundation for Research and Technology-Hellas, N. Plastira 100, Vasilika Vouton, 70013 Heraklion, Greece (G.K.)
| | - Theodosis Theodosiou
- Laboratory of Pharmacology, Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
| | - Theodoros C. Constantinidis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
| | - Kalliopi Stratigi
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, 70013 Heraklion, Greece; (V.M.P.); (C.C.); (P.A.); (K.S.)
| | - Ekaterini Chatzaki
- Institute of Agri-Food and Life Sciences, University Research & Innovation Center, Hellenic Mediterranean University, 71410 Heraklion, Greece; (M.P.); (M.K.)
- Laboratory of Pharmacology, Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
| |
Collapse
|
5
|
Humardani FM, Mulyanata LT, Dwi Putra SE. Adipose cell-free DNA in diabetes. Clin Chim Acta 2023; 539:191-197. [PMID: 36549639 DOI: 10.1016/j.cca.2022.12.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 12/08/2022] [Accepted: 12/09/2022] [Indexed: 12/24/2022]
Abstract
Cancer-associated necrosis is a well-known source of cell-free DNA (cfDNA). However, the origins of cfDNA are not strictly limited to cancer. Additionally, dietary exposure induces apoptosis-induced proliferation in adipocytes, leading to the release of cfDNA. The genetic information derived from cfDNA as a result of apoptosis-induced proliferation contains specific methylation patterns in adipose tissue that can be used as a marker to detect the risk of developing Type 2 diabetes Mellitus (T2DM) in the future. cfDNA is superior to peripheral blood leukocytes (PBL) and whole blood samples for reflecting tissue pathology due to the frequent use of PBL and whole blood samples that do not match tissue pathology. The difficulty of demonstrating that cfDNA is derived from adipose tissue. We propose several promising techniques by analyzing cfDNA derived from adipose tissue to detect T2DM risk. First, adipose-specific genes such as ADIPOQ and Leptin were utilized. Second, MCTA-Seq, EpiSCORE, deconvolution, multiplexing, and automated machine learning (AutoML) were used to determine the proportion of total methylation in related genes.
Collapse
Affiliation(s)
| | | | - Sulistyo Emantoko Dwi Putra
- Department of Biology, Faculty of Biotechnology, University of Surabaya, Surabaya, Indonesia; Raya Kalingrungkut Road, Kali Rungkut, State of Rungkut, Surabaya City, East Java 60293, Indonesia.
| |
Collapse
|
6
|
Prediction and Ranking of Biomarkers Using multiple UniReD. Int J Mol Sci 2022; 23:ijms231911112. [PMID: 36232413 PMCID: PMC9569535 DOI: 10.3390/ijms231911112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 09/06/2022] [Accepted: 09/17/2022] [Indexed: 11/23/2022] Open
Abstract
Protein–protein interactions (PPIs) are of key importance for understanding how cells and organisms function. Thus, in recent decades, many approaches have been developed for the identification and discovery of such interactions. These approaches addressed the problem of PPI identification either by an experimental point of view or by a computational one. Here, we present an updated version of UniReD, a computational prediction tool which takes advantage of biomedical literature aiming to extract documented, already published protein associations and predict undocumented ones. The usefulness of this computational tool has been previously evaluated by experimentally validating predicted interactions and by benchmarking it against public databases of experimentally validated PPIs. In its updated form, UniReD allows the user to provide a list of proteins of known implication in, e.g., a particular disease, as well as another list of proteins that are potentially associated with the proteins of the first list. UniReD then automatically analyzes both lists and ranks the proteins of the second list by their association with the proteins of the first list, thus serving as a potential biomarker discovery/validation tool.
Collapse
|
7
|
Oberhofer A, Bronkhorst AJ, Uhlig C, Ungerer V, Holdenrieder S. Tracing the Origin of Cell-Free DNA Molecules through Tissue-Specific Epigenetic Signatures. Diagnostics (Basel) 2022; 12:diagnostics12081834. [PMID: 36010184 PMCID: PMC9406971 DOI: 10.3390/diagnostics12081834] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/15/2022] [Accepted: 07/25/2022] [Indexed: 12/11/2022] Open
Abstract
All cell and tissue types constantly release DNA fragments into human body fluids by various mechanisms including programmed cell death, accidental cell degradation and active extrusion. Particularly, cell-free DNA (cfDNA) in plasma or serum has been utilized for minimally invasive molecular diagnostics. Disease onset or pathological conditions that lead to increased cell death alter the contribution of different tissues to the total pool of cfDNA. Because cfDNA molecules retain cell-type specific epigenetic features, it is possible to infer tissue-of-origin from epigenetic characteristics. Recent research efforts demonstrated that analysis of, e.g., methylation patterns, nucleosome occupancy, and fragmentomics determined the cell- or tissue-of-origin of individual cfDNA molecules. This novel tissue-of origin-analysis enables to estimate the contributions of different tissues to the total cfDNA pool in body fluids and find tissues with increased cell death (pathologic condition), expanding the portfolio of liquid biopsies towards a wide range of pathologies and early diagnosis. In this review, we summarize the currently available tissue-of-origin approaches and point out the next steps towards clinical implementation.
Collapse
|