1
|
Park JW, Rhee JK. Integrative Analysis of ATAC-Seq and RNA-Seq through Machine Learning Identifies 10 Signature Genes for Breast Cancer Intrinsic Subtypes. BIOLOGY 2024; 13:799. [PMID: 39452108 PMCID: PMC11505269 DOI: 10.3390/biology13100799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Revised: 09/28/2024] [Accepted: 10/05/2024] [Indexed: 10/26/2024]
Abstract
Breast cancer is a heterogeneous disease composed of various biologically distinct subtypes, each characterized by unique molecular features. Its formation and progression involve a complex, multistep process that includes the accumulation of numerous genetic and epigenetic alterations. Although integrating RNA-seq transcriptome data with ATAC-seq epigenetic information provides a more comprehensive understanding of gene regulation and its impact across different conditions, no classification model has yet been developed for breast cancer intrinsic subtypes based on such integrative analyses. In this study, we employed machine learning algorithms to predict intrinsic subtypes through the integrative analysis of ATAC-seq and RNA-seq data. We identified 10 signature genes (CDH3, ERBB2, TYMS, GREB1, OSR1, MYBL2, FAM83D, ESR1, FOXC1, and NAT1) using recursive feature elimination with cross-validation (RFECV) and a support vector machine (SVM) based on SHAP (SHapley Additive exPlanations) feature importance. Furthermore, we found that these genes were primarily associated with immune responses, hormone signaling, cancer progression, and cellular proliferation.
Collapse
Affiliation(s)
| | - Je-Keun Rhee
- Department of Bioinformatics & Life Science, Soongsil University, Seoul 06987, Republic of Korea;
| |
Collapse
|
2
|
Ye B, Ji H, Zhu M, Wang A, Tang J, Liang Y, Zhang Q. Single-cell sequencing reveals novel proliferative cell type: a key player in renal cell carcinoma prognosis and therapeutic response. Clin Exp Med 2024; 24:167. [PMID: 39052149 PMCID: PMC11272756 DOI: 10.1007/s10238-024-01424-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 07/02/2024] [Indexed: 07/27/2024]
Abstract
Renal cell carcinoma (RCC) is characterized by a variety of subtypes, each defined by unique genetic and morphological features. This study utilizes single-cell RNA sequencing to explore the molecular heterogeneity of RCC. A highly proliferative cell subset, termed as "Prol," was discovered within RCC tumors, and its increased presence was linked to poorer patient outcomes. An artificial intelligence network, encompassing traditional regression, machine learning, and deep learning algorithms, was employed to develop a Prol signature capable of predicting prognosis. The signature demonstrated superior performance in predicting RCC prognosis compared to other signatures and exhibited pan-cancer prognostic capabilities. RCC patients with high Prol signature scores exhibited resistance to targeted therapies and immunotherapies. Furthermore, the key gene CEP55 from the Prol signature was validated by both proteinomics and quantitative real time polymerase chain reaction. Our findings may provide new insights into the molecular and cellular mechanisms of RCC and facilitate the development of novel biomarkers and therapeutic targets.
Collapse
Affiliation(s)
- Bicheng Ye
- School of Clinical Medicine, Yangzhou Polytechnic College, Yangzhou, China
- Department of Urology, Changhai Hospital, Naval Medical University (Second Military Medical University), Shanghai, China
| | - Hongsheng Ji
- Department of Urology, Lianshui People's Hospital of Kangda College Affiliated to Nanjing Medical University, Huai'an, China
| | - Meng Zhu
- Department of Geriatrics, The Affiliated Huaian Hospital of Xuzhou Medical University, Huaian Second People's Hospital, Huaian, China
| | - Anbang Wang
- Department of Urology, Changhai Hospital, Naval Medical University (Second Military Medical University), Shanghai, China
| | - Jingsong Tang
- Department of General Surgery, Northern Jiangsu People's Hospital Affiliated to Yangzhou University, Yangzhou, China.
| | - Yong Liang
- Department of Medical Laboratory, Huai'an Second People's Hospital Affiliated to Xuzhou Medical Universit, Huaian, China.
| | - Qing Zhang
- Department of Hepatology, Huai'an No. 4 People's Hospital, Huai'an, China.
| |
Collapse
|
3
|
Platz JJ, Bryan DS, Naunheim KS, Ferguson MK. Chatbot Reliability in Managing Thoracic Surgical Clinical Scenarios. Ann Thorac Surg 2024; 118:275-281. [PMID: 38574939 DOI: 10.1016/j.athoracsur.2024.03.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 02/02/2024] [Accepted: 03/12/2024] [Indexed: 04/06/2024]
Abstract
BACKGROUND Chatbot use in medicine is growing, and concerns have been raised regarding their accuracy. This study assessed the performance of 4 different chatbots in managing thoracic surgical clinical scenarios. METHODS Topic domains were identified and clinical scenarios were developed within each domain. Each scenario included 3 stems using Key Feature methods related to diagnosis, evaluation, and treatment. Twelve scenarios were presented to ChatGPT-4 (OpenAI), Bard (recently renamed Gemini; Google), Perplexity (Perplexity AI), and Claude 2 (Anthropic) in 3 separate runs. Up to 1 point was awarded for each stem, yielding a potential of 3 points per scenario. Critical failures were identified before scoring; if they occurred, the stem and overall scenario scores were adjusted to 0. We arbitrarily established a threshold of ≥2 points mean adjusted score per scenario as a passing grade and established a critical fail rate of ≥30% as failure to pass. RESULTS The bot performances varied considerably within each run, and their overall performance was a fail on all runs (critical mean scenario fails of 83%, 71%, and 71%). The bots trended toward "learning" from the first to the second run, but without improvement in overall raw (1.24 ± 0.47 vs 1.63 ± 0.76 vs 1.51 ± 0.60; P = .29) and adjusted (0.44 ± 0.54 vs 0.80 ± 0.94 vs 0.76 ± 0.81; P = .48) scenario scores after all runs. CONCLUSIONS Chatbot performance in managing clinical scenarios was insufficient to provide reliable assistance. This is a cautionary note against reliance on the current accuracy of chatbots in complex thoracic surgery medical decision making.
Collapse
Affiliation(s)
- Joseph J Platz
- Division of Thoracic Surgery, Department of Surgery, St. Louis University, St Louis, Missouri.
| | - Darren S Bryan
- Division of Thoracic Surgery, Department of Surgery, University of Chicago, Chicago, Illinois
| | - Keith S Naunheim
- Division of Thoracic Surgery, Department of Surgery, St. Louis University, St Louis, Missouri
| | - Mark K Ferguson
- Division of Thoracic Surgery, Department of Surgery, University of Chicago, Chicago, Illinois
| |
Collapse
|
4
|
Ye B, Li Z, Wang Q. A novel artificial intelligence network to assess the prognosis of gastrointestinal cancer to immunotherapy based on genetic mutation features. Front Immunol 2024; 15:1428529. [PMID: 38994371 PMCID: PMC11236566 DOI: 10.3389/fimmu.2024.1428529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 06/14/2024] [Indexed: 07/13/2024] Open
Abstract
Background Immune checkpoint inhibitors (ICIs) have revolutionized gastrointestinal cancer treatment, yet the absence of reliable biomarkers hampers precise patient response prediction. Methods We developed and validated a genomic mutation signature (GMS) employing a novel artificial intelligence network to forecast the prognosis of gastrointestinal cancer patients undergoing ICIs therapy. Subsequently, we explored the underlying immune landscapes across different subtypes using multiomics data. Finally, UMI-77 was pinpointed through the analysis of drug sensitization data from the Genomics of Drug Sensitivity in Cancer (GDSC) database. The sensitivity of UMI-77 to the AGS and MKN45 cell lines was evaluated using the cell counting kit-8 (CCK8) assay and the plate clone formation assay. Results Using the artificial intelligence network, we developed the GMS that independently predicts the prognosis of gastrointestinal cancer patients. The GMS demonstrated consistent performance across three public cohorts and exhibited high sensitivity and specificity for 6, 12, and 24-month overall survival (OS) in receiver operating characteristic (ROC) curve analysis. It outperformed conventional clinical and molecular features. Low-risk samples showed a higher presence of cytolytic immune cells and enhanced immunogenic potential compared to high-risk samples. Additionally, we identified the small molecule compound UMI-77. The half-maximal inhibitory concentration (IC50) of UMI-77 was inversely related to the GMS. Notably, the AGS cell line, classified as high-risk, displayed greater sensitivity to UMI-77, whereas the MKN45 cell line, classified as low-risk, showed less sensitivity. Conclusion The GMS developed here can reliably predict survival benefit for gastrointestinal cancer patients on ICIs therapy.
Collapse
Affiliation(s)
- Bicheng Ye
- School of Clinical Medicine, Yangzhou Polytechnic College, Yangzhou, China
| | - Zhongyan Li
- Department of Geriatric Medicine, Huai'an Hospital Affiliated to Yangzhou University (The Fifth People's Hospital of Huai'an), Huai'an, China
| | - Qiqi Wang
- Department of Gastroenterology, Wenzhou Central Hospital, Wenzhou, China
- Department of Gastroenterology, The Dingli Clinical College of Wenzhou Medical University, Wenzhou, China
- Department of Gastroenterology, The Second Afliated Hospital of Shanghai University, Wenzhou, China
| |
Collapse
|
5
|
Roig-Genoves JV, García-Giménez JL, Mena-Molla S. A miRNA-based epigenetic molecular clock for biological skin-age prediction. Arch Dermatol Res 2024; 316:326. [PMID: 38822910 PMCID: PMC11144124 DOI: 10.1007/s00403-024-03129-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 04/27/2024] [Accepted: 05/02/2024] [Indexed: 06/03/2024]
Abstract
Skin aging is one of the visible characteristics of the aging process in humans. In recent years, different biological clocks have been generated based on protein or epigenetic markers, but few have focused on biological age in the skin. Arrest the aging process or even being able to restore an organism from an older to a younger stage is one of the main challenges in the last 20 years in biomedical research. We have implemented several machine learning models, including regression and classification algorithms, in order to create an epigenetic molecular clock based on miRNA expression profiles of healthy subjects to predict biological age-related to skin. Our best models are capable of classifying skin samples according to age groups (18-28; 29-39; 40-50; 51-60 or 61-83 years old) with an accuracy of 80% or predict age with a mean absolute error of 10.89 years using the expression levels of 1856 unique miRNAs. Our results suggest that this kind of epigenetic clocks arises as a promising tool with several applications in the pharmaco-cosmetic industry.
Collapse
Affiliation(s)
| | - José Luis García-Giménez
- Consortium Center for Biomedical Network Research on Rare Diseases (CIBERER), Institute of Health Carlos III, Valencia, 46010, Spain
- INCLIVA Health Research Institute, INCLIVA, Valencia, 46010, Spain
- EpiDisease S.L (Spin-off from the CIBER-ISCIII), Parc Científic de la Universitat de Valencia, Paterna, 46980, Spain
- Department of Physiology, Faculty of Pharmacy, University of Valencia, Burjassot, 46100, Spain
| | - Salvador Mena-Molla
- INCLIVA Health Research Institute, INCLIVA, Valencia, 46010, Spain.
- EpiDisease S.L (Spin-off from the CIBER-ISCIII), Parc Científic de la Universitat de Valencia, Paterna, 46980, Spain.
- Department of Physiology, Faculty of Pharmacy, University of Valencia, Burjassot, 46100, Spain.
| |
Collapse
|
6
|
Kuraz Abebe B, Wang J, Guo J, Wang H, Li A, Zan L. A review of the role of epigenetic studies for intramuscular fat deposition in beef cattle. Gene 2024; 908:148295. [PMID: 38387707 DOI: 10.1016/j.gene.2024.148295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/23/2024] [Accepted: 02/15/2024] [Indexed: 02/24/2024]
Abstract
Intramuscular fat (IMF) deposition profoundly influences meat quality and economic value in beef cattle production. Meanwhile, contemporary developments in epigenetics have opened new outlooks for understanding the molecular basics of IMF regulation, and it has become a key area of research for world scholars. Therefore, the aim of this paper was to provide insight and synthesis into the intricate relationship between epigenetic mechanisms and IMF deposition in beef cattle. The methodology involves a thorough analysis of existing literature, including pertinent books, academic journals, and online resources, to provide a comprehensive overview of the role of epigenetic studies in IMF deposition in beef cattle. This review summarizes the contemporary studies in epigenetic mechanisms in IMF regulation, high-resolution epigenomic mapping, single-cell epigenomics, multi-omics integration, epigenome editing approaches, longitudinal studies in cattle growth, environmental epigenetics, machine learning in epigenetics, ethical and regulatory considerations, and translation to industry practices from perspectives of IMF deposition in beef cattle. Moreover, this paper highlights DNA methylation, histone modifications, acetylation, phosphorylation, ubiquitylation, non-coding RNAs, DNA hydroxymethylation, epigenetic readers, writers, and erasers, chromatin immunoprecipitation followed by sequencing, whole genome bisulfite sequencing, epigenome-wide association studies, and their profound impact on the expression of crucial genes governing adipogenesis and lipid metabolism. Nutrition and stress also have significant influences on epigenetic modifications and IMF deposition. The key findings underscore the pivotal role of epigenetic studies in understanding and enhancing IMF deposition in beef cattle, with implications for precision livestock farming and ethical livestock management. In conclusion, this review highlights the crucial significance of epigenetic pathways and environmental factors in affecting IMF deposition in beef cattle, providing insightful information for improving the economics and meat quality of cattle production.
Collapse
Affiliation(s)
- Belete Kuraz Abebe
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; Department of Animal Science, Werabe University, P.O. Box 46, Werabe, Ethiopia
| | - Jianfang Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Juntao Guo
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Hongbao Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Anning Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Linsen Zan
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; National Beef Cattle Improvement Center, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China.
| |
Collapse
|
7
|
Wu Z, Guo K, Luo E, Wang T, Wang S, Yang Y, Zhu X, Ding R. Medical long-tailed learning for imbalanced data: Bibliometric analysis. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 247:108106. [PMID: 38452661 DOI: 10.1016/j.cmpb.2024.108106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 02/24/2024] [Accepted: 02/26/2024] [Indexed: 03/09/2024]
Abstract
BACKGROUND In the last decade, long-tail learning has become a popular research focus in deep learning applications in medicine. However, no scientometric reports have provided a systematic overview of this scientific field. We utilized bibliometric techniques to identify and analyze the literature on long-tailed learning in deep learning applications in medicine and investigate research trends, core authors, and core journals. We expanded our understanding of the primary components and principal methodologies of long-tail learning research in the medical field. METHODS Web of Science was utilized to collect all articles on long-tailed learning in medicine published until December 2023. The suitability of all retrieved titles and abstracts was evaluated. For bibliometric analysis, all numerical data were extracted. CiteSpace was used to create clustered and visual knowledge graphs based on keywords. RESULTS A total of 579 articles met the evaluation criteria. Over the last decade, the annual number of publications and citation frequency both showed significant growth, following a power-law and exponential trend, respectively. Noteworthy contributors to this field include Husanbir Singh Pannu, Fadi Thabtah, and Talha Mahboob Alam, while leading journals such as IEEE ACCESS, COMPUTERS IN BIOLOGY AND MEDICINE, IEEE TRANSACTIONS ON MEDICAL IMAGING, and COMPUTERIZED MEDICAL IMAGING AND GRAPHICS have emerged as pivotal platforms for disseminating research in this area. The core of long-tailed learning research within the medical domain is encapsulated in six principal themes: deep learning for imbalanced data, model optimization, neural networks in image analysis, data imbalance in health records, CNN in diagnostics and risk assessment, and genetic information in disease mechanisms. CONCLUSION This study summarizes recent advancements in applying long-tail learning to deep learning in medicine through bibliometric analysis and visual knowledge graphs. It explains new trends, sources, core authors, journals, and research hotspots. Although this field has shown great promise in medical deep learning research, our findings will provide pertinent and valuable insights for future research and clinical practice.
Collapse
Affiliation(s)
- Zheng Wu
- School of Information Engineering, Hunan University of Science and Engineering, Yongzhou 425199, China.
| | - Kehua Guo
- School of Computer Science and Engineering, Central South University, Changsha 410083, China.
| | - Entao Luo
- School of Information Engineering, Hunan University of Science and Engineering, Yongzhou 425199, China.
| | - Tian Wang
- BNU-UIC Institute of Artificial Intelligence and Future Networks, Beijing Normal University (BNU Zhuhai), Zhuhai, China.
| | - Shoujin Wang
- Data Science Institute, University of Technology Sydney, Sydney, Australia.
| | - Yi Yang
- Department of Computer Science, Northeastern Illinois University, Chicago, IL 60625, USA.
| | - Xiangyuan Zhu
- School of Computer Science and Engineering, Central South University, Changsha 410083, China.
| | - Rui Ding
- School of Computer Science and Engineering, Central South University, Changsha 410083, China.
| |
Collapse
|
8
|
Li S, Ma L, Cui R. Identification of Novel Diagnostic Biomarkers and Classification Patterns for Osteoarthritis by Analyzing a Specific Set of Genes Related to Inflammation. Inflammation 2023; 46:2193-2208. [PMID: 37462886 DOI: 10.1007/s10753-023-01871-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 06/14/2023] [Accepted: 07/03/2023] [Indexed: 11/25/2023]
Abstract
Osteoarthritis (OA) is a prevalent joint disease globally. TNFA is recognized as a crucial inflammatory cytokine that plays a significant role in the pathophysiological mechanisms that occur during the progression of OA. However, the TNFA_SIGNALING_VIA_NFKB (TSVN)-related genes (TRGs) during the progression of OA remain unclear. By conducting a combinatory analysis of OA transcriptome data from three datasets, various differentially expressed TRGs were identified. The logistic regression model was used to mine hub TRGs for OA, and a nomogram prediction model was subsequently constructed using these TRGs. To identify new molecular subgroups, we performed consensus clustering. We then conducted functional analyses, including GO, KEGG, GSVA, and GSEA, to elucidate the underlying mechanisms. To determine the immune microenvironment, we applied xCell. The logistic regression analysis identified three hub TRGs (BHLHE40, BTG2, and CCNL1) as potential biomarkers for OA. Based on these TRGs, we constructed an OA predictive model. This model has demonstrated promising results in enhancing the accuracy of OA diagnosis, as evident from the ROC analysis (AUC merged dataset = 0.937, AUC validating dataset = 0.924). We identified two molecular subtypes, C1 and C2, and found that the C1 subtype showed activation of immune- and inflammation-related pathways. The involvement of TSVN in the development and progression of OA has been established. We identified several hub genes, such as BHLHE40, BTG2, and CCNL1, that may have a significant association with the progression of OA. Furthermore, our logistic regression model based on these genes has shown promising results in accurately diagnosing OA patients.
Collapse
Affiliation(s)
- Songsheng Li
- Orthopaedics Department III (Joint), The Fifth Clinical Medical College of Henan University of Chinese Medicine, Zhengzhou, China.
| | - Lige Ma
- Orthopaedics Department III (Joint), The Fifth Clinical Medical College of Henan University of Chinese Medicine, Zhengzhou, China
| | - Ruikai Cui
- Orthopaedics Department III (Joint), The Fifth Clinical Medical College of Henan University of Chinese Medicine, Zhengzhou, China
| |
Collapse
|
9
|
Mavaie P, Holder L, Skinner MK. Hybrid deep learning approach to improve classification of low-volume high-dimensional data. BMC Bioinformatics 2023; 24:419. [PMID: 37936066 PMCID: PMC10631218 DOI: 10.1186/s12859-023-05557-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 11/01/2023] [Indexed: 11/09/2023] Open
Abstract
BACKGROUND The performance of machine learning classification methods relies heavily on the choice of features. In many domains, feature generation can be labor-intensive and require domain knowledge, and feature selection methods do not scale well in high-dimensional datasets. Deep learning has shown success in feature generation but requires large datasets to achieve high classification accuracy. Biology domains typically exhibit these challenges with numerous handcrafted features (high-dimensional) and small amounts of training data (low volume). METHOD A hybrid learning approach is proposed that first trains a deep network on the training data, extracts features from the deep network, and then uses these features to re-express the data for input to a non-deep learning method, which is trained to perform the final classification. RESULTS The approach is systematically evaluated to determine the best layer of the deep learning network from which to extract features and the threshold on training data volume that prefers this approach. Results from several domains show that this hybrid approach outperforms standalone deep and non-deep learning methods, especially on low-volume, high-dimensional datasets. The diverse collection of datasets further supports the robustness of the approach across different domains. CONCLUSIONS The hybrid approach combines the strengths of deep and non-deep learning paradigms to achieve high performance on high-dimensional, low volume learning tasks that are typical in biology domains.
Collapse
Affiliation(s)
- Pegah Mavaie
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, 99164, USA
| | - Lawrence Holder
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, 99164, USA
| | - Michael K Skinner
- School of Biological Sciences, Center for Reproductive Biology, Washington State University, Pullman, WA, 99164-4236, USA.
| |
Collapse
|
10
|
Feltes BC, Ligabue-Braun R, Dorn M. Editorial: Computational and integrative approaches for developmental biology and molecular evolution. Front Genet 2023; 14:1252328. [PMID: 37519892 PMCID: PMC10382133 DOI: 10.3389/fgene.2023.1252328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 07/10/2023] [Indexed: 08/01/2023] Open
Affiliation(s)
- Bruno César Feltes
- Department of Biophysics, Institute of Biosciences, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
| | - Rodrigo Ligabue-Braun
- Department of Pharmacosciences, Federal University of Health Sciences of Porto Alegre, Porto Alegre, Brazil
| | - Márcio Dorn
- Department of Theoretical Informatics, Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
- Center of Biotechnology, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
- National Institute of Science and Technology: Forensics, Porto Alegre, Brazil
| |
Collapse
|
11
|
Chen B, Jiao Z, Shen T, Fan R, Chen Y, Xu Z. Early antidepressant treatment response prediction in major depression using clinical and TPH2 DNA methylation features based on machine learning approaches. BMC Psychiatry 2023; 23:299. [PMID: 37127594 PMCID: PMC10150459 DOI: 10.1186/s12888-023-04791-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 04/16/2023] [Indexed: 05/03/2023] Open
Abstract
OBJECTIVE To identify DNA methylation and clinical features, and to construct machine learning classifiers to assign the patients with major depressive disorder (MDD) into responders and non-responders after a 2-week treatment into responders and non-responders. METHOD Han Chinese patients (291 in total) with MDD comprised the study population. Datasets contained demographic information, environment stress factors, and the methylation levels of 38 methylated sites of tryptophan hydroxylase 2 (TPH2) genes in peripheral blood samples. Recursive Feature Elimination (RFE) was employed to select features. Five classification algorithms (logistic regression, classification and regression trees, support vector machine, logitboost and random forests) were used to establish the models. Performance metrics (AUC, F-Measure, G-Mean, accuracy, sensitivity, specificity, positive predictive value and negative predictive value) were computed with 5-fold-cross-validation. Variable importance was evaluated by random forest algorithm. RESULT RF with RFE outperformed the other models in our samples based on the demographic information and clinical features (AUC = 61.2%, 95%CI: 60.1-62.4%) / TPH2 CpGs features (AUC = 66.6%, 95%CI: 65.4-67.8%) / both clinical and TPH2 CpGs features (AUC = 72.9%, 95%CI: 71.8-74.0%). CONCLUSION The effects of TPH2 on the early-stage antidepressant response were explored by machine learning algorithms. On the basis of the baseline depression severity and TPH2 CpG sites, machine learning approaches can enhance our ability to predict the early-stage antidepressant response. Some potentially important predictors (e.g., TPH2-10-60 (rs2129575), TPH2-2-163 (rs11178998), age of first onset, age) in early-stage treatment response could be utilized in future fundamental research, drug development and clinical practice.
Collapse
Affiliation(s)
- Bingwei Chen
- Department of Epidemiology and Biostatistics, School of Public health, Southeast University, Nanjing, 210009, China.
| | - Zhigang Jiao
- Department of Epidemiology and Biostatistics, School of Public health, Southeast University, Nanjing, 210009, China.
| | - Tian Shen
- Department of Psychosomatics and Psychiatry, School of Medicine, Zhongda Hospital, Southeast University, Nanjing, 210009, China
| | - Ru Fan
- Department of Epidemiology and Biostatistics, School of Public health, Southeast University, Nanjing, 210009, China
| | - Yuqi Chen
- Department of Epidemiology and Biostatistics, School of Public health, Southeast University, Nanjing, 210009, China
- Department of Occupational Health and Poisoning Control, Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China
| | - Zhi Xu
- Department of Psychosomatics and Psychiatry, School of Medicine, Zhongda Hospital, Southeast University, Nanjing, 210009, China
| |
Collapse
|
12
|
Marino N, Putignano G, Cappilli S, Chersoni E, Santuccione A, Calabrese G, Bischof E, Vanhaelen Q, Zhavoronkov A, Scarano B, Mazzotta AD, Santus E. Towards AI-driven longevity research: An overview. FRONTIERS IN AGING 2023; 4:1057204. [PMID: 36936271 PMCID: PMC10018490 DOI: 10.3389/fragi.2023.1057204] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 02/06/2023] [Indexed: 03/06/2023]
Abstract
While in the past technology has mostly been utilized to store information about the structural configuration of proteins and molecules for research and medical purposes, Artificial Intelligence is nowadays able to learn from the existing data how to predict and model properties and interactions, revealing important knowledge about complex biological processes, such as aging. Modern technologies, moreover, can rely on a broader set of information, including those derived from the next-generation sequencing (e.g., proteomics, lipidomics, and other omics), to understand the interactions between human body and the external environment. This is especially relevant as external factors have been shown to have a key role in aging. As the field of computational systems biology keeps improving and new biomarkers of aging are being developed, artificial intelligence promises to become a major ally of aging research.
Collapse
Affiliation(s)
- Nicola Marino
- Women’s Brain Project (WBP), Gunterhausen, Switzerland
| | | | - Simone Cappilli
- Dermatology, Catholic University of the Sacred Heart, Rome, Italy
- UOC of Dermatology, Department of Abdominal and Endocrine Metabolic Medical and Surgical Sciences, A. Gemelli University Hospital Foundation-IRCCS, Rome, Italy
| | - Emmanuele Chersoni
- Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong, China
| | | | - Giuliana Calabrese
- Department of Translational Medicine and Surgery, CatholicUniversity of the Sacred Heart, Rome, Italy
| | - Evelyne Bischof
- Insilico Medicine Hong Kong Ltd., New Territories, Hong Kong SAR, China
| | - Quentin Vanhaelen
- Insilico Medicine Hong Kong Ltd., New Territories, Hong Kong SAR, China
| | - Alex Zhavoronkov
- Insilico Medicine Hong Kong Ltd., New Territories, Hong Kong SAR, China
| | - Bryan Scarano
- Department of Translational Medicine and Surgery, CatholicUniversity of the Sacred Heart, Rome, Italy
| | - Alessandro D. Mazzotta
- Department of Digestive, Oncological and Metabolic Surgery, Institute Mutualiste Montsouris, Paris, France
- Biorobotics Institute, Scuola Superiore Sant’anna, Pisa, Italy
| | | |
Collapse
|
13
|
Zhang N, Zhang H, Liu Z, Dai Z, Wu W, Zhou R, Li S, Wang Z, Liang X, Wen J, Zhang X, Zhang B, Ouyang S, Zhang J, Luo P, Li X, Cheng Q. An artificial intelligence network-guided signature for predicting outcome and immunotherapy response in lung adenocarcinoma patients based on 26 machine learning algorithms. Cell Prolif 2023; 56:e13409. [PMID: 36822595 PMCID: PMC10068958 DOI: 10.1111/cpr.13409] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/27/2022] [Accepted: 01/12/2023] [Indexed: 02/25/2023] Open
Abstract
The immune cells play an increasingly vital role in influencing the proliferation, progression, and metastasis of lung adenocarcinoma (LUAD) cells. However, the potential of immune cells' specific genes-based model remains largely unknown. In the current study, by analysing single-cell RNA sequencing (scRNA-seq) data and bulk RNA sequencing data, the tumour-infiltrating immune cell (TIIC) associated signature was developed based on a total of 26 machine learning (ML) algorithms. As a result, the TIIC signature score could predict survival outcomes of LUAD patients across five independent datasets. The TIIC signature score showed superior performance to 168 previously established signatures in LUAD. Moreover, the TIIC signature score developed by the immunofluorescence staining of the tissue array of LUAD patients showed a prognostic value. Our research revealed a solid connection between TIIC signature score and tumour immunity as well as metabolism. Additionally, it has been discovered that the TIIC signature score can forecast genomic change, chemotherapeutic drug susceptibility, and-most significantly-immunotherapeutic response. As a newly demonstrated biomarker, the TIIC signature score facilitated the selection of the LUAD population who would benefit from future clinical stratification.
Collapse
Affiliation(s)
- Nan Zhang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Hao Zhang
- Department of Neurosurgery, The Second Affiliated Hospital, Chongqing Medical University, Chongqing, China
| | - Zaoqu Liu
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Ziyu Dai
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Wantao Wu
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China.,Department of Oncology, Xiangya Hospital, Central South University, Changsha, China
| | - Ran Zhou
- Division of Neuroscience and Experimental Psychology, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Shuyu Li
- Department of Thyroid and Breast Surgery, Tongji Hospital, Tongji Medical College of Huazhong University of Science and Technology, Wuhan, China
| | - Zeyu Wang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Xisong Liang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Jie Wen
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Xun Zhang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Bo Zhang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Sirui Ouyang
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Jian Zhang
- Department of Oncology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Peng Luo
- Department of Oncology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Xizhe Li
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China.,Department of Thoracic Surgery, Xiangya Hospital, Central South University, Changsha, China.,Hunan Engineering Research Center for Pulmonary Nodules Precise Diagnosis & Treatment, Changsha, China
| | - Quan Cheng
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
14
|
Luo B, Luo Z, Zhang X, Xu M, Shi C. Status of cognitive frailty in elderly patients with chronic kidney disease and construction of a risk prediction model: a cross-sectional study. BMJ Open 2022; 12:e060633. [PMID: 36572488 PMCID: PMC9806025 DOI: 10.1136/bmjopen-2021-060633] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 09/02/2022] [Indexed: 12/27/2022] Open
Abstract
OBJECTIVE To investigate the risk factors of cognitive frailty in elderly patients with chronic kidney disease (CKD), and to establish an artificial neural network (ANN) model. DESIGN A cross-sectional design. SETTING Two tertiary hospitals in southern China. PARTICIPANTS 425 elderly patients aged ≥60 years with CKD. METHODS Data were collected via questionnaire investigation, anthropometric measurements, laboratory tests and electronic medical records. The 425 samples were randomly divided into a training set, test set and validation set at a ratio of 5:3:2. Variables were screened by univariate and multivariate logistic regression analyses, then an ANN model was constructed. The accuracy, specificity, sensitivity, receiver operating characteristic (ROC) curve and area under the ROC curve (AUC) were used to evaluate the predictive power of the model. RESULTS Barthel Index (BI) score, albumin, education level, 15-item Geriatric Depression Scale score and Social Support Rating Scale score were the factors influencing the occurrence of cognitive frailty (p<0.05). Among them, BI score was the most important factor determining cognitive frailty, with an importance index of 0.30. The accuracy, specificity and sensitivity of the ANN model were 86.36%, 88.61% and 80.65%, respectively, and the AUC of the constructed ANN model was 0.913. CONCLUSION The ANN model constructed in this study has good predictive ability, and can provide a reference tool for clinical nursing staff in the early prediction of cognitive frailty in a high-risk population.
Collapse
Affiliation(s)
- Baolin Luo
- School of Nursing, Shantou University Medical College, Shantou, Guangdong, China
- Nursing Department, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
| | - Zebing Luo
- School of Nursing, Shantou University Medical College, Shantou, Guangdong, China
- Cancer Department, Cancer Hospital of Shantou University Medical College, Shantou, Guangdong, China
| | - Xiaoyun Zhang
- School of Nursing, Shantou University Medical College, Shantou, Guangdong, China
- Nephrology Department, Shantou Central Hospital, Shantou, Guangdong, China
| | - Meiwan Xu
- Nephrology Department, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong, China
| | - Chujun Shi
- School of Nursing, Shantou University Medical College, Shantou, Guangdong, China
| |
Collapse
|
15
|
Tsimenidis S, Vrochidou E, Papakostas GA. Omics Data and Data Representations for Deep Learning-Based Predictive Modeling. Int J Mol Sci 2022; 23:12272. [PMID: 36293133 PMCID: PMC9603455 DOI: 10.3390/ijms232012272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/03/2022] [Accepted: 10/12/2022] [Indexed: 11/25/2022] Open
Abstract
Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data stream since it has advanced quickly with there being successive innovations. However, an obstacle to scientific progress emerges: the difficulty of applying DL to biology, and this because both fields are evolving at a breakneck pace, thus making it hard for an individual to occupy the front lines of both of them. This paper aims to bridge the gap and help computer scientists bring their valuable expertise into the life sciences. This work provides an overview of the most common types of biological data and data representations that are used to train DL models, with additional information on the models themselves and the various tasks that are being tackled. This is the essential information a DL expert with no background in biology needs in order to participate in DL-based research projects in biomedicine, biotechnology, and drug discovery. Alternatively, this study could be also useful to researchers in biology to understand and utilize the power of DL to gain better insights into and extract important information from the omics data.
Collapse
Affiliation(s)
| | | | - George A. Papakostas
- MLV Research Group, Department of Computer Science, International Hellenic University, 65404 Kavala, Greece
| |
Collapse
|
16
|
Zare Harofte S, Soltani M, Siavashy S, Raahemifar K. Recent Advances of Utilizing Artificial Intelligence in Lab on a Chip for Diagnosis and Treatment. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2022; 18:e2203169. [PMID: 36026569 DOI: 10.1002/smll.202203169] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Revised: 07/16/2022] [Indexed: 05/14/2023]
Abstract
Nowadays, artificial intelligence (AI) creates numerous promising opportunities in the life sciences. AI methods can be significantly advantageous for analyzing the massive datasets provided by biotechnology systems for biological and biomedical applications. Microfluidics, with the developments in controlled reaction chambers, high-throughput arrays, and positioning systems, generate big data that is not necessarily analyzed successfully. Integrating AI and microfluidics can pave the way for both experimental and analytical throughputs in biotechnology research. Microfluidics enhances the experimental methods and reduces the cost and scale, while AI methods significantly improve the analysis of huge datasets obtained from high-throughput and multiplexed microfluidics. This review briefly presents a survey of the role of AI and microfluidics in biotechnology. Also, the incorporation of AI with microfluidics is comprehensively investigated. Specifically, recent studies that perform flow cytometry cell classification, cell isolation, and a combination of them by gaining from both AI methods and microfluidic techniques are covered. Despite all current challenges, various fields of biotechnology can be remarkably affected by the combination of AI and microfluidic technologies. Some of these fields include point-of-care systems, precision, personalized medicine, regenerative medicine, prognostics, diagnostics, and treatment of oncology and non-oncology-related diseases.
Collapse
Affiliation(s)
- Samaneh Zare Harofte
- Department of Mechanical Engineering, K. N. Toosi University of Technology, Tehran, 19967-15433, Iran
| | - Madjid Soltani
- Department of Mechanical Engineering, K. N. Toosi University of Technology, Tehran, 19967-15433, Iran
- Department of Electrical and Computer Engineering, Faculty of Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
- Centre for Biotechnology and Bioengineering (CBB), University of Waterloo, Waterloo, ON, N2L 3G1, Canada
- Advanced Bioengineering Initiative Center, Multidisciplinary International Complex, K. N. Toosi University of Technology, Tehran, 14176-14411, Iran
- Cancer Biology Research Center, Cancer Institute of Iran, Tehran University of Medical Sciences, Tehran, 14197-33141, Iran
| | - Saeed Siavashy
- Department of Mechanical Engineering, K. N. Toosi University of Technology, Tehran, 19967-15433, Iran
| | - Kaamran Raahemifar
- Data Science and Artificial Intelligence Program, College of Information Sciences and Technology (IST), Penn State University, State College, PA, 16801, USA
- School of Optometry and Vision Science, Faculty of Science, University of Waterloo, 200 University Ave. W, Waterloo, ON, N2L 3G1, Canada
- Department of Chemical Engineering, Faculty of Engineering, University of Waterloo, 200 University Ave. W, Waterloo, ON, N2L 3G1, Canada
| |
Collapse
|
17
|
Review on Machine Learning Techniques for Medical Data Classification and Disease Diagnosis. REGENERATIVE ENGINEERING AND TRANSLATIONAL MEDICINE 2022. [DOI: 10.1007/s40883-022-00273-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
18
|
Abstract
The tremendous amount of biological sequence data available, combined with the recent methodological breakthrough in deep learning in domains such as computer vision or natural language processing, is leading today to the transformation of bioinformatics through the emergence of deep genomics, the application of deep learning to genomic sequences. We review here the new applications that the use of deep learning enables in the field, focusing on three aspects: the functional annotation of genomes, the sequence determinants of the genome functions and the possibility to write synthetic genomic sequences.
Collapse
|
19
|
Nguyen TM, Le HL, Hwang KB, Hong YC, Kim JH. Predicting High Blood Pressure Using DNA Methylome-Based Machine Learning Models. Biomedicines 2022; 10:biomedicines10061406. [PMID: 35740428 PMCID: PMC9220060 DOI: 10.3390/biomedicines10061406] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 06/06/2022] [Accepted: 06/10/2022] [Indexed: 12/12/2022] Open
Abstract
DNA methylation modification plays a vital role in the pathophysiology of high blood pressure (BP). Herein, we applied three machine learning (ML) algorithms including deep learning (DL), support vector machine, and random forest for detecting high BP using DNA methylome data. Peripheral blood samples of 50 elderly individuals were collected three times at three visits for DNA methylome profiling. Participants who had a history of hypertension and/or current high BP measure were considered to have high BP. The whole dataset was randomly divided to conduct a nested five-group cross-validation for prediction performance. Data in each outer training set were independently normalized using a min–max scaler, reduced dimensionality using principal component analysis, then fed into three predictive algorithms. Of the three ML algorithms, DL achieved the best performance (AUPRC = 0.65, AUROC = 0.73, accuracy = 0.69, and F1-score = 0.73). To confirm the reliability of using DNA methylome as a biomarker for high BP, we constructed mixed-effects models and found that 61,694 methylation sites located in 15,523 intragenic regions and 16,754 intergenic regions were significantly associated with BP measures. Our proposed models pioneered the methodology of applying ML and DNA methylome data for early detection of high BP in clinical practices.
Collapse
Affiliation(s)
- Thi Mai Nguyen
- Department of Integrative Bioscience & Biotechnology, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea;
| | - Hoang Long Le
- Department of Computer Science & Engineering, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea;
| | - Kyu-Baek Hwang
- School of Computer Science & Engineering, Soongsil University, 369 Sangdo-ro, Dongjak-gu, Seoul 06978, Korea;
| | - Yun-Chul Hong
- Department of Preventive Medicine, College of Medicine, Seoul National University, Seoul 03080, Korea;
- Institute of Environmental Medicine, Seoul National University Medical Research Center, Seoul 03080, Korea
| | - Jin Hee Kim
- Department of Integrative Bioscience & Biotechnology, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea;
- Correspondence: ; Tel.: +82-2-3408-3655
| |
Collapse
|
20
|
Ding W, Nan Y, Wu J, Han C, Xin X, Li S, Liu H, Zhang L. Combining multi-dimensional molecular fingerprints to predict the hERG cardiotoxicity of compounds. Comput Biol Med 2022; 144:105390. [DOI: 10.1016/j.compbiomed.2022.105390] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/06/2022] [Accepted: 03/07/2022] [Indexed: 01/28/2023]
|
21
|
de Sire A, Gallelli L, Marotta N, Lippi L, Fusco N, Calafiore D, Cione E, Muraca L, Maconi A, De Sarro G, Ammendolia A, Invernizzi M. Vitamin D Deficiency in Women with Breast Cancer: A Correlation with Osteoporosis? A Machine Learning Approach with Multiple Factor Analysis. Nutrients 2022; 14:1586. [PMID: 35458148 PMCID: PMC9031622 DOI: 10.3390/nu14081586] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 04/08/2022] [Accepted: 04/09/2022] [Indexed: 12/16/2022] Open
Abstract
Breast cancer (BC) is the most frequent malignant tumor in women in Europe and North America, and the use of aromatase inhibitors (AIs) is recommended in women affected by estrogen receptor-positive BCs. AIs, by inhibiting the enzyme that converts androgens into estrogen, cause a decrement in bone mineral density (BMD), with a consequent increased risk of fragility fractures. This study aimed to evaluate the role of vitamin D3 deficiency in women with breast cancer and its correlation with osteoporosis and BMD modifications. This observational cross-sectional study collected the following data regarding bone health: osteoporosis and osteopenia diagnosis, lumbar spine (LS) and femoral neck bone mineral density (BMD), serum levels of 25-hydroxyvitamin D3 (25(OH)D3), calcium and parathyroid hormone. The study included 54 women with BC, mean age 67.3 ± 8.16 years. Given a significantly low correlation with the LS BMD value (r2 = 0.30, p = 0.025), we assessed the role of vitamin D3 via multiple factor analysis and found that BMD and vitamin D3 contributed to the arrangement of clusters, reported as vectors, providing similar trajectories of influence to the construction of the machine learning model. Thus, in a cohort of women with BC undergoing Ais, we identified a very low prevalence (5.6%) of patients with adequate bone health and a normal vitamin D3 status. According to our cluster model, we may conclude that the assessment and management of bone health and vitamin D3 status are crucial in BC survivors.
Collapse
Affiliation(s)
- Alessandro de Sire
- Physical Medicine and Rehabilitation Unit, Department of Medical and Surgical Sciences, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy; (N.M.); (A.A.)
| | - Luca Gallelli
- Operative Unit of Clinical Pharmacology, Mater Domini University Hospital, Department of Health Science, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy; (L.G.); (G.D.S.)
- Research Center FAS@UMG, Department of Health Science, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy
| | - Nicola Marotta
- Physical Medicine and Rehabilitation Unit, Department of Medical and Surgical Sciences, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy; (N.M.); (A.A.)
| | - Lorenzo Lippi
- Translational Medicine, Dipartimento Attività Integrate Ricerca e Innovazione (DAIRI), Azienda Ospedaliera SS. Antonio e Biagio e Cesare Arrigo, 15121 Alessandria, Italy; (L.L.); (A.M.); (M.I.)
- Department of Health Sciences, University of Eastern Piedmont “A. Avogadro”, 28100 Novara, Italy
| | - Nicola Fusco
- Department of Oncology and Hemato-Oncology, University of Milan, 20126 Milan, Italy;
- Division of Pathology, IEO, European Institute of Oncology IRCCS, 20141 Milan, Italy
| | - Dario Calafiore
- Physical Medicine and Rehabilitation Unit, Department of Neurosciences, ASST Carlo Poma, 46100 Mantova, Italy;
| | - Erika Cione
- Department of Pharmacy, Health and Nutritional Sciences, Department of Excellence 2018–2022, University of Calabria, 87036 Rende, Italy;
| | - Lucia Muraca
- Department of General Medicine, ASP 7, 88100 Catanzaro, Italy;
| | - Antonio Maconi
- Translational Medicine, Dipartimento Attività Integrate Ricerca e Innovazione (DAIRI), Azienda Ospedaliera SS. Antonio e Biagio e Cesare Arrigo, 15121 Alessandria, Italy; (L.L.); (A.M.); (M.I.)
| | - Giovambattista De Sarro
- Operative Unit of Clinical Pharmacology, Mater Domini University Hospital, Department of Health Science, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy; (L.G.); (G.D.S.)
- Research Center FAS@UMG, Department of Health Science, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy
| | - Antonio Ammendolia
- Physical Medicine and Rehabilitation Unit, Department of Medical and Surgical Sciences, University of Catanzaro “Magna Graecia”, 88100 Catanzaro, Italy; (N.M.); (A.A.)
| | - Marco Invernizzi
- Translational Medicine, Dipartimento Attività Integrate Ricerca e Innovazione (DAIRI), Azienda Ospedaliera SS. Antonio e Biagio e Cesare Arrigo, 15121 Alessandria, Italy; (L.L.); (A.M.); (M.I.)
- Department of Health Sciences, University of Eastern Piedmont “A. Avogadro”, 28100 Novara, Italy
| |
Collapse
|
22
|
VandenBosch LS, Luu K, Timms AE, Challam S, Wu Y, Lee AY, Cherry TJ. Machine Learning Prediction of Non-Coding Variant Impact in Human Retinal cis-Regulatory Elements. Transl Vis Sci Technol 2022; 11:16. [PMID: 35435921 PMCID: PMC9034719 DOI: 10.1167/tvst.11.4.16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 03/25/2022] [Indexed: 11/24/2022] Open
Abstract
Purpose Prior studies have demonstrated the significance of specific cis-regulatory variants in retinal disease; however, determining the functional impact of regulatory variants remains a major challenge. In this study, we utilized a machine learning approach, trained on epigenomic data from the adult human retina, to systematically quantify the predicted impact of cis-regulatory variants. Methods We used human retinal DNA accessibility data (ATAC-seq) to determine a set of 18.9k high-confidence, putative cis-regulatory elements. Eighty percent of these elements were used to train a machine learning model utilizing a gapped k-mer support vector machine-based approach. In silico saturation mutagenesis and variant scoring was applied to predict the functional impact of all potential single nucleotide variants within cis-regulatory elements. Impact scores were tested in a 20% hold-out dataset and compared to allele population frequency, phylogenetic conservation, transcription factor (TF) binding motifs, and existing massively parallel reporter assay data. Results We generated a model that distinguishes between human retinal regulatory elements and negative test sequences with 95% accuracy. Among a hold-out test set of 3.7k human retinal CREs, all possible single nucleotide variants were scored. Variants with negative impact scores correlated with higher phylogenetic conservation of the reference allele, disruption of predicted TF binding motifs, and massively parallel reporter expression. Conclusions We demonstrated the utility of human retinal epigenomic data to train a machine learning model for the purpose of predicting the impact of non-coding regulatory sequence variants. Our model accurately scored sequences and predicted putative transcription factor binding motifs. This approach has the potential to expedite the characterization of pathogenic non-coding sequence variants in the context of unexplained retinal disease. Translational Relevance This workflow and resulting dataset serve as a promising genomic tool to facilitate the clinical prioritization of functionally disruptive non-coding mutations in the retina.
Collapse
Affiliation(s)
- Leah S. VandenBosch
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA, USA
| | - Kelsey Luu
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA, USA
| | - Andrew E. Timms
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA, USA
| | - Shriya Challam
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA, USA
| | - Yue Wu
- University of Washington Department of Ophthalmology, Seattle, WA, USA
| | - Aaron Y. Lee
- University of Washington Department of Ophthalmology, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Timothy J. Cherry
- Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- University of Washington Department of Pediatrics, Seattle, WA, USA
| |
Collapse
|
23
|
Computational Intelligence Method for Detection of White Blood Cells Using Hybrid of Convolutional Deep Learning and SIFT. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:9934144. [PMID: 35069796 PMCID: PMC8769840 DOI: 10.1155/2022/9934144] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 10/25/2021] [Indexed: 12/22/2022]
Abstract
Infection diseases are among the top global issues with negative impacts on health, economy, and society as a whole. One of the most effective ways to detect these diseases is done by analysing the microscopic images of blood cells. Artificial intelligence (AI) techniques are now widely used to detect these blood cells and explore their structures. In recent years, deep learning architectures have been utilized as they are powerful tools for big data analysis. In this work, we are presenting a deep neural network for processing of microscopic images of blood cells. Processing these images is particularly important as white blood cells and their structures are being used to diagnose different diseases. In this research, we design and implement a reliable processing system for blood samples and classify five different types of white blood cells in microscopic images. We use the Gram-Schmidt algorithm for segmentation purposes. For the classification of different types of white blood cells, we combine Scale-Invariant Feature Transform (SIFT) feature detection technique with a deep convolutional neural network. To evaluate our work, we tested our method on LISC and WBCis databases. We achieved 95.84% and 97.33% accuracy of segmentation for these data sets, respectively. Our work illustrates that deep learning models can be promising in designing and developing a reliable system for microscopic image processing.
Collapse
|
24
|
Kanapeckaitė A, Burokienė N, Mažeikienė A, Cottrell GS, Widera D. Biophysics is reshaping our perception of the epigenome: from DNA-level to high-throughput studies. BIOPHYSICAL REPORTS 2021; 1:100028. [PMID: 36425454 PMCID: PMC9680810 DOI: 10.1016/j.bpr.2021.100028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 09/24/2021] [Indexed: 06/16/2023]
Abstract
Epigenetic research holds great promise to advance our understanding of biomarkers and regulatory processes in health and disease. An increasing number of new approaches, ranging from molecular to biophysical analyses, enable identifying epigenetic changes on the level of a single gene or the whole epigenome. The aim of this review is to highlight how the field is shifting from completely molecular-biology-driven solutions to multidisciplinary strategies including more reliance on biophysical analysis tools. Biophysics not only offers technical advancements in imaging or structure analysis but also helps to explore regulatory interactions. New computational methods are also being developed to meet the demand of growing data volumes and their processing. Therefore, it is important to capture these new directions in epigenetics from a biophysical perspective and discuss current challenges as well as multiple applications of biophysical methods and tools. Specifically, we gradually introduce different biophysical research methods by first considering the DNA-level information and eventually higher-order chromatin structures. Moreover, we aim to highlight that the incorporation of bioinformatics, machine learning, and artificial intelligence into biophysical analysis allows gaining new insights into complex epigenetic processes. The gained understanding has already proven useful in translational and clinical research providing better patient stratification options or new therapeutic insights. Together, this offers a better readiness to transform bench-top experiments into industrial high-throughput applications with a possibility to employ developed methods in clinical practice and diagnostics.
Collapse
Affiliation(s)
- Austė Kanapeckaitė
- Algorithm379, Laisvės g. 7, LT 12007, Vilnius, Lithuania
- Reading School of Pharmacy, Whiteknights, Reading, UK, RG6 6UB
| | - Neringa Burokienė
- Clinics of Internal Diseases, Family Medicine and Oncology, Institute of Clinical Medicine, Faculty of Medicine, Vilnius University, M. K. Čiurlionio str. 21/27, LT-03101 Vilnius, Lithuania
| | - Asta Mažeikienė
- Department of Physiology, Biochemistry, Microbiology and Laboratory Medicine, Institute of Biomedical Sciences, Faculty of Medicine, M. K. Čiurlionio str. 21/27, LT-03101 Vilnius, Lithuania
| | | | - Darius Widera
- Reading School of Pharmacy, Whiteknights, Reading, UK, RG6 6UB
| |
Collapse
|
25
|
Brochhausen M, Hester DM. Data Properties or Analytical Methodologies: Too Much Attention to the Former Ignores Concerns About the Latter. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2021; 21:70-72. [PMID: 34806969 DOI: 10.1080/15265161.2021.1991037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
|
26
|
Arslan E, Schulz J, Rai K. Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine. Biochim Biophys Acta Rev Cancer 2021; 1876:188588. [PMID: 34245839 PMCID: PMC8595561 DOI: 10.1016/j.bbcan.2021.188588] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 05/29/2021] [Accepted: 07/02/2021] [Indexed: 02/01/2023]
Abstract
The recent deluge of genome-wide technologies for the mapping of the epigenome and resulting data in cancer samples has provided the opportunity for gaining insights into and understanding the roles of epigenetic processes in cancer. However, the complexity, high-dimensionality, sparsity, and noise associated with these data pose challenges for extensive integrative analyses. Machine Learning (ML) algorithms are particularly suited for epigenomic data analyses due to their flexibility and ability to learn underlying hidden structures. We will discuss four overlapping but distinct major categories under ML: dimensionality reduction, unsupervised methods, supervised methods, and deep learning (DL). We review the preferred use cases of these algorithms in analyses of cancer epigenomics data with the hope to provide an overview of how ML approaches can be used to explore fundamental questions on the roles of epigenome in cancer biology and medicine.
Collapse
Affiliation(s)
- Emre Arslan
- Department of Genomic Medicine, MD Anderson Cancer Center, Houston, TX 77030, United States of America
| | - Jonathan Schulz
- Department of Genomic Medicine, MD Anderson Cancer Center, Houston, TX 77030, United States of America
| | - Kunal Rai
- Department of Genomic Medicine, MD Anderson Cancer Center, Houston, TX 77030, United States of America.
| |
Collapse
|
27
|
Mavaie P, Holder L, Beck D, Skinner MK. Predicting environmentally responsive transgenerational differential DNA methylated regions (epimutations) in the genome using a hybrid deep-machine learning approach. BMC Bioinformatics 2021; 22:575. [PMID: 34847877 PMCID: PMC8630850 DOI: 10.1186/s12859-021-04491-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 11/18/2021] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Deep learning is an active bioinformatics artificial intelligence field that is useful in solving many biological problems, including predicting altered epigenetics such as DNA methylation regions. Deep learning (DL) can learn an informative representation that addresses the need for defining relevant features. However, deep learning models are computationally expensive, and they require large training datasets to achieve good classification performance. RESULTS One approach to addressing these challenges is to use a less complex deep learning network for feature selection and Machine Learning (ML) for classification. In the current study, we introduce a hybrid DL-ML approach that uses a deep neural network for extracting molecular features and a non-DL classifier to predict environmentally responsive transgenerational differential DNA methylated regions (DMRs), termed epimutations, based on the extracted DL-based features. Various environmental toxicant induced epigenetic transgenerational inheritance sperm epimutations were used to train the model on the rat genome DNA sequence and use the model to predict transgenerational DMRs (epimutations) across the entire genome. CONCLUSION The approach was also used to predict potential DMRs in the human genome. Experimental results show that the hybrid DL-ML approach outperforms deep learning and traditional machine learning methods.
Collapse
Affiliation(s)
- Pegah Mavaie
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, 99164-2752, USA
| | - Lawrence Holder
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, 99164-2752, USA.
| | - Daniel Beck
- Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA, 99164-4236, USA
| | - Michael K Skinner
- Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA, 99164-4236, USA.
| |
Collapse
|
28
|
Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data. Biomedicines 2021; 9:biomedicines9111733. [PMID: 34829962 PMCID: PMC8615388 DOI: 10.3390/biomedicines9111733] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/26/2021] [Accepted: 11/17/2021] [Indexed: 12/25/2022] Open
Abstract
Deep learning (DL) is a distinct class of machine learning that has achieved first-class performance in many fields of study. For epigenomics, the application of DL to assist physicians and scientists in human disease-relevant prediction tasks has been relatively unexplored until very recently. In this article, we critically review published studies that employed DL models to predict disease detection, subtype classification, and treatment responses, using epigenomic data. A comprehensive search on PubMed, Scopus, Web of Science, Google Scholar, and arXiv.org was performed following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Among 1140 initially identified publications, we included 22 articles in our review. DNA methylation and RNA-sequencing data are most frequently used to train the predictive models. The reviewed models achieved a high accuracy ranged from 88.3% to 100.0% for disease detection tasks, from 69.5% to 97.8% for subtype classification tasks, and from 80.0% to 93.0% for treatment response prediction tasks. We generated a workflow to develop a predictive model that encompasses all steps from first defining human disease-related tasks to finally evaluating model performance. DL holds promise for transforming epigenomic big data into valuable knowledge that will enhance the development of translational epigenomics.
Collapse
|
29
|
Caudai C, Galizia A, Geraci F, Le Pera L, Morea V, Salerno E, Via A, Colombo T. AI applications in functional genomics. Comput Struct Biotechnol J 2021; 19:5762-5790. [PMID: 34765093 PMCID: PMC8566780 DOI: 10.1016/j.csbj.2021.10.009] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 10/05/2021] [Accepted: 10/05/2021] [Indexed: 12/13/2022] Open
Abstract
We review the current applications of artificial intelligence (AI) in functional genomics. The recent explosion of AI follows the remarkable achievements made possible by "deep learning", along with a burst of "big data" that can meet its hunger. Biology is about to overthrow astronomy as the paradigmatic representative of big data producer. This has been made possible by huge advancements in the field of high throughput technologies, applied to determine how the individual components of a biological system work together to accomplish different processes. The disciplines contributing to this bulk of data are collectively known as functional genomics. They consist in studies of: i) the information contained in the DNA (genomics); ii) the modifications that DNA can reversibly undergo (epigenomics); iii) the RNA transcripts originated by a genome (transcriptomics); iv) the ensemble of chemical modifications decorating different types of RNA transcripts (epitranscriptomics); v) the products of protein-coding transcripts (proteomics); and vi) the small molecules produced from cell metabolism (metabolomics) present in an organism or system at a given time, in physiological or pathological conditions. After reviewing main applications of AI in functional genomics, we discuss important accompanying issues, including ethical, legal and economic issues and the importance of explainability.
Collapse
Affiliation(s)
- Claudia Caudai
- CNR, Institute of Information Science and Technologies “A. Faedo” (ISTI), Pisa, Italy
| | - Antonella Galizia
- CNR, Institute of Applied Mathematics and Information Technologies (IMATI), Genoa, Italy
| | - Filippo Geraci
- CNR, Institute for Informatics and Telematics (IIT), Pisa, Italy
| | - Loredana Le Pera
- CNR, Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), Bari, Italy
- CNR, Institute of Molecular Biology and Pathology (IBPM), Rome, Italy
| | - Veronica Morea
- CNR, Institute of Molecular Biology and Pathology (IBPM), Rome, Italy
| | - Emanuele Salerno
- CNR, Institute of Information Science and Technologies “A. Faedo” (ISTI), Pisa, Italy
| | - Allegra Via
- CNR, Institute of Molecular Biology and Pathology (IBPM), Rome, Italy
| | - Teresa Colombo
- CNR, Institute of Molecular Biology and Pathology (IBPM), Rome, Italy
| |
Collapse
|
30
|
Wang Y, Zhang P, Guo W, Liu H, Li X, Zhang Q, Du Z, Hu G, Han X, Pu L, Tian J, Gu X. A deep learning approach to automate whole-genome prediction of diverse epigenomic modifications in plants. THE NEW PHYTOLOGIST 2021; 232:880-897. [PMID: 34287908 DOI: 10.1111/nph.17630] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 07/09/2021] [Indexed: 06/13/2023]
Abstract
Epigenetic modifications function in gene transcription, RNA metabolism, and other biological processes. However, multiple factors currently limit the scientific utility of epigenomic datasets generated for plants. Here, using deep-learning approaches, we developed a Smart Model for Epigenetics in Plants (SMEP) to predict six types of epigenomic modifications: DNA 5-methylcytosine (5mC) and N6-methyladenosine (6mA) methylation, RNA N6-methyladenosine (m6 A) methylation, and three types of histone modification. Using the datasets from the japonica rice Nipponbare, SMEP achieved 95% prediction accuracy for 6mA, and also achieved around 80% for 5mC, m6 A, and the three types of histone modification based on the 10-fold cross-validation. Additionally, > 95% of the 6mA peaks detected after a heat-shock treatment were predicted. We also successfully applied the SMEP for examining epigenomic modifications in indica rice 93-11 and even the B73 maize line. Taken together, we show that the deep-learning-enabled SMEP can reliably mine epigenomic datasets from diverse plants to yield actionable insights about epigenomic sites. Thus, our work opens new avenues for the application of predictive tools to facilitate functional research, and will almost certainly increase the efficiency of genome engineering efforts.
Collapse
Affiliation(s)
- Yifan Wang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Pingxian Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Weijun Guo
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Hanqing Liu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiulan Li
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Qian Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zhuoying Du
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Guihua Hu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiao Han
- College of Biological Science and Engineering, Fuzhou University, Fuzhou, 350108, China
| | - Li Pu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Jian Tian
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiaofeng Gu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| |
Collapse
|
31
|
Tamimi I, Ballesteros J, Lara AP, Tat J, Alaqueel M, Schupbach J, Marwan Y, Urdiales C, Gomez-de-Gabriel JM, Burman M, Martineau PA. A Prediction Model for Primary Anterior Cruciate Ligament Injury Using Artificial Intelligence. Orthop J Sports Med 2021; 9:23259671211027543. [PMID: 34568504 PMCID: PMC8461131 DOI: 10.1177/23259671211027543] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 03/02/2021] [Indexed: 11/17/2022] Open
Abstract
Background Supervised machine learning models in artificial intelligence (AI) have been increasingly used to predict different types of events. However, their use in orthopaedic surgery has been limited. Hypothesis It was hypothesized that supervised learning techniques could be used to build a mathematical model to predict primary anterior cruciate ligament (ACL) injuries using a set of morphological features of the knee. Study Design Cross-sectional study; Level of evidence, 3. Methods Included were 50 adults who had undergone primary ACL reconstruction between 2008 and 2015. All patients were between 18 and 40 years of age at the time of surgery. Patients with a previous ACL injury, multiligament knee injury, previous ACL reconstruction, history of ACL revision surgery, complete meniscectomy, infection, missing data, and associated fracture were excluded. We also identified 50 sex-matched controls who had not sustained an ACL injury. For all participants, we used the preoperative magnetic resonance images to measure the anteroposterior lengths of the medial and lateral tibial plateaus as well as the lateral and medial bone slope (LBS and MBS), lateral and medial meniscal height (LMH and MMH), and lateral and medial meniscal slope (LMS and MMS). The AI predictor was created using Matlab R2019b. A Gaussian naïve Bayes model was selected to create the predictor. Results Patients in the ACL injury group had a significantly increased posterior LBS (7.0° ± 4.7° vs 3.9° ± 5.4°; P = .008) and LMS (-1.7° ± 4.8° vs -4.0° ± 4.2°; P = .002) and a lower MMH (5.5 ± 0.1 vs 6.1 ± 0.1 mm; P = .006) and LMH (6.9 ± 0.1 vs 7.6 ± 0.1 mm; P = .001). The AI model selected LBS and MBS as the best possible predictive combination, achieving 70% validation accuracy and 92% testing accuracy. Conclusion A prediction model for primary ACL injury, created using machine learning techniques, achieved a >90% testing accuracy. Compared with patients who did not sustain an ACL injury, patients with torn ACLs had an increased posterior LBS and LMS and a lower MMH and LMH.
Collapse
Affiliation(s)
- Iskandar Tamimi
- Knee Division, Hospital Regional Universitario de Málaga, Málaga, Spain
| | | | - Almudena Perez Lara
- Department of Radiology, Hospital Regional Universitario de Málaga, Málaga, Spain
| | - Jimmy Tat
- Department of Surgery, Division of Orthopaedic Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Motaz Alaqueel
- Division of Orthopaedic Surgery, McGill University Health Centre, Montreal, Canada
| | - Justin Schupbach
- Division of Orthopaedic Surgery, McGill University Health Centre, Montreal, Canada
| | - Yousef Marwan
- Division of Orthopaedic Surgery, McGill University Health Centre, Montreal, Canada
| | - Cristina Urdiales
- Electronics Technology Department, Escuela de Ingeniería Telecomunicación, University of Malaga, Málaga, Spain
| | | | - Mark Burman
- Division of Orthopaedic Surgery, McGill University Health Centre, Montreal, Canada
| | - Paul Andre Martineau
- Division of Orthopaedic Surgery, McGill University Health Centre, Montreal, Canada
| |
Collapse
|
32
|
Monaco A, Pantaleo E, Amoroso N, Lacalamita A, Lo Giudice C, Fonzino A, Fosso B, Picardi E, Tangaro S, Pesole G, Bellotti R. A primer on machine learning techniques for genomic applications. Comput Struct Biotechnol J 2021; 19:4345-4359. [PMID: 34429852 PMCID: PMC8365460 DOI: 10.1016/j.csbj.2021.07.021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 07/23/2021] [Accepted: 07/23/2021] [Indexed: 11/28/2022] Open
Abstract
High throughput sequencing technologies have enabled the study of complex biological aspects at single nucleotide resolution, opening the big data era. The analysis of large volumes of heterogeneous "omic" data, however, requires novel and efficient computational algorithms based on the paradigm of Artificial Intelligence. In the present review, we introduce and describe the most common machine learning methodologies, and lately deep learning, applied to a variety of genomics tasks, trying to emphasize capabilities, strengths and limitations through a simple and intuitive language. We highlight the power of the machine learning approach in handling big data by means of a real life example, and underline how described methods could be relevant in all cases in which large amounts of multimodal genomic data are available.
Collapse
Affiliation(s)
- Alfonso Monaco
- Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Bari, Via A. Orabona 4, 70125 Bari, Italy
| | - Ester Pantaleo
- Dipartimento Interateneo di Fisica "M. Merlin", Università degli Studi di Bari "Aldo Moro", Via G. Amendola 173, 70125 Bari, Italy
| | - Nicola Amoroso
- Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Bari, Via A. Orabona 4, 70125 Bari, Italy.,Dipartimento di Farmacia - Scienze del Farmaco, Università degli Studi di Bari "Aldo Moro", Via A. Orabona 4, 70125 Bari, Italy
| | - Antonio Lacalamita
- National Institute of Gastroenterology "S. de Bellis", Research Hospital, 70013 Castellana Grotte (Bari), Italy
| | - Claudio Lo Giudice
- Dipartimento di Bioscienze, Biotecnologie e Biofarmaceutica, Università degli Studi di Bari "Aldo Moro", Via A. Orabona 4, 70125 Bari, Italy
| | - Adriano Fonzino
- Dipartimento di Bioscienze, Biotecnologie e Biofarmaceutica, Università degli Studi di Bari "Aldo Moro", Via A. Orabona 4, 70125 Bari, Italy
| | - Bruno Fosso
- Istituto di Biomembrane, Bioenergetica e Biotecnologie Molecolari, Consiglio Nazionale delle Ricerche, Via G. Amendola 122/O, 70126 Bari, Italy
| | - Ernesto Picardi
- Dipartimento di Bioscienze, Biotecnologie e Biofarmaceutica, Università degli Studi di Bari "Aldo Moro", Via A. Orabona 4, 70125 Bari, Italy.,Istituto di Biomembrane, Bioenergetica e Biotecnologie Molecolari, Consiglio Nazionale delle Ricerche, Via G. Amendola 122/O, 70126 Bari, Italy
| | - Sabina Tangaro
- Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Bari, Via A. Orabona 4, 70125 Bari, Italy.,Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari "Aldo Moro", Bari, Via G. Amendola 165, 70125 Bari, Italy
| | - Graziano Pesole
- Dipartimento di Bioscienze, Biotecnologie e Biofarmaceutica, Università degli Studi di Bari "Aldo Moro", Via A. Orabona 4, 70125 Bari, Italy.,Istituto di Biomembrane, Bioenergetica e Biotecnologie Molecolari, Consiglio Nazionale delle Ricerche, Via G. Amendola 122/O, 70126 Bari, Italy
| | - Roberto Bellotti
- Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Bari, Via A. Orabona 4, 70125 Bari, Italy.,Dipartimento Interateneo di Fisica "M. Merlin", Università degli Studi di Bari "Aldo Moro", Via G. Amendola 173, 70125 Bari, Italy
| |
Collapse
|
33
|
Mari T, Henderson J, Maden M, Nevitt S, Duarte R, Fallon N. Systematic Review of the Effectiveness of Machine Learning Algorithms for Classifying Pain Intensity, Phenotype or Treatment Outcomes Using Electroencephalogram Data. THE JOURNAL OF PAIN 2021; 23:349-369. [PMID: 34425248 DOI: 10.1016/j.jpain.2021.07.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 06/25/2021] [Accepted: 07/27/2021] [Indexed: 11/17/2022]
Abstract
Recent attempts to utilize machine learning (ML) to predict pain-related outcomes from Electroencephalogram (EEG) data demonstrate promising results. The primary aim of this review was to evaluate the effectiveness of ML algorithms for predicting pain intensity, phenotypes or treatment response from EEG. Electronic databases MEDLINE, EMBASE, Web of Science, PsycINFO and The Cochrane Library were searched. A total of 44 eligible studies were identified, with 22 presenting attempts to predict pain intensity, 15 investigating the prediction of pain phenotypes and seven assessing the prediction of treatment response. A meta-analysis was not considered appropriate for this review due to heterogenos methods and reporting. Consequently, data were narratively synthesized. The results demonstrate that the best performing model of the individual studies allows for the prediction of pain intensity, phenotypes and treatment response with accuracies ranging between 62 to 100%, 57 to 99% and 65 to 95.24%, respectively. The results suggest that ML has the potential to effectively predict pain outcomes, which may eventually be used to assist clinical care. However, inadequate reporting and potential bias reduce confidence in the results. Future research should improve reporting standards and externally validate models to decrease bias, which would increase the feasibility of clinical translation. PERSPECTIVE: This systematic review explores the state-of-the-art machine learning methods for predicting pain intensity, phenotype or treatmentresponse from EEG data. Results suggest that machine learning may demonstrate clinical utility, pending further research and development. Areas for improvement, including standardized processing, reporting and the need for better methodological assessment tools, are discussed.
Collapse
Affiliation(s)
- Tyler Mari
- Department of Psychology, University of Liverpool, Liverpool, UK.
| | | | - Michelle Maden
- Department of Health Data Science, Liverpool Reviews and Implementation Group, University of Liverpool, Liverpool, UK
| | - Sarah Nevitt
- Department of Health Data Science, Liverpool Reviews and Implementation Group, University of Liverpool, Liverpool, UK
| | - Rui Duarte
- Department of Health Data Science, Liverpool Reviews and Implementation Group, University of Liverpool, Liverpool, UK
| | - Nicholas Fallon
- Department of Psychology, University of Liverpool, Liverpool, UK
| |
Collapse
|
34
|
Gundersen S, Boddu S, Capella-Gutierrez S, Drabløs F, Fernández JM, Kompova R, Taylor K, Titov D, Zerbino D, Hovig E. Recommendations for the FAIRification of genomic track metadata. F1000Res 2021; 10. [PMID: 34249331 PMCID: PMC8226415 DOI: 10.12688/f1000research.28449.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/17/2021] [Indexed: 01/25/2023] Open
Abstract
Background: Many types of data from genomic analyses can be represented as genomic tracks,
i.e. features linked to the genomic coordinates of a reference genome. Examples of such data are epigenetic DNA methylation data, ChIP-seq peaks, germline or somatic DNA variants, as well as RNA-seq expression levels. Researchers often face difficulties in locating, accessing and combining relevant tracks from external sources, as well as locating the raw data, reducing the value of the generated information. Description of work: We propose to advance the application of FAIR data principles (Findable, Accessible, Interoperable, and Reusable) to produce searchable metadata for genomic tracks. Findability and Accessibility of metadata can then be ensured by a track search service that integrates globally identifiable metadata from various track hubs in the Track Hub Registry and other relevant repositories. Interoperability and Reusability need to be ensured by the specification and implementation of a basic set of recommendations for metadata. We have tested this concept by developing such a specification in a JSON Schema, called FAIRtracks, and have integrated it into a novel track search service, called TrackFind. We demonstrate practical usage by importing datasets through TrackFind into existing examples of relevant analytical tools for genomic tracks: EPICO and the GSuite HyperBrowser. Conclusion: We here provide a first iteration of a draft standard for genomic track metadata, as well as the accompanying software ecosystem. It can easily be adapted or extended to future needs of the research community regarding data, methods and tools, balancing the requirements of both data submitters and analytical end-users.
Collapse
Affiliation(s)
| | - Sanjay Boddu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | | | - Finn Drabløs
- Department of Clinical and Molecular Medicine, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Radmila Kompova
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway
| | - Kieron Taylor
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Dmytro Titov
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway
| | - Daniel Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Eivind Hovig
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway.,Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital (OUH), Oslo, Norway
| |
Collapse
|
35
|
Arora I, Tollefsbol TO. Computational methods and next-generation sequencing approaches to analyze epigenetics data: Profiling of methods and applications. Methods 2021; 187:92-103. [PMID: 32941995 PMCID: PMC7914156 DOI: 10.1016/j.ymeth.2020.09.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/08/2020] [Accepted: 09/10/2020] [Indexed: 12/20/2022] Open
Abstract
Epigenetics is mainly comprised of features that regulate genomic interactions thereby playing a crucial role in a vast array of biological processes. Epigenetic mechanisms such as DNA methylation and histone modifications influence gene expression by modulating the packaging of DNA in the nucleus. A plethora of studies have emphasized the importance of analyzing epigenetics data through genome-wide studies and high-throughput approaches, thereby providing key insights towards epigenetics-based diseases such as cancer. Recent advancements have been made towards translating epigenetics research into a high throughput approach such as genome-scale profiling. Amongst all, bioinformatics plays a pivotal role in achieving epigenetics-related computational studies. Despite significant advancements towards epigenomic profiling, it is challenging to understand how various epigenetic modifications such as chromatin modifications and DNA methylation regulate gene expression. Next-generation sequencing (NGS) provides accurate and parallel sequencing thereby allowing researchers to comprehend epigenomic profiling. In this review, we summarize different computational methods such as machine learning and other bioinformatics tools, publicly available databases and resources to identify key modifications associated with epigenetic machinery. Additionally, the review also focuses on understanding recent methodologies related to epigenome profiling using NGS methods ranging from library preparation, different sequencing platforms and analytical techniques to evaluate various epigenetic modifications such as DNA methylation and histone modifications. We also provide detailed information on bioinformatics tools and computational strategies responsible for analyzing large scale data in epigenetics.
Collapse
Affiliation(s)
- Itika Arora
- Department of Biology, University of Alabama at Birmingham, 1300 University Boulevard, Birmingham, AL 35294, USA.
| | - Trygve O Tollefsbol
- Department of Biology, University of Alabama at Birmingham, 1300 University Boulevard, Birmingham, AL 35294, USA; Comprehensive Center for Healthy Aging, University of Alabama Birmingham, 1530 3rd Avenue South, Birmingham, AL 35294, USA; Comprehensive Cancer Center, University of Alabama Birmingham, 1802 6th Avenue South, Birmingham, AL 35294, USA; Nutrition Obesity Research Center, University of Alabama Birmingham, 1675 University Boulevard, Birmingham, AL 35294, USA; Comprehensive Diabetes Center, University of Alabama Birmingham, 1825 University Boulevard, Birmingham, AL 35294, USA.
| |
Collapse
|
36
|
Majnarić LT, Babič F, O’Sullivan S, Holzinger A. AI and Big Data in Healthcare: Towards a More Comprehensive Research Framework for Multimorbidity. J Clin Med 2021; 10:jcm10040766. [PMID: 33672914 PMCID: PMC7918668 DOI: 10.3390/jcm10040766] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 02/02/2021] [Accepted: 02/11/2021] [Indexed: 12/11/2022] Open
Abstract
Multimorbidity refers to the coexistence of two or more chronic diseases in one person. Therefore, patients with multimorbidity have multiple and special care needs. However, in practice it is difficult to meet these needs because the organizational processes of current healthcare systems tend to be tailored to a single disease. To improve clinical decision making and patient care in multimorbidity, a radical change in the problem-solving approach to medical research and treatment is needed. In addition to the traditional reductionist approach, we propose interactive research supported by artificial intelligence (AI) and advanced big data analytics. Such research approach, when applied to data routinely collected in healthcare settings, provides an integrated platform for research tasks related to multimorbidity. This may include, for example, prediction, correlation, and classification problems based on multiple interaction factors. However, to realize the idea of this paradigm shift in multimorbidity research, the optimization, standardization, and most importantly, the integration of electronic health data into a common national and international research infrastructure is needed. Ultimately, there is a need for the integration and implementation of efficient AI approaches, particularly deep learning, into clinical routine directly within the workflows of the medical professionals.
Collapse
Affiliation(s)
- Ljiljana Trtica Majnarić
- Department of Internal Medicine, Family Medicine and the History of Medicine, Faculty of Medicine, University Josip Juraj Strossmayer, 31000 Osijek, Croatia;
- Department of Public Health, Faculty of Dental Medicine, University Josip Juraj Strossmayer, 31000 Osijek, Croatia
| | - František Babič
- Department of Cybernetics and Artificial Intelligence, Faculty of Electrical Engineering and Informatics, Technical University of Košice, 066 01 Košice, Slovakia
- Correspondence: ; Tel.: +421-55-602-4220
| | - Shane O’Sullivan
- Department of Pathology, Faculdade de Medicina, Universidade de São Paulo, 05508-220 São Paulo, Brazil;
| | - Andreas Holzinger
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, 8036 Graz, Austria;
| |
Collapse
|
37
|
Balea-Fernandez FJ, Martinez-Vega B, Ortega S, Fabelo H, Leon R, Callico GM, Bibao-Sieyro C. Analysis of Risk Factors in Dementia Through Machine Learning. J Alzheimers Dis 2021; 79:845-861. [DOI: 10.3233/jad-200955] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Background: Sociodemographic data indicate the progressive increase in life expectancy and the prevalence of Alzheimer’s disease (AD). AD is raised as one of the greatest public health problems. Its etiology is twofold: on the one hand, non-modifiable factors and on the other, modifiable. Objective: This study aims to develop a processing framework based on machine learning (ML) and optimization algorithms to study sociodemographic, clinical, and analytical variables, selecting the best combination among them for an accurate discrimination between controls and subjects with major neurocognitive disorder (MNCD). Methods: This research is based on an observational-analytical design. Two research groups were established: MNCD group (n = 46) and control group (n = 38). ML and optimization algorithms were employed to automatically diagnose MNCD. Results: Twelve out of 37 variables were identified in the validation set as the most relevant for MNCD diagnosis. Sensitivity of 100%and specificity of 71%were achieved using a Random Forest classifier. Conclusion: ML is a potential tool for automatic prediction of MNCD which can be applied to relatively small preclinical and clinical data sets. These results can be interpreted to support the influence of the environment on the development of AD.
Collapse
Affiliation(s)
| | - Beatriz Martinez-Vega
- Research Institute for Applied Microelectronics, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
| | - Samuel Ortega
- Research Institute for Applied Microelectronics, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
| | - Himar Fabelo
- Research Institute for Applied Microelectronics, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
| | - Raquel Leon
- Research Institute for Applied Microelectronics, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
| | - Gustavo M. Callico
- Research Institute for Applied Microelectronics, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
| | - Cristina Bibao-Sieyro
- Hospital Universitario de Gran Canaria Dr. Negrín, Las Palmas de Gran Canaria, Spain
| |
Collapse
|
38
|
Abstract
In recent years, mass spectrometry (MS)-based metabolomics has been extensively applied to characterize biochemical mechanisms, and study physiological processes and phenotypic changes associated with disease. Metabolomics has also been important for identifying biomarkers of interest suitable for clinical diagnosis. For the purpose of predictive modeling, in this chapter, we will review various supervised learning algorithms such as random forest (RF), support vector machine (SVM), and partial least squares-discriminant analysis (PLS-DA). In addition, we will also review feature selection methods for identifying the best combination of metabolites for an accurate predictive model. We conclude with best practices for reproducibility by including internal and external replication, reporting metrics to assess performance, and providing guidelines to avoid overfitting and to deal with imbalanced classes. An analysis of an example data will illustrate the use of different machine learning methods and performance metrics.
Collapse
Affiliation(s)
- Tusharkanti Ghosh
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Weiming Zhang
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Debashis Ghosh
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
39
|
Mishra R, Li B. The Application of Artificial Intelligence in the Genetic Study of Alzheimer's Disease. Aging Dis 2020; 11:1567-1584. [PMID: 33269107 PMCID: PMC7673858 DOI: 10.14336/ad.2020.0312] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 03/12/2020] [Indexed: 12/13/2022] Open
Abstract
Alzheimer's disease (AD) is a neurodegenerative disease in which genetic factors contribute approximately 70% of etiological effects. Studies have found many significant genetic and environmental factors, but the pathogenesis of AD is still unclear. With the application of microarray and next-generation sequencing technologies, research using genetic data has shown explosive growth. In addition to conventional statistical methods for the processing of these data, artificial intelligence (AI) technology shows obvious advantages in analyzing such complex projects. This article first briefly reviews the application of AI technology in medicine and the current status of genetic research in AD. Then, a comprehensive review is focused on the application of AI in the genetic research of AD, including the diagnosis and prognosis of AD based on genetic data, the analysis of genetic variation, gene expression profile, gene-gene interaction in AD, and genetic analysis of AD based on a knowledge base. Although many studies have yielded some meaningful results, they are still in a preliminary stage. The main shortcomings include the limitations of the databases, failing to take advantage of AI to conduct a systematic biology analysis of multilevel databases, and lack of a theoretical framework for the analysis results. Finally, we outlook the direction of future development. It is crucial to develop high quality, comprehensive, large sample size, data sharing resources; a multi-level system biology AI analysis strategy is one of the development directions, and computational creativity may play a role in theory model building, verification, and designing new intervention protocols for AD.
Collapse
Affiliation(s)
- Rohan Mishra
- Washington Institute for Health Sciences, Arlington, VA 22203, USA
| | - Bin Li
- Washington Institute for Health Sciences, Arlington, VA 22203, USA
- Georgetown University Medical Center, Washington D.C. 20057, USA
| |
Collapse
|
40
|
Shibata M, Okamura K, Yura K, Umezawa A. High-precision multiclass cell classification by supervised machine learning on lectin microarray data. Regen Ther 2020; 15:195-201. [PMID: 33426219 PMCID: PMC7770415 DOI: 10.1016/j.reth.2020.09.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 09/25/2020] [Indexed: 11/27/2022] Open
Abstract
INTRODUCTION Establishment of a cell classification platform for evaluation and selection of human pluripotent stem cells (hPSCs) is of great importance to assure the efficacy and safety of cell-based therapy. In our previous work, we introduced a discriminant function that evaluates pluripotency from the cells' glycome. However, it is not yet suitable for general use. METHODS The current study aims to establish a high-precision cell classification platform introducing supervised machine learning and test the platform on glycome analysis as a proof-of-concept study. We employed linear classification and neural network to the lectin microarray data from 1577 human cells and categorized them into five classes including hPSCs. RESULTS The linear-classification-based model and the neural-network-based model successfully predicted the sample type with accuracies of 89% and 97%, respectively. CONCLUSIONS Because of the high recognition accuracies and the small amount of computing resources required for these analyses, our platform can be a high precision conventional cell classification system for hPSCs.
Collapse
Affiliation(s)
- Mayu Shibata
- Department of Reproductive Biology, National Center for Child Health and Development, Tokyo, 157-8535, Japan
- Graduate School of Humanities and Sciences, Ochanomizu University, Tokyo, 112-8610, Japan
| | - Kohji Okamura
- Department of Systems BioMedicine, National Center for Child Health and Development, Tokyo, 157-8535, Japan
| | - Kei Yura
- Graduate School of Humanities and Sciences, Ochanomizu University, Tokyo, 112-8610, Japan
- School of Advanced Science and Engineering, Waseda University, Tokyo, 162-0041, Japan
| | - Akihiro Umezawa
- Department of Reproductive Biology, National Center for Child Health and Development, Tokyo, 157-8535, Japan
| |
Collapse
|
41
|
Farzanegan B, Farzanegan R, Behgam Shadmehr M, Lajevardi S, Niakan Kalhori SR. Prediction of Patient's Adherence to the Post-Intubation Tracheal Stenosis Follow-up Plan in Iran: Application of two Data Mining Techniques. TANAFFOS 2020; 19:330-339. [PMID: 33959170 PMCID: PMC8088141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 08/01/2020] [Indexed: 11/12/2022]
Abstract
BACKGROUND Timely diagnosis of post-intubation tracheal stenosis (PITS), which is one of the most serious complications of endotracheal intubation, may change its natural history. To prevent PITS, patients who are discharged from the intensive care unit (ICU) with more than 24 hours of intubation should be actively followed-up for three months after extubation. This study aimed to evaluate the abilities of artificial neural network (ANN) and decision tree (DT) methods in predicting the patients' adherence to the follow-up plan and revealing the knowledge behind PITS screening system development requirements. MATERIALS AND METHODS In this cohort study, conducted in 14 ICUs during 12 months in ten cities of Iran, the data of 203 intubated ICU-discharged patients were collected. Ten influential factors were defined for adherences to the PITS follow-up (P<0.05). A feed-forward multilayer perceptron algorithm was applied using a training set (two-thirds of the entire data) to develop a model for predicting the patients' adherence to the follow-up plan three months after extubation. The same data were used to develop a C5.0 DT in MATLAB 2010a. The remaining one-third of data was used for model testing, based on the holdout method. RESULTS The accuracy, sensitivity, and specificity of the developed ANN classifier were 83.30%, 72.70%, and 89.50%, respectively. The accuracy of the DT model with five nodes, 13 branches, and nine leaves (producing nine rules for active follow-up) was 75.36%. CONCLUSION The developed classifier might aid care providers to identify possible cases of non-adherence to the follow-up and care plans. Overall, active follow-up of these patients may prevent the adverse consequences of PITS after ICU discharge.
Collapse
Affiliation(s)
- Behrooz Farzanegan
- Tracheal Diseases Research Center (TDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Roya Farzanegan
- Tracheal Diseases Research Center (TDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Behgam Shadmehr
- Tracheal Diseases Research Center (TDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | | | - Sharareh R. Niakan Kalhori
- Department of Health information Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
42
|
Shu C, Justice AC, Zhang X, Marconi VC, Hancock DB, Johnson EO, Xu K. DNA methylation biomarker selected by an ensemble machine learning approach predicts mortality risk in an HIV-positive veteran population. Epigenetics 2020; 16:741-753. [PMID: 33092459 PMCID: PMC8216205 DOI: 10.1080/15592294.2020.1824097] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Background: With the improved life expectancy of people living with HIV (PLWH), identifying vulnerable subpopulations at high mortality risk is important. Evidences showed that DNA methylation (DNAm) is associated with mortality in non-HIV populations. Here, we established a panel of DNAm biomarkers that can predict mortality risk among PLWH. Methods: 1,081 HIV-positive participants from the Veterans Ageing Cohort Study (VACS) were divided into training (N = 460), validation (N = 114), and testing (N = 507) sets. VACS index was used as a measure of mortality risk among PLWH. Model training and fine-tuning were conducted using the ensemble method in the training and validation sets and prediction performance was assessed in the testing set. The survival analysis comparing the predicted high and low mortality risk groups and the Gene Ontology enrichment analysis of the predictive CpG sites were performed. Results: We selected a panel of 393 CpGs for the ensemble prediction model that showed excellent performance in predicting high mortality risk with an auROC of 0.809 (95%CI: 0.767,0.851) and a balanced accuracy of 0.653 (95%CI: 0.611, 0.693) in the testing set. The high mortality risk group was significantly associated with 10-year mortality (hazard ratio = 1.79, p = 4E-05) compared with low risk group. These 393 CpGs were located in 280 genes enriched in immune and inflammation response pathways. Conclusions: We identified a panel of DNAm features associated with mortality risk in PLWH. These DNAm features may serve as predictive biomarkers for mortality risk among PLWH. Abbreviations: AUC: Area Under Curve; CI: Confidence interval; DMR: differentially methylated region; DNA: Deoxyribonucleic acid; DNAm: DNA methylation; DAVID: Database for Annotation, Visualization, and Integrated Discovery; EWA: epigenome-wide association; FDR: False discovery rate; FWER: Family-wise error rate; GLMNET: elastic-net-regularized generalized linear models; GO: Gene ontology; HIV: Human immunodeficiency virus; HM450K: Human Methylation 450 K BeadChip; k-NN: k-nearest neighbours; NK: Natural killer; PC: Principal component; PLWH: people living with HIV; QC: Quality control; SVM: Support Vector Machines; VACS: Veterans Ageing Cohort Study; XGBoost: Extreme Gradient Boosting Tree
Collapse
Affiliation(s)
- Chang Shu
- Department of Psychiatry, Yale School of Medicine, New Haven, CT, USA.,Connecticut Veteran Healthcare System, West Haven, CT, USA
| | - Amy C Justice
- Connecticut Veteran Healthcare System, West Haven, CT, USA.,Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| | - Xinyu Zhang
- Department of Psychiatry, Yale School of Medicine, New Haven, CT, USA.,Connecticut Veteran Healthcare System, West Haven, CT, USA
| | - Vincent C Marconi
- Division of Infectious Disease, Emory University School of Medicine, Atlanta, GA, USA
| | - Dana B Hancock
- GenOmics, Bioinformatics, and Translational Research Center, Biostatistics and Epidemiology Division, RTI International, Research Triangle Park, NC, USA
| | - Eric O Johnson
- GenOmics, Bioinformatics, and Translational Research Center, Biostatistics and Epidemiology Division, RTI International, Research Triangle Park, NC, USA.,Fellow Program, RTI International, Research Triangle Park, NC, USA
| | - Ke Xu
- Department of Psychiatry, Yale School of Medicine, New Haven, CT, USA.,Connecticut Veteran Healthcare System, West Haven, CT, USA
| |
Collapse
|
43
|
Ambrosini S, Mohammed SA, Costantino S, Paneni F. Disentangling the epigenetic landscape in cardiovascular patients: a path toward personalized medicine. Minerva Cardiol Angiol 2020; 69:331-345. [PMID: 32996305 DOI: 10.23736/s2724-5683.20.05326-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Despite significant advances in our understanding of cardiovascular disease (CVD) we are still far from having developed breakthrough strategies to combat coronary atherosclerosis and heart failure, which account for most of CV deaths worldwide. Available cardiovascular therapies have failed to show to be equally effective in all patients, suggesting that inter-individual diversity is an important factor when it comes to conceive and deliver effective personalized treatments. Genome mapping has proved useful in identifying patients who could benefit more from specific drugs depending on genetic variances; however, our genetic make-up determines only a limited part of an individual's risk profile. Recent studies have demonstrated that epigenetic changes - defined as dynamic changes of DNA and histones which do not affect DNA sequence - are key players in the pathophysiology of cardiovascular disease and may participate to delineate cardiovascular risk trajectories over the lifetime. Epigenetic modifications include changes in DNA methylation, histone modifications and non-coding RNAs and these epigenetic signals have shown to cooperate in modulating chromatin accessibility to transcription factors and gene expression. Environmental factors such as air pollution, smoking, psychosocial context, and unhealthy diet regimens have shown to significantly modify the epigenome thus leading to altered transcriptional programs and CVD phenotypes. Therefore, the integration of genetic and epigenetic information might be invaluable to build individual maps of cardiovascular risk and hence, could be employed for the design of customized diagnostic and therapeutic strategies. In the present review, we discuss the growing importance of epigenetic information and its putative implications in cardiovascular precision medicine.
Collapse
Affiliation(s)
- Samuele Ambrosini
- Center for Molecular Cardiology, University of Zürich, Zurich, Switzerland
| | - Shafeeq A Mohammed
- Center for Molecular Cardiology, University of Zürich, Zurich, Switzerland
| | - Sarah Costantino
- Center for Molecular Cardiology, University of Zürich, Zurich, Switzerland
| | - Francesco Paneni
- Center for Molecular Cardiology, University of Zürich, Zurich, Switzerland - .,Department of Cardiology, University Heart Center, University Hospital Zurich, Zurich, Switzerland.,Department of Research and Education, University Hospital Zurich, Zurich, Switzerland
| |
Collapse
|
44
|
Balaji E V, Kumar N, Satarker S, Nampoothiri M. Zinc as a plausible epigenetic modulator of glioblastoma multiforme. Eur J Pharmacol 2020; 887:173549. [PMID: 32926916 DOI: 10.1016/j.ejphar.2020.173549] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 08/26/2020] [Accepted: 09/09/2020] [Indexed: 01/04/2023]
Abstract
Glioblastoma Multiforme (GBM) is an aggressive brain tumor (WHO grade 4 astrocytoma) with unknown causes and is associated with a reduced life expectancy. The available treatment options namely radiotherapy, surgery and chemotherapy have failed to improve life expectancy. Out of the various therapeutic approaches, epigenetic therapy is one of the most studied. Epigenetic therapy is involved in the effective treatment of GBM by inhibiting DNA methyltransferase, histone deacetylation and non-coding RNA. It also promotes the expression of the tumor suppressor gene and is involved in the suppression of the oncogene. Various targets are being studied to implement proper epigenetic regulation to control GBM effectively. Zinc is one of the micronutrients which is considered to maintain epigenetic regulation by promoting the proper DNA folding, protecting genetic material from the oxidative damage and controlling the enzyme activation involved in the epigenetic regulation. Here, we are discussing the importance of zinc in regulating the epigenetic modifications and assessing its role in glioblastoma research. The discussion also highlights the importance of artificial intelligence using epigenetics for envisaging the glioma progression, diagnosis and its management.
Collapse
Affiliation(s)
- Vignesh Balaji E
- Department of Pharmacology, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal, 576104, India
| | - Nitesh Kumar
- Department of Pharmacology, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal, 576104, India
| | - Sairaj Satarker
- Department of Pharmacology, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal, 576104, India
| | - Madhavan Nampoothiri
- Department of Pharmacology, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal, 576104, India.
| |
Collapse
|
45
|
Togo R, Watanabe H, Ogawa T, Haseyama M. Deep convolutional neural network-based anomaly detection for organ classification in gastric X-ray examination. Comput Biol Med 2020; 123:103903. [PMID: 32658795 DOI: 10.1016/j.compbiomed.2020.103903] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 06/18/2020] [Accepted: 06/29/2020] [Indexed: 12/01/2022]
Abstract
AIM The aim of this study was to determine whether our deep convolutional neural network-based anomaly detection model can distinguish differences in esophagus images and stomach images obtained from gastric X-ray examinations. METHODS A total of 6012 subjects were analyzed as our study subjects. Since the number of esophagus X-ray images is much smaller than the number of gastric X-ray images taken in X-ray examinations, we took an anomaly detection approach to realize the task of organ classification. We constructed a deep autoencoding gaussian mixture model (DAGMM) with a convolutional autoencoder architecture. The trained model can produce an anomaly score for a given test X-ray image. For comparison, the original DAGMM, AnoGAN, and a One-Class Support Vector Machine (OCSVM) that were trained with features obtained by a pre-trained Inception-v3 network were used. RESULTS Sensitivity, specificity, and the calculated harmonic mean of the proposed method were 0.956, 0.980, and 0.968, respectively. Those of the original DAGMM were 0.932, 0.883, and 0.907, respectively. Those of AnoGAN were 0.835, 0.833, and 0.834, respectively, and those of OCSVM were 0.932, 0.935, and 0.934, respectively. Experimental results showed the effectiveness of the proposed method for an organ classification task. CONCLUSION Our deep convolutional neural network-based anomaly detection model has shown the potential for clinical use in organ classification.
Collapse
Affiliation(s)
- Ren Togo
- Education and Research Center for Mathematical and Data Science, Hokkaido University, N-12, W-7, Kita-ku, Sapporo, 060-0812, Japan.
| | - Haruna Watanabe
- Faculty of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-Ku, Sapporo, 060-0814, Japan.
| | - Takahiro Ogawa
- Faculty of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-Ku, Sapporo, 060-0814, Japan.
| | - Miki Haseyama
- Faculty of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-Ku, Sapporo, 060-0814, Japan.
| |
Collapse
|
46
|
Rauschert S, Raubenheimer K, Melton PE, Huang RC. Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification. Clin Epigenetics 2020; 12:51. [PMID: 32245523 PMCID: PMC7118917 DOI: 10.1186/s13148-020-00842-4] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 03/22/2020] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Machine learning is a sub-field of artificial intelligence, which utilises large data sets to make predictions for future events. Although most algorithms used in machine learning were developed as far back as the 1950s, the advent of big data in combination with dramatically increased computing power has spurred renewed interest in this technology over the last two decades. MAIN BODY Within the medical field, machine learning is promising in the development of assistive clinical tools for detection of e.g. cancers and prediction of disease. Recent advances in deep learning technologies, a sub-discipline of machine learning that requires less user input but more data and processing power, has provided even greater promise in assisting physicians to achieve accurate diagnoses. Within the fields of genetics and its sub-field epigenetics, both prime examples of complex data, machine learning methods are on the rise, as the field of personalised medicine is aiming for treatment of the individual based on their genetic and epigenetic profiles. CONCLUSION We now have an ever-growing number of reported epigenetic alterations in disease, and this offers a chance to increase sensitivity and specificity of future diagnostics and therapies. Currently, there are limited studies using machine learning applied to epigenetics. They pertain to a wide variety of disease states and have used mostly supervised machine learning methods.
Collapse
Affiliation(s)
- S Rauschert
- Telethon Kids Institute, University of Western Australia, Nedlands, Perth, Western Australia.
| | - K Raubenheimer
- School of Medicine, Notre Dame University, Fremantle, Western Australia
| | - P E Melton
- Centre for Genetic Origins of Health and Disease, The University of Western Australia and Curtin University, Perth, Western Australia
- School of Pharmacy and Biomedical Sciences, Curtin University, Bentley, Western Australia
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Tasmania, Australia
| | - R C Huang
- Telethon Kids Institute, University of Western Australia, Nedlands, Perth, Western Australia
| |
Collapse
|
47
|
Xu R, Li S, Guo S, Zhao Q, Abramson MJ, Li S, Guo Y. Environmental temperature and human epigenetic modifications: A systematic review. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2020; 259:113840. [PMID: 31884209 DOI: 10.1016/j.envpol.2019.113840] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 11/26/2019] [Accepted: 12/16/2019] [Indexed: 05/28/2023]
Abstract
The knowledge about the effects of environmental temperature on human epigenome is a potential key to understand the health impacts of temperature and to guide acclimation under climate change. We performed a systematic review on the epidemiological studies that have evaluated the association between environmental temperature and human epigenetic modifications. We identified seven original articles on this topic published between 2009 and 2019, including six cohort studies and one cross-sectional study. They focused on DNA methylation in elderly people (blood sample) or infants (placenta sample), with sample size ranging from 306 to 1798. These studies were conducted in relatively low temperature setting (median/mean temperature: 0.8-13 °C), and linear models were used to evaluate temperature-DNA methylation association over short period (≤28 days). It has been reported that short-term ambient temperature could affect global human DNA methylation. A total of 15 candidate genes (ICAM-1, CRAT, F3, TLR-2, iNOS, ZKSCAN4, ZNF227, ZNF595, ZNF597, ZNF668, CACNA1H, AIRE, MYEOV2, NKX1-2 and CCDC15) with methylation status associated with ambient temperature have been identified. DNA methylation on ZKSCAN4, ICAM-1 partly mediated the effect of short-term cold temperature on high blood pressure and ICAM-1 protein (related to cardiovascular events), respectively. In summary, epidemiological evidence about the impacts of environment temperature on human epigenetics remains scarce and limited to short-term linear effect of cold temperature on DNA methylation in elderly people and infants. More studies are needed to broaden our understanding of temperature related epigenetic changes, especially under a changing climate.
Collapse
Affiliation(s)
- Rongbin Xu
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia
| | - Shuai Li
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, 3010, Australia; Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, UK
| | - Shuaijun Guo
- Centre for Community Child Health, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC, 3052, Australia
| | - Qi Zhao
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia
| | - Michael J Abramson
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia
| | - Shanshan Li
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia
| | - Yuming Guo
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia.
| |
Collapse
|
48
|
Schiano C, Benincasa G, Franzese M, Della Mura N, Pane K, Salvatore M, Napoli C. Epigenetic-sensitive pathways in personalized therapy of major cardiovascular diseases. Pharmacol Ther 2020; 210:107514. [PMID: 32105674 DOI: 10.1016/j.pharmthera.2020.107514] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The complex pathobiology underlying cardiovascular diseases (CVDs) has yet to be explained. Aberrant epigenetic changes may result from alterations in enzymatic activities, which are responsible for putting in and/or out the covalent groups, altering the epigenome and then modulating gene expression. The identification of novel individual epigenetic-sensitive trajectories at single cell level might provide additional opportunities to establish predictive, diagnostic and prognostic biomarkers as well as drug targets in CVDs. To date, most of studies investigated DNA methylation mechanism and miRNA regulation as epigenetics marks. During atherogenesis, big epigenetic changes in DNA methylation and different ncRNAs, such as miR-93, miR-340, miR-433, miR-765, CHROME, were identified into endothelial cells, smooth muscle cells, and macrophages. During man development, lipid metabolism, inflammation and homocysteine homeostasis, alter vascular transcriptional mechanism of fundamental genes such as ABCA1, SREBP2, NOS, HIF1. At histone level, increased HDAC9 was associated with matrix metalloproteinase 1 (MMP1) and MMP2 expression in pro-inflammatory macrophages of human carotid plaque other than to have a positive effect on toll like receptor signaling and innate immunity. HDAC9 deficiency promoted inflammation resolution and reverse cholesterol transport, which might block atherosclerosis progression and promote lesion regression. Here, we describe main human epigenetic mechanisms involved in atherosclerosis, coronary heart disease, ischemic stroke, peripheral artery disease; cardiomyopathy and heart failure. Different epigenetics mechanisms are activated, such as regulation by circular RNAs, as MICRA, and epitranscriptomics at RNA level. Moreover, in order to open new frontiers for precision medicine and personalized therapy, we offer a panoramic view on the most innovative bioinformatic tools designed to identify putative genes and molecular networks underlying CVDs in man.
Collapse
Affiliation(s)
- Concetta Schiano
- Clinical Department of Internal Medicine and Specialistics, Department of Advanced Clinical and Surgical Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy.
| | - Giuditta Benincasa
- Clinical Department of Internal Medicine and Specialistics, Department of Advanced Clinical and Surgical Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | | | | | | | | | - Claudio Napoli
- Clinical Department of Internal Medicine and Specialistics, Department of Advanced Clinical and Surgical Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy; IRCCS SDN, Naples, Italy
| |
Collapse
|
49
|
Sieg M, Richter G, Schaefer AS, Kruppa J. Detection of suspicious interactions of spiking covariates in methylation data. BMC Bioinformatics 2020; 21:36. [PMID: 32000657 PMCID: PMC6993406 DOI: 10.1186/s12859-020-3364-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 01/14/2020] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND In methylation analyses like epigenome-wide association studies, a high amount of biomarkers is tested for an association between the measured continuous outcome and different covariates. In the case of a continuous covariate like smoking pack years (SPY), a measure of lifetime exposure to tobacco toxins, a spike at zero can occur. Hence, all non-smokers are generating a peak at zero, while the smoking patients are distributed over the other SPY values. Additionally, the spike might also occur on the right side of the covariate distribution, if a category "heavy smoker" is designed. Here, we will focus on methylation data with a spike at the left or the right of the distribution of a continuous covariate. After the methylation data is generated, analysis is usually performed by preprocessing, quality control, and determination of differentially methylated sites, often performed in pipeline fashion. Hence, the data is processed in a string of methods, which are available in one software package. The pipelines can distinguish between categorical covariates, i.e. for group comparisons or continuous covariates, i.e. for linear regression. The differential methylation analysis is often done internally by a linear regression without checking its inherent assumptions. A spike in the continuous covariate is ignored and can cause biased results. RESULTS We have reanalysed five data sets, four freely available from ArrayExpress, including methylation data and smoking habits reported by smoking pack years. Therefore, we generated an algorithm to check for the occurrences of suspicious interactions between the values associated with the spike position and the non-spike positions of the covariate. Our algorithm helps to decide if a suspicious interaction can be found and further investigations should be carried out. This is mostly important, because the information on the differentially methylated sites will be used for post-hoc analyses like pathway analyses. CONCLUSIONS We help to check for the validation of the linear regression assumptions in a methylation analysis pipeline. These assumptions should also be considered for machine learning approaches. In addition, we are able to detect outliers in the continuous covariate. Therefore, more statistical robust results should be produced in methylation analysis using our algorithm as a preprocessing step.
Collapse
Affiliation(s)
- Miriam Sieg
- Charité - University Medicine, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charitéplatz 1, Berlin, 10117 Germany
- Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Strane 2, Berlin, 10178 Germany
| | - Gesa Richter
- Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Strane 2, Berlin, 10178 Germany
- Department of Periodontology and Synoptic Dentistry, Institute of Dental, Oral and Maxillary Medicine, Charité - University Medicine, Charitéplatz 1, Berlin, 10117 Germany
| | - Arne S. Schaefer
- Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Strane 2, Berlin, 10178 Germany
- Department of Periodontology and Synoptic Dentistry, Institute of Dental, Oral and Maxillary Medicine, Charité - University Medicine, Charitéplatz 1, Berlin, 10117 Germany
| | - Jochen Kruppa
- Charité - University Medicine, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charitéplatz 1, Berlin, 10117 Germany
- Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Strane 2, Berlin, 10178 Germany
| |
Collapse
|
50
|
Accurate Identification of Tomograms of Lung Nodules Using CNN: Influence of the Optimizer, Preprocessing and Segmentation. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7297567 DOI: 10.1007/978-3-030-49076-8_23] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The diagnosis of pulmonary nodules plays an important role in the treatment of lung cancer, thus improving the diagnosis is the primary concern. This article shows a comparison of the results in the identification of computed tomography scans with pulmonary nodules, through the use of different optimizers (Adam and Nadam); the effect of the use of pre-processing and segmentation techniques using CNNs is also thoroughly explored. The dataset employed was Lung TIME which is publicly available. When no preprocessing or segmentation was applied, training accuracy above 90.24% and test accuracy above 86.8% were obtained. In contrast, when segmentation was applied without preprocessing, a training accuracy above 97.19% and test accuracy above 95.07% were reached. On the other hand, when preprocessing and segmentation was applied, a training accuracy above 96.41% and test accuracy above 94.71% were achieved. On average, the Adam optimizer scored a training accuracy of 96.17% and a test accuracy of 95.23%. Whereas, the Nadam optimizer obtained 96.25% and 95.2%, respectively. It is concluded that CNN has a good performance even when working with images with noise. The performance of the network was similar when working with preprocessing and segmentation than when using only segmentation. Also, it can be inferred that, the application of preprocessing and segmentation is an excellent option when it is required to improve accuracy in CNNs.
Collapse
|