1
|
Gómez-Valadés A, Martínez-Tomás R, García-Herranz S, Bjørnerud A, Rincón M. Early detection of mild cognitive impairment through neuropsychological tests in population screenings: a decision support system integrating ontologies and machine learning. Front Neuroinform 2024; 18:1378281. [PMID: 39478874 PMCID: PMC11522961 DOI: 10.3389/fninf.2024.1378281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 10/04/2024] [Indexed: 11/02/2024] Open
Abstract
Machine learning (ML) methodologies for detecting Mild Cognitive Impairment (MCI) are progressively gaining prevalence to manage the vast volume of processed information. Nevertheless, the black-box nature of ML algorithms and the heterogeneity within the data may result in varied interpretations across distinct studies. To avoid this, in this proposal, we present the design of a decision support system that integrates a machine learning model represented using the Semantic Web Rule Language (SWRL) in an ontology with specialized knowledge in neuropsychological tests, the NIO ontology. The system's ability to detect MCI subjects was evaluated on a database of 520 neuropsychological assessments conducted in Spanish and compared with other well-established ML methods. Using the F2 coefficient to minimize false negatives, results indicate that the system performs similarly to other well-established ML methods (F2TE2 = 0.830, only below bagging, F2BAG = 0.832) while exhibiting other significant attributes such as explanation capability and data standardization to a common framework thanks to the ontological part. On the other hand, the system's versatility and ease of use were demonstrated with three additional use cases: evaluation of new cases even if the acquisition stage is incomplete (the case records have missing values), incorporation of a new database into the integrated system, and use of the ontology capabilities to relate different domains. This makes it a useful tool to support physicians and neuropsychologists in population-based screenings for early detection of MCI.
Collapse
Affiliation(s)
- Alba Gómez-Valadés
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| | - Rafael Martínez-Tomás
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| | | | - Atle Bjørnerud
- Computational Radiology and Artificial Intelligence Unit, Department of Physics and Computational Radiology, Clinic for Radiology and Nuclear Medicine, Oslo University Hospital, Oslo, Norway
- Department of Physics, University of Oslo, Oslo, Norway
| | - Mariano Rincón
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
| |
Collapse
|
2
|
Badwal AK, Singh S. A comprehensive review on the current status of CRISPR based clinical trials for rare diseases. Int J Biol Macromol 2024; 277:134097. [PMID: 39059527 DOI: 10.1016/j.ijbiomac.2024.134097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 07/03/2024] [Accepted: 07/20/2024] [Indexed: 07/28/2024]
Abstract
A considerable fraction of population in the world suffers from rare diseases. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and its related Cas proteins offer a modern form of curative gene therapy for treating the rare diseases. Hereditary transthyretin amyloidosis, hereditary angioedema, duchenne muscular dystrophy and Rett syndrome are a few examples of such rare diseases. CRISPR/Cas9, for example, has been used in the treatment of β-thalassemia and sickle cell disease (Frangoul et al., 2021; Pavani et al., 2021) [1,2]. Neurological diseases such as Huntington's have also been focused in some studies involving CRISPR/Cas (Yang et al., 2017; Yan et al., 2023) [3,4]. Delivery of these biologicals via vector and non vector mediated methods depends on the type of target cells, characteristics of expression, time duration of expression, size of foreign genetic material etc. For instance, retroviruses find their applicability in case of ex vivo delivery in somatic cells due to their ability to integrate in the host genome. These have been successfully used in gene therapy involving X-SCID patients although, incidence of inappropriate activation has been reported. On the other hand, ex vivo gene therapy for β-thalassemia involved use of BB305 lentiviral vector for high level expression of CRISPR biological in HSCs. The efficacy and safety of these biologicals will decide their future application as efficient genome editing tools as they go forward in further stages of human clinical trials. This review focuses on CRISPR/Cas based therapies which are at various stages of clinical trials for treatment of rare diseases and the constraints and ethical issues associated with them.
Collapse
Affiliation(s)
- Amneet Kaur Badwal
- Department of Biotechnology, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Mohali 160062, Punjab, India
| | - Sushma Singh
- Department of Biotechnology, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Mohali 160062, Punjab, India.
| |
Collapse
|
3
|
Muhunzi D, Kitambala L, Mashauri HL. Big data analytics in the healthcare sector: Opportunities and challenges in developing countries. A literature review. Health Informatics J 2024; 30:14604582241294217. [PMID: 39434249 DOI: 10.1177/14604582241294217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2024]
Abstract
Background: Despite the ongoing efforts to digitalize the healthcare sector in developing countries, the full adoption of big data analytics in healthcare settings is yet to be attained Exploring opportunities and challenges encountered is essential for designing and implementing effective interventional strategies. Objective: Exploring opportunities and challenges towards integrating big data analytics technologies in the healthcare industry in developing countries. Methodology: This was a narrative review study design. A literature search on different databases was conducted including PubMed, ScienceDirect, MEDLINE, Scopus, and Google Scholar. Articles with predetermined keywords and written in English were included. Results: Big data analytics finds its application in population health management and clinical decision-support systems even in developing countries. The major challenges towards the integration of big data analytics in the healthcare sector in developing countries include fragmentation of healthcare data and lack of interoperability, data security, privacy and confidentiality concerns, limited resources and inadequate regulatory and policy frameworks for governing big data analytics technologies and limited reliable power and internet infrastructures. Conclusion: Digitalization of healthcare delivery in developing countries faces several significant challenges. However, the integration of big data analytics can potentially open new avenues for enhancing healthcare delivery with cost-effective benefits.
Collapse
Affiliation(s)
- David Muhunzi
- Department of Internal Medicine, Muhimbili University of Health and Allied Sciences(MUHAS), Dar es Salaam, Tanzania
| | - Lucy Kitambala
- Department of Internal Medicine, Muhimbili University of Health and Allied Sciences(MUHAS), Dar es Salaam, Tanzania
| | - Harold L Mashauri
- Department of Epidemiology, Institute of Public Health, Kilimanjaro Christian Medical University College, Moshi, Tanzania
- Department of Internal Medicine, Kilimanjaro Christian Medical University College, Moshi, Tanzania
| |
Collapse
|
4
|
Guermazi D, Shah A, Yumeen S, Vance T, Saliba E. Skinformatics: Navigating the big data landscape of dermatology. J Eur Acad Dermatol Venereol 2024. [PMID: 39254192 DOI: 10.1111/jdv.20319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Accepted: 08/20/2024] [Indexed: 09/11/2024]
Abstract
Big data and associated approaches to analyse it are on the rise, especially in healthcare settings. This growth is also seen with unique applications in the field of dermatology. While big data offer a plethora of opportunity for improving our current understanding of disease and ability to deliver care, as with any technology innovation, the potential pitfalls should be addressed. In this piece, we highlight opportunities and challenges associated with big data in dermatology. Opportunities include large and novel data sources that may offer a wealth of information, automated detection, classification and diagnostics and improved public health monitoring. Challenges include data quality, issues of interpretability and disparities within artificial intelligence (AI) training data sets. Clinicians and researchers in the field should be aware of these developments within the field of big data to understand how best it may be used toward improving patient care and health outcomes, particularly in the field of dermatology.
Collapse
Affiliation(s)
- Dorra Guermazi
- Brown University, Division of Biology and Medicine, Providence, Rhode Island, USA
| | - Asghar Shah
- Brown University, Division of Biology and Medicine, Providence, Rhode Island, USA
| | - Sara Yumeen
- Department of Dermatology, Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Terrence Vance
- Department of Dermatology, Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Elie Saliba
- Department of Dermatology, Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
- Department of Dermatology, Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut, Lebanon
| |
Collapse
|
5
|
Badr Y, Abdul Kader L, Shamayleh A. The Use of Big Data in Personalized Healthcare to Reduce Inventory Waste and Optimize Patient Treatment. J Pers Med 2024; 14:383. [PMID: 38673011 PMCID: PMC11051308 DOI: 10.3390/jpm14040383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Revised: 03/18/2024] [Accepted: 03/27/2024] [Indexed: 04/28/2024] Open
Abstract
Precision medicine is emerging as an integral component in delivering care in the health system leading to better diagnosis and optimizing the treatment of patients. This growth is due to the new technologies in the data science field that have led to the ability to model complex diseases. Precision medicine is based on genomics and omics facilities that provide information about molecular proteins and biomarkers that could lead to discoveries for the treatment of patients suffering from various diseases. However, the main problems related to precision medicine are the ability to analyze, interpret, and integrate data. Hence, there is a lack of smooth transition from conventional to precision medicine. Therefore, this work reviews the limitations and discusses the benefits of overcoming them if big data tools are utilized and merged with precision medicine. The results from this review indicate that most of the literature focuses on the challenges rather than providing flexible solutions to adapt big data to precision medicine. As a result, this paper adds to the literature by proposing potential technical, educational, and infrastructural solutions in big data for a better transition to precision medicine.
Collapse
Affiliation(s)
- Yara Badr
- Department of Biomedical Engineering, American University of Sharjah, Sharjah 26666, United Arab Emirates; (Y.B.); (L.A.K.)
| | - Lamis Abdul Kader
- Department of Biomedical Engineering, American University of Sharjah, Sharjah 26666, United Arab Emirates; (Y.B.); (L.A.K.)
| | - Abdulrahim Shamayleh
- Department of Industrial Engineering, American University of Sharjah, Sharjah 26666, United Arab Emirates
| |
Collapse
|
6
|
Irrera O, Marchesin S, Silvello G. MetaTron: advancing biomedical annotation empowering relation annotation and collaboration. BMC Bioinformatics 2024; 25:112. [PMID: 38486137 PMCID: PMC10941452 DOI: 10.1186/s12859-024-05730-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 03/04/2024] [Indexed: 03/17/2024] Open
Abstract
BACKGROUND The constant growth of biomedical data is accompanied by the need for new methodologies to effectively and efficiently extract machine-readable knowledge for training and testing purposes. A crucial aspect in this regard is creating large, often manually or semi-manually, annotated corpora vital for developing effective and efficient methods for tasks like relation extraction, topic recognition, and entity linking. However, manual annotation is expensive and time-consuming especially if not assisted by interactive, intuitive, and collaborative computer-aided tools. To support healthcare experts in the annotation process and foster annotated corpora creation, we present MetaTron. MetaTron is an open-source and free-to-use web-based annotation tool to annotate biomedical data interactively and collaboratively; it supports both mention-level and document-level annotations also integrating automatic built-in predictions. Moreover, MetaTron enables relation annotation with the support of ontologies, functionalities often overlooked by off-the-shelf annotation tools. RESULTS We conducted a qualitative analysis to compare MetaTron with a set of manual annotation tools including TeamTat, INCEpTION, LightTag, MedTAG, and brat, on three sets of criteria: technical, data, and functional. A quantitative evaluation allowed us to assess MetaTron performances in terms of time and number of clicks to annotate a set of documents. The results indicated that MetaTron fulfills almost all the selected criteria and achieves the best performances. CONCLUSIONS MetaTron stands out as one of the few annotation tools targeting the biomedical domain supporting the annotation of relations, and fully customizable with documents in several formats-PDF included, as well as abstracts retrieved from PubMed, Semantic Scholar, and OpenAIRE. To meet any user need, we released MetaTron both as an online instance and as a Docker image locally deployable.
Collapse
Affiliation(s)
- Ornella Irrera
- Department of Information Engineering, University of Padova, Padua, Italy.
| | - Stefano Marchesin
- Department of Information Engineering, University of Padova, Padua, Italy
| | - Gianmaria Silvello
- Department of Information Engineering, University of Padova, Padua, Italy
| |
Collapse
|
7
|
Kumar P, Paul RK, Roy HS, Yeasin M, Ajit, Paul AK. Big Data Analysis in Computational Biology and Bioinformatics. Methods Mol Biol 2024; 2719:181-197. [PMID: 37803119 DOI: 10.1007/978-1-0716-3461-5_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/08/2023]
Abstract
Advancements in high-throughput technologies, genomics, transcriptomics, and metabolomics play an important role in obtaining biological information about living organisms. The field of computational biology and bioinformatics has experienced significant growth with the advent of high-throughput sequencing technologies and other high-throughput techniques. The resulting large amounts of data present both opportunities and challenges for data analysis. Big data analysis has become essential for extracting meaningful insights from the massive amount of data. In this chapter, we provide an overview of the current status of big data analysis in computational biology and bioinformatics. We discuss the various aspects of big data analysis, including data acquisition, storage, processing, and analysis. We also highlight some of the challenges and opportunities of big data analysis in this area of research. Despite the challenges, big data analysis presents significant opportunities like development of efficient and fast computing algorithms for advancing our understanding of biological processes, identifying novel biomarkers for breeding research and developments, predicting disease, and identifying potential drug targets for drug development programs.
Collapse
Affiliation(s)
- Prakash Kumar
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - Ranjit Kumar Paul
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - Himadri Shekhar Roy
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - Md Yeasin
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - Ajit
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - Amrit Kumar Paul
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| |
Collapse
|
8
|
Pinzi L, Rastelli G. Trends and Applications in Computationally Driven Drug Repurposing. Int J Mol Sci 2023; 24:16511. [PMID: 38003701 PMCID: PMC10671888 DOI: 10.3390/ijms242216511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 11/06/2023] [Indexed: 11/26/2023] Open
Abstract
Drug repurposing is a widely used approach originally developed to aid in the identification of new uses of already existing drugs outside the scope of the original medical indication [...].
Collapse
Affiliation(s)
| | - Giulio Rastelli
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125 Modena, Italy;
| |
Collapse
|
9
|
Mao J, Meng F, Wang G. Editorial: Big data for biomedical research of inflammatory diseases. Front Pharmacol 2023; 14:1287616. [PMID: 37799974 PMCID: PMC10548542 DOI: 10.3389/fphar.2023.1287616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 09/08/2023] [Indexed: 10/07/2023] Open
Affiliation(s)
- Jingxin Mao
- Chongqing Medical and Pharmaceutical College, Chongqing, China
- College of Pharmaceutical Sciences, Southwest University, Chongqing, China
| | - Fancheng Meng
- College of Pharmaceutical Sciences, Southwest University, Chongqing, China
| | - Guoze Wang
- The Key Laboratory of Environmental Pollution Monitoring and Disease Control, School of Public Health, Ministry of Education, Guizhou Medical University, Guiyang, China
| |
Collapse
|
10
|
Yuan Q, Zhao WL, Qin B. Big data and variceal rebleeding prediction in cirrhosis patients. Artif Intell Gastroenterol 2023; 4:1-9. [DOI: 10.35712/aig.v4.i1.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 02/03/2023] [Accepted: 03/10/2023] [Indexed: 06/08/2023] Open
Abstract
Big data has convincing merits in developing risk stratification strategies for diseases. The 6 “V”s of big data, namely, volume, velocity, variety, veracity, value, and variability, have shown promise for real-world scenarios. Big data can be applied to analyze health data and advance research in preclinical biology, medicine, and especially disease initiation, development, and control. A study design comprises data selection, inclusion and exclusion criteria, standard confirmation and cohort establishment, follow-up strategy, and events of interest. The development and efficiency verification of a prognosis model consists of deciding the data source, taking previous models as references while selecting candidate predictors, assessing model performance, choosing appropriate statistical methods, and model optimization. The model should be able to inform disease development and outcomes, such as predicting variceal rebleeding in patients with cirrhosis. Our work has merits beyond those of other colleagues with respect to cirrhosis patient screening and data source regarding variceal bleeding.
Collapse
Affiliation(s)
- Quan Yuan
- Department of Gastroenterology, The First Affiliated Hospital of Chongqing Medical University, Chongqing 400042, China
| | - Wen-Long Zhao
- College of Medical Informatics, Chongqing Medical University, Chongqing 400016, China
- Medical Data Science Academy, Chongqing 400016, China
- Chongqing Engineering Research Centre for Clinical Big-data and Drug Evaluation, Chongqing 400016, China
| | - Bo Qin
- Department of Infectious Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing 400042, China
| |
Collapse
|
11
|
Safarlou CW, Jongsma KR, Vermeulen R, Bredenoord AL. The ethical aspects of exposome research: a systematic review. EXPOSOME 2023; 3:osad004. [PMID: 37745046 PMCID: PMC7615114 DOI: 10.1093/exposome/osad004] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
In recent years, exposome research has been put forward as the next frontier for the study of human health and disease. Exposome research entails the analysis of the totality of environmental exposures and their corresponding biological responses within the human body. Increasingly, this is operationalized by big-data approaches to map the effects of internal as well as external exposures using smart sensors and multiomics technologies. However, the ethical implications of exposome research are still only rarely discussed in the literature. Therefore, we conducted a systematic review of the academic literature regarding both the exposome and underlying research fields and approaches, to map the ethical aspects that are relevant to exposome research. We identify five ethical themes that are prominent in ethics discussions: the goals of exposome research, its standards, its tools, how it relates to study participants, and the consequences of its products. Furthermore, we provide a number of general principles for how future ethics research can best make use of our comprehensive overview of the ethical aspects of exposome research. Lastly, we highlight three aspects of exposome research that are most in need of ethical reflection: the actionability of its findings, the epidemiological or clinical norms applicable to exposome research, and the meaning and action-implications of bias.
Collapse
Affiliation(s)
- Caspar W. Safarlou
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
| | - Karin R. Jongsma
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
| | - Roel Vermeulen
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
- Department of Population Health Sciences, Utrecht University,
Utrecht, The Netherlands
| | - Annelien L. Bredenoord
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
- Erasmus School of Philosophy, Erasmus University Rotterdam,
Rotterdam, The Netherlands
| |
Collapse
|
12
|
Trostle AJ, Li L, Kim SY, Wang J, Al-Ouran R, Yalamanchili HK, Liu Z, Wan YW. A Comprehensive and Integrative Approach to MeCP2 Disease Transcriptomics. Int J Mol Sci 2023; 24:ijms24065122. [PMID: 36982190 PMCID: PMC10049497 DOI: 10.3390/ijms24065122] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 03/11/2023] Open
Abstract
Mutations in MeCP2 result in a crippling neurological disease, but we lack a lucid picture of MeCP2′s molecular role. Individual transcriptomic studies yield inconsistent differentially expressed genes. To overcome these issues, we demonstrate a methodology to analyze all modern public data. We obtained relevant raw public transcriptomic data from GEO and ENA, then homogeneously processed it (QC, alignment to reference, differential expression analysis). We present a web portal to interactively access the mouse data, and we discovered a commonly perturbed core set of genes that transcends the limitations of any individual study. We then found functionally distinct, consistently up- and downregulated subsets within these genes and some bias to their location. We present this common core of genes as well as focused cores for up, down, cell fraction models, and some tissues. We observed enrichment for this mouse core in other species MeCP2 models and observed overlap with ASD models. By integrating and examining transcriptomic data at scale, we have uncovered the true picture of this dysregulation. The vast scale of these data enables us to analyze signal-to-noise, evaluate a molecular signature in an unbiased manner, and demonstrate a framework for future disease focused informatics work.
Collapse
Affiliation(s)
- Alexander J. Trostle
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Lucian Li
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Seon-Young Kim
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Howard Hughes Medical Institute, Houston, TX 77030, USA
| | - Jiasheng Wang
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
- Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, TX 77030, USA
| | - Rami Al-Ouran
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hari Krishna Yalamanchili
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
- USDA/ARS Children’s Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Zhandong Liu
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
- Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, TX 77030, USA
- Correspondence: (Z.L.); (Y.-W.W.)
| | - Ying-Wooi Wan
- Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Howard Hughes Medical Institute, Houston, TX 77030, USA
- Correspondence: (Z.L.); (Y.-W.W.)
| |
Collapse
|
13
|
Bernal L, Pinzi L, Rastelli G. Identification of Promising Drug Candidates against Prostate Cancer through Computationally-Driven Drug Repurposing. Int J Mol Sci 2023; 24:ijms24043135. [PMID: 36834548 PMCID: PMC9964599 DOI: 10.3390/ijms24043135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 02/08/2023] Open
Abstract
Prostate cancer (PC) is one of the most common types of cancer in males. Although early stages of PC are generally associated with favorable outcomes, advanced phases of the disease present a significantly poorer prognosis. Moreover, currently available therapeutic options for the treatment of PC are still limited, being mainly focused on androgen deprivation therapies and being characterized by low efficacy in patients. As a consequence, there is a pressing need to identify alternative and more effective therapeutics. In this study, we performed large-scale 2D and 3D similarity analyses between compounds reported in the DrugBank database and ChEMBL molecules with reported anti-proliferative activity on various PC cell lines. The analyses included also the identification of biological targets of ligands with potent activity on PC cells, as well as investigations on the activity annotations and clinical data associated with the more relevant compounds emerging from the ligand-based similarity results. The results led to the prioritization of a set of drugs and/or clinically tested candidates potentially useful in drug repurposing against PC.
Collapse
Affiliation(s)
- Leonardo Bernal
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125 Modena, Italy
- Clinical and Experimental Medicine PhD Program, University of Modena and Reggio Emilia, 41125 Modena, Italy
| | - Luca Pinzi
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125 Modena, Italy
| | - Giulio Rastelli
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125 Modena, Italy
- Correspondence: ; Tel.: +39-059-2058564
| |
Collapse
|
14
|
Pham TD, Ravi V, Fan C, Luo B, Sun XF. Tensor Decomposition of Largest Convolutional Eigenvalues Reveals Pathologic Predictive Power of RhoB in Rectal Cancer Biopsy. THE AMERICAN JOURNAL OF PATHOLOGY 2023; 193:579-590. [PMID: 36740183 DOI: 10.1016/j.ajpath.2023.01.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/29/2022] [Accepted: 01/06/2023] [Indexed: 02/05/2023]
Abstract
RhoB protein belongs to the Rho GTPase family, which plays an important role in governing cell signaling and tissue morphology. RhoB expression is known to have implications in pathologic processes of diseases. Investigation in the regulation and communication of this protein, detected by immunohistochemical staining on the microscope, is worth exploring to gain insightful information that may lead to identifying optimal disease treatment options. In particular, the role of RhoB in rectal cancer is not well discovered. Here, we report that methods of deep learning-based image analysis and the decomposition of multiway arrays discover the predictive factor of RhoB in two cohorts of patients with rectal cancer having survival rates of <5 and >5 years. The analysis results show distinctions between the tensor decomposition factors of the two cohorts.
Collapse
Affiliation(s)
- Tuan D Pham
- Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, Saudi Arabia.
| | - Vinayakumar Ravi
- Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, Saudi Arabia
| | - Chuanwen Fan
- Department of Clinical and Experimental Medicine, Linkoping University, Linkoping, Sweden
| | - Bin Luo
- Department of Clinical and Experimental Medicine, Linkoping University, Linkoping, Sweden; Department of Gastrointestinal Surgery, Sichuan Provincial People's Hospital, Chengdu, China
| | - Xiao-Feng Sun
- Department of Clinical and Experimental Medicine, Linkoping University, Linkoping, Sweden
| |
Collapse
|
15
|
Vinceti A, Trastulla L, Perron U, Raiconi A, Iorio F. A heuristic algorithm solving the mutual-exclusivity-sorting problem. Bioinformatics 2023; 39:btad016. [PMID: 36669133 PMCID: PMC9857977 DOI: 10.1093/bioinformatics/btad016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 11/14/2022] [Accepted: 01/11/2023] [Indexed: 01/13/2023] Open
Abstract
MOTIVATION Binary (or Boolean) matrices provide a common effective data representation adopted in several domains of computational biology, especially for investigating cancer and other human diseases. For instance, they are used to summarize genetic aberrations-copy number alterations or mutations-observed in cancer patient cohorts, effectively highlighting combinatorial relations among them. One of these is the tendency for two or more genes not to be co-mutated in the same sample or patient, i.e. a mutual-exclusivity trend. Exploiting this principle has allowed identifying new cancer driver protein-interaction networks and has been proposed to design effective combinatorial anti-cancer therapies rationally. Several tools exist to identify and statistically assess mutual-exclusive cancer-driver genomic events. However, these tools need to be equipped with robust/efficient methods to sort rows and columns of a binary matrix to visually highlight possible mutual-exclusivity trends. RESULTS Here, we formalize the mutual-exclusivity-sorting problem and present MutExMatSorting: an R package implementing a computationally efficient algorithm able to sort rows and columns of a binary matrix to highlight mutual-exclusivity patterns. Particularly, our algorithm minimizes the extent of collective vertical overlap between consecutive non-zero entries across rows while maximizing the number of adjacent non-zero entries in the same row. Here, we demonstrate that existing tools for mutual-exclusivity analysis are suboptimal according to these criteria and are outperformed by MutExMatSorting. AVAILABILITY AND IMPLEMENTATION https://github.com/AleVin1995/MutExMatSorting. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alessandro Vinceti
- Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy
| | - Lucia Trastulla
- Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy
| | - Umberto Perron
- Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy
| | - Andrea Raiconi
- Institute for Applied Mathematics “Mauro Picone”, National Research Council (IAC-CNR), 80131 Napoli, Italy
| | - Francesco Iorio
- Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy
| |
Collapse
|
16
|
Mahajan HB, Rashid AS, Junnarkar AA, Uke N, Deshpande SD, Futane PR, Alkhayyat A, Alhayani B. Integration of Healthcare 4.0 and blockchain into secure cloud-based electronic health records systems. APPLIED NANOSCIENCE 2023; 13:2329-2342. [PMID: 35136707 PMCID: PMC8813573 DOI: 10.1007/s13204-021-02164-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Accepted: 10/09/2021] [Indexed: 12/23/2022]
Abstract
Since the last decade, cloud-based electronic health records (EHRs) have gained significant attention to enable remote patient monitoring. The recent development of Healthcare 4.0 using the Internet of Things (IoT) components and cloud computing to access medical operations remotely has gained the researcher's attention from a smart city perspective. Healthcare 4.0 mainly consisted of periodic medical data sensing, aggregation, data transmission, data sharing, and data storage. The sensitive and personal data of patients lead to several challenges while protecting it from hackers. Therefore storing, accessing, and sharing the patient medical information on the cloud needs security attention that data should not be compromised by the authorized user's components of E-healthcare systems. To achieve secure medical data storage, sharing, and accessing in cloud service provider, several cryptography algorithms are designed so far. However, such conventional solutions failed to achieve the trade-off between the requirements of EHR security solutions such as computational efficiency, service side verification, user side verifications, without the trusted third party, and strong security. Blockchain-based security solutions gained significant attention in the recent past due to the ability to provide strong security for data storage and sharing with the minimum computation efforts. The blockchain made focused on bitcoin technology among the researchers. Utilizing the blockchain which secure healthcare records management has been of recent interest. This paper presents the systematic study of modern blockchain-based solutions for securing medical data with or without cloud computing. We implement and evaluate the different methods using blockchain in this paper. According to the research studies, the research gaps, challenges, and future roadmap are the outcomes of this paper that boost emerging Healthcare 4.0 technology.
Collapse
Affiliation(s)
| | - Ameer Sardar Rashid
- grid.440843.fBusiness Information Technology, College of Administration and Economics, University of Sulaimani, Sulaimaniya, Iraq
| | - Aparna A. Junnarkar
- grid.32056.320000 0001 2190 9326PES Modern College of Engineering, Pune, India
| | - Nilesh Uke
- Trinity Academy of Engineering, Pune, India
| | - Sarita D. Deshpande
- grid.32056.320000 0001 2190 9326PES Modern College of Engineering, Pune, India
| | - Pravin R. Futane
- grid.32056.320000 0001 2190 9326Vishwakarma Institute of Information Technology (VIIT), Pune, India
| | - Ahmed Alkhayyat
- grid.444971.b0000 0004 6023 831XTechnical Engineering College, The Islamic University, Najaf, Iraq
| | - Bilal Alhayani
- grid.38575.3c0000 0001 2337 3561Department Electronics and Communication, Yildiz Technical University, Istanbul, Turkey
| |
Collapse
|
17
|
Liu J, Lang K, Tan S, Jie W, Zhu Y, Huang S, Huang W. A web-based database server using 43,710 public RNA-seq samples for the analysis of gene expression and alternative splicing in livestock animals. BMC Genomics 2022; 23:706. [PMID: 36253723 PMCID: PMC9575303 DOI: 10.1186/s12864-022-08881-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 09/09/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Livestock animals is of great significance in agricultural production. However, the role of specific gene expression, especially alternative splicing in determining phenotype, is not well understood. The livestock research community needs a gene expression and alternative splicing database contributing to livestock genetic improvement. DESCRIPTION We report the construction of LivestockExp ( https://bioinfo.njau.edu.cn/livestockExp ), a web-based database server for the exploration of gene expression and alternative splicing using 43,710 uniformly processed RNA-seq samples from livestock animals and several relative species across six orders. The database is equipped with basic querying functions and multiple online analysis modules including differential/specific expression analysis, co-expression network analysis, and cross-species gene expression conservation analysis. In addition to the re-analysis of public datasets, users can upload personal datasets to perform co-analysis with public datasets. The database also offers a wide range of visualization tools and diverse links to external databases enabling users to efficiently explore the results and to gain additional insights. CONCLUSION LivestockExp covers by far the largest number of livestock animal species and RNA-seq samples and provides a valuable data resource and analysis platform for the convenient utilization of public RNA-seq datasets.
Collapse
Affiliation(s)
- Jinding Liu
- Bioinformatics Center, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, 210095, Jiangsu, China. .,Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA.
| | - Kun Lang
- College of Information Management, Nanjing Agricultural University, Nanjing, 210095, Jiangsu, China
| | - Suxu Tan
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA
| | - Wencai Jie
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, 210023, Jiangsu, China
| | - Yihua Zhu
- College of Information Management, Nanjing Agricultural University, Nanjing, 210095, Jiangsu, China
| | - Shiqing Huang
- College of Information Management, Nanjing Agricultural University, Nanjing, 210095, Jiangsu, China.
| | - Wen Huang
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA.
| |
Collapse
|
18
|
Artificial Intelligence in Biological Sciences. Life (Basel) 2022; 12:life12091430. [PMID: 36143468 PMCID: PMC9505413 DOI: 10.3390/life12091430] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 08/25/2022] [Accepted: 09/10/2022] [Indexed: 12/03/2022] Open
Abstract
Artificial intelligence (AI), currently a cutting-edge concept, has the potential to improve the quality of life of human beings. The fields of AI and biological research are becoming more intertwined, and methods for extracting and applying the information stored in live organisms are constantly being refined. As the field of AI matures with more trained algorithms, the potential of its application in epidemiology, the study of host–pathogen interactions and drug designing widens. AI is now being applied in several fields of drug discovery, customized medicine, gene editing, radiography, image processing and medication management. More precise diagnosis and cost-effective treatment will be possible in the near future due to the application of AI-based technologies. In the field of agriculture, farmers have reduced waste, increased output and decreased the amount of time it takes to bring their goods to market due to the application of advanced AI-based approaches. Moreover, with the use of AI through machine learning (ML) and deep-learning-based smart programs, one can modify the metabolic pathways of living systems to obtain the best possible outputs with the minimal inputs. Such efforts can improve the industrial strains of microbial species to maximize the yield in the bio-based industrial setup. This article summarizes the potentials of AI and their application to several fields of biology, such as medicine, agriculture, and bio-based industry.
Collapse
|
19
|
Rico-González M, Illa J, Nakamura FY, Pino-Ortega J. Reducing Big Data to Principal Components for Position-Specific Futsal Training. Percept Mot Skills 2022; 129:1546-1562. [PMID: 35830493 DOI: 10.1177/00315125221115014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Since training/competition loads must be quickly assessed and interpreted to inform exercise prescription, big data should be simplified through multivariate data analysis. Our aim in the present research was to highlight which variables from big data analyses provided the most relevant information for describing the behavior of top-level futsal players in their different playing positions (i.e., goalkeeper, defenders, wingers, and forwards). We collected data from four top-level Spanish teams that participated in the final rounds of a national tournament. Through principal component analysis (PCA) we grouped 6-9 variables in 3-4 PCs that explained 62-81% of total variance, depending on playing positions. The most relevant variables explaining goalkeepers' performance were accelerations per minute, maximum acceleration (m/s2), 5-8 impacts per minute, and < 3 takeoffs per minute. Defenders' behavior was best explained by absolute distance covered from 6-12 km/h (m/min) and from 18-21 km/h (m/min), from 5-8 landings per minute, and > 8 landings per minute. Wingers' and pivots' performances were mainly explained by accelerations and decelerations, together with a high level of aerobic endurance (especially for wingers). These findings allow for individualized training and game analysis.
Collapse
Affiliation(s)
- Markel Rico-González
- Department of Didactics of Musical, Plastic and Corporal Expression, University of the Basque Country, Leioa, Spain.,BIOVETMED & SPORTSCI Research group, Department of Physical activity and Sport, Faculty of Sport Sciences, University of Murcia, San Javier, Murcia
| | - Jordi Illa
- Sports Performance Area, 523641Fútbol Club Barcelona, Barcelona, Spain
| | - Fabio Yuzo Nakamura
- Associate Graduate Program in Physical Education UPE/UFPB, João Pessoa, Brazil.,Research Centre in Sports Sciences, Health Sciences and Human Development, CIDESD, University Institute of Maia, Maia, Portugal
| | - José Pino-Ortega
- BIOVETMED & SPORTSCI Research group, Department of Physical activity and Sport, Faculty of Sport Sciences, University of Murcia, San Javier, Murcia.,Faculty of Sports Sciences, 567872University of Murcia, San Javier, Spain
| |
Collapse
|
20
|
Towards the Use of Big Data in Healthcare: A Literature Review. Healthcare (Basel) 2022; 10:healthcare10071232. [PMID: 35885759 PMCID: PMC9322051 DOI: 10.3390/healthcare10071232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2022] [Revised: 06/23/2022] [Accepted: 06/29/2022] [Indexed: 12/13/2022] Open
Abstract
The interest in new and more advanced technological solutions is paving the way for the diffusion of innovative and revolutionary applications in healthcare organizations. The application of an artificial intelligence system to medical research has the potential to move toward highly advanced e-Health. This analysis aims to explore the main areas of application of big data in healthcare, as well as the restructuring of the technological infrastructure and the integration of traditional data analytical tools and techniques with an elaborate computational technology that is able to enhance and extract useful information for decision-making. We conducted a literature review using the Scopus database over the period 2010–2020. The article selection process involved five steps: the planning and identification of studies, the evaluation of articles, the extraction of results, the summary, and the dissemination of the audit results. We included 93 documents. Our results suggest that effective and patient-centered care cannot disregard the acquisition, management, and analysis of a huge volume and variety of health data. In this way, an immediate and more effective diagnosis could be possible while maximizing healthcare resources. Deriving the benefits associated with digitization and technological innovation, however, requires the restructuring of traditional operational and strategic processes, and the acquisition of new skills.
Collapse
|
21
|
Viberg Johansson J, Bentzen HB, Mascalzoni D. What ethical approaches are used by scientists when sharing health data? An interview study. BMC Med Ethics 2022; 23:41. [PMID: 35410285 PMCID: PMC9004072 DOI: 10.1186/s12910-022-00779-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 04/01/2022] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND Health data-driven activities have become central in diverse fields (research, AI development, wearables, etc.), and new ethical challenges have arisen with regards to privacy, integrity, and appropriateness of use. To ensure the protection of individuals' fundamental rights and freedoms in a changing environment, including their right to the protection of personal data, we aim to identify the ethical approaches adopted by scientists during intensive data exploitation when collecting, using, or sharing peoples' health data. METHODS Twelve scientists who were collecting, using, or sharing health data in different contexts in Sweden, were interviewed. We used systematic expert interviews to access these scientists' specialist knowledge, and analysed the interviews with thematic analysis. Phrases, sentences, or paragraphs through which ethical values and norms were expressed, were identified and coded. Codes that reflected similar concepts were grouped, subcategories were formulated, and categories were connected to traditional ethical approaches. RESULTS Through several examples, the respondents expressed four different ethical approaches, which formed the main conceptual categories: consideration of consequences, respect for rights, procedural compliance, and being professional. CONCLUSIONS To a large extent, the scientists' ethical approaches were consistent with ethical and legal principles. Data sharing was considered important and worth pursuing, even though it is difficult. An awareness of the complex issues involved in data sharing was reflected from different perspectives, and the respondents commonly perceived a general lack of practical procedures that would by default ensure ethical and legally compliant data collection and sharing. We suggest that it is an opportune time to move on from policy discussions to practical technological ethics-by-design solutions that integrate these principles into practice.
Collapse
Affiliation(s)
- Jennifer Viberg Johansson
- Centre for Research Ethics and Bioethics, Department of Public Health and Caring Sciences, Uppsala University, Box 564, 751 22, Uppsala, Sweden.
- Institute for Futures Studies, Stockholm, Sweden.
| | - Heidi Beate Bentzen
- Norwegian Research Center for Computers and Law, Faculty of Law, University of Oslo, Oslo, Norway
| | - Deborah Mascalzoni
- Centre for Research Ethics and Bioethics, Department of Public Health and Caring Sciences, Uppsala University, Box 564, 751 22, Uppsala, Sweden
- Institute of Biomedicine, Eurac Research, Bolzano, Italy
| |
Collapse
|
22
|
Bradley EA, Khan A, McNeal DM, Bravo‐Jaimes K, Khanna A, Cook S, Opotowsky AR, John A, Lee M, Pasquali S, Daniels CJ, Pernick M, Kirkpatrick JN, Gurvitz M. Operational and Ethical Considerations for a National Adult Congenital Heart Disease Database. J Am Heart Assoc 2022; 11:e022338. [PMID: 35301853 PMCID: PMC9075495 DOI: 10.1161/jaha.121.022338] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 02/17/2022] [Indexed: 11/18/2022]
Abstract
As more adults survive with congenital heart disease, the need to better understand the long-term complications, and comorbid disease will become increasingly important. Improved care and survival into the early and late adult years for all patients equitably requires accurate, timely, and comprehensive data to support research and quality-based initiatives. National data collection in adult congenital heart disease will require a sound foundation emphasizing core ethical principles that acknowledge patient and clinician perspectives and promote national collaboration. In this document we examine these foundational principles and offer suggestions for developing an ethically responsible and inclusive framework for national ACHD data collection.
Collapse
Affiliation(s)
- Elisa A. Bradley
- The Ohio State University Wexner Medical CenterDorothy M. Davis Heart and Lung Research InstituteColumbusOH
- Division of Cardiovascular MedicineHeart and Vascular InstitutePenn State University College of MedicineHersheyPA
| | - Abigail Khan
- Adult Congenital Heart ProgramKnight Cardiovascular InstituteOregon Health and Science UniversityPortlandOR
- Department of MedicineUniversity of Colorado Anschutz Medical CampusAuroraCO
| | - Demetria M. McNeal
- Department of MedicineUniversity of Colorado Anschutz Medical CampusAuroraCO
| | - Katia Bravo‐Jaimes
- Division of Cardiovascular MedicineUniversity of CaliforniaLos AngelesCA
| | - Amber Khanna
- Department of MedicineUniversity of Colorado Anschutz Medical CampusAuroraCO
| | - Stephen Cook
- Indiana University Health and Riley Children's HospitalIndianapolisIN
| | - Alexander R. Opotowsky
- Department of PediatricsThe Heart InstituteCincinnati Children's HospitalUniversity of Cincinnati College of MedicineCincinnatiOH
| | - Anitha John
- Division of CardiologyChildren's National Health SystemWashingtonDC
| | - Marc Lee
- The Heart Center, Nationwide Children's HospitalColumbusOH
| | - Sara Pasquali
- Department of Pediatric CardiologyUniversity of Michigan and Mott Children's HospitalAnn ArborMI
| | - Curt J. Daniels
- Division of Cardiovascular Medicine & Nationwide Children’s HospitalThe Ohio State University Department of Internal MedicineColumbusOH
| | - Michael Pernick
- Board of Directors MemberAdult Congenital Heart AssociationMediaPA
| | - James N. Kirkpatrick
- University of Washington Heart Institute and Department of Bioethics and HumanitiesSeattleWA
| | | |
Collapse
|
23
|
Data-driven machine learning: A new approach to process and utilize biomedical data. PREDICTIVE MODELING IN BIOMEDICAL DATA MINING AND ANALYSIS 2022. [PMCID: PMC9464259 DOI: 10.1016/b978-0-323-99864-2.00017-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
24
|
Ventura S, Soda P, González AR. EDITORIAL. Comput Intell 2021. [DOI: 10.1111/coin.12493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | - Paolo Soda
- University Campus Bio‐Medico di Roma Italy
| | | |
Collapse
|
25
|
Naffaa ME, Hassan F, Golan-Cohen A, Merzon E, Green I, Saab A, Paz Z. Factors associated with drug survival on first biologic therapy in patients with rheumatoid arthritis: a population-based cohort study. Rheumatol Int 2021; 41:1905-1913. [PMID: 34529109 DOI: 10.1007/s00296-021-04989-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 09/02/2021] [Indexed: 11/24/2022]
Abstract
Lack of sufficient head-to-head trials comparing biologic disease-modifying antirheumatic drugs (bDMARDs) in rheumatoid arthritis (RA), makes the choice of the first bDMARD a matter of rheumatologist's preference. Longer drug survival on the first bDMARD usually correlates with early remission. We aimed to identify factors associated with longer drug survival. We conducted a population-based retrospective longitudinal cohort study. We identified RA patients using the relevant International Classification of Disease 9th codes. "True" RA patients were defined as patients fulfilling, additionally, at least one of the following: receiving conventional DMARDs (cDMARDs), being positive for rheumatoid factor or anti-cyclic citrullinated peptide, or being diagnosed by a rheumatologist. We compared drug survival times and identified factors associated with longer drug survival. We identified 4268 true RA patients between the years of 2000-2017. 820 patients (19.2%) received at least one bDMARD. The most commonly prescribed bDMARDs were etanercept (352, 42.9%), adalimumab (143, 17.4%), infliximab (142, 17.3%) and tocilizumab (58, 7.1%). Infliximab was associated with the longest drug survival (47.1 months ± 46.3) while golimumab was associated with the shortest drug survival (14.9 months ± 15.1). Male gender [hazard ratio (HR) = 0.76, 95% confidence interval (CI), 0.63-0.86, p = 0.001], concurrent conventional DMARDs use (HR = 0.79, 95% CI 0.68 - 0.98, p = .031) and initiating bDMARD therapy in earlier calendric years (HR = 1.12, 95% CI 1.10 -1.18, p = 0.0001) were associated with longer drug survival. Male gender, concomitant cDMARDs and initiating biologic therapy at earlier calendric years are associated with longer drug survival.
Collapse
Affiliation(s)
- Mohammad E Naffaa
- Rheumatology Unit, Galilee Medical Center, Road 89, Naharyia, Israel. .,Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel.
| | - Fadi Hassan
- Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel.,Internal Medicine "E", Galilee Medical Center, Naharyia, Israel
| | - Avivit Golan-Cohen
- Leumit Health Services, Tel Aviv-Yafo, Israel.,Faculty of Medicine, Tel Aviv University, Tel Aviv-Yafo, Israel
| | | | - Ilan Green
- Leumit Health Services, Tel Aviv-Yafo, Israel.,Faculty of Medicine, Tel Aviv University, Tel Aviv-Yafo, Israel
| | - Amir Saab
- Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel.,Internal Medicine "E", Galilee Medical Center, Naharyia, Israel
| | - Ziv Paz
- Rheumatology Unit, Galilee Medical Center, Road 89, Naharyia, Israel.,Azrieli Faculty of Medicine, Bar-Ilan University, Zefat, Israel
| |
Collapse
|
26
|
Big Data for Biomedical Education with a Focus on the COVID-19 Era: An Integrative Review of the Literature. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18178989. [PMID: 34501581 PMCID: PMC8430694 DOI: 10.3390/ijerph18178989] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 08/19/2021] [Accepted: 08/21/2021] [Indexed: 12/02/2022]
Abstract
Medical education refers to education and training delivered to medical students in order to become a practitioner. In recent decades, medicine has been radically transformed by scientific and computational/digital advances—including the introduction of new information and communication technologies, the discovery of DNA, and the birth of genomics and post-genomics super-specialties (transcriptomics, proteomics, interactomics, and metabolomics/metabonomics, among others)—which contribute to the generation of an unprecedented amount of data, so-called ‘big data’. While these are well-studied in fields such as medical research and methodology, translational medicine, and clinical practice, they remain overlooked and understudied in the field of medical education. For this purpose, we carried out an integrative review of the literature. Twenty-nine studies were retrieved and synthesized in the present review. Included studies were published between 2012 and 2021. Eleven studies were performed in North America: specifically, nine were conducted in the USA and two studies in Canada. Six studies were carried out in Europe: two in France, two in Germany, one in Italy, and one in several European countries. One additional study was conducted in China. Eight papers were commentaries/theoretical or perspective articles, while five were designed as a case study. Five investigations exploited large databases and datasets, while five additional studies were surveys. Two papers employed visual data analytical/data mining techniques. Finally, other two papers were technical papers, describing the development of software, computational tools and/or learning environments/platforms, while two additional studies were literature reviews (one of which being systematic and bibliometric).The following nine sub-topics could be identified: (I) knowledge and awareness of big data among medical students; (II) difficulties and challenges in integrating and implementing big data teaching into the medical syllabus; (III) exploiting big data to review, improve and enhance medical school curriculum; (IV) exploiting big data to monitor the effectiveness of web-based learning environments among medical students; (V) exploiting big data to capture the determinants and signatures of successful academic performance and counteract/prevent drop-out; (VI) exploiting big data to promote equity, inclusion, and diversity; (VII) exploiting big data to enhance integrity and ethics, avoiding plagiarism and duplication rate; (VIII) empowering medical students, improving and enhancing medical practice; and, (IX) exploiting big data in continuous medical education and learning. These sub-themes were subsequently grouped in the following four major themes/topics: namely, (I) big data and medical curricula; (II) big data and medical academic performance; (III) big data and societal/bioethical issues in biomedical education; and (IV) big data and medical career. Despite the increasing importance of big data in biomedicine, current medical curricula and syllabuses appear inadequate to prepare future medical professionals and practitioners that can leverage on big data in their daily clinical practice. Challenges in integrating, incorporating, and implementing big data teaching into medical school need to be overcome to facilitate the training of the next generation of medical professionals. Finally, in the present integrative review, state-of-art and future potential uses of big data in the field of biomedical discussion are envisaged, with a focus on the still ongoing “Coronavirus Disease 2019” (COVID-19) pandemic, which has been acting as a catalyst for innovation and digitalization.
Collapse
|
27
|
Muller SHA, Kalkman S, van Thiel GJMW, Mostert M, van Delden JJM. The social licence for data-intensive health research: towards co-creation, public value and trust. BMC Med Ethics 2021; 22:110. [PMID: 34376204 PMCID: PMC8353823 DOI: 10.1186/s12910-021-00677-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 08/03/2021] [Indexed: 11/10/2022] Open
Abstract
Background The rise of Big Data-driven health research challenges the assumed contribution of medical research to the public good, raising questions about whether the status of such research as a common good should be taken for granted, and how public trust can be preserved. Scandals arising out of sharing data during medical research have pointed out that going beyond the requirements of law may be necessary for sustaining trust in data-intensive health research. We propose building upon the use of a social licence for achieving such ethical governance. Main text We performed a narrative review of the social licence as presented in the biomedical literature. We used a systematic search and selection process, followed by a critical conceptual analysis. The systematic search resulted in nine publications. Our conceptual analysis aims to clarify how societal permission can be granted to health research projects which rely upon the reuse and/or linkage of health data. These activities may be morally demanding. For these types of activities, a moral legitimation, beyond the limits of law, may need to be sought in order to preserve trust. Our analysis indicates that a social licence encourages us to recognise a broad range of stakeholder interests and perspectives in data-intensive health research. This is especially true for patients contributing data. Incorporating such a practice paves the way towards an ethical governance, based upon trust. Public engagement that involves patients from the start is called for to strengthen this social licence. Conclusions There are several merits to using the concept of social licence as a guideline for ethical governance. Firstly, it fits the novel scale of data-related risks; secondly, it focuses attention on trustworthiness; and finally, it offers co-creation as a way forward. Greater trust can be achieved in the governance of data-intensive health research by highlighting strategic dialogue with both patients contributing the data, and the public in general. This should ultimately contribute to a more ethical practice of governance. Supplementary Information The online version contains supplementary material available at 10.1186/s12910-021-00677-5.
Collapse
Affiliation(s)
- Sam H A Muller
- Department of Medical Humanities, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CX, Utrecht, The Netherlands.
| | - Shona Kalkman
- Department of Medical Humanities, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CX, Utrecht, The Netherlands
| | - Ghislaine J M W van Thiel
- Department of Medical Humanities, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CX, Utrecht, The Netherlands
| | - Menno Mostert
- Department of Medical Humanities, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CX, Utrecht, The Netherlands
| | - Johannes J M van Delden
- Department of Medical Humanities, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CX, Utrecht, The Netherlands
| |
Collapse
|
28
|
Pinzi L, Tinivella A, Gagliardelli L, Beneventano D, Rastelli G. LigAdvisor: a versatile and user-friendly web-platform for drug design. Nucleic Acids Res 2021; 49:W326-W335. [PMID: 34023895 PMCID: PMC8262749 DOI: 10.1093/nar/gkab385] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 04/19/2021] [Accepted: 04/27/2021] [Indexed: 12/17/2022] Open
Abstract
Although several tools facilitating in silico drug design are available, their results are usually difficult to integrate with publicly available information or require further processing to be fully exploited. The rational design of multi-target ligands (polypharmacology) and the repositioning of known drugs towards unmet therapeutic needs (drug repurposing) have raised increasing attention in drug discovery, although they usually require careful planning of tailored drug design strategies. Computational tools and data-driven approaches can help to reveal novel valuable opportunities in these contexts, as they enable to efficiently mine publicly available chemical, biological, clinical, and disease-related data. Based on these premises, we developed LigAdvisor, a data-driven webserver which integrates information reported in DrugBank, Protein Data Bank, UniProt, Clinical Trials and Therapeutic Target Database into an intuitive platform, to facilitate drug discovery tasks as drug repurposing, polypharmacology, target fishing and profiling. As designed, LigAdvisor enables easy integration of similarity estimation results with clinical data, thereby allowing a more efficient exploitation of information in different drug discovery contexts. Users can also develop customizable drug design tasks on their own molecules, by means of ligand- and target-based search modes, and download their results. LigAdvisor is publicly available at https://ligadvisor.unimore.it/.
Collapse
Affiliation(s)
- Luca Pinzi
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Annachiara Tinivella
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy.,Clinical and Experimental Medicine, PhD Program, University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Luca Gagliardelli
- Department of Engineering "Enzo Ferrari", University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Domenico Beneventano
- Department of Engineering "Enzo Ferrari", University of Modena and Reggio Emilia, Modena 41125, Italy
| | - Giulio Rastelli
- Department of Life Sciences, University of Modena and Reggio Emilia, Modena 41125, Italy
| |
Collapse
|
29
|
Leite ML, de Loiola Costa LS, Cunha VA, Kreniski V, de Oliveira Braga Filho M, da Cunha NB, Costa FF. Artificial intelligence and the future of life sciences. Drug Discov Today 2021; 26:2515-2526. [PMID: 34245910 DOI: 10.1016/j.drudis.2021.07.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 05/12/2021] [Accepted: 07/01/2021] [Indexed: 12/23/2022]
Abstract
Over the past few decades, the number of health and 'omics-related data' generated and stored has grown exponentially. Patient information can be collected in real time and explored using various artificial intelligence (AI) tools in clinical trials; mobile devices can also be used to improve aspects of both the diagnosis and treatment of diseases. In addition, AI can be used in the development of new drugs or for drug repurposing, in faster diagnosis and more efficient treatment for various diseases, as well as to identify data-driven hypotheses for scientists. In this review, we discuss how AI is starting to revolutionize the life sciences sector.
Collapse
Affiliation(s)
- Michel L Leite
- Genomic Sciences and Biotechnology Program, Universidade Católica de Brasília SGAN 916 Modulo B, Bloco C, 70.790-160, Brasília, DF, Brazil; Department of Molecular Biology, Biological Sciences Institute, University of Brasília, Campus Darcy Ribeiro, Block K, 70.790-900, Brasilia, Federal District, Brazil
| | - Lorena S de Loiola Costa
- Genomic Sciences and Biotechnology Program, Universidade Católica de Brasília SGAN 916 Modulo B, Bloco C, 70.790-160, Brasília, DF, Brazil
| | - Victor A Cunha
- Genomic Sciences and Biotechnology Program, Universidade Católica de Brasília SGAN 916 Modulo B, Bloco C, 70.790-160, Brasília, DF, Brazil
| | - Victor Kreniski
- Apple Developer Academy, Universidade Católica de Brasília, Brasilia, Brazil
| | | | - Nicolau B da Cunha
- Genomic Sciences and Biotechnology Program, Universidade Católica de Brasília SGAN 916 Modulo B, Bloco C, 70.790-160, Brasília, DF, Brazil
| | - Fabricio F Costa
- Genomic Sciences and Biotechnology Program, Universidade Católica de Brasília SGAN 916 Modulo B, Bloco C, 70.790-160, Brasília, DF, Brazil; Apple Developer Academy, Universidade Católica de Brasília, Brasilia, Brazil; Cancer Biology and Epigenomics Program, Ann & Robert H Lurie Children's Hospital of Chicago Research Center and Northwestern University's Feinberg School of Medicine, 2430 N. Halsted St, Box 220, Chicago, IL 60614, USA; MATTER Chicago, 222 W. Merchandise Mart Plaza, Suite 12th Floor, Chicago, IL 60654, USA; Genomic Enterprise, San Diego, CA 92008, USA; Genomic Enterprise, New York, NY 11581, USA.
| |
Collapse
|
30
|
Training Design, Performance Analysis, and Talent Identification-A Systematic Review about the Most Relevant Variables through the Principal Component Analysis in Soccer, Basketball, and Rugby. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18052642. [PMID: 33807971 PMCID: PMC7967544 DOI: 10.3390/ijerph18052642] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/01/2021] [Accepted: 03/02/2021] [Indexed: 12/19/2022]
Abstract
Since the accelerating development of technology applied to team sports and its subsequent high amount of information available, the need for data mining leads to the use of data reduction techniques such as Principal Component Analysis (PCA). This systematic review aims to identify determinant variables in soccer, basketball and rugby using exploratory factor analysis for, training design, performance analysis and talent identification. Three electronic databases (PubMed, Web of Science, SPORTDiscus) were systematically searched and 34 studies were finally included in the qualitative synthesis. Through PCA, data sets were reduced by 75.07%, and 3.9 ± 2.53 factors were retained that explained 80 ± 0.14% of the total variance. All team sports should be analyzed or trained based on the high level of aerobic capacity combined with adequate levels of power and strength to perform repeated high-intensity actions in a very short time, which differ between team sports. Accelerations and decelerations are mainly significant in soccer, jumps and landings are crucial in basketball, and impacts are primarily identified in rugby. Besides, from these team sports, primary information about different technical/tactical variables was extracted such as (a) soccer: occupied space, ball controls, passes, and shots; (b) basketball: throws, rebounds, and turnovers; or (c) rugby: possession game pace and team formation. Regarding talent identification, both anthropometrics and some physical capacity measures are relevant in soccer and basketball. Although overall, since these variables have been identified in different investigations, further studies should perform PCA on data sets that involve variables from different dimensions (technical, tactical, conditional).
Collapse
|
31
|
Gomez-Valades A, Martinez-Tomas R, Rincon M. Integrative Base Ontology for the Research Analysis of Alzheimer's Disease-Related Mild Cognitive Impairment. Front Neuroinform 2021; 15:561691. [PMID: 33613222 PMCID: PMC7889797 DOI: 10.3389/fninf.2021.561691] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 01/12/2021] [Indexed: 11/13/2022] Open
Abstract
Early detection of mild cognitive impairment (MCI) has become a priority in Alzheimer's disease (AD) research, as it is a transitional phase between normal aging and dementia. However, information on MCI and AD is scattered across different formats and standards generated by different technologies, making it difficult to work with them manually. Ontologies have emerged as a solution to this problem due to their capacity for homogenization and consensus in the representation and reuse of data. In this context, an ontology that integrates the four main domains of neurodegenerative diseases, diagnostic tests, cognitive functions, and brain areas will be of great use in research. Here, we introduce the first approach to this ontology, the Neurocognitive Integrated Ontology (NIO), which integrates the knowledge regarding neuropsychological tests (NT), AD, cognitive functions, and brain areas. This ontology enables interoperability and facilitates access to data by integrating dispersed knowledge across different disciplines, rendering it useful for other research groups. To ensure the stability and reusability of NIO, the ontology was developed following the ontology-building life cycle, integrating and expanding terms from four different reference ontologies. The usefulness of this ontology was validated through use-case scenarios.
Collapse
Affiliation(s)
- Alba Gomez-Valades
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED) Madrid, Spain
| | - Rafael Martinez-Tomas
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED) Madrid, Spain
| | - Mariano Rincon
- Department of Artificial Intelligence, Universidad Nacional de Educación a Distancia (UNED) Madrid, Spain
| |
Collapse
|
32
|
Adaptive Enterprise Architecture for the Digital Healthcare Industry: A Digital Platform for Drug Development. INFORMATION 2021. [DOI: 10.3390/info12020067] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Enterprise architecture (EA) is useful for effectively structuring digital platforms with digital transformation in information societies. Moreover, digital platforms in the healthcare industry accelerate and increase the efficiency of drug discovery and development processes. However, there is the lack of knowledge concerning relationships between EA and digital platforms, in spite of the needs of it. In this paper, we investigated and analyzed the process of drug design and development within the healthcare industry, together with related work in using an enterprise architecture framework for the digital era named the Adaptive Integrated Digital Architecture Framework (AIDAF), specifically supporting the design of digital platforms there. Based on this analysis, we evaluate a method and propose a new reference architecture for promoting digital platforms in the healthcare industry, with future specific aspects of them making effective use of Artificial Intelligence (AI). The practical and theoretical contributions include: (1) Streamlined processes through digital platforms in organizations. (2) Informal knowledge supply and sharing among organizational members through digital platforms. (3) Efficiency and effectiveness in planning production and business for drug development. The findings indicate that EA with digital platforms using the AIDAF contribute to digital transformation with effectiveness for new drugs in the healthcare industry.
Collapse
|
33
|
Anđelić N, Baressi Šegota S, Lorencin I, Mrzljak V, Car Z. Estimation of COVID-19 epidemic curves using genetic programming algorithm. Health Informatics J 2021; 27:1460458220976728. [PMID: 33459107 DOI: 10.1177/1460458220976728] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
This paper investigates the possibility of the implementation of Genetic Programming (GP) algorithm on a publicly available COVID-19 data set, in order to obtain mathematical models which could be used for estimation of confirmed, deceased, and recovered cases and the estimation of epidemiology curve for specific countries, with a high number of cases, such as China, Italy, Spain, and USA and as well as on the global scale. The conducted investigation shows that the best mathematical models produced for estimating confirmed and deceased cases achieved R2 scores of 0.999, while the models developed for estimation of recovered cases achieved the R2 score of 0.998. The equations generated for confirmed, deceased, and recovered cases were combined in order to estimate the epidemiology curve of specific countries and on the global scale. The estimated epidemiology curve for each country obtained from these equations is almost identical to the real data contained within the data set.
Collapse
Affiliation(s)
- Nikola Anđelić
- University of Rijeka, Faculty of Engineering, Rijeka, Croatia
| | | | - Ivan Lorencin
- University of Rijeka, Faculty of Engineering, Rijeka, Croatia
| | - Vedran Mrzljak
- University of Rijeka, Faculty of Engineering, Rijeka, Croatia
| | - Zlatan Car
- University of Rijeka, Faculty of Engineering, Rijeka, Croatia
| |
Collapse
|
34
|
Sumon TA, Hussain MA, Hasan MT, Hasan M, Jang WJ, Bhuiya EH, Chowdhury AAM, Sharifuzzaman SM, Brown CL, Kwon HJ, Lee EW. A Revisit to the Research Updates of Drugs, Vaccines, and Bioinformatics Approaches in Combating COVID-19 Pandemic. Front Mol Biosci 2021; 7:585899. [PMID: 33569389 PMCID: PMC7868442 DOI: 10.3389/fmolb.2020.585899] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 12/17/2020] [Indexed: 12/19/2022] Open
Abstract
A new strain of coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) responsible for the coronavirus disease 2019 (COVID-19) pandemic was first detected in the city of Wuhan in Hubei province, China in late December 2019. To date, more than 1 million deaths and nearly 57 million confirmed cases have been recorded across 220 countries due to COVID-19, which is the greatest threat to global public health in our time. Although SARS-CoV-2 is genetically similar to other coronaviruses, i.e., SARS and Middle East respiratory syndrome coronavirus (MERS-CoV), no confirmed therapeutics are yet available against COVID-19, and governments, scientists, and pharmaceutical companies worldwide are working together in search for effective drugs and vaccines. Repurposing of relevant therapies, developing vaccines, and using bioinformatics to identify potential drug targets are strongly in focus to combat COVID-19. This review deals with the pathogenesis of COVID-19 and its clinical symptoms in humans including the most recent updates on candidate drugs and vaccines. Potential drugs (remdesivir, hydroxychloroquine, azithromycin, dexamethasone) and vaccines [mRNA-1273; measles, mumps and rubella (MMR), bacille Calmette-Guérin (BCG)] in human clinical trials are discussed with their composition, dosage, mode of action, and possible release dates according to the trial register of US National Library of Medicines (clinicaltrials.gov), European Union (clinicaltrialsregister.eu), and Chinese Clinical Trial Registry (chictr.org.cn) website. Moreover, recent reports on in silico approaches like molecular docking, molecular dynamics simulations, network-based identification, and homology modeling are included, toward repurposing strategies for the use of already approved drugs against newly emerged pathogens. Limitations of effectiveness, side effects, and safety issues of each approach are also highlighted. This review should be useful for the researchers working to find out an effective strategy for defeating SARS-CoV-2.
Collapse
Affiliation(s)
- Tofael Ahmed Sumon
- Department of Fish Health Management, Sylhet Agricultural University, Sylhet, Bangladesh
| | - Md. Ashraf Hussain
- Department of Fisheries Technology and Quality Control, Sylhet Agricultural University, Sylhet, Bangladesh
| | - Md. Tawheed Hasan
- Department of Aquaculture, Sylhet Agricultural University, Sylhet, Bangladesh
| | - Mahmudul Hasan
- Department of Pharmaceuticals and Industrial Biotechnology, Sylhet Agricultural University, Sylhet, Bangladesh
| | - Won Je Jang
- Department of Biotechnology, Pukyong National University, Busan, South Korea
| | | | | | - S. M. Sharifuzzaman
- Institute of Marine Sciences, University of Chittagong, Chittagong, Bangladesh
| | - Christopher Lyon Brown
- World Fisheries University Pilot Programme, Pukyong National University, Busan, South Korea
| | - Hyun-Ju Kwon
- Biopharmaceutical Engineering Major, Division of Applied Bioengineering, Dong-Eui University, Busan, South Korea
| | - Eun-Woo Lee
- Biopharmaceutical Engineering Major, Division of Applied Bioengineering, Dong-Eui University, Busan, South Korea
| |
Collapse
|
35
|
|
36
|
Adeowo FY, Lawal MM, Kumalo HM. Design and Development of Cholinesterase Dual Inhibitors towards Alzheimer's Disease Treatment: A Focus on Recent Contributions from Computational and Theoretical Perspective. ChemistrySelect 2020. [DOI: 10.1002/slct.202003573] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Fatima Y. Adeowo
- Discipline of Medical Biochemistry School of Laboratory Medicine and Medical Science University of KwaZulu-Natal Durban 4001 South Africa
| | - Monsurat M. Lawal
- Discipline of Medical Biochemistry School of Laboratory Medicine and Medical Science University of KwaZulu-Natal Durban 4001 South Africa
| | - Hezekiel M. Kumalo
- Discipline of Medical Biochemistry School of Laboratory Medicine and Medical Science University of KwaZulu-Natal Durban 4001 South Africa
| |
Collapse
|
37
|
Rojas-Valverde D, Pino-Ortega J, Gómez-Carmona CD, Rico-González M. A Systematic Review of Methods and Criteria Standard Proposal for the Use of Principal Component Analysis in Team's Sports Science. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17238712. [PMID: 33255212 PMCID: PMC7727687 DOI: 10.3390/ijerph17238712] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 11/19/2020] [Accepted: 11/21/2020] [Indexed: 12/11/2022]
Abstract
The availability of critical information about training and competition is fundamental on performance. Principal components analysis (PCA) is widely used in sports as a multivariate technique to manage big data from different technological assessments. This systematic review aimed to explore the methods reported and statistical criteria used in team's sports science and to propose a criteria standard to report PCA in further applications. A systematic electronic search was developed through four electronic databases and a total of 45 studies were included in the review for final analysis. Inclusion criteria: (i) of the studies we looked at, 22.22% performed factorability processes with different retention criteria (r > 0.4-0.7); (ii) 21 studies confirmed sample adequacy using Kaiser-Meyer-Olkim (KMO > 5-8) and 22 reported Bartlett's sphericity; (iii) factor retention was considered if eigenvalues >1-1.5 (n = 29); (iv) 23 studies reported loading retention (>0.4-0.7); and (v) used VariMax as the rotation method (48.9%). A lack of consistency and serious voids in reporting of essential methodological information was found. Twenty-one items were selected to provide a standard quality criterion to report methods sections when using PCA. These evidence-based criteria will lead to a better understanding and applicability of the results and future study replications.
Collapse
Affiliation(s)
- Daniel Rojas-Valverde
- Centro de Investigación y Diagnóstico en Salud y Deporte (CIDISAD), Escuela de Ciencias del Movimiento Humano y Calidad de Vida (CIEMHCAVI), Universidad Nacional, Heredia 86-3000, Costa Rica
- Grupo de Avances en el Entrenamiento Deportivo y Acondicionamiento Físico (GAEDAF), Facultad Ciencias del Deporte, Universidad de Extremadura, 10071 Cáceres, Spain
- Correspondence: (D.R.-V.); (J.P.-O.); or (M.R.-G.)
| | - José Pino-Ortega
- Department of Physical Activity and Sport Sciences, International Excellence Campus “Mare Nostrum”, Faculty of Sports Sciences, University of Murcia, 30720 San Javier, Spain
- Biovetmed & Sportsci Research Group, University of Murcia, 30100 Murcia, Spain
- Correspondence: (D.R.-V.); (J.P.-O.); or (M.R.-G.)
| | - Carlos D. Gómez-Carmona
- Research Group in Optimization of Training and Sports Performance (GOERD), Department of Didactics of Music, Plastic and Body Expression, Sports Science Faculty, University of Extremadura, 10071 Caceres, Spain;
| | - Markel Rico-González
- Biovetmed & Sportsci Research Group, University of Murcia, 30100 Murcia, Spain
- Departament of Physical Education and Sport, University of the Basque Country, UPV-EHU, Lasarte 71, 01007 Vitoria-Gasteiz, Spain
- Correspondence: (D.R.-V.); (J.P.-O.); or (M.R.-G.)
| |
Collapse
|
38
|
DMAKit: A user-friendly web platform for bringing state-of-the-art data analysis techniques to non-specific users. INFORM SYST 2020. [DOI: 10.1016/j.is.2020.101557] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
39
|
Abuín JM, Lopes N, Ferreira L, Pena TF, Schmidt B. Big Data in metagenomics: Apache Spark vs MPI. PLoS One 2020; 15:e0239741. [PMID: 33022000 PMCID: PMC7537910 DOI: 10.1371/journal.pone.0239741] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/14/2020] [Indexed: 11/23/2022] Open
Abstract
The progress of next-generation sequencing has lead to the availability of massive data sets used by a wide range of applications in biology and medicine. This has sparked significant interest in using modern Big Data technologies to process this large amount of information in distributed memory clusters of commodity hardware. Several approaches based on solutions such as Apache Hadoop or Apache Spark, have been proposed. These solutions allow developers to focus on the problem while the need to deal with low level details, such as data distribution schemes or communication patterns among processing nodes, can be ignored. However, performance and scalability are also of high importance when dealing with increasing problems sizes, making in this way the usage of High Performance Computing (HPC) technologies such as the message passing interface (MPI) a promising alternative. Recently, MetaCacheSpark, an Apache Spark based software for detection and quantification of species composition in food samples has been proposed. This tool can be used to analyze high throughput sequencing data sets of metagenomic DNA and allows for dealing with large-scale collections of complex eukaryotic and bacterial reference genome. In this work, we propose MetaCache-MPI, a fast and memory efficient solution for computing clusters which is based on MPI instead of Apache Spark. In order to evaluate its performance a comparison is performed between the original single CPU version of MetaCache, the Spark version and the MPI version we are introducing. Results show that for 32 processes, MetaCache-MPI is 1.65× faster while consuming 48.12% of the RAM memory used by Spark for building a metagenomics database. For querying this database, also with 32 processes, the MPI version is 3.11× faster, while using 55.56% of the memory used by Spark. We conclude that the new MetaCache-MPI version is faster in both building and querying the database and uses less RAM memory, when compared with MetaCacheSpark, while keeping the accuracy of the original implementation.
Collapse
Affiliation(s)
- José M. Abuín
- 2Ai—School of Technology, IPCA, Barcelos, Portugal
- CiTIUS, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- * E-mail:
| | - Nuno Lopes
- 2Ai—School of Technology, IPCA, Barcelos, Portugal
| | | | - Tomás F. Pena
- CiTIUS, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Bertil Schmidt
- Department of Computer Science, Johannes Gutenberg University, Mainz, Germany
| |
Collapse
|
40
|
Six Dijkstra MWMC, Siebrand E, Dorrestijn S, Salomons EL, Reneman MF, Oosterveld FGJ, Soer R, Gross DP, Bieleman HJ. Ethical Considerations of Using Machine Learning for Decision Support in Occupational Health: An Example Involving Periodic Workers' Health Assessments. JOURNAL OF OCCUPATIONAL REHABILITATION 2020; 30:343-353. [PMID: 32500471 PMCID: PMC7406529 DOI: 10.1007/s10926-020-09895-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose Computer algorithms and Machine Learning (ML) will be integrated into clinical decision support within occupational health care. This will change the interaction between health care professionals and their clients, with unknown consequences. The aim of this study was to explore ethical considerations and potential consequences of using ML based decision support tools (DSTs) in the context of occupational health. Methods We conducted an ethical deliberation. This was supported by a narrative literature review of publications about ML and DSTs in occupational health and by an assessment of the potential impact of ML-DSTs according to frameworks from medical ethics and philosophy of technology. We introduce a hypothetical clinical scenario from a workers' health assessment to reflect on biomedical ethical principles: respect for autonomy, beneficence, non-maleficence and justice. Results Respect for autonomy is affected by uncertainty about what future consequences the worker is consenting to as a result of the fluctuating nature of ML-DSTs and validity evidence used to inform the worker. A beneficent advisory process is influenced because the three elements of evidence based practice are affected through use of a ML-DST. The principle of non-maleficence is challenged by the balance between group-level benefits and individual harm, the vulnerability of the worker in the occupational context, and the possibility of function creep. Justice might be empowered when the ML-DST is valid, but profiling and discrimination are potential risks. Conclusions Implications of ethical considerations have been described for the socially responsible design of ML-DSTs. Three recommendations were provided to minimize undesirable adverse effects of the development and implementation of ML-DSTs.
Collapse
Affiliation(s)
- Marianne W M C Six Dijkstra
- School of Health, Saxion University of Applied Sciences/AGZ, M.H. Tromplaan 28, 7500 KB, Enschede, The Netherlands.
- Department of Rehabilitation Medicine, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.
- University of Groningen, Groningen, The Netherlands.
| | - Egbert Siebrand
- Research Group Ethics & Technology, Saxion University of Applied Sciences, Enschede, The Netherlands
| | - Steven Dorrestijn
- Research Group Ethics & Technology, Saxion University of Applied Sciences, Enschede, The Netherlands
| | - Etto L Salomons
- School of Ambient Intelligence, Saxion University of Applied Sciences, Enschede, The Netherlands
| | - Michiel F Reneman
- Department of Rehabilitation Medicine, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Frits G J Oosterveld
- School of Health, Saxion University of Applied Sciences/AGZ, M.H. Tromplaan 28, 7500 KB, Enschede, The Netherlands
| | - Remko Soer
- School of Health, Saxion University of Applied Sciences/AGZ, M.H. Tromplaan 28, 7500 KB, Enschede, The Netherlands
- University Medical Center Groningen, Pain Centre, University of Groningen, Groningen, The Netherlands
| | - Douglas P Gross
- Department of Physical Therapy, University of Alberta, Edmonton, Canada
| | - Hendrik J Bieleman
- School of Health, Saxion University of Applied Sciences/AGZ, M.H. Tromplaan 28, 7500 KB, Enschede, The Netherlands
| |
Collapse
|
41
|
Mirchev M, Mircheva I, Kerekovska A. The Academic Viewpoint on Patient Data Ownership in the Context of Big Data: Scoping Review. J Med Internet Res 2020; 22:e22214. [PMID: 32808934 PMCID: PMC7463395 DOI: 10.2196/22214] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 07/24/2020] [Accepted: 07/26/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The ownership of patient information in the context of big data is a relatively new problem, which is not yet fully recognized by the medical academic community. The problem is interdisciplinary, incorporating legal, ethical, medical, and aspects of information and communication technologies, requiring a sophisticated analysis. However, no previous scoping review has mapped existing studies on the subject. OBJECTIVE This study aims to map and assess published studies on patient data ownership in the context of big data as viewed by the academic community. METHODS A scoping review was conducted based on the 5-stage framework outlined by Arksey and O'Malley and further developed by Levac, Colquhoun, and O'Brien. The organization and reporting of results of the scoping review were conducted according to PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses and its extensions for Scoping Reviews). A systematic and comprehensive search of 4 scientific information databases, PubMed, ScienceDirect, Scopus, and Springer, was performed for studies published between January 2000 and October 2019. Two authors independently assessed the eligibility of the studies and the extracted data. RESULTS The review included 32 eligible articles authored by academicians that correspond to 3 focus areas: problem (ownership), area (health care), and context (big data). Five major aspects were studied: the scientific area of publications, aspects and academicians' perception of ownership in the context of big data, proposed solutions, and practical applications for data ownership issues in the context of big data. The aspects in which publications consider ownership of medical data are not clearly distinguished but can be summarized as ethical, legal, political, and managerial. The ownership of patient data is perceived primarily as a challenge fundamental to conducting medical research, including data sales and sharing, and to a lesser degree as a means of control, problem, threat, and opportunity also in view of medical research. Although numerous solutions falling into 3 categories, technology, law, and policy, were proposed, only 3 real applications were discussed. CONCLUSIONS The issue of ownership of patient information in the context of big data is poorly researched; it is not addressed consistently and in its integrity, and there is no consensus on policy decisions and the necessary legal regulations. Future research should investigate the issue of ownership as a core research question and not as a minor fragment among other topics. More research is needed to increase the body of knowledge regarding the development of adequate policies and relevant legal frameworks in compliance with ethical standards. The combined efforts of multidisciplinary academic teams are needed to overcome existing gaps in the perception of ownership, the aspects of ownership, and the possible solutions to patient data ownership issues in the reality of big data.
Collapse
Affiliation(s)
- Martin Mirchev
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| | - Iskra Mircheva
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| | - Albena Kerekovska
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| |
Collapse
|
42
|
Leite ML, Oliveira KBS, Cunha VA, Dias SC, da Cunha NB, Costa FF. Epigenetic Therapies in the Precision Medicine Era. ADVANCED THERAPEUTICS 2020. [DOI: 10.1002/adtp.201900184] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Michel Lopes Leite
- Genomic Sciences and Biotechnology Program UCB ‐ Brasilia, SgAN 916, Modulo B, Bloco C, 70790‐160 Brasília DF Brazil
| | | | - Victor Albuquerque Cunha
- Genomic Sciences and Biotechnology Program UCB ‐ Brasilia, SgAN 916, Modulo B, Bloco C, 70790‐160 Brasília DF Brazil
| | - Simoni Campos Dias
- Genomic Sciences and Biotechnology Program UCB ‐ Brasilia, SgAN 916, Modulo B, Bloco C, 70790‐160 Brasília DF Brazil
- Animal Biology DepartmentUniversidade de Brasília UnB, Campus Darcy Ribeiro. Brasilia DF 70910‐900 Brazil
| | - Nicolau Brito da Cunha
- Genomic Sciences and Biotechnology Program UCB ‐ Brasilia, SgAN 916, Modulo B, Bloco C, 70790‐160 Brasília DF Brazil
| | - Fabricio F. Costa
- Cancer Biology and Epigenomics ProgramAnn & Robert H Lurie Children's Hospital of Chicago Research Center, Northwestern University's Feinberg School of Medicine 2430 N. Halsted St., Box 220 Chicago IL 60611 USA
- Northwestern University's Feinberg School of Medicine 2430 N. Halsted St., Box 220 Chicago IL 60611 USA
- MATTER Chicago 222 W. Merchandise Mart Plaza, Suite 12th Floor Chicago IL 60654 USA
- Genomic Enterprise (www.genomicenterprise.com) San Diego, CA 92008 and New York NY 11581 USA
| |
Collapse
|
43
|
Medina-Ortiz D, Contreras S, Quiroz C, Olivera-Nappa Á. Development of Supervised Learning Predictive Models for Highly Non-linear Biological, Biomedical, and General Datasets. Front Mol Biosci 2020; 7:13. [PMID: 32118039 PMCID: PMC7031350 DOI: 10.3389/fmolb.2020.00013] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 01/22/2020] [Indexed: 11/13/2022] Open
Abstract
In highly non-linear datasets, attributes or features do not allow readily finding visual patterns for identifying common underlying behaviors. Therefore, it is not possible to achieve classification or regression using linear or mildly non-linear hyperspace partition functions. Hence, supervised learning models based on the application of most existing algorithms are limited, and their performance metrics are low. Linear transformations of variables, such as principal components analysis, cannot avoid the problem, and even models based on artificial neural networks and deep learning are unable to improve the metrics. Sometimes, even when features allow classification or regression in reported cases, performance metrics of supervised learning algorithms remain unsatisfyingly low. This problem is recurrent in many areas of study as, per example, the clinical, biotechnological, and protein engineering areas, where many of the attributes are correlated in an unknown and very non-linear fashion or are categorical and difficult to relate to a target response variable. In such areas, being able to create predictive models would dramatically impact the quality of their outcomes, generating an immediate added value for both the scientific and general public. In this manuscript, we present RV-Clustering, a library of unsupervised learning algorithms, and a new methodology designed to find optimum partitions within highly non-linear datasets that allow deconvoluting variables and notoriously improving performance metrics in supervised learning classification or regression models. The partitions obtained are statistically cross-validated, ensuring correct representativity and no over-fitting. We have successfully tested RV-Clustering in several highly non-linear datasets with different origins. The approach herein proposed has generated classification and regression models with high-performance metrics, which further supports its ability to generate predictive models for highly non-linear datasets. Advantageously, the method does not require significant human input, which guarantees a higher usability in the biological, biomedical, and protein engineering community with no specific knowledge in the machine learning area.
Collapse
Affiliation(s)
- David Medina-Ortiz
- Departamento de Ingeniería Química, Biotecnología y Materiales, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile.,Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
| | - Sebastián Contreras
- Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
| | - Cristofer Quiroz
- Facultad de Ingeniería, Universidad Autónoma de Chile, Talca, Chile
| | - Álvaro Olivera-Nappa
- Departamento de Ingeniería Química, Biotecnología y Materiales, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile.,Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
| |
Collapse
|
44
|
Probst D, Reymond JL. Visualization of very large high-dimensional data sets as minimum spanning trees. J Cheminform 2020; 12:12. [PMID: 33431043 PMCID: PMC7015965 DOI: 10.1186/s13321-020-0416-x] [Citation(s) in RCA: 128] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 02/04/2020] [Indexed: 01/10/2023] Open
Abstract
The chemical sciences are producing an unprecedented amount of large, high-dimensional data sets containing chemical structures and associated properties. However, there are currently no algorithms to visualize such data while preserving both global and local features with a sufficient level of detail to allow for human inspection and interpretation. Here, we propose a solution to this problem with a new data visualization method, TMAP, capable of representing data sets of up to millions of data points and arbitrary high dimensionality as a two-dimensional tree (http://tmap.gdb.tools). Visualizations based on TMAP are better suited than t-SNE or UMAP for the exploration and interpretation of large data sets due to their tree-like nature, increased local and global neighborhood and structure preservation, and the transparency of the methods the algorithm is based on. We apply TMAP to the most used chemistry data sets including databases of molecules such as ChEMBL, FDB17, the Natural Products Atlas, DSSTox, as well as to the MoleculeNet benchmark collection of data sets. We also show its broad applicability with further examples from biology, particle physics, and literature.![]()
Collapse
Affiliation(s)
- Daniel Probst
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012, Bern, Switzerland.
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012, Bern, Switzerland.
| |
Collapse
|
45
|
Picache J, May JC, McLean JA. Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses. ACS OMEGA 2020; 5:980-985. [PMID: 31984253 PMCID: PMC6977078 DOI: 10.1021/acsomega.9b03708] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 12/24/2019] [Indexed: 05/09/2023]
Abstract
Mass spectrometry (MS) is used in multiple omics disciplines to generate large collections of data. This data enables advancements in biomedical research by providing global profiles of a given system. One of the main barriers to generating these profiles is the inability to accurately annotate omics data, especially small molecules. To complement pre-existing large databases that are not quite complete, research groups devote efforts to generating personal libraries to annotate their data. Scientific progress is impeded during the generation of these personal libraries because the data contained within them is often redundant and/or incompatible with other databases. To overcome these redundancies and incompatibilities, we propose that communal, crowd-sourced databases be curated in a standardized fashion. A small number of groups have shown this model is feasible and successful. While the needs of a specific field will dictate the functionality of a communal database, we discuss some features to consider during database development. Special emphasis is made on standardization of terminology, documentation, format, reference materials, and quality assurance practices. These standardization procedures enable a field to have higher confidence in the quality of the data within a given database. We also discuss the three conceptual pillars of database design as well as how crowd-sourcing is practiced. Generating open-source databases requires front-end effort, but the result is a well curated, high quality data set that all can use. Having a resource such as this fosters collaboration and scientific advancement.
Collapse
Affiliation(s)
- Jaqueline
A. Picache
- Department of Chemistry,
Center for Innovative Technology, Vanderbilt Institute of Chemical
Biology, Vanderbilt Institute for Integrative Biosystems Research
and Education, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Jody C. May
- Department of Chemistry,
Center for Innovative Technology, Vanderbilt Institute of Chemical
Biology, Vanderbilt Institute for Integrative Biosystems Research
and Education, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - John A. McLean
- Department of Chemistry,
Center for Innovative Technology, Vanderbilt Institute of Chemical
Biology, Vanderbilt Institute for Integrative Biosystems Research
and Education, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
46
|
Evolution of mHealth Eco-System: A Step Towards Personalized Medicine. ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING 2020. [DOI: 10.1007/978-981-15-1286-5_30] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
47
|
Martani A, Geneviève LD, Pauli-Magnus C, McLennan S, Elger BS. Regulating the Secondary Use of Data for Research: Arguments Against Genetic Exceptionalism. Front Genet 2019; 10:1254. [PMID: 31956328 PMCID: PMC6951399 DOI: 10.3389/fgene.2019.01254] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Accepted: 11/14/2019] [Indexed: 12/02/2022] Open
Abstract
As accessing, collecting, and storing personal information become increasingly easier, the secondary use of data has the potential to make healthcare research more cost and time effective. The widespread reuse of data, however, raises important ethical and policy issues, especially because of the sensitive nature of genetic and health-related information. Regulation is thus crucial to determine the conditions upon which data can be reused. In this respect, the question emerges whether it is appropriate to endorse genetic exceptionalism and grant genetic data an exceptional status with respect to secondary use requirements. Using Swiss law as a case study, it is argued that genetic exceptionalism in secondary use regulation is not justified for three reasons. First, although genetic data have particular features, also other non-genetic data can be extremely sensitive. Second, having different regulatory requirements depending on the nature of data hinders the creation of comprehensible consent forms. Third, empirical evidence about public preferences concerning data reuse suggests that exceptional protection for genetic data alone is not justified. In this sense, it is claimed that regulation concerning data reuse should treat genetic data as important, but not exceptional.
Collapse
Affiliation(s)
- Andrea Martani
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
| | | | - Christiane Pauli-Magnus
- Department of Clinical Research, University and University Hospital of Basel, Basel, Switzerland
| | - Stuart McLennan
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
- Institute of History and Ethics in Medicine, Technical University of Munich, Munich, Germany
| | - Bernice Simone Elger
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
- University Center of Legal Medicine, University of Geneva, Geneva, Switzerland
| |
Collapse
|
48
|
Abstract
Exercise is a well-known non-pharmacologic agent used to prevent and treat a wide range of pathologic conditions such as metabolic and cardiovascular disease. In this sense, the classic field of exercise physiology has determined the main theoretical and practical bases of physiologic adaptations in response to exercise. However, the last decades were marked by significant advances in analytical laboratory techniques, where the field of biochemistry, genetics and molecular biology promoted exercise science to enter a new era. Regardless of its application, whether in the field of disease prevention or performance, the association of molecular biology with exercise physiology has been fundamental for unveiling knowledge of the molecular mechanisms related to the adaptation to exercise. This chapter will address the natural evolution of exercise physiology toward genetics and molecular biology, emphasizing the collection of integrated analytical approaches that composes the OMICS and their contribution to the field of molecular exercise physiology.
Collapse
|
49
|
Djulbegovic MB, Uversky VN. Expanding the understanding of the heterogeneous nature of melanoma with bioinformatics and disorder-based proteomics. Int J Biol Macromol 2019; 150:1281-1293. [PMID: 31743721 DOI: 10.1016/j.ijbiomac.2019.10.139] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/19/2019] [Accepted: 10/15/2019] [Indexed: 01/07/2023]
Abstract
The past few decades show that incidences of melanoma are on the rise. The risk associated with this disease is an interplay between genetic and host factors and sun exposure. While scientific progress in the treatment of melanoma is remarkable, additional research is needed to improve patient outcomes and to better understand the heterogenous nature of this disease. Fortunately, as the clinical community enters the era of "big data" and personalized medicine, the rise of bioinformatics that stems from recent advances in high throughout profiling of biological information offers potential for innovative treatment options. This study aims to provide an example of the usefulness of bioinformatics and disorder-based proteomics to identify the molecular pathway in melanoma, garner information on selected proteins from this pathway and uncover their intrinsically disordered proteins regions (IDPRs) and investigate functionality implicated in these IDPRs. The present study provides a new look at the melanoma heterogeneity and suggests that, in addition to the well-established genetic heterogeneity of melanoma, there is another level of heterogeneity that lies within the conformational ensembles that stem from intrinsic disorder in melanoma-related proteins. The hope is that these insights will inspire future drug discovery campaigns.
Collapse
Affiliation(s)
- Mak B Djulbegovic
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA; USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA; Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia.
| |
Collapse
|
50
|
Marín M, Esteban FJ, Ramírez-Rodrigo H, Ros E, Sáez-Lara MJ. An integrative methodology based on protein-protein interaction networks for identification and functional annotation of disease-relevant genes applied to channelopathies. BMC Bioinformatics 2019; 20:565. [PMID: 31718537 PMCID: PMC6849233 DOI: 10.1186/s12859-019-3162-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 10/15/2019] [Indexed: 12/19/2022] Open
Abstract
Background Biologically data-driven networks have become powerful analytical tools that handle massive, heterogeneous datasets generated from biomedical fields. Protein-protein interaction networks can identify the most relevant structures directly tied to biological functions. Functional enrichments can then be performed based on these structural aspects of gene relationships for the study of channelopathies. Channelopathies refer to a complex group of disorders resulting from dysfunctional ion channels with distinct polygenic manifestations. This study presents a semi-automatic workflow using protein-protein interaction networks that can identify the most relevant genes and their biological processes and pathways in channelopathies to better understand their etiopathogenesis. In addition, the clinical manifestations that are strongly associated with these genes are also identified as the most characteristic in this complex group of diseases. Results In particular, a set of nine representative disease-related genes was detected, these being the most significant genes in relation to their roles in channelopathies. In this way we attested the implication of some voltage-gated sodium (SCN1A, SCN2A, SCN4A, SCN4B, SCN5A, SCN9A) and potassium (KCNQ2, KCNH2) channels in cardiovascular diseases, epilepsies, febrile seizures, headache disorders, neuromuscular, neurodegenerative diseases or neurobehavioral manifestations. We also revealed the role of Ankyrin-G (ANK3) in the neurodegenerative and neurobehavioral disorders as well as the implication of these genes in other systems, such as the immunological or endocrine systems. Conclusions This research provides a systems biology approach to extract information from interaction networks of gene expression. We show how large-scale computational integration of heterogeneous datasets, PPI network analyses, functional databases and published literature may support the detection and assessment of possible potential therapeutic targets in the disease. Applying our workflow makes it feasible to spot the most relevant genes and unknown relationships in channelopathies and shows its potential as a first-step approach to identify both genes and functional interactions in clinical-knowledge scenarios of target diseases. Methods An initial gene pool is previously defined by searching general databases under a specific semantic framework. From the resulting interaction network, a subset of genes are identified as the most relevant through the workflow that includes centrality measures and other filtering and enrichment databases.
Collapse
Affiliation(s)
- Milagros Marín
- Department of Computer Architecture and Technology - CITIC, University of Granada, Granada, Spain.,Department of Biochemistry and Molecular Biology I, University of Granada, Granada, Spain
| | - Francisco J Esteban
- Systems Biology Unit, Department of Experimental Biology, University of Jaén, Jaén, Spain.
| | | | - Eduardo Ros
- Department of Computer Architecture and Technology - CITIC, University of Granada, Granada, Spain
| | - María José Sáez-Lara
- Department of Biochemistry and Molecular Biology I, University of Granada, Granada, Spain.
| |
Collapse
|