1
|
Aziz T, Naveed M, Shabbir MA, Sarwar A, Naseeb J, Zhao L, Yang Z, Cui H, Lin L, Albekairi TH. Unveiling the whole genomic features and potential probiotic characteristics of novel Lactiplantibacillus plantarum HMX2. Front Microbiol 2024; 15:1504625. [PMID: 39611087 PMCID: PMC11602494 DOI: 10.3389/fmicb.2024.1504625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 10/24/2024] [Indexed: 11/30/2024] Open
Abstract
This study investigates the genomic features and probiotic potential of Lactiplantibacillus plantarum HMX2, isolated from Chinese Sauerkraut, using whole-genome sequencing (WGS) and bioinformatics for the first time. This study also aims to find genetic diversity, antibiotic resistance genes, and functional capabilities to help us better understand its food safety applications and potential as a probiotic. L. plantarum HMX2 was cultured, and DNA was extracted for WGS. Genomic analysis comprised average nucleotide identity (ANI) prediction, genome annotation, pangenome, and synteny analysis. Bioinformatics techniques were used to identify CoDing Sequences (CDSs), transfer RNA (tRNA) and ribosomal RNA (rRNA) genes, and antibiotic resistance genes, as well as to conduct phylogenetic analysis to establish genetic diversity and evolution. The study found a significant genetic similarity (99.17% ANI) between L. plantarum HMX2 and the reference strain. Genome annotation revealed 3,242 coding sequences, 65 tRNA genes, and 16 rRNA genes. Significant genetic variety was found, including 25 antibiotic resistance genes. A phylogenetic study placed L. plantarum HMX2 among closely related bacteria, emphasizing its potential for probiotic and food safety applications. The genomic investigation of L. plantarum showed essential genes, including plnJK and plnEF, which contribute to antibacterial action against foodborne pathogens. Furthermore, genes such as MurA, Alr, and MprF improve food safety and probiotic potential by promoting bacterial survival under stress conditions in food and the gastrointestinal tract. This study introduces the new genomic features of L. plantarum HMX2 about specific genetics and its possibility of relevant uses in food security and technologies. These findings of specific genes involved in antimicrobial activity provide fresh possibilities for exploiting this strain in forming probiotic preparations and food preservation methods. The future research should focus on the experimental validation of antibiotic resistance genes, comparative genomics to investigate functional diversity, and the development of novel antimicrobial therapies that take advantage of L. plantarum's capabilities.
Collapse
Affiliation(s)
- Tariq Aziz
- Department of Food Science and Technology, College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen, Guangdong, China
- School of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong, China
- Key Laboratory of Geriatric Nutrition and Health of Ministry of Education, Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology and Business University, Beijing, China
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Muhammad Naveed
- Department of Biotechnology, Faculty of Science and Technology, University of Central Punjab, Lahore, Pakistan
| | - Muhammad Aqib Shabbir
- Department of Biotechnology, Faculty of Biological Sciences, Lahore University of Biological and Applied Sciences, Lahore, Pakistan
| | - Abid Sarwar
- Key Laboratory of Geriatric Nutrition and Health of Ministry of Education, Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology and Business University, Beijing, China
| | - Jasra Naseeb
- Key Laboratory of Geriatric Nutrition and Health of Ministry of Education, Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology and Business University, Beijing, China
| | - Liqing Zhao
- Department of Food Science and Technology, College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen, Guangdong, China
| | - Zhennai Yang
- Key Laboratory of Geriatric Nutrition and Health of Ministry of Education, Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology and Business University, Beijing, China
| | - Haiying Cui
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Lin Lin
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Thamer H. Albekairi
- Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Hsiao YC, Dutta A. Network Modeling and Control of Dynamic Disease Pathways, Review and Perspectives. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1211-1230. [PMID: 38498762 DOI: 10.1109/tcbb.2024.3378155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Dynamic disease pathways are a combination of complex dynamical processes among bio-molecules in a cell that leads to diseases. Network modeling of disease pathways considers disease-related bio-molecules (e.g. DNA, RNA, transcription factors, enzymes, proteins, and metabolites) and their interaction (e.g. DNA methylation, histone modification, alternative splicing, and protein modification) to study disease progression and predict therapeutic responses. These bio-molecules and their interactions are the basic elements in the study of the misregulation in the disease-related gene expression that lead to abnormal cellular responses. Gene regulatory networks, cell signaling networks, and metabolic networks are the three major types of intracellular networks for the study of the cellular responses elicited from extracellular signals. The disease-related cellular responses can be prevented or regulated by designing control strategies to manipulate these extracellular or other intracellular signals. The paper reviews the regulatory mechanisms, the dynamic models, and the control strategies for each intracellular network. The applications, limitations and the prospective for modeling and control are also discussed.
Collapse
|
3
|
Karimzadeh F, Soltani Fard E, Nadi A, Malekzadeh R, Elahian F, Mirzaei SA. Advances in skin gene therapy: utilizing innovative dressing scaffolds for wound healing, a comprehensive review. J Mater Chem B 2024; 12:6033-6062. [PMID: 38887828 DOI: 10.1039/d4tb00966e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
The skin, serving as the body's outermost layer, boasts a vast area and intricate structure, functioning as the primary barrier against external threats. Disruptions in the composition and functionality of the skin can lead to a diverse array of skin conditions, such as wounds, burns, and diabetic ulcers, along with inflammatory disorders, infections, and various types of skin cancer. These disorders not only exacerbate concerns regarding skin health and beauty but also have a significant impact on mental well-being. Due to the complexity of these disorders, conventional treatments often prove insufficient, necessitating the exploration of new therapeutic approaches. Researchers develop new therapies by deciphering these intricacies and gaining a thorough understanding of the protein networks and molecular processes in skin. A new window of opportunity has opened up for improving wound healing processes because of recent advancements in skin gene therapy. To enhance skin regeneration and healing, this extensive review investigates the use of novel dressing scaffolds in conjunction with gene therapy approaches. Scaffolds that do double duty as wound protectors and vectors for therapeutic gene delivery are being developed using innovative biomaterials. To improve cellular responses and speed healing, these state-of-the-art scaffolds allow for the targeted delivery and sustained release of genetic material. The most recent developments in gene therapy techniques include RNA interference, CRISPR-based gene editing, and the utilization of viral and non-viral vectors in conjunction with scaffolds, which were reviewed here to overcome skin disorders and wound complications. In the future, there will be rare chances to develop custom methods for skin health care thanks to the combination of modern technology and collaboration among disciplines.
Collapse
Affiliation(s)
- Fatemeh Karimzadeh
- Student Research Committee, Shahrekord University of Medical Sciences, Shahrekord, Iran
- Department of Medical Biotechnology, School of Advanced Technologies, Shahrekord University of Medical Sciences, Shahrekord, Iran.
| | - Elahe Soltani Fard
- Student Research Committee, Shahrekord University of Medical Sciences, Shahrekord, Iran
- Department of Molecular Medicine, School of Advanced Technologies, Shahrekord University of Medical Sciences, Shahrekord, Iran
| | - Akram Nadi
- Stem Cell Biology Research Center, Yazd Reproductive Sciences Institute, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Rahim Malekzadeh
- Student Research Committee, Shahrekord University of Medical Sciences, Shahrekord, Iran
- Department of Medical Biotechnology, School of Advanced Technologies, Shahrekord University of Medical Sciences, Shahrekord, Iran.
| | - Fatemeh Elahian
- Advanced Technology Cores, Baylor College of Medicine, Houston, Texas, USA
| | - Seyed Abbas Mirzaei
- Department of Medical Biotechnology, School of Advanced Technologies, Shahrekord University of Medical Sciences, Shahrekord, Iran.
- Cellular and Molecular Research Center, Basic Health Sciences Institute, Shahrekord University of Medical Sciences, Shahrekord, Iran
| |
Collapse
|
4
|
Oba GM, Nakato R. Clover: An unbiased method for prioritizing differentially expressed genes using a data-driven approach. Genes Cells 2024; 29:456-470. [PMID: 38602264 PMCID: PMC11163938 DOI: 10.1111/gtc.13119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 03/12/2024] [Accepted: 03/20/2024] [Indexed: 04/12/2024]
Abstract
Identifying key genes from a list of differentially expressed genes (DEGs) is a critical step in transcriptome analysis. However, current methods, including Gene Ontology analysis and manual annotation, essentially rely on existing knowledge, which is highly biased depending on the extent of the literature. As a result, understudied genes, some of which may be associated with important molecular mechanisms, are often ignored or remain obscure. To address this problem, we propose Clover, a data-driven scoring method to specifically highlight understudied genes. Clover aims to prioritize genes associated with important molecular mechanisms by integrating three metrics: the likelihood of appearing in the DEG list, tissue specificity, and number of publications. We applied Clover to Alzheimer's disease data and confirmed that it successfully detected known associated genes. Moreover, Clover effectively prioritized understudied but potentially druggable genes. Overall, our method offers a novel approach to gene characterization and has the potential to expand our understanding of gene functions. Clover is an open-source software written in Python3 and available on GitHub at https://github.com/G708/Clover.
Collapse
Affiliation(s)
- Gina Miku Oba
- Laboratory of Computational Genomics, Institute for Quantitative BiosciencesUniversity of TokyoTokyoJapan
- Department of Computational Biology and Medical Science, Graduate School of Frontier ScienceUniversity of TokyoTokyoJapan
| | - Ryuichiro Nakato
- Laboratory of Computational Genomics, Institute for Quantitative BiosciencesUniversity of TokyoTokyoJapan
- Department of Computational Biology and Medical Science, Graduate School of Frontier ScienceUniversity of TokyoTokyoJapan
| |
Collapse
|
5
|
Zou Y, Xiao Z, Wang L, Wang Y, Yin H, Li Y. Prevalence of antibiotic resistance genes and virulence factors in the sediment of WWTP effluent-dominated rivers. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 897:165441. [PMID: 37437635 DOI: 10.1016/j.scitotenv.2023.165441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/02/2023] [Accepted: 07/08/2023] [Indexed: 07/14/2023]
Abstract
In the context of increasing aridity due to climate changes, effluent from wastewater treatment plants (WWTPs) became dominant in some rivers. However, the prevalence of antibiotic resistance genes (ARGs) and virulence factors (VFs) in effluent-dominated rivers was rarely investigated. In this study, the profiles of ARGs and VFs in the sediment of two effluent-dominated rivers were revealed through the metagenomic sequencing technique. In each river, samples from the effluent discharge point (P site) and approximately 500 m downstream (D site) were collected. Results showed that the abundances of ARGs and VFs were both higher in D sites than those in P sites, indicating higher risks in the downstream areas. The compositions of ARGs were similar in the P sites of two rivers while being distinct in the D sites. The same was true for changes in the VFs compositions. Microbial community structure variations were the main driver for the changes in ARGs and VFs. Network analysis revealed that the interaction of ARGs and VF genes (VFGs) in sediment was intense. Two VFGs and eleven ARGs were identified to play important roles in the network. Metagenome-assembled genomes (MAGs) were generated to evaluate the coexistence of ARGs and VFGs at the single genome level. It was found that 38.4 % of the MAGs contained both ARGs and VFGs, and two MAGs were from pathogenic genera. These results suggested that high microbiological risks existed in effluent-dominated rivers, and necessary measures should be taken to prevent the potential threat to public health.
Collapse
Affiliation(s)
- Yina Zou
- The National Key Laboratory of Water Disaster Prevention, Yangtze Institute for Conservation and Development, Hohai University, Nanjing 210098, PR China
| | - Zijian Xiao
- The National Key Laboratory of Water Disaster Prevention, Dayu College, Hohai University, Nanjing 210098, PR China
| | - Longfei Wang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, PR China
| | - Yutao Wang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, PR China
| | - Haojie Yin
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, PR China
| | - Yi Li
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, PR China.
| |
Collapse
|
6
|
Nicholson DN, Himmelstein DS, Greene CS. Expanding a database-derived biomedical knowledge graph via multi-relation extraction from biomedical abstracts. BioData Min 2022; 15:26. [PMID: 36258252 PMCID: PMC9578183 DOI: 10.1186/s13040-022-00311-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 09/17/2022] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND Knowledge graphs support biomedical research efforts by providing contextual information for biomedical entities, constructing networks, and supporting the interpretation of high-throughput analyses. These databases are populated via manual curation, which is challenging to scale with an exponentially rising publication rate. Data programming is a paradigm that circumvents this arduous manual process by combining databases with simple rules and heuristics written as label functions, which are programs designed to annotate textual data automatically. Unfortunately, writing a useful label function requires substantial error analysis and is a nontrivial task that takes multiple days per function. This bottleneck makes populating a knowledge graph with multiple nodes and edge types practically infeasible. Thus, we sought to accelerate the label function creation process by evaluating how label functions can be re-used across multiple edge types. RESULTS We obtained entity-tagged abstracts and subsetted these entities to only contain compounds, genes, and disease mentions. We extracted sentences containing co-mentions of certain biomedical entities contained in a previously described knowledge graph, Hetionet v1. We trained a baseline model that used database-only label functions and then used a sampling approach to measure how well adding edge-specific or edge-mismatch label function combinations improved over our baseline. Next, we trained a discriminator model to detect sentences that indicated a biomedical relationship and then estimated the number of edge types that could be recalled and added to Hetionet v1. We found that adding edge-mismatch label functions rarely improved relationship extraction, while control edge-specific label functions did. There were two exceptions to this trend, Compound-binds-Gene and Gene-interacts-Gene, which both indicated physical relationships and showed signs of transferability. Across the scenarios tested, discriminative model performance strongly depends on generated annotations. Using the best discriminative model for each edge type, we recalled close to 30% of established edges within Hetionet v1. CONCLUSIONS Our results show that this framework can incorporate novel edges into our source knowledge graph. However, results with label function transfer were mixed. Only label functions describing very similar edge types supported improved performance when transferred. We expect that the continued development of this strategy may provide essential building blocks to populating biomedical knowledge graphs with discoveries, ensuring that these resources include cutting-edge results.
Collapse
Affiliation(s)
- David N. Nicholson
- grid.25879.310000 0004 1936 8972Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Philadelphia, PA USA
| | - Daniel S. Himmelstein
- grid.25879.310000 0004 1936 8972Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Philadelphia, PA USA
| | - Casey S. Greene
- grid.430503.10000 0001 0703 675XDepartment of Biomedical Informatics, University of Colorado School of Medicine and Center for Health Artificial Intellegence (CHAI), University of Colorado School of Medicine, Aurora, USA
| |
Collapse
|
7
|
Al-Anzi BF, Khajah M, Fakhraldeen SA. Predicting and explaining the impact of genetic disruptions and interactions on organismal viability. Bioinformatics 2022; 38:4088-4099. [PMID: 35861390 PMCID: PMC9438956 DOI: 10.1093/bioinformatics/btac519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 06/30/2022] [Accepted: 07/20/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively interacting or not). Third, most are uninterpretable black box machine learning models. RESULTS Budding yeast datasets were used to develop high-performance Multinomial Regression (MN) models capable of predicting the impact of single, double and triple genetic disruptions on viability. These models are interpretable and give realistic non-binary predictions and can predict negative genetic interactions (GIs) in triple-gene knockouts. They are based on a limited set of gene features and their predictions are influenced by the probability of target gene participating in molecular complexes or pathways. Furthermore, the MN models have utility in other organisms such as fission yeast, fruit flies and humans, with the single gene fitness MN model being able to distinguish essential genes necessary for cell-autonomous viability from those required for multicellular survival. Finally, our models exceed the performance of previous models, without sacrificing interpretability. AVAILABILITY AND IMPLEMENTATION All code and processed datasets used to generate results and figures in this manuscript are available at our Github repository at https://github.com/KISRDevelopment/cell_viability_paper. The repository also contains a link to the GI prediction website that lets users search for GIs using the MN models. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Saja A Fakhraldeen
- Ecosystem-based Management of Marine Resources Program, Kuwait Institute for Scientific Research, Safat, 13109, Kuwait
| |
Collapse
|
8
|
Venkata Subbiah H, Ramesh Babu P, Subbiah U. Determination of deleterious single-nucleotide polymorphisms of human LYZ C gene: an in silico study. J Genet Eng Biotechnol 2022; 20:92. [PMID: 35776277 PMCID: PMC9247897 DOI: 10.1186/s43141-022-00383-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 06/14/2022] [Indexed: 11/26/2022]
Abstract
Background Single-nucleotide polymorphisms (SNPs) have a crucial function in affecting the susceptibility of individuals to diseases and also determine how an individual responds to different treatment options. The present study aimed to predict and characterize deleterious missense nonsynonymous SNPs (nsSNPs) of lysozyme C (LYZ C) gene using different computational methods. Lyz C is an important antimicrobial peptide capable of damaging the peptidoglycan layer of bacteria leading to osmotic shock and cell death. The nsSNPs were first analyzed by SIFT and PolyPhen v2 tools. The nsSNPs predicted as deleterious were then assessed by other in silico tools — SNAP, PROVEAN, PhD-SNP, and SNPs & GO. These SNPs were further examined by I-Mutant 3.0 and ConSurf. GeneMANIA and STRING tools were used to study the interaction network of the LYZ C gene. NetSurfP 2.0 was used to predict the secondary structure of Lyz C protein. The impact of variations on the structural characteristics of the protein was studied by HOPE analysis. The structures of wild type and variants were predicted by SWISS-MODEL web server, and energy minimization was carried out using XenoPlot software. TM-align tool was used to predict root-mean-square deviation (RMSD) and template modeling (TM) scores. Results Eight missense nsSNPs (T88N, I74T, F75I, D67H, W82R, D85H, R80C, and R116S) were found to be potentially deleterious. I-Mutant 3.0 determined that the variants decreased the stability of the protein. ConSurf predicted rs121913547, rs121913549, and rs387906536 nsSNPs to be conserved. Interaction network tools showed that LYZ C protein interacted with lactoferrin (LTF). HOPE tool analyzed differences in physicochemical properties between wild type and variants. TM-align tool predicted the alignment score, and the protein folding was found to be identical. PyMOL was used to visualize the superimposition of variants over wild type. Conclusion This study ascertained the deleterious missense nsSNPs of the LYZ C gene and could be used in further experimental analysis. These high-risk nsSNPs could be used as molecular targets for diagnostic and therapeutic interventions.
Collapse
Affiliation(s)
- Harini Venkata Subbiah
- Human Genetics Research Centre, Sree Balaji Dental College & Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Polani Ramesh Babu
- Center for Materials Engineering and Regenerative Medicine, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Usha Subbiah
- Human Genetics Research Centre, Sree Balaji Dental College & Hospital, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, India.
| |
Collapse
|
9
|
You Y, Lai X, Pan Y, Zheng H, Vera J, Liu S, Deng S, Zhang L. Artificial intelligence in cancer target identification and drug discovery. Signal Transduct Target Ther 2022; 7:156. [PMID: 35538061 PMCID: PMC9090746 DOI: 10.1038/s41392-022-00994-0] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 03/14/2022] [Accepted: 04/05/2022] [Indexed: 02/08/2023] Open
Abstract
Artificial intelligence is an advanced method to identify novel anticancer targets and discover novel drugs from biology networks because the networks can effectively preserve and quantify the interaction between components of cell systems underlying human diseases such as cancer. Here, we review and discuss how to employ artificial intelligence approaches to identify novel anticancer targets and discover drugs. First, we describe the scope of artificial intelligence biology analysis for novel anticancer target investigations. Second, we review and discuss the basic principles and theory of commonly used network-based and machine learning-based artificial intelligence algorithms. Finally, we showcase the applications of artificial intelligence approaches in cancer target identification and drug discovery. Taken together, the artificial intelligence models have provided us with a quantitative framework to study the relationship between network characteristics and cancer, thereby leading to the identification of potential anticancer targets and the discovery of novel drug candidates.
Collapse
Affiliation(s)
- Yujie You
- College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Xin Lai
- Laboratory of Systems Tumor Immunology, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, Erlangen, 91052, Germany
| | - Yi Pan
- Faculty of Computer Science and Control Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Room D513, 1068 Xueyuan Avenue, Shenzhen University Town, Shenzhen, 518055, China
| | - Huiru Zheng
- School of Computing, Ulster University, Belfast, BT15 1ED, UK
| | - Julio Vera
- Laboratory of Systems Tumor Immunology, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, Erlangen, 91052, Germany
| | - Suran Liu
- College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Senyi Deng
- Institute of Thoracic Oncology, Department of Thoracic Surgery, West China Hospital, Sichuan University, Chengdu, 610065, China.
| | - Le Zhang
- College of Computer Science, Sichuan University, Chengdu, 610065, China.
- Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou, 310024, China.
- Key Laboratory of Systems Health Science of Zhejiang Province, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China.
| |
Collapse
|
10
|
Bhatnagar R, Sardar S, Beheshti M, Podichetty JT. How can natural language processing help model informed drug development?: a review. JAMIA Open 2022; 5:ooac043. [PMID: 35702625 PMCID: PMC9188322 DOI: 10.1093/jamiaopen/ooac043] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/28/2022] [Accepted: 05/26/2022] [Indexed: 01/20/2023] Open
Abstract
Objective To summarize applications of natural language processing (NLP) in model informed drug development (MIDD) and identify potential areas of improvement. Materials and Methods Publications found on PubMed and Google Scholar, websites and GitHub repositories for NLP libraries and models. Publications describing applications of NLP in MIDD were reviewed. The applications were stratified into 3 stages: drug discovery, clinical trials, and pharmacovigilance. Key NLP functionalities used for these applications were assessed. Programming libraries and open-source resources for the implementation of NLP functionalities in MIDD were identified. Results NLP has been utilized to aid various processes in drug development lifecycle such as gene-disease mapping, biomarker discovery, patient-trial matching, adverse drug events detection, etc. These applications commonly use NLP functionalities of named entity recognition, word embeddings, entity resolution, assertion status detection, relation extraction, and topic modeling. The current state-of-the-art for implementing these functionalities in MIDD applications are transformer models that utilize transfer learning for enhanced performance. Various libraries in python, R, and Java like huggingface, sparkNLP, and KoRpus as well as open-source platforms such as DisGeNet, DeepEnroll, and Transmol have enabled convenient implementation of NLP models to MIDD applications. Discussion Challenges such as reproducibility, explainability, fairness, limited data, limited language-support, and security need to be overcome to ensure wider adoption of NLP in MIDD landscape. There are opportunities to improve the performance of existing models and expand the use of NLP in newer areas of MIDD. Conclusions This review provides an overview of the potential and pitfalls of current NLP approaches in MIDD.
Collapse
Affiliation(s)
- Roopal Bhatnagar
- Data Science, Data Collaboration Center, Critical Path Institute , Tucson, Arizona, USA
| | - Sakshi Sardar
- Quantitative Medicine, Critical Path Institute , Tucson, Arizona, USA
| | - Maedeh Beheshti
- Quantitative Medicine, Critical Path Institute , Tucson, Arizona, USA
| | | |
Collapse
|
11
|
Weiskittel TM, Ung CY, Correia C, Zhang C, Li H. De novo individualized disease modules reveal the synthetic penetrance of genes and inform personalized treatment regimens. Genome Res 2021; 32:124-134. [PMID: 34876496 PMCID: PMC8744682 DOI: 10.1101/gr.275889.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 11/30/2021] [Indexed: 12/04/2022]
Abstract
Current understandings of individual disease etiology and therapeutics are limited despite great need. To fill the gap, we propose a novel computational pipeline that collects potent disease gene cooperative pathways to envision individualized disease etiology and therapies. Our algorithm constructs individualized disease modules de novo, which enables us to elucidate the importance of mutated genes in specific patients and to understand the synthetic penetrance of these genes across patients. We reveal that importance of the notorious cancer drivers TP53 and PIK3CA fluctuate widely across breast cancers and peak in tumors with distinct numbers of mutations and that rarely mutated genes such as XPO1 and PLEKHA1 have high disease module importance in specific individuals. Furthermore, individualized module disruption enables us to devise customized singular and combinatorial target therapies that were highly varied across patients, showing the need for precision therapeutics pipelines. As the first analysis of de novo individualized disease modules, we illustrate the power of individualized disease modules for precision medicine by providing deep novel insights on the activity of diseased genes in individuals.
Collapse
Affiliation(s)
- Taylor M Weiskittel
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA
| | - Choong Y Ung
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA
| | - Cristina Correia
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA
| | - Cheng Zhang
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA
| | - Hu Li
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA
| |
Collapse
|
12
|
Xin S, Zhang W. Construction and analysis of the protein-protein interaction network for the detoxification enzymes of the silkworm, Bombyx mori. ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 2021; 108:e21850. [PMID: 34750851 DOI: 10.1002/arch.21850] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 09/27/2021] [Accepted: 10/15/2021] [Indexed: 06/13/2023]
Abstract
Detoxification enzymes are necessary for insects to metabolize toxic substances and maintain physiological activities. Cytochromes P450 (CYPs), glutathione S-transferases (GSTs), and carboxylesterase (CarEs) are the main detoxification enzymes in insects. In addition, UDP-glucosyltransferase and ATP-binding cassette transporter also participate in the process of material metabolism. This study collected proteins related to detoxification in the silkworm, Bombyx mori (Lepidoptera: Bombycidae). And we performed Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis on these proteins to understand their biological function. We constructed the protein-protein interaction network for the silkworm's detoxification enzymes and analyzed the network's topological properties. We found that BGIBMGA014046-TA, BGIBMGA003221-TA, BGIBMGA011092-TA, BGIBMGA000074-TA, and LOC732976 are the essential proteins in the network. These proteins are primarily involved in the process of ribosome biogenesis and may be related to protein synthesis. We integrated GO, KEGG, and network analysis and found that ribosome-associated protein and GSTs played a vital role in the detoxification process.
Collapse
Affiliation(s)
- ShangHong Xin
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - WenJun Zhang
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
13
|
de Oliveira KC, Camilo C, Gastaldi VD, Sant'Anna Feltrin A, Lisboa BCG, de Jesus Rodrigues de Paula V, Moretto AC, Lafer B, Hoexter MQ, Miguel EC, Maschietto M, Brentani H. Brain areas involved with obsessive-compulsive disorder present different DNA methylation modulation. BMC Genom Data 2021; 22:45. [PMID: 34717534 PMCID: PMC8557022 DOI: 10.1186/s12863-021-00993-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Accepted: 08/29/2021] [Indexed: 12/13/2022] Open
Abstract
Background Obsessive-compulsive disorder (OCD) is characterized by intrusive thoughts and repetitive actions, that presents the involvement of the cortico-striatal areas. The contribution of environmental risk factors to OCD development suggests that epigenetic mechanisms may contribute to its pathophysiology. DNA methylation changes and gene expression were evaluated in post-mortem brain tissues of the cortical (anterior cingulate gyrus and orbitofrontal cortex) and ventral striatum (nucleus accumbens, caudate nucleus and putamen) areas from eight OCD patients and eight matched controls. Results There were no differentially methylated CpG (cytosine-phosphate-guanine) sites (DMSs) in any brain area, nevertheless gene modules generated from CpG sites and protein-protein-interaction (PPI) showed enriched gene modules for all brain areas between OCD cases and controls. All brain areas but nucleus accumbens presented a predominantly hypomethylation pattern for the differentially methylated regions (DMRs). Although there were common transcriptional factors that targeted these DMRs, their targeted differentially expressed genes were different among all brain areas. The protein-protein interaction network based on methylation and gene expression data reported that all brain areas were enriched for G-protein signaling pathway, immune response, apoptosis and synapse biological processes but each brain area also presented enrichment of specific signaling pathways. Finally, OCD patients and controls did not present significant DNA methylation age differences. Conclusions DNA methylation changes in brain areas involved with OCD, especially those involved with genes related to synaptic plasticity and the immune system could mediate the action of genetic and environmental factors associated with OCD. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-021-00993-0.
Collapse
Affiliation(s)
- Kátia Cristina de Oliveira
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil.,Center of Mathematics, Computation and Cognition, Federal University of ABC, São Bernardo do Campo, Brazil.,Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, Brazil
| | - Caroline Camilo
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil.
| | - Vinícius Daguano Gastaldi
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil
| | - Arthur Sant'Anna Feltrin
- Center of Mathematics, Computation and Cognition, Federal University of ABC, São Bernardo do Campo, Brazil
| | - Bianca Cristina Garcia Lisboa
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil
| | - Vanessa de Jesus Rodrigues de Paula
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil
| | | | - Beny Lafer
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil
| | - Marcelo Queiroz Hoexter
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil.,Laboratório de Psicopatologia e Terapêutica Psiquiátrica (LIM23), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, Brazil
| | - Euripedes Constantino Miguel
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil.,Laboratório de Psicopatologia e Terapêutica Psiquiátrica (LIM23), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, Brazil
| | | | | | - Helena Brentani
- Departamento & Instituto de Psiquiatria, Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Rua Dr. Ovídio Pires de Campos, 785 - LIM23 (Térreo), São Paulo, 05403-010, Brazil.,Laboratório de Psicopatologia e Terapêutica Psiquiátrica (LIM23), Faculdade de Medicina FMUSP, Universidade de Sao Paulo, Sao Paulo, Brazil
| |
Collapse
|
14
|
Singh N, Bhatnagar S. Machine Learning for Prediction of Drug Targets in Microbe Associated Cardiovascular Diseases by Incorporating Host-pathogen Interaction Network Parameters. Mol Inform 2021; 41:e2100115. [PMID: 34676983 DOI: 10.1002/minf.202100115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 10/01/2021] [Indexed: 12/20/2022]
Abstract
Host-pathogen interactions play a crucial role in invasion, infection, and induction of immune response in humans. In this work, four machine learning algorithms, namely Logistic regression, K-nearest neighbor, Support Vector Machine, and Random Forest were implemented for the classification of drug targets. The algorithms were trained using 3400 hosts and 3800 pathogen drug and non-drug target proteins as learning instances. For each protein, 68 pathogen and 73 host features were computed that included sequence, structure, biological and host-pathogen network centrality characteristics. The Random Forest classifier model achieved the best accuracy after 10-fold cross-validation. 99 % accuracy was achieved with a ROC-AUC score of 0.99±0.01 for both pathogen and host training sets. The Eigenvector Centrality of host-pathogen interactions and host-host interactions was the top feature in performing classification of pathogen and host targets respectively. Other features important for classification were the presence of catalytic and binding sites, low instability/aliphatic index, and cellular location. The Random Forest classifier was then used for prediction of drug targets involved in Microbe Associated Cardiovascular Diseases. 331 host and 743 pathogen proteins were predicted as drug targets by the random forest model and can be validated experimentally for therapeutic intervention in Microbe Associated Cardiovascular Diseases.
Collapse
Affiliation(s)
- Nirupma Singh
- Department of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India
| | - Sonika Bhatnagar
- Department of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India.,Computational and Structural Biology Laboratory, Department of Biological Sciences and Engineering, Netaji Subhas University of Technology Dwarka, New Delhi, 110078, India
| |
Collapse
|
15
|
Choi W, Lee H. Identifying disease-gene associations using a convolutional neural network-based model by embedding a biological knowledge graph with entity descriptions. PLoS One 2021; 16:e0258626. [PMID: 34653225 PMCID: PMC8519444 DOI: 10.1371/journal.pone.0258626] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 10/01/2021] [Indexed: 12/09/2022] Open
Abstract
Understanding the role of genes in human disease is of high importance. However, identifying genes associated with human diseases requires laborious experiments that involve considerable effort and time. Therefore, a computational approach to predict candidate genes related to complex diseases including cancer has been extensively studied. In this study, we propose a convolutional neural network-based knowledge graph-embedding model (KGED), which is based on a biological knowledge graph with entity descriptions to infer relationships between biological entities. As an application demonstration, we generated gene-interaction networks for each cancer type using gene-gene relationships inferred by KGED. We then analyzed the constructed gene networks using network centrality measures, including betweenness, closeness, degree, and eigenvector centrality metrics, to rank the central genes of the network and identify highly correlated cancer genes. Furthermore, we evaluated our proposed approach for prostate, breast, and lung cancers by comparing the performance with that of existing approaches. The KGED model showed improved performance in predicting cancer-related genes using the inferred gene-gene interactions. Thus, we conclude that gene-gene interactions inferred by KGED can be helpful for future research, such as that aimed at future research on pathogenic mechanisms of human diseases, and contribute to the field of disease treatment discovery.
Collapse
Affiliation(s)
- Wonjun Choi
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Buk-gu, Gwangju, Republic of Korea
| | - Hyunju Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Buk-gu, Gwangju, Republic of Korea
| |
Collapse
|
16
|
Gopal J, Prakash Sinnarasan VS, Venkatesan A. Identification of Repurpose Drugs by Computational Analysis of Disease-Gene-Drug Associations. J Comput Biol 2021; 28:975-984. [PMID: 34242526 DOI: 10.1089/cmb.2020.0356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Repurposing of marketed drugs to find new indications has become an alternative to circumvent the risk of traditional drug development by its productivity quality. Despite many approaches, computational analysis has great potential to fuel the development of all-rounder drugs to find new classes of medicine for neglected and rare disease. The genes that can explain variations in drug response associated to disease are more important and significant in drug therapeutics necessitate elucidating the relationships of a gene, drug, and disease. The proposed computational analysis facilitates the discovery of knowledge on both target and disease-based relationships from large sources of biomedical literature spread over different platforms. It uses the utility of text mining for automatic extraction of valuable aggregated biomedical entities (disease, gene, and drug) from PubMed to serves as an input to the analysis of association prediction. The top-ranked associations considered for identification of repurposing drugs and also the hidden associations identified using concurrence principle to extrapolate the new relationships. Such findings are reported as novel and contribute to the knowledge base for pharmacogenomics, would immensely support the discovery and progress of novel therapeutic pathways and patient segment biomarkers.
Collapse
Affiliation(s)
- Jeyakodi Gopal
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, India
| | | | - Amouda Venkatesan
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, India
| |
Collapse
|
17
|
Mining Proteome Research Reports: A Bird's Eye View. Proteomes 2021; 9:proteomes9020029. [PMID: 34200663 PMCID: PMC8293458 DOI: 10.3390/proteomes9020029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 05/27/2021] [Accepted: 06/08/2021] [Indexed: 01/25/2023] Open
Abstract
The complexity of data has burgeoned to such an extent that scientists of every realm are encountering the incessant challenge of data management. Modern-day analytical approaches with the help of free source tools and programming languages have facilitated access to the context of the various domains as well as specific works reported. Here, with this article, an attempt has been made to provide a systematic analysis of all the available reports at PubMed on Proteome using text mining. The work is comprised of scientometrics as well as information extraction to provide the publication trends as well as frequent keywords, bioconcepts and most importantly gene–gene co-occurrence network. Out of 33,028 PMIDs collected initially, the segregation of 24,350 articles under 28 Medical Subject Headings (MeSH) was analyzed and plotted. Keyword link network and density visualizations were provided for the top 1000 frequent Mesh keywords. PubTator was used, and 322,026 bioconcepts were able to extracted under 10 classes (such as Gene, Disease, CellLine, etc.). Co-occurrence networks were constructed for PMID-bioconcept as well as bioconcept–bioconcept associations. Further, for creation of subnetwork with respect to gene–gene co-occurrence, a total of 11,100 unique genes participated with mTOR and AKT showing the highest (64) number of connections. The gene p53 was the most popular one in the network in accordance with both the degree and weighted degree centrality, which were 425 and 1414, respectively. The present piece of study is an amalgam of bibliometrics and scientific data mining methods looking deeper into the whole scale analysis of available literature on proteome.
Collapse
|
18
|
Janyasupab P, Suratanee A, Plaimas K. Network diffusion with centrality measures to identify disease-related genes. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021; 18:2909-2929. [PMID: 33892577 DOI: 10.3934/mbe.2021147] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Disease-related gene prioritization is one of the most well-established pharmaceutical techniques used to identify genes that are important to a biological process relevant to a disease. In identifying these essential genes, the network diffusion (ND) approach is a widely used technique applied in gene prioritization. However, there is still a large number of candidate genes that need to be evaluated experimentally. Therefore, it would be of great value to develop a new strategy to improve the precision of the prioritization. Given the efficiency and simplicity of centrality measures in capturing a gene that might be important to the network structure, herein, we propose a technique that extends the scope of ND through a centrality measure to identify new disease-related genes. Five common centrality measures with different aspects were examined for integration in the traditional ND model. A total of 40 diseases were used to test our developed approach and to find new genes that might be related to a disease. Results indicated that the best measure to combine with the diffusion is closeness centrality. The novel candidate genes identified by the model for all 40 diseases were provided along with supporting evidence. In conclusion, the integration of network centrality in ND is a simple but effective technique to discover more precise disease-related genes, which is extremely useful for biomedical science.
Collapse
Affiliation(s)
- Panisa Janyasupab
- Advanced Virtual and Intelligent Computing (AVIC) Center, Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Apichat Suratanee
- Intelligent and Nonlinear Dynamic Innovations Research Center, Department of Mathematics, Faculty of Applied Science, King Mongkut's University of Technology North Bangkok, Bangkok 10800, Thailand
| | - Kitiporn Plaimas
- Advanced Virtual and Intelligent Computing (AVIC) Center, Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
- Omics Science and Bioinformatics Center, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
| |
Collapse
|
19
|
Morenikeji OB, Wallace M, Strutton E, Bernard K, Yip E, Thomas BN. Integrative Network Analysis of Predicted miRNA-Targets Regulating Expression of Immune Response Genes in Bovine Coronavirus Infection. Front Genet 2020; 11:584392. [PMID: 33193717 PMCID: PMC7554596 DOI: 10.3389/fgene.2020.584392] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 09/04/2020] [Indexed: 12/11/2022] Open
Abstract
Bovine coronavirus (BCoV) infection that causes disease outbreaks among farm animals, resulting in significant economic losses particularly in the cattle industry, has the potential to become zoonotic. miRNAs, which are short non-coding segments of RNA that inhibits the expression of their target genes, have been identified as potential biomarkers and drug targets, though this potential in BCoV remains largely unknown. We hypothesize that certain miRNAs could simultaneously target multiple genes, are significantly conserved across many species, thereby demonstrating the potential to serve as diagnostic or therapeutic tools for bovine coronavirus infection. To this end, we utilized different existing and publicly available computational tools to conduct system analysis predicting important miRNAs that could affect BCoV pathogenesis. Eleven genes including CEBPD, IRF1, TLR9, SRC, and RHOA, significantly indicated in immune-related pathways, were identified to be associated with BCoV, and implicated in other coronaviruses. Of the 70 miRNAs predicted to target the identified genes, four concomitant miRNAs (bta-miR-11975, bta-miR-11976, bta-miR-22-3p, and bta-miR-2325c) were found. Examining the gene interaction network suggests IL-6, IRF1, and TP53 as key drivers. Phylogenetic analysis revealed that miR-22 was completely conserved across all 14 species it was searched against, suggesting a shared and important functional role. Functional annotation and associated pathways of target genes, such as positive regulation of cytokine production, IL-6 signaling pathway, and regulation of leukocyte differentiation, indicate the miRNAs are major participants in multiple aspects of both innate and adaptive immune response. Examination of variants evinced a potentially deleterious SNP in bta-miR-22-3p and an advantageous SNP in bta-miR-2325c. Conclusively, this study provides new insight into miRNAs regulating genes responding to BCoV infection, with bta-miR-22-3p particularly indicated as a potential drug target or diagnostic marker for bovine coronavirus.
Collapse
Affiliation(s)
| | | | - Ellis Strutton
- Department of Biology, Hamilton College, Clinton, NY, United States
| | - Kahleel Bernard
- Department of Biology, Hamilton College, Clinton, NY, United States
| | - Elaine Yip
- Department of Biology, Hamilton College, Clinton, NY, United States
| | - Bolaji N Thomas
- Department of Biomedical Sciences, College of Health Sciences and Technology, Rochester Institute of Technology, Rochester, NY, United States
| |
Collapse
|
20
|
Srivastava S, Dafale NA, Purohit HJ. Functional genomics assessment of lytic polysaccharide mono-oxygenase with glycoside hydrolases in Paenibacillus dendritiformis CRN18. Int J Biol Macromol 2020; 164:3729-3738. [PMID: 32835796 DOI: 10.1016/j.ijbiomac.2020.08.147] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 08/06/2020] [Accepted: 08/19/2020] [Indexed: 11/25/2022]
Abstract
Recently discovered Lytic Polysaccharide Mono-Oxygenase (LPMO) enhances the enzymatic deconstruction of complex polysaccharide by oxidation. The present study demonstrates the agricultural waste hydrolyzing capabilities of Paenibacillus dendritiformis CRN18, which exhibits the enzyme activity of exo-glucanase, β-glucosidase, β-glucuronidase, endo-1, 4 β-xylanases, arabinosidase, and α-galactosidase as 0.1U/ml, 0.3U/ml, 0.09U/ml, 0.1U/ml, 0.05U/ml, and 0.41U/ml, respectively. The genome analysis of strain reveals the presence of four LPMO genes, along with lignocellulolytic genes. The gene structure of LPMO and its phylogenetic analysis shows the evolutionary relatedness with the Bacillus LPMO gene. Gene position of LPMOs in the genome of strains shows the close association of two LPMOs with chitin active enzyme GH18, and the other two are associated with hemicellulases (GH39, GH23). Protein-protein interaction and gene networking of LPMO sheds light on the co-occurrence, neighborhood, and interaction of LPMOs with chitinase and xylanase enzymes. Structural prediction of LPMOs unravels the information of the LPMO's binding site. Although the LPMO has been explored for its oxidative mechanism, a little light has been shed on its gene structure. This study provides insights into the LPMO gene structure in P. dendritiformis CRN18 and its potential in lignocellulose hydrolysis.
Collapse
Affiliation(s)
- Shweta Srivastava
- Environmental Biotechnology & Genomics Division, CSIR-National Environmental Engineering Research Institute (NEERI), Nagpur 440 020, India; AcSIR-Academy for Scientific and Innovative Research, Ghaziabad 201 002, India
| | - Nishant A Dafale
- Environmental Biotechnology & Genomics Division, CSIR-National Environmental Engineering Research Institute (NEERI), Nagpur 440 020, India; AcSIR-Academy for Scientific and Innovative Research, Ghaziabad 201 002, India.
| | - Hemant J Purohit
- Environmental Biotechnology & Genomics Division, CSIR-National Environmental Engineering Research Institute (NEERI), Nagpur 440 020, India
| |
Collapse
|
21
|
Taha K, Yoo PD. An Effective Disease Risk Indicator Tool. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2020; 2020:5284-5287. [PMID: 33019176 DOI: 10.1109/embc44109.2020.9175994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Each mixture of deficient molecular families of a specific disease induces the disease at a different time frame in the future. Based on this, we propose a novel methodology for personalizing a person's level of future susceptibility to a specific disease by inferring the mixture of his/her molecular families, whose combined deficiencies is likely to induce the disease. We implemented the methodology in a working system called DRIT, which consists of the following components: logic inferencer, information extractor, risk indicator, and interrelationship between molecular families modeler. The information extractor takes advantage of the exponential increase of biomedical literature to extract the common biomarkers that test positive among most patients with a specific disease. The logic inferencer transforms the hierarchical interrelationships between the molecular families of a disease into rule-based specifications. The interrelationship between molecular families modeler models the hierarchical interrelationships between the molecular families, whose biomarkers were extracted by the information extractor. It employs the specification rules and the inference rules for predicate logic to infer as many as possible probable deficient molecular families for a person based on his/her few molecular families, whose biomarkers tested positive by medical screening. The risk indicator outputs a risk indicator value that reflects a person's level of future susceptibility to the disease. We evaluated DRIT by comparing it experimentally with a comparable method. Results revealed marked improvement.
Collapse
|
22
|
Abstract
Cardiovascular diseases are the leading cause of death worldwide. Complex diseases with highly heterogenous disease progression among patient populations, cardiovascular diseases feature multifactorial contributions from both genetic and environmental stressors. Despite significant effort utilizing multiple approaches from molecular biology to genome-wide association studies, the genetic landscape of cardiovascular diseases, particularly for the nonfamilial forms of heart failure, is still poorly understood. In the past decade, systems-level approaches based on omics technologies have become an important approach for the study of complex traits in large populations. These advances create opportunities to integrate genetic variation with other biological layers to identify and prioritize candidate genes, understand pathogenic pathways, and elucidate gene-gene and gene-environment interactions. In this review, we will highlight some of the recent progress made using systems genetics approaches to uncover novel mechanisms and molecular bases of cardiovascular pathophysiological manifestations. The key technology and data analysis platforms necessary to implement systems genetics will be described, and the current major challenges and future directions will also be discussed. For complex cardiovascular diseases, such as heart failure, systems genetics represents a powerful strategy to obtain mechanistic insights and to develop individualized diagnostic and therapeutic regiments, paving the way for precision cardiovascular medicine.
Collapse
Affiliation(s)
- Christoph D. Rau
- Departments of Anesthesiology, Medicine, Physiology
- Current address: Department of Genetics, University of North Carolina School of Medicine, Chapel Hill, NC 27599
| | - Aldons J. Lusis
- Department of Human Genetics and Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA 90095
| | - Yibin Wang
- Departments of Anesthesiology, Medicine, Physiology
| |
Collapse
|
23
|
Liu C, Ma Y, Zhao J, Nussinov R, Zhang YC, Cheng F, Zhang ZK. Computational network biology: Data, models, and applications. PHYSICS REPORTS 2020; 846:1-66. [DOI: 10.1016/j.physrep.2019.12.004] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|