1
|
Xie K, Hou Y, Zhou X. Deep centroid: a general deep cascade classifier for biomedical omics data classification. Bioinformatics 2024; 40:btae039. [PMID: 38305432 PMCID: PMC10868341 DOI: 10.1093/bioinformatics/btae039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/13/2024] [Accepted: 01/30/2024] [Indexed: 02/03/2024] Open
Abstract
MOTIVATION Classification of samples using biomedical omics data is a widely used method in biomedical research. However, these datasets often possess challenging characteristics, including high dimensionality, limited sample sizes, and inherent biases across diverse sources. These factors limit the performance of traditional machine learning models, particularly when applied to independent datasets. RESULTS To address these challenges, we propose a novel classifier, Deep Centroid, which combines the stability of the nearest centroid classifier and the strong fitting ability of the deep cascade strategy. Deep Centroid is an ensemble learning method with a multi-layer cascade structure, consisting of feature scanning and cascade learning stages that can dynamically adjust the training scale. We apply Deep Centroid to three precision medicine applications-cancer early diagnosis, cancer prognosis, and drug sensitivity prediction-using cell-free DNA fragmentations, gene expression profiles, and DNA methylation data. Experimental results demonstrate that Deep Centroid outperforms six traditional machine learning models in all three applications, showcasing its potential in biological omics data classification. Furthermore, functional annotations reveal that the features scanned by the model exhibit biological significance, indicating its interpretability from a biological perspective. Our findings underscore the promising application of Deep Centroid in the classification of biomedical omics data, particularly in the field of precision medicine. AVAILABILITY AND IMPLEMENTATION Deep Centroid is available at both github (github.com/xiexiexiekuan/DeepCentroid) and Figshare (https://figshare.com/articles/software/Deep_Centroid_A_General_Deep_Cascade_Classifier_for_Biomedical_Omics_Data_Classification/24993516).
Collapse
Affiliation(s)
- Kuan Xie
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, People’s Republic of China
| | - Yuying Hou
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, People’s Republic of China
| | - Xionghui Zhou
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, People’s Republic of China
- Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan 430070, People’s Republic of China
| |
Collapse
|
2
|
Gong L, Zhang R, Han M, Hu QN. CCIBP: a comprehensive cosmetic ingredients bioinformatics platform. Bioinformatics 2023; 39:btad416. [PMID: 37399096 PMCID: PMC10345691 DOI: 10.1093/bioinformatics/btad416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 05/30/2023] [Accepted: 07/03/2023] [Indexed: 07/05/2023] Open
Abstract
SUMMARY Cosmetics form an important part of our daily lives, and it is therefore important to understand the basic physicochemical properties, metabolic pathways, and toxicological and safe concentrations of these cosmetics molecules. Therefore, comprehensive cosmetic ingredients bioinformatics platform (CCIBP) was developed here, which is a unique comprehensive cosmetic database providing information on regulations, physicochemical properties, and human metabolic pathways for cosmetic molecules from major regions of the world, whilst also correlating plant information in natural products. CCIBP supports formulation analysis, efficacy component analysis, and also combines knowledge of synthetic biology to facilitate access to natural molecules and biosynthetic production. CCIBP, empowered with chemoinformatics, bioinformatics, and synthetic biology data and tools, presents a very helpful platform for cosmetic research and development of ingredients. AVAILABILITY AND IMPLEMENTATION CCIBP is available at: http://design.rxnfinder.org/cosing/.
Collapse
Affiliation(s)
- Linlin Gong
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P.R. China
| | - Rui Zhang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P.R. China
| | - Mengying Han
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P.R. China
| | - Qian-Nan Hu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P.R. China
| |
Collapse
|
3
|
Dai X, Shen L. Advances and Trends in Omics Technology Development. Front Med (Lausanne) 2022; 9:911861. [PMID: 35860739 PMCID: PMC9289742 DOI: 10.3389/fmed.2022.911861] [Citation(s) in RCA: 99] [Impact Index Per Article: 49.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 05/09/2022] [Indexed: 12/11/2022] Open
Abstract
The human history has witnessed the rapid development of technologies such as high-throughput sequencing and mass spectrometry that led to the concept of “omics” and methodological advancement in systematically interrogating a cellular system. Yet, the ever-growing types of molecules and regulatory mechanisms being discovered have been persistently transforming our understandings on the cellular machinery. This renders cell omics seemingly, like the universe, expand with no limit and our goal toward the complete harness of the cellular system merely impossible. Therefore, it is imperative to review what has been done and is being done to predict what can be done toward the translation of omics information to disease control with minimal cell perturbation. With a focus on the “four big omics,” i.e., genomics, transcriptomics, proteomics, metabolomics, we delineate hierarchies of these omics together with their epiomics and interactomics, and review technologies developed for interrogation. We predict, among others, redoxomics as an emerging omics layer that views cell decision toward the physiological or pathological state as a fine-tuned redox balance.
Collapse
|
4
|
CoronaPep: An Anti-Coronavirus Peptide Generation Tool. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1299-1304. [PMID: 33687847 PMCID: PMC8769015 DOI: 10.1109/tcbb.2021.3064630] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The novel coronavirus (COVID-19) infections have adopted the shape of a global pandemic now, demanding an urgent vaccine design. The current work reports contriving an anti-coronavirus peptide scanner tool to discern anti-coronavirus targets in the embodiment of peptides. The proffered CoronaPep tool features the fast fingerprinting of the anti-coronavirus target serving supreme prominence in the current bioinformatics research. The anti-coronavirus target protein sequences reported from the current outbreak are scanned against the anti-coronavirus target data-sets via CORONAPEP which provides precision-based anti-coronavirus peptides. This tool is specifically for the coronavirus data, which can predict peptides from the whole genome, or a gene or protein's list. Besides it is relatively fast, accurate, userfriendly and can generate maximum output from the limited information. The availability of tools like CORONAPEP will immeasurably perquisite researchers in the discipline of oncology and structure-based drug design.
Collapse
|
5
|
Khan T, Khan A, Nasir SN, Ahmad S, Ali SS, Wei DQ. CytomegaloVirusDb: Multi-omics knowledge database for cytomegaloviruses. Comput Biol Med 2021; 135:104563. [PMID: 34256256 DOI: 10.1016/j.compbiomed.2021.104563] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/06/2021] [Accepted: 06/06/2021] [Indexed: 11/16/2022]
Abstract
Cytomegalovirus infection is a significant health concern and need further exploration in immunologic response mechanisms during primary and reactivated CMV infection. In this work, we evaluated the whole genomes and proteomes of different CMV species and developed an integrated open-access platform, CytomegaloVirusDb, a multi-Omics knowledge database for Cytomegaloviruses. The resource is categorized into the main sections "Genomics," "Proteomics," "Immune response," and "Therapeutics,". The database is annotated with the list of all CMV species included in the study, and available information is freely accessible at http://www.cmvdb.dqweilab-sjtu.com/index.php. Various parameters used in the analysis for each section were primarily based on the whole genome or proteome of each specie. The platform provided datasets are open to access for researchers to obtain CMV species-specific information. This will help further to explore the dynamics of CMV-specific immune response and therapeutics. This platform is a useful resource to aid in advancing research against Cytomegaloviruses.
Collapse
Affiliation(s)
- Taimoor Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Syed Nouman Nasir
- National Center for Bioinformatics, Quaid-i-Azam University, Islamabad, Pakistan
| | - Sajjad Ahmad
- Department of Health and Biological Sciences, Abasyn University, Peshawar, 25000, Pakistan
| | - Syed Shujait Ali
- Center for Biotechnology and Microbiology, University of Swat, Swat, KP, Pakistan
| | - Dong-Qing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China; State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, PR China; Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nashan District, Shenzhen, Guangdong, 518055, PR China.
| |
Collapse
|
6
|
Mehmood A, Kaushik AC, Wang Q, Li CD, Wei DQ. Bringing Structural Implications and Deep Learning-Based Drug Identification for KRAS Mutants. J Chem Inf Model 2021; 61:571-586. [PMID: 33513018 DOI: 10.1021/acs.jcim.0c00488] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Colorectal cancer is considered one of the leading causes of death that is linked with the Kirsten Rat Sarcoma (KRAS) harboring codons 13 and 61 mutations. The objective for this study is to search for clinically important codon 61 mutations and analyze how they affect the protein structural dynamics. Additionally, a deep-learning approach is used to carry out a similarity search for potential compounds that might have a comparatively better affinity. Public databases like The Cancer Genome Atlas and Genomic Data Commons were accessed for obtaining the data regarding mutations that are associated with colon cancer. Multiple analysis such as genomic alteration landscape, survival analysis, and systems biology-based kinetic simulations were carried out to predict dynamic changes for the selected mutations. Additionally, a molecular dynamics simulation of 100 ns for all the seven shortlisted codon 61 mutations have been conducted, which revealed noticeable deviations. Finally, the deep learning-based predicted compounds were docked with the KRAS 3D conformer, showing better affinity and good docking scores as compared to the already existing drugs. Taking together the outcomes of systems biology and molecular dynamics, it is observed that the reported mutations in the SII region are highly detrimental as they have an immense impact on the protein sensitive sites' native conformation and overall stability. The drugs reported in this study show increased performance and are encouraged to be used for further evaluation regarding the situation that ascends as a result of KRAS mutations.
Collapse
Affiliation(s)
- Aamir Mehmood
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China.,Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong, 518055, P. R. China
| | - Aman Chandra Kaushik
- Wuxi School of Medicine, Jiangnan University, Li Lake Avenue, Wuxi, Jiangsu 214122, China
| | - Qiankun Wang
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China
| | - Cheng-Dong Li
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China
| | - Dong-Qing Wei
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China.,Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong, 518055, P. R. China
| |
Collapse
|
7
|
Yu C, Qi X, Lin Y, Li Y, Shen B. iODA: An integrated tool for analysis of cancer pathway consistency from heterogeneous multi-omics data. J Biomed Inform 2020; 112:103605. [PMID: 33096244 DOI: 10.1016/j.jbi.2020.103605] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 09/07/2020] [Accepted: 10/15/2020] [Indexed: 02/05/2023]
Abstract
The latest advances in the next generation sequencing technology have greatly facilitated the extensive research of genomics and transcriptomics, thereby promoting the decoding of carcinogenesis with unprecedented resolution. Considering the contribution of analyzing high-throughput multi-omics data to the exploration of cancer molecular mechanisms, an integrated tool for heterogeneous multi-omics data analysis (iODA) is proposed for the systems-level interpretation of multi-omics data, i.e., transcriptomic profiles (mRNA or miRNA expression data) and protein-DNA interactions (ChIP-Seq data). Considering the data heterogeneity, iODA can compare six statistical algorithms in differential analysis for the selected sample data and assist users in choosing the globally optimal one for dysfunctional mRNA or miRNA identification. Since molecular signatures are more consistent at the pathway level than at the gene level, the tool is able to enrich the identified dysfunctional molecules onto the KEGG pathways and extracted the consistent items as key components for further pathogenesis investigation. Compared with other tools, iODA is multi-functional for the systematic analysis of different level of omics data, and its analytical power was demonstrated through case studies of single and cross-level prostate cancer omics data. iODA is open source under GNU GPL and can be downloaded from http://www.sysbio.org.cn/iODA.
Collapse
Affiliation(s)
- Chunjiang Yu
- School of Biotechnology, Suzhou Industrial Park Institute of Service Outsourcing, Suzhou, China; Center for Systems Biology, Soochow University, Suzhou, China
| | - Xin Qi
- School of Chemistry, Biology and Material Engineering, Suzhou University of Science and Technology, Suzhou, China; Center for Systems Biology, Soochow University, Suzhou, China
| | - Yuxin Lin
- Center for Systems Biology, Soochow University, Suzhou, China
| | - Yin Li
- Center for Systems Biology, Soochow University, Suzhou, China
| | - Bairong Shen
- Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
8
|
Abdurakhmanova ER, Brusnakov MY, Golovchenko OV, Pilyo SG, Velychko NV, Harden EA, Prichard MN, James SH, Zhirnov VV, Brovarets VS. Synthesis and in vitro anticytomegalovirus activity of 5-hydroxyalkylamino-1,3-oxazoles derivatives. Med Chem Res 2020. [DOI: 10.1007/s00044-020-02593-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
9
|
Robson B. COVID-19 Coronavirus spike protein analysis for synthetic vaccines, a peptidomimetic antagonist, and therapeutic drugs, and analysis of a proposed achilles' heel conserved region to minimize probability of escape mutations and drug resistance. Comput Biol Med 2020; 121:103749. [PMID: 32568687 PMCID: PMC7151553 DOI: 10.1016/j.compbiomed.2020.103749] [Citation(s) in RCA: 91] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 04/03/2020] [Accepted: 04/03/2020] [Indexed: 12/17/2022]
Abstract
This paper continues a recent study of the spike protein sequence of the COVID-19 virus (SARS-CoV-2). It is also in part an introductory review to relevant computational techniques for tackling viral threats, using COVID-19 as an example. Q-UEL tools for facilitating access to knowledge and bioinformatics tools were again used for efficiency, but the focus in this paper is even more on the virus. Subsequence KRSFIEDLLFNKV of the S2′ spike glycoprotein proteolytic cleavage site continues to appear important. Here it is shown to be recognizable in the common cold coronaviruses, avian coronaviruses and possibly as traces in the nidoviruses of reptiles and fish. Its function or functions thus seem important to the coronaviruses. It might represent SARS-CoV-2 Achilles’ heel, less likely to acquire resistance by mutation, as has happened in some early SARS vaccine studies discussed in the previous paper. Preliminary conformational analysis of the receptor (ACE2) binding site of the spike protein is carried out suggesting that while it is somewhat conserved, it appears to be more variable than KRSFIEDLLFNKV. However compounds like emodin that inhibit SARS entry, apparently by binding ACE2, might also have functions at several different human protein binding sites. The enzyme 11β-hydroxysteroid dehydrogenase type 1 is again argued to be a convenient model pharmacophore perhaps representing an ensemble of targets, and it is noted that it occurs both in lung and alimentary tract. Perhaps it benefits the virus to block an inflammatory response by inhibiting the dehydrogenase, but a fairly complex web involves several possible targets. This paper “drills down” into the studies of the author's previous COVID-19 paper. Designing vaccine and drugs must seek to avoid escape mutations. Subsequence KRSFIEDLLFNKV seems recognizable across many coronaviruses. The ACE2 binding domain is a target, but shows variation. A steroid dehydrogenase is argued to remain an interesting model pharmacophore.
Collapse
Affiliation(s)
- B Robson
- Ingine Inc. Cleveland Ohio USA, The Dirac Foundation, Oxfordshire, UK.
| |
Collapse
|
10
|
Robson B. Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus. Comput Biol Med 2020; 119:103670. [PMID: 32209231 PMCID: PMC7094376 DOI: 10.1016/j.compbiomed.2020.103670] [Citation(s) in RCA: 126] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Revised: 02/17/2020] [Accepted: 02/17/2020] [Indexed: 12/19/2022]
Abstract
This paper concerns study of the genome of the Wuhan Seafood Market isolate believed to represent the causative agent of the disease COVID-19. This is to find a short section or sections of viral protein sequence suitable for preliminary design proposal for a peptide synthetic vaccine and a peptidomimetic therapeutic, and to explore some design possibilities. The project was originally directed towards a use case for the Q-UEL language and its implementation in a knowledge management and automated inference system for medicine called the BioIngine, but focus here remains mostly on the virus itself. However, using Q-UEL systems to access relevant and emerging literature, and to interact with standard publically available bioinformatics tools on the Internet, did help quickly identify sequences of amino acids that are well conserved across many coronaviruses including 2019-nCoV. KRSFIEDLLFNKV was found to be particularly well conserved in this study and corresponds to the region around one of the known cleavage sites of the SARS virus that are believed to be required for virus activation for cell entry. This sequence motif and surrounding variations formed the basis for proposing a specific synthetic vaccine epitope and peptidomimetic agent. The work can, nonetheless, be described in traditional bioinformatics terms, and readily reproduced by others, albeit with the caveat that new data and research into 2019-nCoV is emerging and evolving at an explosive pace. Preliminary studies using molecular modeling and docking, and in that context the potential value of certain known herbal extracts, are also described. Bioinformatics studies are carried out on the COVID-19 virus. A sequence motif KRSFIEDLLFNKV is of particular interest. Based on the above, synthetic peptides are designed. Preliminary considerations are also given to non-peptide organic molecules.
Collapse
Affiliation(s)
- B Robson
- Ingine Inc., Cleveland, Ohio, USA; The Dirac Foundation, Oxfordshire, UK.
| |
Collapse
|
11
|
Kaushik AC, Mehmood A, Khan MT, Kumar A, Dai X, Wei DQ. RETRACTED ARTICLE: Protein blueprint and their interactions while approachability struggle for amino acids. J Biomol Struct Dyn 2020; 39:i-ix. [PMID: 31914855 DOI: 10.1080/07391102.2020.1713894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
| | - Aamir Mehmood
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Muhammad Tahir Khan
- Department of Bioinformatics and Biosciences, Capital University of Science and Technology, Islamabad, Pakistan
| | - Ajay Kumar
- Institute of Biomedical Sciences, National Sun Yat-Sen University, Kaohsiung City, Taiwan
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
12
|
Kaushik AC, Mehmood A, Peng S, Zhang YJ, Dai X, Wei DQ. A-CaMP: a tool for anti-cancer and antimicrobial peptide generation. J Biomol Struct Dyn 2020; 39:285-293. [PMID: 31870207 DOI: 10.1080/07391102.2019.1708796] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Anti-cancer peptides (ACPs) play a vital role in the cell signaling process. Antimicrobial peptides (AMPs) provide immunity against pathogenic microbes, AMPs present activity against pathogenic microbes. Some of them are known to possess both anticancer and antimicrobial activity. However, so far, no tools have been developed that could predict potential ACPs from wild and mutated cancerous protein sequences in the numerous public databases. In the present study, we developed a A-CaMP tool that allows rapid fingerprinting of the anti-cancer and antimicrobial peptides, which play a crucial role in current bioinformatics research. Besides, we compared the performance and functionality of our A-CaMP tool with those of other methods available online. A-CaMP scans the target protein sequences provided by the user against the datasets. It possesses a robust coding architecture, has been developed in PERL language and is scalable of therefore has extensive applications in bioinformatics. It was observed to achieve a prediction accuracy of 93.4%, which is much higher than that of any of the existing tools. Sequence alignment studies also highlight the potential use of A-CaMP as a tool for the identification of AMPs. A-CaMP is the first open source tool that uses clinical data and proposes final peptides along with the necessary information; this includes wild and mutant sequence and peptides, which lays the foundation for its application in therapies for cancer and bacterial infections. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Aman Chandra Kaushik
- Wuxi School of Medicine, Jiangnan University, Wuxi, China.,State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Aamir Mehmood
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Shaoliang Peng
- School of Computer Science, National University of Defense Technology, Changsha, China
| | - Yu-Juan Zhang
- College of Life Science, Chongqing Normal University, Chongqing, China
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|