1
|
Ullah F, Salam A, Nadeem M, Amin F, AlSalman H, Abrar M, Alfakih T. Extended dipeptide composition framework for accurate identification of anticancer peptides. Sci Rep 2024; 14:17381. [PMID: 39075193 PMCID: PMC11286958 DOI: 10.1038/s41598-024-68475-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Accepted: 07/24/2024] [Indexed: 07/31/2024] Open
Abstract
The identification of anticancer peptides (ACPs) is crucial, especially in the development of peptide-based cancer therapy. The classical models such as Split Amino Acid Composition (SAAC) and Pseudo Amino Acid Composition (PseAAC) lack the incorporation of feature representation. These advancements improve the predictive accuracy and efficiency of ACP identification. Thus, the effort of this research is to propose and develop an advanced framework based on feature extraction. Thus, to achieve this objective herein we propose an Extended Dipeptide Composition (EDPC) framework. The proposed EDPC framework extends the dipeptide composition by considering the local sequence environment information and reforming the CD-HIT framework to remove noise and redundancy. To measure the accuracy, we have performed several experiments. These experiments were employed using four famous machine learning (ML) algorithms named; Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and K Nearest Neighbor (KNN). For comparisons, we have used accuracy, specificity, sensitivity, precision, recall, and F1-Score as evaluation criteria. The reliability of the proposed framework is further evaluated using statistical significance tests. As a result, the proposed EDPC framework exhibited enhanced performance than SAAC and PseAAC, where the SVM model delivered the highest accuracy of 96. 6% and significant enhancements in specificity, sensitivity, precision, and F1-score over multiple datasets. Due to the incorporation of enhanced feature representation and the incorporation of local and global sequence profiles proposed EDPC achieves higher classification performance. The proposed frameworks can deal with noise and also duplicating features. These are accompanied by a wide range of feature representations. Finally, our proposed framework can be used for clinical applications where ACP identification is essential. Future works will include extending to a larger variety of datasets, incorporating tertiary structural information, and using deep learning techniques to improve the proposed EDPC.
Collapse
Affiliation(s)
- Faizan Ullah
- Department of Computer Science, Bacha Khan University, Charsadda, 24420, Pakistan
| | - Abdu Salam
- Department of Computer Science, Abdul Wali Khan University, Mardan, 23200, Pakistan
| | - Muhammad Nadeem
- Department of Computer Science and Software Engineering, International Islamic University, Islamabad, 44000, Pakistan
| | - Farhan Amin
- School of Computer Science and Engineering, Yeungnam University, Gyeongsan, 38541, Korea.
| | - Hussain AlSalman
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, 11543, Riyadh, Saudi Arabia.
| | - Mohammad Abrar
- Faculty of Computer Studies, Arab Open University, Muscat, Oman
| | - Taha Alfakih
- Department of Information Systems, College of Computer and Information Sciences, King Saud University, 11543, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Azad H, Akbar MY, Sarfraz J, Haider W, Riaz MN, Ali GM, Ghazanfar S. G-ACP: a machine learning approach to the prediction of therapeutic peptides for gastric cancer. J Biomol Struct Dyn 2024:1-14. [PMID: 38450672 DOI: 10.1080/07391102.2024.2323141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 02/15/2024] [Indexed: 03/08/2024]
Abstract
Conventional Gastrointestinal (GI) cancer treatments are quite expensive and have major hazards. Nowadays, a different strategy places more emphasis on creating tiny biologically active peptides that do not cause severe poisoning. Anticancer peptides (ACPs) are found through experimental screening, which is time-dependent and frequently fraught with difficulties. Gastric ACPs are emerging as a promising GI cancer treatment in the current day. It is crucial to identify novel gastric ACPs to have an improved knowledge of their functioning processes and treatment of gastric cancer. As a result of the post-genomic era's massive production of peptide sequences, rapid and effective ACPs using a computational method are essential. Several adaptive statistical techniques for distinguishing ACPs and non-ACPs have recently been developed. A variety of adapted statistically significant methods have been developed to differentiate between ACPs and non-ACPs. Despite significant progress, there is no specific model for the prediction of gastric ACPs because the specific model will predict a particular type of peptide more accurately and quickly. To overcome this, an initiative is taken for the creation of a reliable framework for the accurate identification of gastric ACPs. The current technique in particular contains four possible features along with one hybrid feature encoding mechanisms which are the target-class motif previously indicated by Amino Acid Composition, Dipeptide Composition, Tripeptide Composition (TPC), Pseudo Amino Acid Composition (PAAC), and their Hybrid. Machine Learning algorithms make high-performance and accurate prediction tools. Moreover, highly variable and ideal deep feature selection is done using an ANOVA-based F score for feature pruning. Experiments on a range of algorithms are carried out to identify the optimal operating strategy due to the diverse nature of learning. Following analysis of the empirical results, Naïve Bayes with TPC and Hybrid feature space outperforms other methods with 0.99 accuracy score on the testing dataset. To find the model generalization an external validation is carried out. In external datasets, the Extra Trees with PAAC features outperforms with the accuracy of 0.94. The comparison study shows that our suggested model will predict gastric ACPs more accurately and will be useful in drug development and gastric cancer. The predictive model can be freely accessed at https://github.com/humeraazad10/G-ACP.git.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Humera Azad
- Department of Biosciences (Bioinformatics) Islamabad, Comsats University Islamabad, Pakistan
| | - Muhammad Yasir Akbar
- National Institute for Genomics and Advanced Biotechnology (NIGAB), National Agricultural Research Center (NARC), Pakistan
| | | | - Waseem Haider
- Department of Biosciences (Bioinformatics) Islamabad, Comsats University Islamabad, Pakistan
| | - Muhammad Naeem Riaz
- National Institute for Genomics and Advanced Biotechnology (NIGAB), National Agricultural Research Center (NARC), Pakistan
| | - Ghulam Muhammad Ali
- Department of Biosciences (Bioinformatics) Islamabad, Comsats University Islamabad, Pakistan
| | - Shakira Ghazanfar
- National Institute for Genomics and Advanced Biotechnology (NIGAB), National Agricultural Research Center (NARC), Pakistan
| |
Collapse
|
3
|
La Paglia L, Vazzana M, Mauro M, Urso A, Arizza V, Vizzini A. Bioactive Molecules from the Innate Immunity of Ascidians and Innovative Methods of Drug Discovery: A Computational Approach Based on Artificial Intelligence. Mar Drugs 2023; 22:6. [PMID: 38276644 PMCID: PMC10817596 DOI: 10.3390/md22010006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 12/12/2023] [Accepted: 12/17/2023] [Indexed: 01/27/2024] Open
Abstract
The study of bioactive molecules of marine origin has created an important bridge between biological knowledge and its applications in biotechnology and biomedicine. Current studies in different research fields, such as biomedicine, aim to discover marine molecules characterized by biological activities that can be used to produce potential drugs for human use. In recent decades, increasing attention has been paid to a particular group of marine invertebrates, the Ascidians, as they are a source of bioactive products. We describe omics data and computational methods relevant to identifying the mechanisms and processes of innate immunity underlying the biosynthesis of bioactive molecules, focusing on innovative computational approaches based on Artificial Intelligence. Since there is increasing attention on finding new solutions for a sustainable supply of bioactive compounds, we propose that a possible improvement in the biodiscovery pipeline might also come from the study and utilization of marine invertebrates' innate immunity.
Collapse
Affiliation(s)
- Laura La Paglia
- Istituto di Calcolo e Reti ad Alte Prestazioni–Consiglio Nazionale delle Ricerche, Via Ugo La Malfa 153, 90146 Palermo, Italy; (L.L.P.); (A.U.)
| | - Mirella Vazzana
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Manuela Mauro
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Alfonso Urso
- Istituto di Calcolo e Reti ad Alte Prestazioni–Consiglio Nazionale delle Ricerche, Via Ugo La Malfa 153, 90146 Palermo, Italy; (L.L.P.); (A.U.)
| | - Vincenzo Arizza
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Aiti Vizzini
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| |
Collapse
|
4
|
Ahmad I, Pal S, Singh R, Ahmad K, Dey N, Srivastava A, Ahmad R, Suliman M, Alshahrani MY, Barkat MA, Siddiqui S. Antimicrobial peptide moricin induces ROS mediated caspase-dependent apoptosis in human triple-negative breast cancer via suppression of notch pathway. Cancer Cell Int 2023; 23:121. [PMID: 37344820 DOI: 10.1186/s12935-023-02958-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 05/26/2023] [Indexed: 06/23/2023] Open
Abstract
BACKGROUND Breast cancer is the world's most prevalent cancer among women. Microorganisms have been the richest source of antibiotics as well as anticancer drugs. Moricin peptides have shown antibacterial properties; however, the anticancer potential and mechanistic insights into moricin peptide-induced cancer cell death have not yet been explored. METHODS An investigation through in silico analysis, analytical methods (Reverse Phase-High Performance Liquid Chromatography (RP-HPLC), mass spectroscopy (MS), circular dichroism (CD), and in vitro studies, has been carried out to delineate the mechanism(s) of moricin-induced cancer cell death. An in-silico analysis was performed to predict the anticancer potential of moricin in cancer cells using Anti CP and ACP servers based on a support vector machine (SVM). Molecular docking was performed to predict the binding interaction between moricin and peptide-related cancer signaling pathway(s) through the HawkDOCK web server. Further, in vitro anticancer activity of moricin was performed against MDA-MB-231 cells. RESULTS In silico observation revealed that moricin is a potential anticancer peptide, and protein-protein docking showed a strong binding interaction between moricin and signaling proteins. CD showed a predominant helical structure of moricin, and the MS result determined the observed molecular weight of moricin is 4544 Da. An in vitro study showed that moricin exposure to MDA-MB-231 cells caused dose dependent inhibition of cell viability with a high generation of reactive oxygen species (ROS). Molecular study revealed that moricin exposure caused downregulation in the expression of Notch-1, NF-ƙB and Bcl2 proteins while upregulating p53, Bax, caspase 3, and caspase 9, which results in caspase-dependent cell death in MDA-MB-231 cells. CONCLUSIONS In conclusion, this study reveals the anticancer potential and underlying mechanism of moricin peptide-induced cell death in triple negative cancer cells, which could be used in the development of an anticancer drug.
Collapse
Affiliation(s)
- Imran Ahmad
- Department of Biochemistry, King George's Medical University, Lucknow, 226003, India.
| | - Saurabh Pal
- Department of Biotechnology, Era's Lucknow Medical College & Hospital, Era University, Lucknow, 226003, India
| | - Ranjana Singh
- Department of Biochemistry, King George's Medical University, Lucknow, 226003, India.
| | - Khursheed Ahmad
- Department of Biotechnology, Era's Lucknow Medical College & Hospital, Era University, Lucknow, 226003, India
| | - Nilanjan Dey
- Department of Chemistry, BITS- Pilani Hyderabad Campus, Hyderabad, 500078, Telangana, India
| | - Aditi Srivastava
- Department of Biochemistry, Era's Lucknow Medical College & Hospital, Era University, Lucknow, 226003, India
| | - Rumana Ahmad
- Department of Biochemistry, Era's Lucknow Medical College & Hospital, Era University, Lucknow, 226003, India
| | - Muath Suliman
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, King Khalid University, Abha, Saudi Arabia
| | - Mohammad Y Alshahrani
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, King Khalid University, Abha, Saudi Arabia
| | - Md Abul Barkat
- Department of Pharmaceutics, College of Pharmacy, University of Hafr Al-Batin, Al Jamiah, Hafr Al Batin, 39524, Saudi Arabia
| | - Sahabjada Siddiqui
- Department of Biotechnology, Era's Lucknow Medical College & Hospital, Era University, Lucknow, 226003, India.
| |
Collapse
|
5
|
Li Y, Ma D, Chen D, Chen Y. ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree. Front Genet 2023; 14:1165765. [PMID: 37065496 PMCID: PMC10090421 DOI: 10.3389/fgene.2023.1165765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 03/09/2023] [Indexed: 03/31/2023] Open
Abstract
Cancer is one of the most dangerous diseases in the world, killing millions of people every year. Drugs composed of anticancer peptides have been used to treat cancer with low side effects in recent years. Therefore, identifying anticancer peptides has become a focus of research. In this study, an improved anticancer peptide predictor named ACP-GBDT, based on gradient boosting decision tree (GBDT) and sequence information, is proposed. To encode the peptide sequences included in the anticancer peptide dataset, ACP-GBDT uses a merged-feature composed of AAIndex and SVMProt-188D. A GBDT is adopted to train the prediction model in ACP-GBDT. Independent testing and ten-fold cross-validation show that ACP-GBDT can effectively distinguish anticancer peptides from non-anticancer ones. The comparison results of the benchmark dataset show that ACP-GBDT is simpler and more effective than other existing anticancer peptide prediction methods.
Collapse
Affiliation(s)
- Yanjuan Li
- College of Electrical and Information Engineering, Quzhou University, Quzhou, China
| | - Di Ma
- College of Computer, Hangzhou Dianzi University, Hangzhou, China
| | - Dong Chen
- College of Electrical and Information Engineering, Quzhou University, Quzhou, China
- *Correspondence: Dong Chen, ; Yu Chen,
| | - Yu Chen
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
- *Correspondence: Dong Chen, ; Yu Chen,
| |
Collapse
|
6
|
Garai S, Thomas J, Dey P, Das D. LGBM-ACp: an ensemble model for anticancer peptide prediction and in silico screening with potential drug targets. Mol Divers 2023:10.1007/s11030-023-10602-0. [PMID: 36637711 DOI: 10.1007/s11030-023-10602-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/06/2023] [Indexed: 01/14/2023]
Abstract
Conventional cancer therapies are highly expensive and have serious complications. An alternative approach now emphasizes on the development of small, biologically active peptides without acute toxicity. Experimental screening to find curative anticancer peptides (ACP) often gives rise to multiple obstacles and is time dependent. Consequently, developing an effective computational technique to identify promising ACP candidates prior to preclinical research is in high demand. This study proposed a machine-learning framework that used the light gradient-boosting machine as a classifier and two compositional and two binary profile features as input. The ensemble model displayed an accuracy, MCC, and AUROC of 97.52%, 0.91, and 0.98, respectively, which outclassed most of the existing sequence-based computational tools. A distinct dataset of non-mutagenic, non-toxic, and non-inhibitory Cytochrome P-450 peptides was used to validate the hybrid model. The most relevant ACP in the alternative dataset was compared with two standard ACPs, beta defensin 2, and cecropin-A. Molecular docking of the predicted peptide revealed that it has a strong binding affinity with twenty-five anticancer drug targets, most notably phosphoenolpyruvate carboxykinase (- 7.2 kcal/mol). Additionally, molecular dynamics simulation and principal component analysis supported the stability of the peptide-receptor complex. Overall, the present findings will take a step forward in rational drug design through rapid identification and screening of therapeutic peptides.
Collapse
Affiliation(s)
- Swarnava Garai
- Department of Bioengineering, NIT Agartala, Tripura, 799046, India
| | - Juanit Thomas
- Department of Bioengineering, NIT Agartala, Tripura, 799046, India
| | - Palash Dey
- Civil Engineering Department, The ICFAI University, Tripura, 799210, India
| | - Deeplina Das
- Department of Bioengineering, NIT Agartala, Tripura, 799046, India.
| |
Collapse
|
7
|
In Silico Discovery of Anticancer Peptides from Sanghuang. Int J Mol Sci 2022; 23:ijms232213682. [PMID: 36430160 PMCID: PMC9693127 DOI: 10.3390/ijms232213682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/24/2022] [Accepted: 11/04/2022] [Indexed: 11/10/2022] Open
Abstract
Anticancer peptide (ACP) is a short peptide with less than 50 amino acids that has been discovered in a variety of foods. It has been demonstrated that traditional Chinese medicine or food can help treat cancer in some cases, which suggests that ACP may be one of the therapeutic ingredients. Studies on the anti-cancer properties of Sanghuangporus sanghuang have concentrated on polysaccharides, flavonoids, triterpenoids, etc. The function of peptides has not received much attention. The purpose of this study is to use computer mining techniques to search for potential anticancer peptides from 62 proteins of Sanghuang. We used mACPpred to perform sequence scans after theoretical trypsin hydrolysis and discovered nine fragments with an anticancer probability of over 0.60. The study used AlphaFold 2 to perform structural modeling of the first three ACPs discovered, which had blast results from the Cancer PPD database. Using reverse docking technology, we found the target proteins and interacting residues of two ACPs with an unknown mechanism. Reverse docking results predicted the binding modes of the ACPs and their target protein. In addition, we determined the active part of ACPs by quantum chemical calculation. Our study provides a framework for the future discovery of functional peptides from foods. The ACPs discovered have the potential to be used as drugs in oncology clinical treatment after further research.
Collapse
|