1
|
Elshewey AM, Shams MY, Tawfeek SM, Alharbi AH, Ibrahim A, Abdelhamid AA, Eid MM, Khodadadi N, Abualigah L, Khafaga DS, Tarek Z. Optimizing HCV Disease Prediction in Egypt: The hyOPTGB Framework. Diagnostics (Basel) 2023; 13:3439. [PMID: 37998575 PMCID: PMC10670002 DOI: 10.3390/diagnostics13223439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 11/04/2023] [Accepted: 11/08/2023] [Indexed: 11/25/2023] Open
Abstract
The paper focuses on the hepatitis C virus (HCV) infection in Egypt, which has one of the highest rates of HCV in the world. The high prevalence is linked to several factors, including the use of injection drugs, poor sterilization practices in medical facilities, and low public awareness. This paper introduces a hyOPTGB model, which employs an optimized gradient boosting (GB) classifier to predict HCV disease in Egypt. The model's accuracy is enhanced by optimizing hyperparameters with the OPTUNA framework. Min-Max normalization is used as a preprocessing step for scaling the dataset values and using the forward selection (FS) wrapped method to identify essential features. The dataset used in the study contains 1385 instances and 29 features and is available at the UCI machine learning repository. The authors compare the performance of five machine learning models, including decision tree (DT), support vector machine (SVM), dummy classifier (DC), ridge classifier (RC), and bagging classifier (BC), with the hyOPTGB model. The system's efficacy is assessed using various metrics, including accuracy, recall, precision, and F1-score. The hyOPTGB model outperformed the other machine learning models, achieving a 95.3% accuracy rate. The authors also compared the hyOPTGB model against other models proposed by authors who used the same dataset.
Collapse
Affiliation(s)
- Ahmed M. Elshewey
- Computer Science Department, Faculty of Computers and Information, Suez University, Suez 43533, Egypt
| | - Mahmoud Y. Shams
- Faculty of Artificial Intelligence, Kafrelsheikh University, Kafrelsheikh 33516, Egypt
| | - Sayed M. Tawfeek
- Department of Communications and Electronics, Delta Higher Institute of Engineering and Technology, Mansoura 35111, Egypt
| | - Amal H. Alharbi
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Abdelhameed Ibrahim
- Computer Engineering and Control Systems Department, Faculty of Engineering, Mansoura University, Mansoura 35516, Egypt
| | - Abdelaziz A. Abdelhamid
- Department of Computer Science, Faculty of Computer and Information Sciences, Ain Shams University, Cairo 11566, Egypt
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra 11961, Saudi Arabia
| | - Marwa M. Eid
- Department of Communications and Electronics, Delta Higher Institute of Engineering and Technology, Mansoura 35111, Egypt
- Faculty of Artificial Intelligence, Delta University for Science and Technology, Mansoura 35712, Egypt
| | - Nima Khodadadi
- Department of Civil and Architectural Engineering, University of Miami, Coral Gables, FL 33146, USA;
| | - Laith Abualigah
- Computer Science Department, Prince Hussein Bin Abdullah Faculty for Information Technology, Al al-Bayt University, Mafraq 25113, Jordan
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos 13-5053, Lebanon
- Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman 19328, Jordan
- MEU Research Unit, Middle East University, Amman 11831, Jordan
- Applied Science Research Center, Applied Science Private University, Amman 11931, Jordan
- School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Malaysia
- School of Engineering and Technology, Sunway University Malaysia, Petaling Jaya 27500, Malaysia
| | - Doaa Sami Khafaga
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Zahraa Tarek
- Computer Science Department, Faculty of Computers and Information, Mansoura University, Mansoura 35561, Egypt
| |
Collapse
|
2
|
Wang S, Gao S, Ye W, Li Y, Luan J, Lv X. The emerging importance role of m6A modification in liver disease. Biomed Pharmacother 2023; 162:114669. [PMID: 37037093 DOI: 10.1016/j.biopha.2023.114669] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/29/2023] [Accepted: 04/06/2023] [Indexed: 04/12/2023] Open
Abstract
N6-methyladenosine (m6A) modification, as one of the most common types of inner RNA modification in eukaryotes, plays a multifunctional role in normal and abnormal biological processes. This type of modification is modulated by m6A writer, eraser and reader, which in turn impact various processes of RNA metabolism, such as RNA processing, translation, nuclear export, localization and decay. The current academic view holds that m6A modification exerts a crucial role in the post-transcriptional modulation of gene expression, and is involved in multiple cellular functions, developmental and disease processes. However, the potential molecular mechanism and specific role of m6A modification in the development of liver disease have not been fully elucidated. In our review, we summarized the latest research progress on m6A modification in liver disease, and explored how these novel findings reshape our knowledge of m6A modulation of RNA metabolism. In addition, we also illustrated the effect of m6A on liver development and regeneration to prompt further exploration of the mechanism and role of m6A modification in liver physiology and pathology, providing new insights and references for the search of potential therapeutic targets for liver disease.
Collapse
Affiliation(s)
- Sheng Wang
- Department of Pharmacy, The First Affiliated Hospital of Wannan Medical College (Yijishan Hospital of Wannan Medical College), Wuhu, Anhui Province, China; The Key Laboratory of Anti-inflammatory and Immune medicines, Ministry of Education, Anhui Province Key Laboratory of Major Autoimmune Diseases, School of Pharmacy, Institute for Liver Disease of Anhui Medical University, Hefei, Anhui Province, China
| | - Songsen Gao
- Department of Orthopedics (Spinal Surgery), The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui Province, China
| | - Wufei Ye
- Department of Pharmacy, The First Affiliated Hospital of Wannan Medical College (Yijishan Hospital of Wannan Medical College), Wuhu, Anhui Province, China
| | - Yueran Li
- Department of Pharmacy, The First Affiliated Hospital of Wannan Medical College (Yijishan Hospital of Wannan Medical College), Wuhu, Anhui Province, China
| | - Jiajie Luan
- Department of Pharmacy, The First Affiliated Hospital of Wannan Medical College (Yijishan Hospital of Wannan Medical College), Wuhu, Anhui Province, China
| | - Xiongwen Lv
- The Key Laboratory of Anti-inflammatory and Immune medicines, Ministry of Education, Anhui Province Key Laboratory of Major Autoimmune Diseases, School of Pharmacy, Institute for Liver Disease of Anhui Medical University, Hefei, Anhui Province, China.
| |
Collapse
|
3
|
Hasan MAM, Maniruzzaman M, Shin J. Differentially expressed discriminative genes and significant meta-hub genes based key genes identification for hepatocellular carcinoma using statistical machine learning. Sci Rep 2023; 13:3771. [PMID: 36882493 PMCID: PMC9992474 DOI: 10.1038/s41598-023-30851-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 03/02/2023] [Indexed: 03/09/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is the most common lethal malignancy of the liver worldwide. Thus, it is important to dig the key genes for uncovering the molecular mechanisms and to improve diagnostic and therapeutic options for HCC. This study aimed to encompass a set of statistical and machine learning computational approaches for identifying the key candidate genes for HCC. Three microarray datasets were used in this work, which were downloaded from the Gene Expression Omnibus Database. At first, normalization and differentially expressed genes (DEGs) identification were performed using limma for each dataset. Then, support vector machine (SVM) was implemented to determine the differentially expressed discriminative genes (DEDGs) from DEGs of each dataset and select overlapping DEDGs genes among identified three sets of DEDGs. Enrichment analysis was performed on common DEDGs using DAVID. A protein-protein interaction (PPI) network was constructed using STRING and the central hub genes were identified depending on the degree, maximum neighborhood component (MNC), maximal clique centrality (MCC), centralities of closeness, and betweenness criteria using CytoHubba. Simultaneously, significant modules were selected using MCODE scores and identified their associated genes from the PPI networks. Moreover, metadata were created by listing all hub genes from previous studies and identified significant meta-hub genes whose occurrence frequency was greater than 3 among previous studies. Finally, six key candidate genes (TOP2A, CDC20, ASPM, PRC1, NUSAP1, and UBE2C) were determined by intersecting shared genes among central hub genes, hub module genes, and significant meta-hub genes. Two independent test datasets (GSE76427 and TCGA-LIHC) were utilized to validate these key candidate genes using the area under the curve. Moreover, the prognostic potential of these six key candidate genes was also evaluated on the TCGA-LIHC cohort using survival analysis.
Collapse
Affiliation(s)
- Md Al Mehedi Hasan
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Fukushima, 965-8580, Japan.,Department of Computer Science and Engineering, Rajshahi University of Engineering & Technology, Rajshahi, 6204, Bangladesh
| | - Md Maniruzzaman
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Fukushima, 965-8580, Japan.,Statistics Discipline, Khulna University, Khulna, 9208, Bangladesh
| | - Jungpil Shin
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Fukushima, 965-8580, Japan.
| |
Collapse
|
4
|
Wang S, Wu R, Lu J, Jiang Y, Huang T, Cai YD. Protein-protein interaction networks as miners of biological discovery. Proteomics 2022; 22:e2100190. [PMID: 35567424 DOI: 10.1002/pmic.202100190] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 03/28/2022] [Accepted: 04/29/2022] [Indexed: 11/12/2022]
Abstract
Protein-protein interactions (PPIs) form the basis of a myriad of biological pathways and mechanism, such as the formation of protein-complexes or the components of signaling cascades. Here, we reviewed experimental methods for identifying PPI pairs, including yeast two-hybrid, mass spectrometry, co-localization, and co-immunoprecipitation. Furthermore, a range of computational methods leveraging biochemical properties, evolution history, protein structures and more have enabled identification of additional PPIs. Given the wealth of known PPIs, we reviewed important network methods to construct and analyze networks of PPIs. These methods aid biological discovery through identifying hub genes and dynamic changes in the network, and have been thoroughly applied in various fields of biological research. Lastly, we discussed the challenges and future direction of research utilizing the power of PPI networks. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Steven Wang
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Runxin Wu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jiaqi Lu
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN, USA
| | - Yijia Jiang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Tao Huang
- Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai, China
| |
Collapse
|
5
|
Cao X, Liang Y, Liu R, Zao X, Zhang J, Chen G, Liu R, Chen H, He Y, Zhang J, Ye Y. Uncovering the Pharmacological Mechanisms of Gexia-Zhuyu Formula (GXZY) in Treating Liver Cirrhosis by an Integrative Pharmacology Strategy. Front Pharmacol 2022; 13:793888. [PMID: 35330838 PMCID: PMC8940433 DOI: 10.3389/fphar.2022.793888] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 01/25/2022] [Indexed: 12/22/2022] Open
Abstract
Liver cirrhosis (LC) is a fibrotic lesion of liver tissue caused by the repeated progression of chronic hepatitis. The traditional Chinese medicine Gexia-Zhuyu formula (GXZY) has a therapeutic effect on LC. However, its pharmacological mechanisms on LC remain elucidated. Here, we used the network pharmacology approach to explore the action mechanisms of GXZY on LC. The compounds of GXZY were from the traditional Chinese medicine systems pharmacology (TCMSP) database, and their potential targets were from SwissTargetPrediction and STITCH databases. The disease targets of LC came from GeneCards, DisGeNET, NCBI gene, and OMIM databases. Then we constructed the protein-protein interaction (PPI) network to obtain the key target genes. And the gene ontology (GO), pathway enrichment, and expression analysis of the key genes were also performed. Subsequently, the potential action mechanisms of GXZY on LC predicted by the network pharmacology analyses were experimentally validated in LC rats and LX2 cells. A total of 150 components in GXZY were obtained, among which 111 were chosen as key compounds. The PPI network included 525 targets, and the key targets were obtained by network topological parameters analysis, whereas the predicted key genes of GXZY on LC were AR, JUN, MYC, CASP3, MMP9, GAPDH, and RELA. Furthermore, these key genes were related to pathways in cancer, hepatitis B, TNF signaling pathway, and MAPK signaling pathway. The in vitro and in vivo experiments validated that GXZY inhibited the process of LC mainly via the regulation of cells proliferation and migration through reducing the expression of MMP9. In conclusion, through the combination of network pharmacology and experimental verification, this study offered more insight molecular mechanisms of GXZY on LC.
Collapse
Affiliation(s)
- Xu Cao
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Institute of Liver Diseases, Beijing University of Chinese Medicine, Beijing, China
| | - Yijun Liang
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Ruijia Liu
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Xiaobin Zao
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Key Laboratory of Chinese Internal Medicine of Ministry of Education and Beijing, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jiaying Zhang
- Ministry of Education Key Laboratory of Bioinformatics, Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, China
| | - Guang Chen
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Institute of Liver Diseases, Beijing University of Chinese Medicine, Beijing, China
| | - Ruijie Liu
- Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Hening Chen
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Yannan He
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Key Laboratory of Chinese Internal Medicine of Ministry of Education and Beijing, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jiaxin Zhang
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Institute of Liver Diseases, Beijing University of Chinese Medicine, Beijing, China
| | - Yong'an Ye
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China.,Institute of Liver Diseases, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
6
|
Zhu Y, Ke KB, Xia ZK, Li HJ, Su R, Dong C, Zhou FM, Wang L, Chen R, Wu SG, Zhao H, Gu P, Leung KS, Wong MH, Lu G, Zhang JY, Jiang BH, Qiu JG, Shi XN, Lin MCM. Discovery of vanoxerine dihydrochloride as a CDK2/4/6 triple-inhibitor for the treatment of human hepatocellular carcinoma. Mol Med 2021; 27:15. [PMID: 33579185 PMCID: PMC7879659 DOI: 10.1186/s10020-021-00269-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 01/08/2021] [Indexed: 02/06/2023] Open
Abstract
Background Cyclin-dependent kinases 2/4/6 (CDK2/4/6) play critical roles in cell cycle progression, and their deregulations are hallmarks of hepatocellular carcinoma (HCC). Methods We used the combination of computational and experimental approaches to discover a CDK2/4/6 triple-inhibitor from FDA approved small-molecule drugs for the treatment of HCC. Results We identified vanoxerine dihydrochloride as a new CDK2/4/6 inhibitor, and a strong cytotoxicdrugin human HCC QGY7703 and Huh7 cells (IC50: 3.79 μM for QGY7703and 4.04 μM for Huh7 cells). In QGY7703 and Huh7 cells, vanoxerine dihydrochloride treatment caused G1-arrest, induced apoptosis, and reduced the expressions of CDK2/4/6, cyclin D/E, retinoblastoma protein (Rb), as well as the phosphorylation of CDK2/4/6 and Rb. Drug combination study indicated that vanoxerine dihydrochloride and 5-Fu produced synergistic cytotoxicity in vitro in Huh7 cells. Finally, in vivo study in BALB/C nude mice subcutaneously xenografted with Huh7 cells, vanoxerine dihydrochloride (40 mg/kg, i.p.) injection for 21 days produced significant anti-tumor activity (p < 0.05), which was comparable to that achieved by 5-Fu (10 mg/kg, i.p.), with the combination treatment resulted in synergistic effect. Immunohistochemistry staining of the tumor tissues also revealed significantly reduced expressions of Rb and CDK2/4/6in vanoxerinedihydrochloride treatment group. Conclusions The present study isthe first report identifying a new CDK2/4/6 triple inhibitor vanoxerine dihydrochloride, and demonstrated that this drug represents a novel therapeutic strategy for HCC treatment.
Collapse
Affiliation(s)
- Ying Zhu
- Biomedical Engineering Research Center, Kunming Medical University, Kunming, 650500, Yunnan, China.,Department of Cadre Medical Branch, The 3rd Affiliated Hospital of Kunming Medical University, Kunming, 650118, Yunnan, China
| | - Kun-Bin Ke
- Department of Urology, The 1st Affiliated Hospital of Kunming Medical University, Kunming, 650000, China
| | - Zhong-Kun Xia
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Hong-Jian Li
- CUHK-SDU Joint Laboratory On Reproductive Genetics, School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Rong Su
- Department of Geriatric Cardiology, The 1st Affiliated Hospital of Kunming Medical University, Kunming, 650000, China
| | - Chao Dong
- Department of the Second Medical Oncology, The 3rd Affiliated Hospital of Kunming Medical University, Yunnan Tumor Hospital, Kunming, 650000, China
| | - Feng-Mei Zhou
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Lin Wang
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Rong Chen
- Department of Physiology, Yunnan University of Chinese Medicine, Kunming, 650504, Yunnan, China
| | - Shi-Guo Wu
- Department of Teaching and Research of Formulas of Chinese Medicine, Yunnan University of Chinese Medicine, Kunming, 650000, Yunnan, China
| | - Hui Zhao
- Department of Urology, The 1st Affiliated Hospital of Kunming Medical University, Kunming, 650000, China
| | - Peng Gu
- Department of Urology, The 1st Affiliated Hospital of Kunming Medical University, Kunming, 650000, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, 999077, China
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, 999077, China
| | - Gang Lu
- CUHK-SDU Joint Laboratory On Reproductive Genetics, School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Jian-Ying Zhang
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Bing-Hua Jiang
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Jian-Ge Qiu
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China
| | - Xi-Nan Shi
- Department of Pathology, Yunnan University of Chinese Medicine, Kunming, 650504, Yunnan, China. .,Department ofMedicine, Southwest Guizhou Vocational and Technical College for Nationalities, Xingyi, 562400, Guizhou, China.
| | - Marie Chia-Mi Lin
- Academy of Medical Science, Zhengzhou University, Zhengzhou, 450000, Henan, China.
| |
Collapse
|
7
|
Su ZD, Huang Y, Zhang ZY, Zhao YW, Wang D, Chen W, Chou KC, Lin H. iLoc-lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC. Bioinformatics 2019; 34:4196-4204. [PMID: 29931187 DOI: 10.1093/bioinformatics/bty508] [Citation(s) in RCA: 144] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 06/19/2018] [Indexed: 12/20/2022] Open
Abstract
Motivation Long non-coding RNAs (lncRNAs) are a class of RNA molecules with more than 200 nucleotides. They have important functions in cell development and metabolism, such as genetic markers, genome rearrangements, chromatin modifications, cell cycle regulation, transcription and translation. Their functions are generally closely related to their localization in the cell. Therefore, knowledge about their subcellular locations can provide very useful clues or preliminary insight into their biological functions. Although biochemical experiments could determine the localization of lncRNAs in a cell, they are both time-consuming and expensive. Therefore, it is highly desirable to develop bioinformatics tools for fast and effective identification of their subcellular locations. Results We developed a sequence-based bioinformatics tool called 'iLoc-lncRNA' to predict the subcellular locations of LncRNAs by incorporating the 8-tuple nucleotide features into the general PseKNC (Pseudo K-tuple Nucleotide Composition) via the binomial distribution approach. Rigorous jackknife tests have shown that the overall accuracy achieved by the new predictor on a stringent benchmark dataset is 86.72%, which is over 20% higher than that by the existing state-of-the-art predictor evaluated on the same tests. Availability and implementation A user-friendly webserver has been established at http://lin-group.cn/server/iLoc-LncRNA, by which users can easily obtain their desired results. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhen-Dong Su
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Yan Huang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Zhao-Yue Zhang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Ya-Wei Zhao
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Dong Wang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Wei Chen
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Department of Physics, School of Sciences, and Center for Genomics and Computational Biology, North China University of Science and Technology, Tangshan, China.,Gordon Life Science Institute, Boston, MA, USA
| | - Kuo-Chen Chou
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Gordon Life Science Institute, Boston, MA, USA
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Gordon Life Science Institute, Boston, MA, USA
| |
Collapse
|
8
|
iACP: a sequence-based tool for identifying anticancer peptides. Oncotarget 2017; 7:16895-909. [PMID: 26942877 PMCID: PMC4941358 DOI: 10.18632/oncotarget.7815] [Citation(s) in RCA: 313] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Accepted: 02/11/2016] [Indexed: 02/07/2023] Open
Abstract
Cancer remains a major killer worldwide. Traditional methods of cancer treatment are expensive and have some deleterious side effects on normal cells. Fortunately, the discovery of anticancer peptides (ACPs) has paved a new way for cancer treatment. With the explosive growth of peptide sequences generated in the post genomic age, it is highly desired to develop computational methods for rapidly and effectively identifying ACPs, so as to speed up their application in treating cancer. Here we report a sequence-based predictor called iACP developed by the approach of optimizing the g-gap dipeptide components. It was demonstrated by rigorous cross-validations that the new predictor remarkably outperformed the existing predictors for the same purpose in both overall accuracy and stability. For the convenience of most experimental scientists, a publicly accessible web-server for iACP has been established at http://lin.uestc.edu.cn/server/iACP, by which users can easily obtain their desired results.
Collapse
|
9
|
Xiao X, Hui M, Liu Z. iAFP-Ense: An Ensemble Classifier for Identifying Antifreeze Protein by Incorporating Grey Model and PSSM into PseAAC. J Membr Biol 2016; 249:845-854. [DOI: 10.1007/s00232-016-9935-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Accepted: 10/24/2016] [Indexed: 12/12/2022]
|
10
|
Zhu L, Chen X, Kong X, Cai YD. Investigation of the roles of trace elements during hepatitis C virus infection using protein-protein interactions and a shortest path algorithm. Biochim Biophys Acta Gen Subj 2016; 1860:2756-68. [PMID: 27208424 DOI: 10.1016/j.bbagen.2016.05.018] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 05/05/2016] [Accepted: 05/13/2016] [Indexed: 12/12/2022]
Abstract
BACKGROUND Hepatitis is a type of infectious disease that induces inflammation of the liver without pinpointing a particular pathogen or pathogenesis. Type C hepatitis, as a type of hepatitis, has been reported to induce cirrhosis and hepatocellular carcinoma within a very short amount of time. It is a great threat to human health. Some studies have revealed that trace elements are associated with infection with and immune rejection against hepatitis C virus (HCV). However, the mechanism underlying this phenomenon is still unclear. METHODS In this study, we aimed to expand our knowledge of this phenomenon by designing a computational method to identify genes that may be related to both HCV and trace element metabolic processes. The searching procedure included three stages. First, a shortest path algorithm was applied to a large network, constructed by protein-protein interactions, to identify potential genes of interest. Second, a permutation test was executed to exclude false discoveries. Finally, some rules based on the betweenness and associations between candidate genes and HCV and trace elements were built to select core genes among the remaining genes. RESULTS 12 lists of genes, corresponding to 12 types of trace elements, were obtained. These genes are deemed to be associated with HCV infection and trace elements metabolism. CONCLUSIONS The analyses indicate that some genes may be related to both HCV and trace element metabolic processes, further confirming the associations between HCV and trace elements. The method was further tested on another set of HCV genes, the results indicate that this method is quite robustness. GENERAL SIGNIFICANCE The newly found genes may partially reveal unknown mechanisms between HCV infection and trace element metabolism. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang.
Collapse
Affiliation(s)
- LiuCun Zhu
- School of Life Sciences, Shanghai University, Shanghai 200444, People's Republic of China
| | - XiJia Chen
- School of Life Sciences, Shanghai University, Shanghai 200444, People's Republic of China
| | - Xiangyin Kong
- The Key Laboratory of Stem Cell Biology, Institute of Health Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai 200025, People's Republic of China
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai 200444, People's Republic of China.
| |
Collapse
|
11
|
Prediction of Golgi-resident protein types using general form of Chou's pseudo-amino acid compositions: Approaches with minimal redundancy maximal relevance feature selection. J Theor Biol 2016; 402:38-44. [PMID: 27155042 DOI: 10.1016/j.jtbi.2016.04.032] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Revised: 04/19/2016] [Accepted: 04/26/2016] [Indexed: 11/20/2022]
Abstract
Recently, several efforts have been made in predicting Golgi-resident proteins. However, it is still a challenging task to identify the type of a Golgi-resident protein. Precise prediction of the type of a Golgi-resident protein plays a key role in understanding its molecular functions in various biological processes. In this paper, we proposed to use a mutual information based feature selection scheme with the general form Chou's pseudo-amino acid compositions to predict the Golgi-resident protein types. The positional specific physicochemical properties were applied in the Chou's pseudo-amino acid compositions. We achieved 91.24% prediction accuracy in a jackknife test with 49 selected features. It has the best performance among all the present predictors. This result indicates that our computational model can be useful in identifying Golgi-resident protein types.
Collapse
|
12
|
Huang T, Shu Y, Cai YD. Genetic differences among ethnic groups. BMC Genomics 2015; 16:1093. [PMID: 26690364 PMCID: PMC4687076 DOI: 10.1186/s12864-015-2328-0] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2015] [Accepted: 12/15/2015] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Many differences between different ethnic groups have been observed, such as skin color, eye color, height, susceptibility to some diseases, and response to certain drugs. However, the genetic bases of such differences have been under-investigated. Since the HapMap project, large-scale genotype data from Caucasian, African and Asian population samples have been available. The project found that these populations were located in different areas of the PCA (Principal Component Analysis) plot. However, as an unsupervised method, PCA does not measure the differences in each single nucleotide polymorphism (SNP) among populations. RESULTS We applied an advanced mutual information-based feature selection method to detect associations between SNP status and ethnic groups using the latest HapMap Phase 3 release version 3, which included more sub-populations. A total of 299 SNPs were identified, and they can accurately predicted the ethnicity of all HapMap populations. The 10-fold cross validation accuracy of the SMO (sequential minimal optimization) model on training dataset was 0.901, and the accuracy on independent test dataset was 0.895. CONCLUSIONS In-depth functional analysis of these SNPs and their nearby genes revealed the genetic bases of skin and eye color differences among populations.
Collapse
Affiliation(s)
- Tao Huang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, P. R. China.
| | - Yang Shu
- Sate Key Laboratory of Biotherapy, Sichuan University, Sichuan, 610041, P. R. China.
| | - Yu-Dong Cai
- College of Life Science, Shanghai University, Shanghai, 200444, P. R. China.
| |
Collapse
|
13
|
Cai Z, Zhang G, Zhang X, Liu Y, Fu X. Current insights into computer-aided immunotherapeutic design strategies. Int J Immunopathol Pharmacol 2015; 28:278-85. [PMID: 26091813 DOI: 10.1177/0394632015588765] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Drug designing costs as well as design of immunotherapeutic agents could be nearly halved through the involvement of computer-aided drug designing methods in discovery and research. The inter-disciplinary, time-, and money-consuming process of drug discovery is amended by the development of drug designing, the technique of creating or finding a molecule that can render stimulatory or inhibitory activity upon various biological organisms. Meanwhile, the advancements made within this scientific domain in the last couple of decades have significantly modified and affected the way new bioactive molecules have been produced by the pharmaceutical industry. In this regard, improvements made in hardware solutions and computational techniques along with their efficient integration with biological processes have revolutionized the in silico methods in speeding up the lead identification and optimization processes. In this review, we will discuss various methods of recent computer-aided drug designing techniques that forms the basis of modern day drug discovery projects.
Collapse
Affiliation(s)
- Zhi Cai
- College of Computer and Information Engineering, Heilongjiang University of Science and Technology, Harbin, PR China College of Computer Science and Technology, Harbin Engineering University, Harbin, PR China
| | - Guoyin Zhang
- College of Computer Science and Technology, Harbin Engineering University, Harbin, PR China
| | - Xuejin Zhang
- College of Foreign Language, Heilongjiang University of Science and Technology, Harbin, PR China
| | - Yan Liu
- College of Computer Science and Technology, Harbin Engineering University, Harbin, PR China
| | - Xiaojing Fu
- College of Computer Science and Technology, Harbin Engineering University, Harbin, PR China
| |
Collapse
|
14
|
Castronuovo CC, Cuestas ML, Oubiña JR, Mathet VL. Effect of several PEO-PPO amphiphiles onbax,bcl-2, andhTERTmRNAs: An insight into apoptosis and cell immortalization induced in hepatoma cells by these polymeric excipients. Biotechnol Appl Biochem 2015; 63:273-80. [DOI: 10.1002/bab.1352] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 01/23/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Cynthia Celeste Castronuovo
- Instituto de Investigaciones en Microbiología y Parasitología Médica; UBA-CONICET; Facultad de Medicina; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires; Argentina
- CONICET; Ciudad Autónoma de Buenos Aires; Argentina
| | - María Luján Cuestas
- Instituto de Investigaciones en Microbiología y Parasitología Médica; UBA-CONICET; Facultad de Medicina; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires; Argentina
- CONICET; Ciudad Autónoma de Buenos Aires; Argentina
| | - José Raúl Oubiña
- Instituto de Investigaciones en Microbiología y Parasitología Médica; UBA-CONICET; Facultad de Medicina; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires; Argentina
- CONICET; Ciudad Autónoma de Buenos Aires; Argentina
| | - Verónica Lidia Mathet
- Instituto de Investigaciones en Microbiología y Parasitología Médica; UBA-CONICET; Facultad de Medicina; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires; Argentina
- CONICET; Ciudad Autónoma de Buenos Aires; Argentina
| |
Collapse
|
15
|
Liu B, Chen J, Wang X. Protein remote homology detection by combining Chou’s distance-pair pseudo amino acid composition and principal component analysis. Mol Genet Genomics 2015; 290:1919-31. [DOI: 10.1007/s00438-015-1044-4] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Accepted: 04/06/2015] [Indexed: 02/07/2023]
|
16
|
Khan ZU, Hayat M, Khan MA. Discrimination of acidic and alkaline enzyme using Chou’s pseudo amino acid composition in conjunction with probabilistic neural network model. J Theor Biol 2015; 365:197-203. [DOI: 10.1016/j.jtbi.2014.10.014] [Citation(s) in RCA: 110] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 09/09/2014] [Accepted: 10/11/2014] [Indexed: 12/11/2022]
|
17
|
Cai Z, Xu D, Zhang Q, Zhang J, Ngai SM, Shao J. Classification of lung cancer using ensemble-based feature selection and machine learning methods. MOLECULAR BIOSYSTEMS 2014; 11:791-800. [PMID: 25512221 DOI: 10.1039/c4mb00659c] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Lung cancer is one of the leading causes of death worldwide. There are three major types of lung cancers, non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC) and carcinoid. NSCLC is further classified into lung adenocarcinoma (LADC), squamous cell lung cancer (SQCLC) as well as large cell lung cancer. Many previous studies demonstrated that DNA methylation has emerged as potential lung cancer-specific biomarkers. However, whether there exists a set of DNA methylation markers simultaneously distinguishing such three types of lung cancers remains elusive. In the present study, ROC (Receiving Operating Curve), RFs (Random Forests) and mRMR (Maximum Relevancy and Minimum Redundancy) were proposed to capture the unbiased, informative as well as compact molecular signatures followed by machine learning methods to classify LADC, SQCLC and SCLC. As a result, a panel of 16 DNA methylation markers exhibits an ideal classification power with an accuracy of 86.54%, 84.6% and a recall 84.37%, 85.5% in the leave-one-out cross-validation (LOOCV) and independent data set test experiments, respectively. Besides, comparison results indicate that ensemble-based feature selection methods outperform individual ones when combined with the incremental feature selection (IFS) strategy in terms of the informative and compact property of features. Taken together, results obtained suggest the effectiveness of the ensemble-based feature selection approach and the possible existence of a common panel of DNA methylation markers among such three types of lung cancer tissue, which would facilitate clinical diagnosis and treatment.
Collapse
Affiliation(s)
- Zhihua Cai
- Affiliated Cancer Hospital of Guangzhou Medical University, Guangzhou, Guangdong Province, China
| | | | | | | | | | | |
Collapse
|
18
|
Xu R, Zhou J, Liu B, He Y, Zou Q, Wang X, Chou KC. Identification of DNA-binding proteins by incorporating evolutionary information into pseudo amino acid composition via the top-n-gram approach. J Biomol Struct Dyn 2014; 33:1720-30. [PMID: 25252709 DOI: 10.1080/07391102.2014.968624] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
DNA-binding proteins are crucial for various cellular processes and hence have become an important target for both basic research and drug development. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to establish an automated method for rapidly and accurately identifying DNA-binding proteins based on their sequence information alone. Owing to the fact that all biological species have developed beginning from a very limited number of ancestral species, it is important to take into account the evolutionary information in developing such a high-throughput tool. In view of this, a new predictor was proposed by incorporating the evolutionary information into the general form of pseudo amino acid composition via the top-n-gram approach. It was observed by comparing the new predictor with the existing methods via both jackknife test and independent data-set test that the new predictor outperformed its counterparts. It is anticipated that the new predictor may become a useful vehicle for identifying DNA-binding proteins. It has not escaped our notice that the novel approach to extract evolutionary information into the formulation of statistical samples can be used to identify many other protein attributes as well.
Collapse
Affiliation(s)
- Ruifeng Xu
- a School of Computer Science and Technology , Harbin Institute of Technology Shenzhen Graduate School, HIT Campus Shenzhen University Town , Xili, Shenzhen 518055 , Guangdong , China
| | | | | | | | | | | | | |
Collapse
|
19
|
Zhang N, Huang T, Cai YD. Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties. Mol Genet Genomics 2014; 290:343-52. [PMID: 25248637 DOI: 10.1007/s00438-014-0922-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 09/12/2014] [Indexed: 02/06/2023]
Abstract
More than ten thousand coding variants are contained in each human genome; however, our knowledge of the way genetic variants underlie phenotypic differences is far from complete. Small insertions and deletions (indels) are one of the most common types of human genetic variants, and indels play a significant role in human inherited disease. To date, we still lack a comprehensive understanding of how indels cause diseases. Therefore, identification and analysis of such deleterious variants is a key challenge and has been of great interest in the current research in genome biology. Increasing numbers of computational methods have been developed for discriminating between deleterious indels and neutral indels. However, most of the existing methods are based on traditional sequential or structural features, which cannot completely explain the association between indels and the resulting induced inherited disease. In this study, we establish a novel method to predict deleterious non-frameshifting indels based on features extracted from both protein interaction networks and traditional hybrid properties. Each indel was coded by 1,246 features. Using the maximum relevance minimum redundancy method and the incremental feature selection method, we obtained an optimal feature set containing 42 features, of which 21 features were derived from protein interaction networks. Based on the optimal feature set, an 88 % accuracy and a 0.76 MCC value were achieved by a Random Forest as evaluated by the Jackknife cross-validation test. This method outperformed existing methods of predicting deleterious indels, and can be applied in practice for deleterious non-frameshifting indel predictions in genome research. The analysis of the optimal features selected in the model revealed that network interactions play more important roles and could be informative for better illustrating an indel's function and disease associations than traditional sequential or structural features. These results could shed some light on the genetic basis of human genetic variations and human inherited diseases.
Collapse
Affiliation(s)
- Ning Zhang
- Department of Biomedical Engineering, Tianjin Key Lab of BME Measurement, Tianjin University, Tianjin, 300072, People's Republic of China
| | | | | |
Collapse
|
20
|
Chen L, Lu J, Huang T, Yin J, Wei L, Cai YD. Finding candidate drugs for hepatitis C based on chemical-chemical and chemical-protein interactions. PLoS One 2014; 9:e107767. [PMID: 25225900 PMCID: PMC4166673 DOI: 10.1371/journal.pone.0107767] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2014] [Accepted: 08/14/2014] [Indexed: 11/18/2022] Open
Abstract
Hepatitis C virus (HCV) is an infectious virus that can cause serious illnesses. Only a few drugs have been reported to effectively treat hepatitis C. To have greater diversity in drug choice and better treatment options, it is necessary to develop more drugs to treat the infection. However, it is time-consuming and expensive to discover candidate drugs using experimental methods, and computational methods may complement experimental approaches as a preliminary filtering process. This type of approach was proposed by using known chemical-chemical interactions to extract interactive compounds with three known drug compounds of HCV, and the probabilities of these drug compounds being able to treat hepatitis C were calculated using chemical-protein interactions between the interactive compounds and HCV target genes. Moreover, the randomization test and expectation-maximization (EM) algorithm were both employed to exclude false discoveries. Analysis of the selected compounds, including acyclovir and ganciclovir, indicated that some of these compounds had potential to treat the HCV. Hopefully, this proposed method could provide new insights into the discovery of candidate drugs for the treatment of HCV and other diseases.
Collapse
Affiliation(s)
- Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Jing Lu
- Department of Medicinal Chemistry, School of Pharmacy, Yantai University, Shandong, Yantai, People's Republic of China
| | - Tao Huang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Jun Yin
- College of Information Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Lai Wei
- College of Information Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Yu-Dong Cai
- Institute of Systems Biology, Shanghai University, Shanghai, People's Republic of China
- * E-mail:
| |
Collapse
|
21
|
Large-scale protein-protein interactions detection by integrating big biosensing data with computational model. BIOMED RESEARCH INTERNATIONAL 2014; 2014:598129. [PMID: 25215285 PMCID: PMC4151593 DOI: 10.1155/2014/598129] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Accepted: 07/24/2014] [Indexed: 01/12/2023]
Abstract
Protein-protein interactions are the basis of biological functions, and studying these interactions on a molecular level is of crucial importance for understanding the functionality of a living cell. During the past decade, biosensors have emerged as an important tool for the high-throughput identification of proteins and their interactions. However, the high-throughput experimental methods for identifying PPIs are both time-consuming and expensive. On the other hand, high-throughput PPI data are often associated with high false-positive and high false-negative rates. Targeting at these problems, we propose a method for PPI detection by integrating biosensor-based PPI data with a novel computational model. This method was developed based on the algorithm of extreme learning machine combined with a novel representation of protein sequence descriptor. When performed on the large-scale human protein interaction dataset, the proposed method achieved 84.8% prediction accuracy with 84.08% sensitivity at the specificity of 85.53%. We conducted more extensive experiments to compare the proposed method with the state-of-the-art techniques, support vector machine. The achieved results demonstrate that our approach is very promising for detecting new PPIs, and it can be a helpful supplement for biosensor-based PPI data detection.
Collapse
|
22
|
iCTX-type: a sequence-based predictor for identifying the types of conotoxins in targeting ion channels. BIOMED RESEARCH INTERNATIONAL 2014; 2014:286419. [PMID: 24991545 PMCID: PMC4058692 DOI: 10.1155/2014/286419] [Citation(s) in RCA: 137] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Revised: 04/22/2014] [Accepted: 05/07/2014] [Indexed: 11/30/2022]
Abstract
Conotoxins are small disulfide-rich neurotoxic peptides, which can bind to ion channels with very high specificity and modulate their activities. Over the last few decades, conotoxins have been the drug candidates for treating chronic pain, epilepsy, spasticity, and cardiovascular diseases. According to their functions and targets, conotoxins are generally categorized into three types: potassium-channel type, sodium-channel type, and calcium-channel types. With the avalanche of peptide sequences generated in the postgenomic age, it is urgent and challenging to develop an automated method for rapidly and accurately identifying the types of conotoxins based on their sequence information alone. To address this challenge, a new predictor, called iCTX-Type, was developed by incorporating the dipeptide occurrence frequencies of a conotoxin sequence into a 400-D (dimensional) general pseudoamino acid composition, followed by the feature optimization procedure to reduce the sample representation from 400-D to 50-D vector. The overall success rate achieved by iCTX-Type via a rigorous cross-validation was over 91%, outperforming its counterpart (RBF network). Besides, iCTX-Type is so far the only predictor in this area with its web-server available, and hence is particularly useful for most experimental scientists to get their desired results without the need to follow the complicated mathematics involved.
Collapse
|
23
|
Zhang N, Jiang M, Huang T, Cai YD. Identification of Influenza A/H7N9 virus infection-related human genes based on shortest paths in a virus-human protein interaction network. BIOMED RESEARCH INTERNATIONAL 2014; 2014:239462. [PMID: 24955349 PMCID: PMC4052153 DOI: 10.1155/2014/239462] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2014] [Revised: 04/18/2014] [Accepted: 04/21/2014] [Indexed: 12/15/2022]
Abstract
The recently emerging Influenza A/H7N9 virus is reported to be able to infect humans and cause mortality. However, viral and host factors associated with the infection are poorly understood. It is suggested by the "guilt by association" rule that interacting proteins share the same or similar functions and hence may be involved in the same pathway. In this study, we developed a computational method to identify Influenza A/H7N9 virus infection-related human genes based on this rule from the shortest paths in a virus-human protein interaction network. Finally, we screened out the most significant 20 human genes, which could be the potential infection related genes, providing guidelines for further experimental validation. Analysis of the 20 genes showed that they were enriched in protein binding, saccharide or polysaccharide metabolism related pathways and oxidative phosphorylation pathways. We also compared the results with those from human rhinovirus (HRV) and respiratory syncytial virus (RSV) by the same method. It was indicated that saccharide or polysaccharide metabolism related pathways might be especially associated with the H7N9 infection. These results could shed some light on the understanding of the virus infection mechanism, providing basis for future experimental biology studies and for the development of effective strategies for H7N9 clinical therapies.
Collapse
Affiliation(s)
- Ning Zhang
- Department of Biomedical Engineering, Tianjin University, Tianjin Key Lab of BME Measurement, Tianjin 300072, China
| | - Min Jiang
- State Key Laboratory of Medical Genomics, Institute of Health Sciences, Shanghai Jiaotong University School of Medicine and Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200025, China
| | - Tao Huang
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York City, NY, USA
- Institute of Systems Biology, Shanghai University, 99 Shangda Road, Shanghai 200444, China
| | - Yu-Dong Cai
- Institute of Systems Biology, Shanghai University, 99 Shangda Road, Shanghai 200444, China
| |
Collapse
|
24
|
Fan YN, Xiao X, Min JL, Chou KC. iNR-Drug: predicting the interaction of drugs with nuclear receptors in cellular networking. Int J Mol Sci 2014; 15:4915-37. [PMID: 24651462 PMCID: PMC3975431 DOI: 10.3390/ijms15034915] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Revised: 02/12/2014] [Accepted: 02/16/2014] [Indexed: 12/20/2022] Open
Abstract
Nuclear receptors (NRs) are closely associated with various major diseases such as cancer, diabetes, inflammatory disease, and osteoporosis. Therefore, NRs have become a frequent target for drug development. During the process of developing drugs against these diseases by targeting NRs, we are often facing a problem: Given a NR and chemical compound, can we identify whether they are really in interaction with each other in a cell? To address this problem, a predictor called “iNR-Drug” was developed. In the predictor, the drug compound concerned was formulated by a 256-D (dimensional) vector derived from its molecular fingerprint, and the NR by a 500-D vector formed by incorporating its sequential evolution information and physicochemical features into the general form of pseudo amino acid composition, and the prediction engine was operated by the SVM (support vector machine) algorithm. Compared with the existing prediction methods in this area, iNR-Drug not only can yield a higher success rate, but is also featured by a user-friendly web-server established at http://www.jci-bioinfo.cn/iNR-Drug/, which is particularly useful for most experimental scientists to obtain their desired data in a timely manner. It is anticipated that the iNR-Drug server may become a useful high throughput tool for both basic research and drug development, and that the current approach may be easily extended to study the interactions of drug with other targets as well.
Collapse
Affiliation(s)
- Yue-Nong Fan
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen 333046, Jiangxi, China.
| | - Xuan Xiao
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen 333046, Jiangxi, China.
| | - Jian-Liang Min
- Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen 333046, Jiangxi, China.
| | - Kuo-Chen Chou
- Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
25
|
A QSPR-like model for multilocus genotype networks of Fasciola hepatica in Northwest Spain. J Theor Biol 2014; 343:16-24. [DOI: 10.1016/j.jtbi.2013.11.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Revised: 11/08/2013] [Accepted: 11/11/2013] [Indexed: 11/23/2022]
|
26
|
iRSpot-TNCPseAAC: identify recombination spots with trinucleotide composition and pseudo amino acid components. Int J Mol Sci 2014; 15:1746-66. [PMID: 24469313 PMCID: PMC3958819 DOI: 10.3390/ijms15021746] [Citation(s) in RCA: 211] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Revised: 01/14/2014] [Accepted: 01/16/2014] [Indexed: 01/22/2023] Open
Abstract
Meiosis and recombination are the two opposite aspects that coexist in a DNA system. As a driving force for evolution by generating natural genetic variations, meiotic recombination plays a very important role in the formation of eggs and sperm. Interestingly, the recombination does not occur randomly across a genome, but with higher probability in some genomic regions called “hotspots”, while with lower probability in so-called “coldspots”. With the ever-increasing amount of genome sequence data in the postgenomic era, computational methods for effectively identifying the hotspots and coldspots have become urgent as they can timely provide us with useful insights into the mechanism of meiotic recombination and the process of genome evolution as well. To meet the need, we developed a new predictor called “iRSpot-TNCPseAAC”, in which a DNA sample was formulated by combining its trinucleotide composition (TNC) and the pseudo amino acid components (PseAAC) of the protein translated from the DNA sample according to its genetic codes. The former was used to incorporate its local or short-rage sequence order information; while the latter, its global and long-range one. Compared with the best existing predictor in this area, iRSpot-TNCPseAAC achieved higher rates in accuracy, Mathew’s correlation coefficient, and sensitivity, indicating that the new predictor may become a useful tool for identifying the recombination hotspots and coldspots, or, at least, become a complementary tool to the existing methods. It has not escaped our notice that the aforementioned novel approach to incorporate the DNA sequence order information into a discrete model may also be used for many other genome analysis problems. The web-server for iRSpot-TNCPseAAC is available at http://www.jci-bioinfo.cn/iRSpot-TNCPseAAC. Furthermore, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the current web server to obtain their desired result without the need to follow the complicated mathematical equations.
Collapse
|
27
|
Feng PM, Chen W, Lin H, Chou KC. iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition. Anal Biochem 2013; 442:118-25. [DOI: 10.1016/j.ab.2013.05.024] [Citation(s) in RCA: 230] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2013] [Revised: 05/21/2013] [Accepted: 05/22/2013] [Indexed: 01/22/2023]
|
28
|
Fan GL, Li QZ. Discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of Chou's pseudo amino acid composition. J Theor Biol 2013; 334:45-51. [DOI: 10.1016/j.jtbi.2013.06.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Revised: 05/30/2013] [Accepted: 06/03/2013] [Indexed: 01/22/2023]
|
29
|
Predicting protein subchloroplast locations with both single and multiple sites via three different modes of Chou's pseudo amino acid compositions. J Theor Biol 2013; 335:205-12. [DOI: 10.1016/j.jtbi.2013.06.034] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Revised: 05/26/2013] [Accepted: 06/29/2013] [Indexed: 12/19/2022]
|
30
|
Advanced systems biology methods in drug discovery and translational biomedicine. BIOMED RESEARCH INTERNATIONAL 2013; 2013:742835. [PMID: 24171171 PMCID: PMC3792523 DOI: 10.1155/2013/742835] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/17/2013] [Accepted: 08/26/2013] [Indexed: 02/08/2023]
Abstract
Systems biology is in an exponential development stage in recent years and has been widely utilized in biomedicine to better understand the molecular basis of human disease and the mechanism of drug action. Here, we discuss the fundamental concept of systems biology and its two computational methods that have been commonly used, that is, network analysis and dynamical modeling. The applications of systems biology in elucidating human disease are highlighted, consisting of human disease networks, treatment response prediction, investigation of disease mechanisms, and disease-associated gene prediction. In addition, important advances in drug discovery, to which systems biology makes significant contributions, are discussed, including drug-target networks, prediction of drug-target interactions, investigation of drug adverse effects, drug repositioning, and drug combination prediction. The systems biology methods and applications covered in this review provide a framework for addressing disease mechanism and approaching drug discovery, which will facilitate the translation of research findings into clinical benefits such as novel biomarkers and promising therapies.
Collapse
|
31
|
Xiao X, Min JL, Wang P, Chou KC. iCDI-PseFpt: identify the channel-drug interaction in cellular networking with PseAAC and molecular fingerprints. J Theor Biol 2013; 337:71-9. [PMID: 23988798 DOI: 10.1016/j.jtbi.2013.08.013] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Revised: 07/26/2013] [Accepted: 08/14/2013] [Indexed: 12/29/2022]
Abstract
Many crucial functions in life, such as heartbeat, sensory transduction and central nervous system response, are controlled by cell signalings via various ion channels. Therefore, ion channels have become an excellent drug target, and study of ion channel-drug interaction networks is an important topic for drug development. However, it is both time-consuming and costly to determine whether a drug and a protein ion channel are interacting with each other in a cellular network by means of experimental techniques. Although some computational methods were developed in this regard based on the knowledge of the 3D (three-dimensional) structure of protein, unfortunately their usage is quite limited because the 3D structures for most protein ion channels are still unknown. With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop the sequence-based computational method to address this problem. To take up the challenge, we developed a new predictor called iCDI-PseFpt, in which the protein ion-channel sample is formulated by the PseAAC (pseudo amino acid composition) generated with the gray model theory, the drug compound by the 2D molecular fingerprint, and the operation engine is the fuzzy K-nearest neighbor algorithm. The overall success rate achieved by iCDI-PseFpt via the jackknife cross-validation was 87.27%, which is remarkably higher than that by any of the existing predictors in this area. As a user-friendly web-server, iCDI-PseFpt is freely accessible to the public at the website http://www.jci-bioinfo.cn/iCDI-PseFpt/. Furthermore, for the convenience of most experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated math equations presented in the paper just for its integrity. It has not escaped our notice that the current approach can also be used to study other drug-target interaction networks.
Collapse
Affiliation(s)
- Xuan Xiao
- Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 333403, China; Information School, Zhe-Jiang Textile & Fashion College, Ning-Bo 315211, China; Gordon Life Science Institute, 53 South Cottage Road, Belmont, MA 02478, United States.
| | - Jian-Liang Min
- Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 333403, China.
| | - Pu Wang
- Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 333403, China.
| | - Kuo-Chen Chou
- Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia; Gordon Life Science Institute, 53 South Cottage Road, Belmont, MA 02478, United States.
| |
Collapse
|
32
|
Yang X, Wang T. Linear regression model of short k-word: a similarity distance suitable for biological sequences with various lengths. J Theor Biol 2013; 337:61-70. [PMID: 23933105 DOI: 10.1016/j.jtbi.2013.07.028] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2013] [Revised: 07/02/2013] [Accepted: 07/30/2013] [Indexed: 01/10/2023]
Abstract
Originating from sequences' length difference, both k-word based methods and graphical representation approaches have uncovered biological information in their distinct ways. However, it is less likely that the mechanisms of information storage vary with sequences' length. A similarity distance suitable for sequences with various lengths will be much near to the mechanisms of information storage. In this paper, new sub-sequences of k-word were extracted from biological sequences under a one-to-one mapping. The new sub-sequences were evaluated by a linear regression model. Moreover, a new distance was defined on the invariants from the linear regression model. With comparison to other alignment-free distances, the results of four experiments demonstrated that our similarity distance was more efficient.
Collapse
Affiliation(s)
- Xiwu Yang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning 116024, PR China; School of Mathematics, Liaoning Normal University, Dalian, Liaoning 116029, PR China.
| | - Tianming Wang
- School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning 116024, PR China
| |
Collapse
|
33
|
Liu B, Wang X, Zou Q, Dong Q, Chen Q. Protein Remote Homology Detection by Combining Chou’s Pseudo Amino Acid Composition and Profile-Based Protein Representation. Mol Inform 2013; 32:775-82. [DOI: 10.1002/minf.201300084] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2013] [Accepted: 06/11/2013] [Indexed: 11/12/2022]
|
34
|
Using radial basis function on the general form of Chou's pseudo amino acid composition and PSSM to predict subcellular locations of proteins with both single and multiple sites. Biosystems 2013; 113:50-7. [DOI: 10.1016/j.biosystems.2013.04.005] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Revised: 04/10/2013] [Accepted: 04/24/2013] [Indexed: 12/22/2022]
|
35
|
Feng PM, Ding H, Chen W, Lin H. Naïve Bayes classifier with feature selection to identify phage virion proteins. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013; 2013:530696. [PMID: 23762187 PMCID: PMC3671239 DOI: 10.1155/2013/530696] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/10/2013] [Revised: 04/16/2013] [Accepted: 04/28/2013] [Indexed: 12/31/2022]
Abstract
Knowledge about the protein composition of phage virions is a key step to understand the functions of phage virion proteins. However, the experimental method to identify virion proteins is time consuming and expensive. Thus, it is highly desirable to develop novel computational methods for phage virion protein identification. In this study, a Naïve Bayes based method was proposed to predict phage virion proteins using amino acid composition and dipeptide composition. In order to remove redundant information, a novel feature selection technique was employed to single out optimized features. In the jackknife test, the proposed method achieved an accuracy of 79.15% for phage virion and nonvirion proteins classification, which are superior to that of other state-of-the-art classifiers. These results indicate that the proposed method could be as an effective and promising high-throughput method in phage proteomics research.
Collapse
Affiliation(s)
- Peng-Mian Feng
- School of Public Health, Hebei United University, Tangshan 063000, China
| | - Hui Ding
- Key Laboratory for Neuroinformation of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Wei Chen
- Department of Physics, School of Sciences, Center for Genomics and Computational Biology, Hebei United University, Tangshan 063000, China
| | - Hao Lin
- Key Laboratory for Neuroinformation of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
36
|
Signal propagation in protein interaction network during colorectal cancer progression. BIOMED RESEARCH INTERNATIONAL 2013; 2013:287019. [PMID: 23586028 PMCID: PMC3615629 DOI: 10.1155/2013/287019] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Accepted: 02/18/2013] [Indexed: 11/18/2022]
Abstract
Colorectal cancer is generally categorized into the following four stages according to its development or serious degree: Dukes A, B, C, and D. Since different stage of colorectal cancer actually corresponds to different activated region of the network, the transition of different network states may reflect its pathological changes. In view of this, we compared the gene expressions among the colorectal cancer patients in the aforementioned four stages and obtained the early and late stage biomarkers, respectively. Subsequently, the two kinds of biomarkers were both mapped onto the protein interaction network. If an early biomarker and a late biomarker were close in the network and also if their expression levels were correlated in the Dukes B and C patients, then a signal propagation path from the early stage biomarker to the late one was identified. Many transition genes in the signal propagation paths were involved with the signal transduction, cell communication, and cellular process regulation. Some transition hubs were known as colorectal cancer genes. The findings reported here may provide useful insights for revealing the mechanism of colorectal cancer progression at the cellular systems biology level.
Collapse
|
37
|
Two classifiers based on serum peptide pattern for prediction of HBV-induced liver cirrhosis using MALDI-TOF MS. BIOMED RESEARCH INTERNATIONAL 2013; 2013:814876. [PMID: 23509784 PMCID: PMC3590609 DOI: 10.1155/2013/814876] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Accepted: 01/09/2013] [Indexed: 01/16/2023]
Abstract
Chronic infection with hepatitis B virus (HBV) is associated with the majority of cases of liver cirrhosis (LC) in China. Although liver biopsy is the reference method for evaluation of cirrhosis, it is an invasive procedure with inherent risk. The aim of this study is to discover novel noninvasive specific serum biomarkers for the diagnosis of HBV-induced LC. We performed bead fractionation/MALDI-TOF MS analysis on sera from patients with LC. Thirteen feature peaks which had optimal discriminatory performance were obtained by using support-vector-machine-(SVM-) based strategy. Based on the previous results, five supervised machine learning methods were employed to construct classifiers that discriminated proteomic spectra of patients with HBV-induced LC from those of controls. Here, we describe two novel methods for prediction of HBV-induced LC, termed LC-NB and LC-MLP, respectively. We obtained a sensitivity of 90.9%, a specificity of 94.9%, and overall accuracy of 93.8% on an independent test set. Comparisons with the existing methods showed that LC-NB and LC-MLP held better accuracy. Our study suggests that potential serum biomarkers can be determined for discriminating LC and non-LC cohorts by using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. These two classifiers could be used for clinical practice in HBV-induced LC assessment.
Collapse
|
38
|
Xiao X, Wang P, Lin WZ, Jia JH, Chou KC. iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types. Anal Biochem 2013; 436:168-77. [PMID: 23395824 DOI: 10.1016/j.ab.2013.01.019] [Citation(s) in RCA: 370] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Revised: 01/10/2013] [Accepted: 01/21/2013] [Indexed: 12/14/2022]
Abstract
Antimicrobial peptides (AMPs), also called host defense peptides, are an evolutionarily conserved component of the innate immune response and are found among all classes of life. According to their special functions, AMPs are generally classified into ten categories: Antibacterial Peptides, Anticancer/tumor Peptides, Antifungal Peptides, Anti-HIV Peptides, Antiviral Peptides, Antiparasital Peptides, Anti-protist Peptides, AMPs with Chemotactic Activity, Insecticidal Peptides, and Spermicidal Peptides. Given a query peptide, how can we identify whether it is an AMP or non-AMP? If it is, can we identify which functional type or types it belong to? Particularly, how can we deal with the multi-type problem since an AMP may belong to two or more functional types? To address these problems, which are obviously very important to both basic research and drug development, a multi-label classifier was developed based on the pseudo amino acid composition (PseAAC) and fuzzy K-nearest neighbor (FKNN) algorithm, where the components of PseAAC were featured by incorporating five physicochemical properties. The novel classifier is called iAMP-2L, where "2L" means that it is a 2-level predictor. The 1st-level is to answer the 1st question above, while the 2nd-level is to answer the 2nd and 3rd questions that are beyond the reach of any existing methods in this area. For the conveniences of users, a user-friendly web-server for iAMP-2L was established at http://www.jci-bioinfo.cn/iAMP-2L.
Collapse
Affiliation(s)
- Xuan Xiao
- Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 333403, China.
| | | | | | | | | |
Collapse
|
39
|
Belle A, Kon MA, Najarian K. Biomedical informatics for computer-aided decision support systems: a survey. ScientificWorldJournal 2013; 2013:769639. [PMID: 23431259 PMCID: PMC3575619 DOI: 10.1155/2013/769639] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Accepted: 01/09/2013] [Indexed: 11/18/2022] Open
Abstract
The volumes of current patient data as well as their complexity make clinical decision making more challenging than ever for physicians and other care givers. This situation calls for the use of biomedical informatics methods to process data and form recommendations and/or predictions to assist such decision makers. The design, implementation, and use of biomedical informatics systems in the form of computer-aided decision support have become essential and widely used over the last two decades. This paper provides a brief review of such systems, their application protocols and methodologies, and the future challenges and directions they suggest.
Collapse
Affiliation(s)
- Ashwin Belle
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Mark A. Kon
- Department of Mathematics and Statistics, Boston University, Boston, MA 02215, USA
| | - Kayvan Najarian
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
40
|
Chou KC. Some remarks on predicting multi-label attributes in molecular biosystems. MOLECULAR BIOSYSTEMS 2013; 9:1092-100. [DOI: 10.1039/c3mb25555g] [Citation(s) in RCA: 364] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
41
|
Huang T, Jiang M, Kong X, Cai YD. Dysfunctions associated with methylation, microRNA expression and gene expression in lung cancer. PLoS One 2012; 7:e43441. [PMID: 22912875 PMCID: PMC3422260 DOI: 10.1371/journal.pone.0043441] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 07/23/2012] [Indexed: 12/02/2022] Open
Abstract
Integrating high-throughput data obtained from different molecular levels is essential for understanding the mechanisms of complex diseases such as cancer. In this study, we integrated the methylation, microRNA and mRNA data from lung cancer tissues and normal lung tissues using functional gene sets. For each Gene Ontology (GO) term, three sets were defined: the methylation set, the microRNA set and the mRNA set. The discriminating ability of each gene set was represented by the Matthews correlation coefficient (MCC), as evaluated by leave-one-out cross-validation (LOOCV). Next, the MCCs in the methylation sets, the microRNA sets and the mRNA sets were ranked. By comparing the MCC ranks of methylation, microRNA and mRNA for each GO term, we classified the GO sets into six groups and identified the dysfunctional methylation, microRNA and mRNA gene sets in lung cancer. Our results provide a systematic view of the functional alterations during tumorigenesis that may help to elucidate the mechanisms of lung cancer and lead to improved treatments for patients.
Collapse
Affiliation(s)
- Tao Huang
- Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People's Republic of China
- Shanghai Center for Bioinformation Technology, Shanghai, People's Republic of China
| | - Min Jiang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences and Shanghai Jiao Tong University School of Medicine, Shanghai, People's Republic of China
- State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiaotong University, Shanghai, People's Republic of China
| | - Xiangyin Kong
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences and Shanghai Jiao Tong University School of Medicine, Shanghai, People's Republic of China
- State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiaotong University, Shanghai, People's Republic of China
| | - Yu-Dong Cai
- Institute of Systems Biology, Shanghai University, Shanghai, People's Republic of China
| |
Collapse
|