1
|
Maalal O, Prat M, Peinador R, Lasseux D. Determination of the throat size distribution of a porous medium as an inverse optimization problem combining pore network modeling and genetic and hill climbing algorithms. Phys Rev E 2021; 103:023303. [PMID: 33735971 DOI: 10.1103/physreve.103.023303] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 01/13/2021] [Indexed: 11/07/2022]
Abstract
The pore size distribution of a porous medium is often estimated from the retention curve or the invading fluid flow rate curve using simple relationships more or less explicitly based on the consideration that the porous medium is made of a bundle of cylindrical parallel tubes. This type of determination is tested using pore network simulations. Starting from two- or three-dimensional networks, the characteristics of which are known a priori, the estimation of the throat size distribution (TSD) is performed using the standard methods in the case of drainage. Results show a significant discrepancy with the input data. The disagreement is more pronounced when the fluid flow rate curve is employed together with the parallel tubes assumption. The physical origins of these shortcomings are identified. A method, based on pore network simulations combined with a genetic algorithm and the hill climbing algorithm, is then designed, which makes simultaneous use of the nonwetting fluid flow rate curve and the retention curve of the medium. Very significant improvement is achieved in the estimation of the TSD using this procedure.
Collapse
Affiliation(s)
- Otman Maalal
- Institut de Mécanique des Fluides de Toulouse, Université de Toulouse, Centre National de la Recherche Scientifique, 31400 Toulouse, France.,Institut de La Filtration et des Techniques Séparatives, Rue Marcel Pagnol, 47510 Foulayronnes, France
| | - Marc Prat
- Institut de Mécanique des Fluides de Toulouse, Université de Toulouse, Centre National de la Recherche Scientifique, 31400 Toulouse, France
| | - René Peinador
- Institut de La Filtration et des Techniques Séparatives, Rue Marcel Pagnol, 47510 Foulayronnes, France
| | - Didier Lasseux
- I2M, UMR 5295, Centre National de la Recherche Scientifique, Université Bordeaux, Esplanade des Arts et Métiers, 33405 Talence CEDEX, France
| |
Collapse
|
2
|
Van Rompuy LS, Savić ND, Rodriguez A, Parac-Vogt TN. Selective Hydrolysis of Transferrin Promoted by Zr-Substituted Polyoxometalates. Molecules 2020; 25:E3472. [PMID: 32751602 PMCID: PMC7435656 DOI: 10.3390/molecules25153472] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 07/27/2020] [Accepted: 07/28/2020] [Indexed: 12/20/2022] Open
Abstract
The hydrolysis of the iron-binding blood plasma glycoprotein transferrin (Tf) has been examined at pH = 7.4 in the presence of a series of Zr-substituted polyoxometalates (Zr-POMs) including Keggin (Et2NH2)10[Zr(PW11O39)2]∙7H2O (Zr-K 1:2), (Et2NH2)8[{α-PW11O39Zr-(μ-OH) (H2O)}2]∙7H2O (Zr-K 2:2), Wells-Dawson K15H[Zr(α2-P2W17O61)2]·25H2O (Zr-WD 1:2), Na14[Zr4(α-P2W16O59)2(μ3-O)2(μ-OH)2(H2O)4]·57H2O (Zr-WD 4:2) and Lindqvist (Me4N)2[ZrW5O18(H2O)3] (Zr-L 1:1), (nBu4N)6[(ZrW5O18(μ-OH))2]∙2H2O (Zr-L 2:2)) type POMs. Incubation of transferrin with Zr-POMs resulted in formation of 13 polypeptide fragments that were observed on sodium dodecyl sulfate poly(acrylamide) gel electrophoresis (SDS-PAGE), but the hydrolysis efficiency varied depending on the nature of Zr-POMs. Molecular interactions between Zr-POMs and transferrin were investigated by using a range of complementary techniques such as tryptophan fluorescence, circular dichroism (CD), 31P-NMR spectroscopy, in order to gain better understanding of different efficiency of investigated Zr-POMs. A tryptophan fluorescence quenching study revealed that the most reactive Zr-WD species show the strongest interaction toward transferrin. The CD results demonstrated that interaction of Zr-POMs and transferrin in buffer solution result in significant secondary structure changes. The speciation of Zr-POMs has been followed by 31P-NMR spectroscopy in the presence and absence of transferrin, providing insight into stability of the catalysts under reaction condition.
Collapse
Affiliation(s)
| | | | | | - Tatjana N. Parac-Vogt
- Department of Chemistry, KU Leuven, Celestijnenlaan 200F, 3001 Leuven, Belgium; (L.S.V.R.); (N.D.S.); (A.R.)
| |
Collapse
|
3
|
Tang J, Wang Y, Luo Y, Fu J, Zhang Y, Li Y, Xiao Z, Lou Y, Qiu Y, Zhu F. Computational advances of tumor marker selection and sample classification in cancer proteomics. Comput Struct Biotechnol J 2020; 18:2012-2025. [PMID: 32802273 PMCID: PMC7403885 DOI: 10.1016/j.csbj.2020.07.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 07/06/2020] [Accepted: 07/08/2020] [Indexed: 12/11/2022] Open
Abstract
Cancer proteomics has become a powerful technique for characterizing the protein markers driving transformation of malignancy, tracing proteome variation triggered by therapeutics, and discovering the novel targets and drugs for the treatment of oncologic diseases. To facilitate cancer diagnosis/prognosis and accelerate drug target discovery, a variety of methods for tumor marker identification and sample classification have been developed and successfully applied to cancer proteomic studies. This review article describes the most recent advances in those various approaches together with their current applications in cancer-related studies. Firstly, a number of popular feature selection methods are overviewed with objective evaluation on their advantages and disadvantages. Secondly, these methods are grouped into three major classes based on their underlying algorithms. Finally, a variety of sample separation algorithms are discussed. This review provides a comprehensive overview of the advances on tumor maker identification and patients/samples/tissues separations, which could be guidance to the researches in cancer proteomics.
Collapse
Key Words
- ANN, Artificial Neural Network
- ANOVA, Analysis of Variance
- CFS, Correlation-based Feature Selection
- Cancer proteomics
- Computational methods
- DAPC, Discriminant Analysis of Principal Component
- DT, Decision Trees
- EDA, Estimation of Distribution Algorithm
- FC, Fold Change
- GA, Genetic Algorithms
- GR, Gain Ratio
- HC, Hill Climbing
- HCA, Hierarchical Cluster Analysis
- IG, Information Gain
- LDA, Linear Discriminant Analysis
- LIMMA, Linear Models for Microarray Data
- MBF, Markov Blanket Filter
- MWW, Mann–Whitney–Wilcoxon test
- OPLS-DA, Orthogonal Partial Least Squares Discriminant Analysis
- PCA, Principal Component Analysis
- PLS-DA, Partial Least Square Discriminant Analysis
- RF, Random Forest
- RF-RFE, Random Forest with Recursive Feature Elimination
- SA, Simulated Annealing
- SAM, Significance Analysis of Microarrays
- SBE, Sequential Backward Elimination
- SFS, and Sequential Forward Selection
- SOM, Self-organizing Map
- SU, Symmetrical Uncertainty
- SVM, Support Vector Machine
- SVM-RFE, Support Vector Machine with Recursive Feature Elimination
- Sample classification
- Tumor marker selection
- sPLSDA, Sparse Partial Least Squares Discriminant Analysis
- t-SNE, Student t Distribution
- χ2, Chi-square
Collapse
Affiliation(s)
- Jing Tang
- Department of Bioinformatics, Chongqing Medical University, Chongqing 400016, China.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yongchao Luo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Jianbo Fu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yang Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.,School of Pharmaceutical Sciences and Innovative Drug Research Centre, Chongqing University, Chongqing 401331, China
| | - Yi Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ziyu Xiao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yan Lou
- Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou 310000, China
| | - Yunqing Qiu
- Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou 310000, China
| | - Feng Zhu
- Department of Bioinformatics, Chongqing Medical University, Chongqing 400016, China.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
4
|
Yang CH, Lin YS, Chuang LY, Lin YD. Effective hybrid approach for protein structure prediction in a two-dimensional Hydrophobic-Polar model. Comput Biol Med 2019; 113:103397. [PMID: 31494431 DOI: 10.1016/j.compbiomed.2019.103397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 08/19/2019] [Accepted: 08/19/2019] [Indexed: 10/26/2022]
Abstract
Hydrophobic-polar (HP) models are widely used to predict protein folding and hydrophobic interactions. Numerous optimization algorithms have been proposed to predict protein folding using the two-dimensional (2D) HP model. However, to obtain an optimal protein structure from the 2D HP model remains challenging. In this study, an algorithm integrating particle swarm optimization (PSO) and Tabu search (TS), named PSO-TS, was proposed to predict protein structures based on the 2D HP model. TS can help PSO to avoid getting trapped in a local optima and thus to remove the limitation of PSO in predicting protein folding by the 2D HP model. In this study, a total of 28 protein sequences were used to evaluate the accuracy of PSO-TS in protein folding prediction. The proposed PSO-TS method was compared with 15 other approaches for predicting short and long protein sequences. Experimental results demonstrated that PSO-TS provides a highly accurate, reproducible, and stabile prediction ability for the protein folding by the 2D HP model.
Collapse
Affiliation(s)
- Cheng-Hong Yang
- Department of Electronic Engineering, National Kaohsiung University of Science and Technology, No.1, Sec. 1, Syuecheng Rd., Dashu District, Kaohsiung City, 84001, Taiwan; Program in Biomedical Engineering, Kaohsiung Medical University, No.100, Tzyou 1st Rd., Sanmin Dist., Kaohsiung City, 80756, Taiwan.
| | - Yu-Shiun Lin
- Department of Electronic Engineering, National Kaohsiung University of Science and Technology, No.1, Sec. 1, Syuecheng Rd., Dashu District, Kaohsiung City, 84001, Taiwan.
| | - Li-Yeh Chuang
- Department of Chemical Engineering, I-Shou University, No.415, Jiangong Rd., Sanmin Dist., Kaohsiung City, 807, Taiwan; Institute of Biotechnology and Chemical Engineering, I-Shou University, No.415, Jiangong Rd., Sanmin Dist., Kaohsiung City, 807, Taiwan.
| | - Yu-Da Lin
- Department of Electronic Engineering, National Kaohsiung University of Science and Technology, No.1, Sec. 1, Syuecheng Rd., Dashu District, Kaohsiung City, 84001, Taiwan.
| |
Collapse
|
5
|
Yang CH, Wu KC, Lin YS, Chuang LY, Chang HW. Protein folding prediction in the HP model using ions motion optimization with a greedy algorithm. BioData Min 2018; 11:17. [PMID: 30116298 PMCID: PMC6083565 DOI: 10.1186/s13040-018-0176-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 07/23/2018] [Indexed: 11/10/2022] Open
Abstract
Background The function of a protein is determined by its native protein structure. Among many protein prediction methods, the Hydrophobic-Polar (HP) model, an ab initio method, simplifies the protein folding prediction process in order to reduce the prediction complexity. Results In this study, the ions motion optimization (IMO) algorithm was combined with the greedy algorithm (namely IMOG) and implemented to the HP model for the protein folding prediction based on the 2D-triangular-lattice model. Prediction results showed that the integration method IMOG provided a better prediction efficiency in a HP model. Compared to others, our proposed method turned out as superior in its prediction ability and resilience for most of the test sequences. The efficiency of the proposed method was verified by the prediction results. The global search capability and the ability to escape from the local best solution of IMO combined with a local search (greedy algorithm) to the new algorithm IMOG greatly improve the search for the best solution with reliable protein folding prediction. Conclusion Overall, the HP model integrated with IMO and a greedy algorithm as IMOG provides an improved way of protein structure prediction of high stability, high efficiency, and outstanding performance.
Collapse
Affiliation(s)
- Cheng-Hong Yang
- 1Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan.,2Graduate Institute of Clinical Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Kuo-Chuan Wu
- 1Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan.,3Department of Computer Science and Information Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan
| | - Yu-Shiun Lin
- 1Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan
| | - Li-Yeh Chuang
- 4Department of Chemical Engineering & Institute of Biotechnology and Chemical Engineering, I-Shou University, Kaohsiung, Taiwan
| | - Hsueh-Wei Chang
- 5Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan.,6Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung, Taiwan.,7Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan
| |
Collapse
|
6
|
Dubey SP, Balaji S, Kini NG, Sathish Kumar M. A Novel Framework for Ab Initio Coarse Protein Structure Prediction. Adv Bioinformatics 2018; 2018:7607384. [PMID: 30026759 PMCID: PMC6031167 DOI: 10.1155/2018/7607384] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 04/26/2018] [Accepted: 05/27/2018] [Indexed: 02/07/2023] Open
Abstract
Hydrophobic-Polar model is a simplified representation of Protein Structure Prediction (PSP) problem. However, even with the HP model, the PSP problem remains NP-complete. This work proposes a systematic and problem specific design for operators of the evolutionary program which hybrids with local search hill climbing, to efficiently explore the search space of PSP and thereby obtain an optimum conformation. The proposed algorithm achieves this by incorporating the following novel features: (i) new initialization method which generates only valid individuals with (rather than random) better fitness values; (ii) use of probability-based selection operators that limit the local convergence; (iii) use of secondary structure based mutation operator that makes the structure more closely to the laboratory determined structure; and (iv) incorporating all the above-mentioned features developed a complete two-tier framework. The developed framework builds the protein conformation on the square and triangular lattice. The test has been performed using benchmark sequences, and a comparative evaluation is done with various state-of-the-art algorithms. Moreover, in addition to hypothetical test sequences, we have tested protein sequences deposited in protein database repository. It has been observed that the proposed framework has shown superior performance regarding accuracy (fitness value) and speed (number of generations needed to attain the final conformation). The concepts used to enhance the performance are generic and can be used with any other population-based search algorithm such as genetic algorithm, ant colony optimization, and immune algorithm.
Collapse
Affiliation(s)
- Sandhya Parasnath Dubey
- Department of Computer Science & Eng., Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka 576104, India
| | - S. Balaji
- Department of Biotechnology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka 576104, India
| | - N. Gopalakrishna Kini
- Department of Computer Science & Eng., Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka 576104, India
| | - M. Sathish Kumar
- Department of ECE, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka 576104, India
| |
Collapse
|
7
|
Yang CH, Lin YS, Chuang LY, Chang HW. A Particle Swarm Optimization-Based Approach with Local Search for Predicting Protein Folding. J Comput Biol 2017; 24:981-994. [PMID: 28287821 DOI: 10.1089/cmb.2016.0104] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The hydrophobic-polar (HP) model is commonly used for predicting protein folding structures and hydrophobic interactions. This study developed a particle swarm optimization (PSO)-based algorithm combined with local search algorithms; specifically, the high exploration PSO (HEPSO) algorithm (which can execute global search processes) was combined with three local search algorithms (hill-climbing algorithm, greedy algorithm, and Tabu table), yielding the proposed HE-L-PSO algorithm. By using 20 known protein structures, we evaluated the performance of the HE-L-PSO algorithm in predicting protein folding in the HP model. The proposed HE-L-PSO algorithm exhibited favorable performance in predicting both short and long amino acid sequences with high reproducibility and stability, compared with seven reported algorithms. The HE-L-PSO algorithm yielded optimal solutions for all predicted protein folding structures. All HE-L-PSO-predicted protein folding structures possessed a hydrophobic core that is similar to normal protein folding.
Collapse
Affiliation(s)
- Cheng-Hong Yang
- 1 Department of Electronic Engineering, National Kaohsiung University of Applied Sciences , Kaohsiung, Taiwan
| | - Yu-Shiun Lin
- 1 Department of Electronic Engineering, National Kaohsiung University of Applied Sciences , Kaohsiung, Taiwan
| | - Li-Yeh Chuang
- 2 Department of Chemical Engineering, Institute of Biotechnology and Chemical Engineering, I-Shou University , Kaohsiung, Taiwan
| | - Hsueh-Wei Chang
- 3 Institute of Medical Science and Technology, National Sun Yat-sen University , Kaohsiung, Taiwan .,4 Department of Medical Research, Kaohsiung Medical University Hospital , Kaohsiung, Taiwan .,5 Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University , Kaohsiung, Taiwan
| |
Collapse
|
8
|
A Multi-Objective Approach for Protein Structure Prediction Based on an Energy Model and Backbone Angle Preferences. Int J Mol Sci 2015; 16:15136-49. [PMID: 26151847 PMCID: PMC4519891 DOI: 10.3390/ijms160715136] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2015] [Revised: 06/25/2015] [Accepted: 06/25/2015] [Indexed: 11/17/2022] Open
Abstract
Protein structure prediction (PSP) is concerned with the prediction of protein tertiary structure from primary structure and is a challenging calculation problem. After decades of research effort, numerous solutions have been proposed for optimisation methods based on energy models. However, further investigation and improvement is still needed to increase the accuracy and similarity of structures. This study presents a novel backbone angle preference factor, which is one of the factors inducing protein folding. The proposed multiobjective optimisation approach simultaneously considers energy models and backbone angle preferences to solve the ab initio PSP. To prove the effectiveness of the multiobjective optimisation approach based on the energy models and backbone angle preferences, 75 amino acid sequences with lengths ranging from 22 to 88 amino acids were selected from the CB513 data set to be the benchmarks. The data sets were highly dissimilar, therefore indicating that they are meaningful. The experimental results showed that the root-mean-square deviation (RMSD) of the multiobjective optimization approach based on energy model and backbone angle preferences was superior to those of typical energy models, indicating that the proposed approach can facilitate the ab initio PSP.
Collapse
|
9
|
Shehu A. A Review of Evolutionary Algorithms for Computing Functional Conformations of Protein Molecules. METHODS IN PHARMACOLOGY AND TOXICOLOGY 2015. [DOI: 10.1007/7653_2015_47] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
10
|
Brown MS, Bennett T, Coker JA. Niche Genetic Algorithms are better than traditional Genetic Algorithms for de novo Protein Folding. F1000Res 2014. [DOI: 10.12688/f1000research.5412.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Here we demonstrate that Niche Genetic Algorithms (NGA) are better at computing de novo protein folding than traditional Genetic Algorithms (GA). Previous research has shown that proteins can fold into their active forms in a limited number of ways; however, predicting how a set of amino acids will fold starting from the primary structure is still a mystery. GAs have a unique ability to solve these types of scientific problems because of their computational efficiency. Unfortunately, GAs are generally quite poor at solving problems with multiple optima. However, there is a special group of GAs called Niche Genetic Algorithms (NGA) that are quite good at solving problems with multiple optima. In this study, we use a specific NGA: the Dynamic-radius Species-conserving Genetic Algorithm (DSGA), and show that DSGA is very adept at predicting the folded state of proteins, and that DSGA is better than a traditional GA in deriving the correct folding pattern of a protein.
Collapse
|
11
|
Liu J, Song B, Yao Y, Xue Y, Liu W, Liu Z. Wang-Landau sampling in face-centered-cubic hydrophobic-hydrophilic lattice model proteins. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:042715. [PMID: 25375531 DOI: 10.1103/physreve.90.042715] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Indexed: 06/04/2023]
Abstract
Finding the global minimum-energy structure is one of the main problems of protein structure prediction. The face-centered-cubic (fcc) hydrophobic-hydrophilic (HP) lattice model can reach high approximation ratios of real protein structures, so the fcc lattice model is a good choice to predict the protein structures. The lacking of an effective global optimization method is the key obstacle in solving this problem. The Wang-Landau sampling method is especially useful for complex systems with a rough energy landscape and has been successfully applied to solving many optimization problems. We apply the improved Wang-Landau (IWL) sampling method, which incorporates the generation of an initial conformation based on the greedy strategy and the neighborhood strategy based on pull moves into the Wang-Landau sampling method to predict the protein structures on the fcc HP lattice model. Unlike conventional Monte Carlo simulations that generate a probability distribution at a given temperature, the Wang-Landau sampling method can estimate the density of states accurately via a random walk, which produces a flat histogram in energy space. We test 12 general benchmark instances on both two-dimensional and three-dimensional (3D) fcc HP lattice models. The lowest energies by the IWL sampling method are as good as or better than those of other methods in the literature for all instances. We then test five sets of larger-scale instances, denoted by the S, R, F90, F180, and CASP target instances on the 3D fcc HP lattice model. The numerical results show that our algorithm performs better than the other five methods in the literature on both the lowest energies and the average lowest energies in all runs. The IWL sampling method turns out to be a powerful tool to study the structure prediction of the fcc HP lattice model proteins.
Collapse
Affiliation(s)
- Jingfa Liu
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China and School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, 210044, China and Network Information Center, Nanjing University of Information Science & Technology, Nanjing 210044, China
| | - Beibei Song
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China and School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, 210044, China
| | - Yonglei Yao
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China and School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, 210044, China
| | - Yu Xue
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China and School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, 210044, China
| | - Wenjie Liu
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China and School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing, 210044, China
| | - Zhaoxia Liu
- Network Information Center, Nanjing University of Information Science & Technology, Nanjing 210044, China
| |
Collapse
|
12
|
Tsay JJ, Su SC. An effective evolutionary algorithm for protein folding on 3D FCC HP model by lattice rotation and generalized move sets. Proteome Sci 2013; 11:S19. [PMID: 24565217 PMCID: PMC3908773 DOI: 10.1186/1477-5956-11-s1-s19] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Background Proteins are essential biological molecules which play vital roles in nearly all biological processes. It is the tertiary structure of a protein that determines its functions. Therefore the prediction of a protein's tertiary structure based on its primary amino acid sequence has long been the most important and challenging subject in biochemistry, molecular biology and biophysics. In the past, the HP lattice model was one of the ab initio methods that many researchers used to forecast the protein structure. Although these kinds of simplified methods could not achieve high resolution, they provided a macrocosm-optimized protein structure. The model has been employed to investigate general principles of protein folding, and plays an important role in the prediction of protein structures. Methods In this paper, we present an improved evolutionary algorithm for the protein folding problem. We study the problem on the 3D FCC lattice HP model which has been widely used in previous research. Our focus is to develop evolutionary algorithms (EA) which are robust, easy to implement and can handle various energy functions. We propose to combine three different local search methods, including lattice rotation for crossover, K-site move for mutation, and generalized pull move; these form our key components to improve previous EA-based approaches. Results We have carried out experiments over several data sets which were used in previous research. The results of the experiments show that our approach is able to find optimal conformations which were not found by previous EA-based approaches. Conclusions We have investigated the geometric properties of the 3D FCC lattice and developed several local search techniques to improve traditional EA-based approaches to the protein folding problem. It is known that EA-based approaches are robust and can handle arbitrary energy functions. Our results further show that by extensive development of local searches, EA can also be very effective for finding optimal conformations on the 3D FCC HP model. Furthermore, the local searches developed in this paper can be integrated with other approaches such as the Monte Carlo and Tabu searches to improve their performance.
Collapse
Affiliation(s)
- Jyh-Jong Tsay
- Department of Computer Science and Information Engineering, National Chung Cheng University, 168 University Road, Minhsiung Township, Chiayi County 62102, Taiwan
| | - Shih-Chieh Su
- Department of Computer Science and Information Engineering, National Chung Cheng University, 168 University Road, Minhsiung Township, Chiayi County 62102, Taiwan
| |
Collapse
|
13
|
Liu J, Song B, Liu Z, Huang W, Sun Y, Liu W. Energy-landscape paving for prediction of face-centered-cubic hydrophobic-hydrophilic lattice model proteins. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:052704. [PMID: 24329293 DOI: 10.1103/physreve.88.052704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2013] [Indexed: 06/03/2023]
Abstract
Protein structure prediction (PSP) is a classical NP-hard problem in computational biology. The energy-landscape paving (ELP) method is a class of heuristic global optimization algorithm, and has been successfully applied to solving many optimization problems with complex energy landscapes in the continuous space. By putting forward a new update mechanism of the histogram function in ELP and incorporating the generation of initial conformation based on the greedy strategy and the neighborhood search strategy based on pull moves into ELP, an improved energy-landscape paving (ELP+) method is put forward. Twelve general benchmark instances are first tested on both two-dimensional and three-dimensional (3D) face-centered-cubic (fcc) hydrophobic-hydrophilic (HP) lattice models. The lowest energies by ELP+ are as good as or better than those of other methods in the literature for all instances. Then, five sets of larger-scale instances, denoted by S, R, F90, F180, and CASP target instances on the 3D FCC HP lattice model are tested. The proposed algorithm finds lower energies than those by the five other methods in literature. Not unexpectedly, this is particularly pronounced for the longer sequences considered. Computational results show that ELP+ is an effective method for PSP on the fcc HP lattice model.
Collapse
Affiliation(s)
- Jingfa Liu
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science and Technology, Nanjing 210044, China and Network Information Center, Nanjing University of Information Science and Technology, Nanjing 210044, China
| | - Beibei Song
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science and Technology, Nanjing 210044, China and School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, China
| | - Zhaoxia Liu
- Network Information Center, Nanjing University of Information Science and Technology, Nanjing 210044, China
| | - Weibo Huang
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science and Technology, Nanjing 210044, China and School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, China
| | - Yuanyuan Sun
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science and Technology, Nanjing 210044, China and School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, China
| | - Wenjie Liu
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science and Technology, Nanjing 210044, China and School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing 210044, China
| |
Collapse
|
14
|
Manning T, Sleator RD, Walsh P. Naturally selecting solutions: the use of genetic algorithms in bioinformatics. Bioengineered 2013; 4:266-78. [PMID: 23222169 PMCID: PMC3813526 DOI: 10.4161/bioe.23041] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2012] [Revised: 11/26/2012] [Accepted: 11/28/2012] [Indexed: 11/19/2022] Open
Abstract
For decades, computer scientists have looked to nature for biologically inspired solutions to computational problems; ranging from robotic control to scheduling optimization. Paradoxically, as we move deeper into the post-genomics era, the reverse is occurring, as biologists and bioinformaticians look to computational techniques, to solve a variety of biological problems. One of the most common biologically inspired techniques are genetic algorithms (GAs), which take the Darwinian concept of natural selection as the driving force behind systems for solving real world problems, including those in the bioinformatics domain. Herein, we provide an overview of genetic algorithms and survey some of the most recent applications of this approach to bioinformatics based problems.
Collapse
Affiliation(s)
- Timmy Manning
- Department of Computer Science; Cork Institute of Technology; Cork, Ireland
| | - Roy D Sleator
- Department of Biological Sciences; Cork Institute of Technology; Cork, Ireland
| | - Paul Walsh
- Department of Computer Science; Cork Institute of Technology; Cork, Ireland
| |
Collapse
|