1
|
Bouqellah NA, Farag PF. In Silico Evaluation, Phylogenetic Analysis, and Structural Modeling of the Class II Hydrophobin Family from Different Fungal Phytopathogens. Microorganisms 2023; 11:2632. [PMID: 38004644 PMCID: PMC10672791 DOI: 10.3390/microorganisms11112632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 10/20/2023] [Accepted: 10/24/2023] [Indexed: 11/26/2023] Open
Abstract
The class II hydrophobin group (HFBII) is an extracellular group of proteins that contain the HFBII domain and eight conserved cysteine residues. These proteins are exclusively secreted by fungi and have multiple functions with a probable role as effectors. In the present study, a total of 45 amino acid sequences of hydrophobin class II proteins from different phytopathogenic fungi were retrieved from the NCBI database. We used the integration of well-designed bioinformatic tools to characterize and predict their physicochemical parameters, novel motifs, 3D structures, multiple sequence alignment (MSA), evolution, and functions as effector proteins through molecular docking. The results revealed new features for these protein members. The ProtParam tool detected the hydrophobicity properties of all proteins except for one hydrophilic protein (KAI3335996.1). Out of 45 proteins, six of them were detected as GPI-anchored proteins by the PredGPI server. Different 3D structure templates with high pTM scores were designed by Multifold v1, AlphaFold2, and trRosetta. Most of the studied proteins were anticipated as apoplastic effectors and matched with the ghyd5 gene of Fusarium graminearum as virulence factors. A protein-protein interaction (PPI) analysis unraveled the molecular function of this group as GTP-binding proteins, while a molecular docking analysis detected a chitin-binding effector role. From the MSA analysis, it was observed that the HFBII sequences shared conserved 2 Pro (P) and 2 Gly (G) amino acids besides the known eight conserved cysteine residues. The evolutionary analysis and phylogenetic tree provided evidence of episodic diversifying selection at the branch level using the aBSREL tool. A detailed in silico analysis of this family and the present findings will provide a better understanding of the HFBII characters and evolutionary relationships, which could be very useful in future studies.
Collapse
Affiliation(s)
- Nahla A. Bouqellah
- Department of Biology, College of Science, Taibah University, P.O. Box 344, Al Madinah Al Munawwarah 42317-8599, Saudi Arabia
| | - Peter F. Farag
- Department of Microbiology, Faculty of Science, Ain Shams University, Cairo 11566, Egypt;
| |
Collapse
|
2
|
Pande A, Patiyal S, Lathwal A, Arora C, Kaur D, Dhall A, Mishra G, Kaur H, Sharma N, Jain S, Usmani SS, Agrawal P, Kumar R, Kumar V, Raghava GPS. Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models. J Comput Biol 2023; 30:204-222. [PMID: 36251780 DOI: 10.1089/cmb.2022.0241] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
In the last three decades, a wide range of protein features have been discovered to annotate a protein. Numerous attempts have been made to integrate these features in a software package/platform so that the user may compute a wide range of features from a single source. To complement the existing methods, we developed a method, Pfeature, for computing a wide range of protein features. Pfeature allows to compute more than 200,000 features required for predicting the overall function of a protein, residue-level annotation of a protein, and function of chemically modified peptides. It has six major modules, namely, composition, binary profiles, evolutionary information, structural features, patterns, and model building. Composition module facilitates to compute most of the existing compositional features, plus novel features. The binary profile of amino acid sequences allows to compute the fraction of each type of residue as well as its position. The evolutionary information module allows to compute evolutionary information of a protein in the form of a position-specific scoring matrix profile generated using Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST); fit for annotation of a protein and its residues. A structural module was developed for computing of structural features/descriptors from a tertiary structure of a protein. These features are suitable to predict the therapeutic potential of a protein containing non-natural or chemically modified residues. The model-building module allows to implement various machine learning techniques for developing classification and regression models as well as feature selection. Pfeature also allows the generation of overlapping patterns and features from a protein. A user-friendly Pfeature is available as a web server python library and stand-alone package.
Collapse
Affiliation(s)
- Akshara Pande
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Sumeet Patiyal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Anjali Lathwal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Chakit Arora
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Dilraj Kaur
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Gaurav Mishra
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Department of Electrical Engineering, Shiv Nadar University, Greater Noida, India
| | - Harpreet Kaur
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Neelam Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Shipra Jain
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Salman Sadullah Usmani
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Piyush Agrawal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Rajesh Kumar
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Vinod Kumar
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| |
Collapse
|
3
|
Rademaker DT, Xue LC, ‘t Hoen PAC, Vriend G. Entropy and Variability: A Second Opinion by Deep Learning. Biomolecules 2022; 12:biom12121740. [PMID: 36551168 PMCID: PMC9775329 DOI: 10.3390/biom12121740] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/13/2022] [Accepted: 11/19/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Analysis of the distribution of amino acid types found at equivalent positions in multiple sequence alignments has found applications in human genetics, protein engineering, drug design, protein structure prediction, and many other fields. These analyses tend to revolve around measures of the distribution of the twenty amino acid types found at evolutionary equivalent positions: the columns in multiple sequence alignments. Commonly used measures are variability, average hydrophobicity, or Shannon entropy. One of these techniques, called entropy-variability analysis, as the name already suggests, reduces the distribution of observed residue types in one column to two numbers: the Shannon entropy and the variability as defined by the number of residue types observed. RESULTS We applied a deep learning, unsupervised feature extraction method to analyse the multiple sequence alignments of all human proteins. An auto-encoder neural architecture was trained on 27,835 multiple sequence alignments for human proteins to obtain the two features that best describe the seven million variability patterns. These two unsupervised learned features strongly resemble entropy and variability, indicating that these are the projections that retain most information when reducing the dimensionality of the information hidden in columns in multiple sequence alignments.
Collapse
Affiliation(s)
- Daniel T. Rademaker
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Li C. Xue
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Peter A. C. ‘t Hoen
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
- Baco Institute for Protein Science (BIPS), Mindoro 5201, Philippines
- Correspondence:
| |
Collapse
|
4
|
Silva JM, Pratas D, Caetano T, Matos S. The complexity landscape of viral genomes. Gigascience 2022; 11:giac079. [PMID: 35950839 PMCID: PMC9366995 DOI: 10.1093/gigascience/giac079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 05/25/2022] [Accepted: 07/26/2022] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Viruses are among the shortest yet highly abundant species that harbor minimal instructions to infect cells, adapt, multiply, and exist. However, with the current substantial availability of viral genome sequences, the scientific repertory lacks a complexity landscape that automatically enlights viral genomes' organization, relation, and fundamental characteristics. RESULTS This work provides a comprehensive landscape of the viral genome's complexity (or quantity of information), identifying the most redundant and complex groups regarding their genome sequence while providing their distribution and characteristics at a large and local scale. Moreover, we identify and quantify inverted repeats abundance in viral genomes. For this purpose, we measure the sequence complexity of each available viral genome using data compression, demonstrating that adequate data compressors can efficiently quantify the complexity of viral genome sequences, including subsequences better represented by algorithmic sources (e.g., inverted repeats). Using a state-of-the-art genomic compressor on an extensive viral genomes database, we show that double-stranded DNA viruses are, on average, the most redundant viruses while single-stranded DNA viruses are the least. Contrarily, double-stranded RNA viruses show a lower redundancy relative to single-stranded RNA. Furthermore, we extend the ability of data compressors to quantify local complexity (or information content) in viral genomes using complexity profiles, unprecedently providing a direct complexity analysis of human herpesviruses. We also conceive a features-based classification methodology that can accurately distinguish viral genomes at different taxonomic levels without direct comparisons between sequences. This methodology combines data compression with simple measures such as GC-content percentage and sequence length, followed by machine learning classifiers. CONCLUSIONS This article presents methodologies and findings that are highly relevant for understanding the patterns of similarity and singularity between viral groups, opening new frontiers for studying viral genomes' organization while depicting the complexity trends and classification components of these genomes at different taxonomic levels. The whole study is supported by an extensive website (https://asilab.github.io/canvas/) for comprehending the viral genome characterization using dynamic and interactive approaches.
Collapse
Affiliation(s)
- Jorge Miguel Silva
- Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Diogo Pratas
- Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
- Department of Electronics Telecommunications and Informatics, University of Aveiro, Campus Universitario de Santiago, 3810-193 Aveiro, Portugal
- Department of Virology, University of Helsinki, Haartmaninkatu 3, 00014 Helsinki, Finland
| | - Tânia Caetano
- Department of Biology, University of Aveiro, Campus Universitario de Santiago, 3810-193 Aveiro, Portugal
| | - Sérgio Matos
- Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
- Department of Electronics Telecommunications and Informatics, University of Aveiro, Campus Universitario de Santiago, 3810-193 Aveiro, Portugal
| |
Collapse
|
5
|
Abstract
Programmed cell death protein-1 (PD-1) inhibitory antibodies, often referred to as checkpoint inhibitors, have revolutionized the treatment of cancer. However, there is an unmet need for PD-1 agonists to treat autoimmune disorders. Herein, we describe the de novo design of a small, stable PD-1 binding protein and development of a synthetic PD-1 agonist. Programmed cell death protein-1 (PD-1) expressed on activated T cells inhibits T cell function and proliferation to prevent an excessive immune response, and disease can result if this delicate balance is shifted in either direction. Tumor cells often take advantage of this pathway by overexpressing the PD-1 ligand PD-L1 to evade destruction by the immune system. Alternatively, if there is a decrease in function of the PD-1 pathway, unchecked activation of the immune system and autoimmunity can result. Using a combination of computation and experiment, we designed a hyperstable 40-residue miniprotein, PD-MP1, that specifically binds murine and human PD-1 at the PD-L1 interface with a Kd of ∼100 nM. The apo crystal structure shows that the binder folds as designed with a backbone RMSD of 1.3 Å to the design model. Trimerization of PD-MP1 resulted in a PD-1 agonist that strongly inhibits murine T cell activation. This small, hyperstable PD-1 binding protein was computationally designed with an all-beta interface, and the trimeric agonist could contribute to treatments for autoimmune and inflammatory diseases.
Collapse
|
6
|
AC2: An Efficient Protein Sequence Compression Tool Using Artificial Neural Networks and Cache-Hash Models. ENTROPY 2021; 23:e23050530. [PMID: 33925812 PMCID: PMC8146440 DOI: 10.3390/e23050530] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 04/19/2021] [Accepted: 04/22/2021] [Indexed: 12/28/2022]
Abstract
Recently, the scientific community has witnessed a substantial increase in the generation of protein sequence data, triggering emergent challenges of increasing importance, namely efficient storage and improved data analysis. For both applications, data compression is a straightforward solution. However, in the literature, the number of specific protein sequence compressors is relatively low. Moreover, these specialized compressors marginally improve the compression ratio over the best general-purpose compressors. In this paper, we present AC2, a new lossless data compressor for protein (or amino acid) sequences. AC2 uses a neural network to mix experts with a stacked generalization approach and individual cache-hash memory models to the highest-context orders. Compared to the previous compressor (AC), we show gains of 2–9% and 6–7% in reference-free and reference-based modes, respectively. These gains come at the cost of three times slower computations. AC2 also improves memory usage against AC, with requirements about seven times lower, without being affected by the sequences’ input size. As an analysis application, we use AC2 to measure the similarity between each SARS-CoV-2 protein sequence with each viral protein sequence from the whole UniProt database. The results consistently show higher similarity to the pangolin coronavirus, followed by the bat and human coronaviruses, contributing with critical results to a current controversial subject. AC2 is available for free download under GPLv3 license.
Collapse
|
7
|
Goutham S, Kumari I, Pally D, Singh A, Ghosh S, Akhter Y, Bhat R. Mutually exclusive locales for N-linked glycans and disorder in human glycoproteins. Sci Rep 2020; 10:6040. [PMID: 32269229 PMCID: PMC7142085 DOI: 10.1038/s41598-020-61427-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 01/30/2020] [Indexed: 11/08/2022] Open
Abstract
Several post-translational protein modifications lie predominantly within regions of disorder: the biased localization has been proposed to expand the binding versatility of disordered regions. However, investigating a representative dataset of 500 human N-glycoproteins, we observed the sites of N-linked glycosylations or N-glycosites, to be predominantly present in the regions of predicted order. When compared with disordered stretches, ordered regions were not found to be enriched for asparagines, serines and threonines, residues that constitute the sequon signature for conjugation of N-glycans. We then investigated the basis of mutual exclusivity between disorder and N-glycosites on the basis of amino acid distribution: when compared with control ordered residue stretches without any N-glycosites, residue neighborhoods surrounding N-glycosites showed a depletion of bulky, hydrophobic and disorder-promoting amino acids and an enrichment for flexible and accessible residues that are frequently found in coiled structures. When compared with control disordered residue stretches without any N-glycosites, N-glycosite neighborhoods were depleted of charged, polar, hydrophobic and flexible residues and enriched for aromatic, accessible and order-promoting residues with a tendency to be part of coiled and β structures. N-glycosite neighborhoods also showed greater phylogenetic conservation among amniotes, compared with control ordered regions, which in turn were more conserved than disordered control regions. Our results lead us to propose that unique primary structural compositions and differential propensities for evolvability allowed for the mutual spatial exclusion of N-glycosite neighborhoods and disordered stretches.
Collapse
Affiliation(s)
- Shyamili Goutham
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Sciences, Bangalore, 560012, India
| | - Indu Kumari
- School of Earth and Environmental Sciences, Central University of Himachal Pradesh, District-Kangra, Shahpur, Himachal Pradesh, 176206, India
| | - Dharma Pally
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Sciences, Bangalore, 560012, India
| | - Alvina Singh
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Sciences, Bangalore, 560012, India
| | - Sujasha Ghosh
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Sciences, Bangalore, 560012, India
| | - Yusuf Akhter
- Department of Biotechnology, Babasaheb Bhimrao Ambedkar University, Vidya Vihar, Raebareli Road, Lucknow, Uttar Pradesh, 226025, India
| | - Ramray Bhat
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Sciences, Bangalore, 560012, India.
| |
Collapse
|
8
|
Serafimova K, Mihaylov I, Vassilev D, Avdjieva I, Zielenkiewicz P, Kaczanowski S. Using Machine Learning in Accuracy Assessment of Knowledge-Based Energy and Frequency Base Likelihood in Protein Structures. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7304015 DOI: 10.1007/978-3-030-50420-5_43] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
Many aspects of the study of protein folding and dynamics have been affected by the accumulation of data about native protein structures and recent advances in machine learning. Computational methods for predicting protein structures from their sequences are now heavily based on machine learning tools and on approaches that extract knowledge and rules from data using probabilistic models. Many of these methods use scoring functions to determine which structure best fits a native protein sequence. Using computational approaches, we obtained two scoring functions: knowledge-based energy and likelihood of base frequency, and we compared their accuracy in measuring the sequence structure fit. We compared the machine learning models’ accuracy of predictions for knowledge-based energy and likelihood values to validate our results, showing that likelihood is a more accurate scoring function than knowledge-based energy.
Collapse
|
9
|
Hosseini M, Pratas D, Pinho AJ. AC: A Compression Tool for Amino Acid Sequences. Interdiscip Sci 2019; 11:68-76. [PMID: 30721401 DOI: 10.1007/s12539-019-00322-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 01/23/2019] [Accepted: 01/28/2019] [Indexed: 10/27/2022]
|
10
|
Bywater RP. Why twenty amino acid residue types suffice(d) to support all living systems. PLoS One 2018; 13:e0204883. [PMID: 30321190 PMCID: PMC6188899 DOI: 10.1371/journal.pone.0204883] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 09/17/2018] [Indexed: 11/21/2022] Open
Abstract
It is well known that proteins are built up from an alphabet of 20 different amino acid types. These suffice to enable the protein to fold into its operative form relevant to its required functional roles. For carrying out these allotted functions, there may in some cases be a need for post-translational modifications and it has been established that an additional three types of amino acid have at some point been recruited into this process. But it still remains the case that the 20 residue types referred to are the major building blocks in all terrestrial proteins, and probably "universally". Given this fact, it is surprising that no satisfactory answer has been given to the two questions: "why 20?" and "why just these 20?". Furthermore, a suggestion is made as to how these 20 map to the codon repertoire which in principle has the capacity to cater for 64 different residue types. Attempts are made in this paper to answer these questions by employing a combination of quantum chemical and chemoinformatic tools which are applied to the standard 20 amino acid types as well as 3 “non-standard” types found in nature, a set of fictitious but feasible analog structures designed to test the need for greater coverage of function space and the collection of candidate alternative structures found either on meteorites or in experiments designed to reconstruct pre-life scenarios.
Collapse
|
11
|
Jeremić M, Pantić I, Jakšić M. The influence of lithium sulphate on Shannon entropy in lymphocyte chromatin. MEDICINSKI PODMLADAK 2018. [DOI: 10.5937/mp69-13677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
|
12
|
Bywater RP. A tensegrity model for hydrogen bond networks in proteins. Heliyon 2017; 3:e00307. [PMID: 28607953 PMCID: PMC5454140 DOI: 10.1016/j.heliyon.2017.e00307] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2017] [Revised: 05/15/2017] [Accepted: 05/19/2017] [Indexed: 12/04/2022] Open
Abstract
Hydrogen-bonding networks in proteins considered as structural tensile elements are in balance separately from any other stabilising interactions that may be in operation. The hydrogen bond arrangement in the network is reminiscent of tensegrity structures in architecture and sculpture. Tensegrity has been discussed before in cells and tissues and in proteins. In contrast to previous work only hydrogen bonds are studied here. The other interactions within proteins are either much stronger − covalent bonds connecting the atoms in the molecular skeleton or weaker forces like the so-called hydrophobic interactions. It has been demonstrated that the latter operate independently from hydrogen bonds. Each category of interaction must, if the protein is to have a stable structure, balance out. The hypothesis here is that the entire hydrogen bond network is in balance without any compensating contributions from other types of interaction. For sidechain-sidechain, sidechain-backbone and backbone-backbone hydrogen bonds in proteins, tensegrity balance (“closure”) is required over the entire length of the polypeptide chain that defines individually folding units in globular proteins (“domains”) as well as within the repeating elements in fibrous proteins that consist of extended chain structures. There is no closure to be found in extended structures that do not have repeating elements. This suggests an explanation as to why globular domains, as well as the repeat units in fibrous proteins, have to have a defined number of residues. Apart from networks of sidechain-sidechain hydrogen bonds there are certain key points at which this closure is achieved in the sidechain-backbone hydrogen bonds and these are associated with demarcation points at the start or end of stretches of secondary structure. Together, these three categories of hydrogen bond achieve the closure that is necessary for the stability of globular protein domains as well as repeating elements in fibrous proteins.
Collapse
|
13
|
Pratas D, Pinho AJ. On the Approximation of the Kolmogorov Complexity for DNA Sequences. PATTERN RECOGNITION AND IMAGE ANALYSIS 2017. [DOI: 10.1007/978-3-319-58838-4_29] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
14
|
Pantic I, Petrovic D, Paunovic J, Vucevic D, Radosavljevic T, Pantic S. Age-related reduction of chromatin fractal dimension in toluidine blue - stained hepatocytes. Mech Ageing Dev 2016; 157:30-4. [PMID: 27412950 DOI: 10.1016/j.mad.2016.07.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2016] [Revised: 05/26/2016] [Accepted: 07/06/2016] [Indexed: 02/05/2023]
Abstract
In this study, we proposed a hypothesis that chromatin of mouse hepatocytes exhibits age-related reduction of fractal dimension. This hypothesis was based on previously published works demonstrating that complexity of biological systems such as tissues, decreases during the process of physiological aging. Liver tissue was obtained from 24 male mice divided into 3 age groups: 10-days-old (young, juvenile), 210-days-old (adult) and 390-days-old. The tissue was stained using a modification of toluidine blue (nucleic acid - specific) staining method. A total of 480 chromatin structures (20 for each animal) were analyzed. For each structure, the values of fractal dimension, lacunarity, textural angular second moment and inverse difference moment were calculated using ImageJ software and its plugins. The results indicated the age-related reduction in fractal dimension and increase in lacunarity (p<0.01). Fractal dimension is a potentially good indicator of age associated changes in chromatin structure. To our knowledge, this is the first study to show that fractal complexity of hepatocyte chromatin decreases during the process of physiological aging. Fractal analysis as a method could be useful in detection of small age-related changes in chromatin distribution not otherwise visible with naked eye on conventional tissue micrographs.
Collapse
Affiliation(s)
- Igor Pantic
- Laboratory for Cellular Physiology, Institute of Medical Physiology, Faculty of Medicine University of Belgrade, Visegradska 26/II, RS-11129, Belgrade, Serbia.
| | - Danica Petrovic
- Health Center Studenica, Kraljevo, Jug Bogdanova 112, RS-36000, Kraljevo, Serbia
| | - Jovana Paunovic
- Laboratory for Cellular Physiology, Institute of Medical Physiology, Faculty of Medicine University of Belgrade, Visegradska 26/II, RS-11129, Belgrade, Serbia
| | - Danijela Vucevic
- Institute of pathological physiology, School of Medicine, University of Belgrade, Dr Subotica 9, RS-11129, Belgrade, Serbia
| | - Tatjana Radosavljevic
- Institute of pathological physiology, School of Medicine, University of Belgrade, Dr Subotica 9, RS-11129, Belgrade, Serbia
| | - Senka Pantic
- Institute of Histology and Embryology, School of Medicine, University of Belgrade Visegradska 26/II, RS-11129, Belgrade, Serbia
| |
Collapse
|
15
|
Bywater RP. Comparison of Algorithms for Prediction of Protein Structural Features from Evolutionary Data. PLoS One 2016; 11:e0150769. [PMID: 26963911 PMCID: PMC4786192 DOI: 10.1371/journal.pone.0150769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Accepted: 02/17/2016] [Indexed: 11/18/2022] Open
Abstract
Proteins have many functions and predicting these is still one of the major challenges in theoretical biophysics and bioinformatics. Foremost amongst these functions is the need to fold correctly thereby allowing the other genetically dictated tasks that the protein has to carry out to proceed efficiently. In this work, some earlier algorithms for predicting protein domain folds are revisited and they are compared with more recently developed methods. In dealing with intractable problems such as fold prediction, when different algorithms show convergence onto the same result there is every reason to take all algorithms into account such that a consensus result can be arrived at. In this work it is shown that the application of different algorithms in protein structure prediction leads to results that do not converge as such but rather they collude in a striking and useful way that has never been considered before.
Collapse
|