1
|
Nounou MN, Nounou HN, Meskin N, Datta A, Dougherty ER. Multiscale denoising of biological data: a comparative analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1539-1544. [PMID: 22566476 DOI: 10.1109/tcbb.2012.67] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Measured microarray genomic and metabolic data are a rich source of information about the biological systems they represent. For example, time-series biological data can be used to construct dynamic genetic regulatory network models, which can be used to design intervention strategies to cure or manage major diseases. Also, copy number data can be used to determine the locations and extent of aberrations in chromosome sequences. Unfortunately, measured biological data are usually contaminated with errors that mask the important features in the data. Therefore, these noisy measurements need to be filtered to enhance their usefulness in practice. Wavelet-based multiscale filtering has been shown to be a powerful denoising tool. In this work, different batch as well as online multiscale filtering techniques are used to denoise biological data contaminated with white or colored noise. The performances of these techniques are demonstrated and compared to those of some conventional low-pass filters using two case studies. The first case study uses simulated dynamic metabolic data, while the second case study uses real copy number data. Simulation results show that significant improvement can be achieved using multiscale filtering over conventional filtering techniques.
Collapse
Affiliation(s)
- M N Nounou
- Chemical Engineering Program, Texas A&M University at Qatar, Doha, Qatar.
| | | | | | | | | |
Collapse
|
2
|
BENSON NOAHC, DAGGETT VALERIE. Wavelet Analysis of Protein Motion. INTERNATIONAL JOURNAL OF WAVELETS, MULTIRESOLUTION AND INFORMATION PROCESSING 2012; 10:1250040. [PMID: 25484480 PMCID: PMC4255908 DOI: 10.1142/s0219691312500403] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
As high-throughput molecular dynamics simulations of proteins become more common and the databases housing the results become larger and more prevalent, more sophisticated methods to quickly and accurately mine large numbers of trajectories for relevant information will have to be developed. One such method, which is only recently gaining popularity in molecular biology, is the continuous wavelet transform, which is especially well-suited for time course data such as molecular dynamics simulations. We describe techniques for the calculation and analysis of wavelet transforms of molecular dynamics trajectories in detail and present examples of how these techniques can be useful in data mining. We demonstrate that wavelets are sensitive to structural rearrangements in proteins and that they can be used to quickly detect physically relevant events. Finally, as an example of the use of this approach, we show how wavelet data mining has led to a novel hypothesis related to the mechanism of the protein γδ resolvase.
Collapse
Affiliation(s)
| | - VALERIE DAGGETT
- Department of Bioengineering, Box 355013, University of Washington, Seattle, WA 98195-5013
| |
Collapse
|
3
|
Martin RL, Gardiner EJ, Senger S, Gillet VJ. Compression of Molecular Interaction Fields Using Wavelet Thumbnails: Application to Molecular Alignment. J Chem Inf Model 2012; 52:757-69. [DOI: 10.1021/ci200348h] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Richard L. Martin
- Information School, University of Sheffield, Regent Court, 211 Portobello
Street, Sheffield S1 4DP, United Kingdom
- Computational and Structural Chemistry, GlaxoSmithKline Research & Development, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, United Kingdom
| | - Eleanor J. Gardiner
- Information School, University of Sheffield, Regent Court, 211 Portobello
Street, Sheffield S1 4DP, United Kingdom
| | - Stefan Senger
- Computational and Structural Chemistry, GlaxoSmithKline Research & Development, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, United Kingdom
| | - Valerie J. Gillet
- Information School, University of Sheffield, Regent Court, 211 Portobello
Street, Sheffield S1 4DP, United Kingdom
| |
Collapse
|
4
|
Ye L, Chen H, Liu T, Wu Z, Li J, Zhou R. A WAVELET APPROACH FOR THE ANALYSIS OF FOLDING TRAJECTORY OF PROTEIN TRP-CAGE. J Bioinform Comput Biol 2011; 3:1351-70. [PMID: 16374911 DOI: 10.1142/s0219720005001594] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2005] [Revised: 08/10/2005] [Accepted: 08/15/2005] [Indexed: 11/18/2022]
Abstract
Understanding how protein folds into a functional native structure is arguably one of the most challenging problems remaining in computational biology. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape in terms of various reaction coordinates such as the fraction of native contacts, the radius of gyration, the RMS deviation from the native and so on. In this paper, we present a wavelet approach towards understanding the global state changes during protein folding. The approach is based on the wavelet analysis on the trajectories of various reaction coordinates to identify the significant intermediate states or structural motifs in the folding process. We demonstrate through an example protein Trp-cage that this approach extracts crucial information about protein folding intermediate states as well as the time correlation among these states. Furthermore, the current approach reveals a meaningful structural pattern that had been overlooked in previous works, which provides a better understanding of the folding mechanism as well as the limitation of the current force fields.
Collapse
Affiliation(s)
- Lei Ye
- Department of Computer Science, Zhejiang University, Hangzhou, China
| | | | | | | | | | | |
Collapse
|
5
|
Martin RL, Gardiner E, Gillet VJ, Muñoz-Muriedas J, Senger S. Wavelet Approximation of GRID Fields: Application to Quantitative Structure-Activity Relationships. Mol Inform 2010; 29:603-20. [PMID: 27463455 DOI: 10.1002/minf.201000066] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2010] [Accepted: 08/14/2010] [Indexed: 11/05/2022]
Abstract
Molecular interaction fields such as those computed by the GRID program are widely used in applications such as virtual screening, molecular docking and 3D-QSAR modelling. They characterise molecules according to their favourable interaction sites and therefore enable predictions to be made on how molecules might interact. The fields are, however, comprised of a very large number of data points which presents difficulties for many applications. For example, there are likely to be high degrees of correlation between the variables which can lead to misleading results in 3D-QSAR. We describe the use of wavelet methods for approximating such data into a much smaller number of variables. We present a number of validation experiments, including use of the approximated GRIDs in 3D-QSAR, and demonstrate that wavelet approximation at high levels of data compression preserves the information content in GRID fields while significantly reducing computational requirements.
Collapse
Affiliation(s)
- Richard L Martin
- Information School, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK
| | - Eleanor Gardiner
- Information School, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK
| | - Valerie J Gillet
- Information School, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK.
| | - Jordi Muñoz-Muriedas
- Computational and Structural Chemistry, GlaxoSmithKline Research & Development, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, UK
| | - Stefan Senger
- Computational and Structural Chemistry, GlaxoSmithKline Research & Development, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, UK
| |
Collapse
|
6
|
|
7
|
Kolling DJ, Brunzelle JS, Lhee S, Crofts AR, Nair SK. Atomic resolution structures of rieske iron-sulfur protein: role of hydrogen bonds in tuning the redox potential of iron-sulfur clusters. Structure 2007; 15:29-38. [PMID: 17223530 PMCID: PMC1868424 DOI: 10.1016/j.str.2006.11.012] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Revised: 11/16/2006] [Accepted: 11/16/2006] [Indexed: 11/26/2022]
Abstract
The Rieske [2Fe-2S] iron-sulfur protein of cytochrome bc(1) functions as the initial electron acceptor in the rate-limiting step of the catalytic reaction. Prior studies have established roles for a number of conserved residues that hydrogen bond to ligands of the [2Fe-2S] cluster. We have constructed site-specific variants at two of these residues, measured their thermodynamic and functional properties, and determined atomic resolution X-ray crystal structures for the native protein at 1.2 A resolution and for five variants (Ser-154-->Ala, Ser-154-->Thr, Ser-154-->Cys, Tyr-156-->Phe, and Tyr-156-->Trp) to resolutions between 1.5 A and 1.1 A. These structures and complementary biophysical data provide a molecular framework for understanding the role hydrogen bonds to the cluster play in tuning thermodynamic properties, and hence the rate of this bioenergetic reaction. These studies provide a detailed structure-function dissection of the role of hydrogen bonds in tuning the redox potentials of [2Fe-2S] clusters.
Collapse
Affiliation(s)
- Derrick J. Kolling
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
| | - Joseph S. Brunzelle
- Life Sciences Collaborative Access Team, Argonne National Labs, Argonne, IL, USA
| | - SangMoon Lhee
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
| | - Antony R. Crofts
- Department of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
| | - Satish K. Nair
- Department of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA
- *Correspondence concerning this manuscript should be sent to: Dr. Satish K. Nair, Department of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave, Urbana, IL 61801 USA, Telephone: (217) 333-2688; Facsimile: (217) 244-5858 Electronic Mail:
| |
Collapse
|
8
|
Enomoto T, Lindström MS, Jin A, Ke H, Zhang Y. Essential role of the B23/NPM core domain in regulating ARF binding and B23 stability. J Biol Chem 2006; 281:18463-72. [PMID: 16679321 DOI: 10.1074/jbc.m602788200] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
How cells coordinate inhibition of growth and division during genotoxic events is fundamental to our understanding of the origin of cancer. Despite increasing interest and extensive study, the mechanisms that link regulation of DNA synthesis and ribosomal biogenesis remain elusive. Recently, the tumor suppressor p14(ARF) (ARF) has been shown to interact functionally with the nucleolar protein B23/NPM (B23) and inhibit rRNA biogenesis. However, the molecular basis of the ARF-B23 interaction is hitherto unclear. Here we show that a highly conserved motif in the B23 oligomerization domain is essential for mediating ARF binding in vivo. Mutagenesis of conserved B23 core residues (L102A, G105A, G107A) prevented B23 from interacting with ARF. Modeling of the B23 core indicated that substitutions in the GSGP loop motif could trigger conformational changes in B23 thereby obstructing ARF binding. Interestingly, the GSGP loop mutants were unstable, defective for oligomerization, and delocalized from the nucleolus to the nucleoplasm. B23 core mutants displayed increased ubiquitination and proteasomal degradation. We conclude that the functional integrity of the B23 core motif is required for stability, efficient nucleolar localization as well as ARF binding.
Collapse
Affiliation(s)
- Takeharu Enomoto
- Department of Radiation Oncology, University of North Carolina at Chapel Hill, NC 27599-7512, USA
| | | | | | | | | |
Collapse
|
9
|
Wang J, Crippen GM. Statistical mechanics of protein folding with separable energy functions. Biopolymers 2004; 74:214-20. [PMID: 15150796 DOI: 10.1002/bip.20077] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We have initiated an entirely new approach to statistical mechanical models of strongly interacting systems where the configurational parameters and the potential energy function are both constructed so that the canonical partition function can be evaluated analytically. For a simplified model of proteins consisting of a single, fairly short polypeptide chain without cross-links, we can adjust the energy parameters to favor the experimentally determined native state of seven proteins having diverse types of folds. Then 497 test proteins are predicted to have stable native folds, even though they are also structurally diverse, and 480 of them have no significant sequence similarity to any of the training proteins.
Collapse
Affiliation(s)
- Jianyong Wang
- College of Pharmacy, University of Michigan, Ann Arbor, MI 48109-1065, USA
| | | |
Collapse
|
10
|
Abstract
Chemokine receptors represent a prime target for the development of novel therapeutic strategies in a variety of disease processes, including inflammation, allergy and neoplasia. Here we use maximum likelihood methods and bootstrap methods to investigate both the phylogenetic relationships in a large set of human chemokine receptor sequences and the relationships between chemokine receptors and their nearest neighbors. We found that CCR and CXCR families are not homogeneous. We also provide evidences that angiotensin receptors are the closest neighbors. Other close neighbors include opioid, somatostatin and melanin-concentrating hormone receptors. The phylogenetic analysis suggests ancient paralogous relationships and establishes a link between immune, metabolic and neural systems modulation. We complement our findings with a structural analysis based on wavelet methods of the major branches of chemokine receptors phylogeny. We hypothesize that receptors very close in the tree can form heterodimers. Our analyses reveal different characteristics of amino acid hydrophobicity and volume propensity in the different subfamilies. We also found that the second extra-cytoplasmic loop has higher rates of evolution than the internal loops and transmembrane segments, suggesting that selection, shifting, reassignments and broadening of receptor binding specificities involve mainly this loop.
Collapse
Affiliation(s)
- Pietro Liò
- Department of Zoology, University of Cambridge, Cambridge, UK.
| | | |
Collapse
|
11
|
|
12
|
Li R, Manna AC, Dai S, Cheung AL, Zhang G. Crystal structure of the SarS protein from Staphylococcus aureus. J Bacteriol 2003; 185:4219-25. [PMID: 12837797 PMCID: PMC164878 DOI: 10.1128/jb.185.14.4219-4225.2003] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2003] [Accepted: 04/28/2003] [Indexed: 11/20/2022] Open
Abstract
The expression of virulence determinants in Staphylococcus aureus is controlled by global regulatory loci (e.g., sarA and agr). One of these determinants, protein A (spa), is activated by sarS, which encodes a 250-residue DNA-binding protein. Genetic analysis indicated that the agr locus likely mediates spa repression by suppressing the transcription of sarS. Contrary to SarA and SarR, which require homodimer formation for proper function, SarS is unusual within the SarA protein family in that it contains two homologous halves, with each half sharing sequence similarity to SarA and SarR. Here we report the 2.2 A resolution X-ray crystal structure of the SarS protein. SarS has folds similar to those of SarR and, quite plausibly, the native SarA structure. Two typical winged-helix DNA-binding domains are connected by a well-ordered loop. The interactions between the two domains are extensive and conserved. The putative DNA-binding surface is highly positively charged. In contrast, negatively charged patches are located opposite to the DNA-binding surface. Furthermore, sequence alignment and structural comparison revealed that MarR has folds similar to those of SarR and SarS. Members of the MarR protein family have previously been implicated in the negative regulation of an efflux pump involved in multiple antibiotic resistance in many gram-negative species. We propose that MarR also belongs to the winged-helix protein family and has a similar mode of DNA binding as SarR and SarS and possibly the entire SarA protein family member. Based on the structural differences of SarR, SarS, and MarR, we further classified these winged-helix proteins to three subfamilies, SarA, SarS, and MarR. Finally, a possible transcription regulation mechanism is proposed.
Collapse
Affiliation(s)
- Ronggui Li
- Integrated Department of Immunology, National Jewish Medical and Research Center, and Department of Pharmacology, Biomolecular Structure Program, School of Medicine, University of Colorado Health Science Center, Denver, Colorado 80206, USA
| | | | | | | | | |
Collapse
|
13
|
Kumar MS, Carson M, Hussain MM, Murthy HMK. Structures of apolipoprotein A-II and a lipid-surrogate complex provide insights into apolipoprotein-lipid interactions. Biochemistry 2002; 41:11681-91. [PMID: 12269810 DOI: 10.1021/bi026069w] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Apolipoproteins A-I and A-II form the major protein constituents of high-density lipid particles (HDL), the concentration of which is inversely correlated with the frequency of heart disease in humans. Although the physiological role of apolipoprotein A-II is unclear, evidence for its involvement in free fatty acid metabolism in mice has recently been obtained. Currently, the best characterized activity of apolipoprotein A-II is its potent antagonism of the anti-atherogenic and anti-inflammatory activities of apolipoprotein A-I, probably due to its competition with the latter for lipid acyl side chains in HDL. Many interactions of apolipoprotein A-I with enzymes and proteins involved in reverse cholesterol transport and HDL maturation are mediated by lipid-bound protein. The structural bases of interaction with lipids are expected to be common to exchangeable apolipoproteins and attributable to amphipathic alpha-helices present in each of them. Thus, characterization of apolipoprotein-lipid interactions in any apolipoprotein is likely to provide information that is applicable to the entire class. We report structures of human apolipoprotein A-II and its complex with beta-octyl glucoside, a widely used lipid surrogate. The former shows that disulfide-linked dimers of apolipoprotein A-II form amphipathic alpha-helices which aggregate into tetramers. Dramatic changes, observed in the presence of beta-octyl glucoside, might provide clues to the structural basis for its antagonism of apolipoprotein A-I. Additionally, excursions of individual molecules of apolipoprotein A-II from a common helical architecture in both structures indicate that lipid-bound apolipoproteins are likely to have an ensemble of related conformations. These structures provide the first experimental paradigm for description of apolipoprotein-lipid interactions at the atomic level.
Collapse
Affiliation(s)
- M Suresh Kumar
- Center for Biophysical Sciences and Engineering, University of Alabama at Birmingham, 1530 3rd Avenue South, Birmingham, AL 35294, USA
| | | | | | | |
Collapse
|
14
|
Goldman BB, Wipke WT. Quadratic shape descriptors. 1. Rapid superposition of dissimilar molecules using geometrically invariant surface descriptors. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2000; 40:644-58. [PMID: 10850770 DOI: 10.1021/ci980213w] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this paper, we present a novel approach to shape-based molecular similarity searching. The method that we introduce is able to superimpose dissimilar molecules by using geometrically invariant molecular surface descriptors. The shape descriptors are calculated by least-squares fitting of a quadratic function to small sections of the molecular surface of a ligand. Invariant geometric properties of the approximated surface patch are then extracted from the fitted quadratic function. The extracted properties are used to quantify the shape and to obtain a canonical orientation for this section of surface. The superimposition algorithm uses these geometric invariants to recognize similar regions of surface shape existing on two molecules and to bring these regions (and consequently the molecules) into registration. Because these geometric descriptors are based upon local surface shape, the superimposing algorithm is insensitive to the connectivity and the relative sizes of the molecules being matched. The capabilities of our algorithm are demonstrated by superimposing dissimilar ligands known to inhibit the same enzyme system. In all cases examined the algorithm generates superpositions that are in agreement with crystallographic results. The algorithm is also applied to align the two different proteins on the basis of the shape of their active sites.
Collapse
Affiliation(s)
- B B Goldman
- Department of Chemistry and Biochemistry, University of California, Santa Cruz 95064, USA
| | | |
Collapse
|
15
|
|
16
|
Abstract
I-Ad molecules harboring single amino acid changes in the conserved 80-82 region of the beta-chain show altered trafficking in invariant chain (Ii)-negative cell lines. Since residues beta81 and beta82 form hydrogen bonds with the backbone of bound peptide, alterations in this region may result in distinct MHC class II conformers that are targeted aberrantly. We examined the assembly and peptide binding properties of the mutant I-Ad molecules generated by in vitro translation. Indeed, loss of a single hydrogen bond at beta81, or of two hydrogen bonds at beta82, is sufficient to render I-Ad incapable of stable interaction with CLIP and other antigenic peptides, despite normal assembly with intact invariant chain. These results suggest that stable interaction of MHC class II molecules with peptide requires the integrity of the H-bond network between residues in the MHC class II alpha-helices and bound peptide, and that conformational features revealed by stable peptide binding are critical for MHC class II intracellular transport.
Collapse
Affiliation(s)
- P W Bryant
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA.
| | | | | | | |
Collapse
|
17
|
|