1
|
Ferrero-Serrano Á, Sylvia MM, Forstmeier PC, Olson AJ, Ware D, Bevilacqua PC, Assmann SM. Experimental demonstration and pan-structurome prediction of climate-associated riboSNitches in Arabidopsis. Genome Biol 2022; 23:101. [PMID: 35440059 PMCID: PMC9017077 DOI: 10.1186/s13059-022-02656-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 03/20/2022] [Indexed: 11/23/2022] Open
Abstract
Background Genome-wide association studies (GWAS) aim to correlate phenotypic changes with genotypic variation. Upon transcription, single nucleotide variants (SNVs) may alter mRNA structure, with potential impacts on transcript stability, macromolecular interactions, and translation. However, plant genomes have not been assessed for the presence of these structure-altering polymorphisms or “riboSNitches.” Results We experimentally demonstrate the presence of riboSNitches in transcripts of two Arabidopsis genes, ZINC RIBBON 3 (ZR3) and COTTON GOLGI-RELATED 3 (CGR3), which are associated with continentality and temperature variation in the natural environment. These riboSNitches are also associated with differences in the abundance of their respective transcripts, implying a role in regulating the gene's expression in adaptation to local climate conditions. We then computationally predict riboSNitches transcriptome-wide in mRNAs of 879 naturally inbred Arabidopsis accessions. We characterize correlations between SNPs/riboSNitches in these accessions and 434 climate descriptors of their local environments, suggesting a role of these variants in local adaptation. We integrate this information in CLIMtools V2.0 and provide a new web resource, T-CLIM, that reveals associations between transcript abundance variation and local environmental variation. Conclusion We functionally validate two plant riboSNitches and, for the first time, demonstrate riboSNitch conditionality dependent on temperature, coining the term “conditional riboSNitch.” We provide the first pan-genome-wide prediction of riboSNitches in plants. We expand our previous CLIMtools web resource with riboSNitch information and with 1868 additional Arabidopsis genomes and 269 additional climate conditions, which will greatly facilitate in silico studies of natural genetic variation, its phenotypic consequences, and its role in local adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02656-4.
Collapse
Affiliation(s)
- Ángel Ferrero-Serrano
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| | - Megan M Sylvia
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Peter C Forstmeier
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Andrew J Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.,USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Philip C Bevilacqua
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Department of Chemistry, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA. .,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| |
Collapse
|
2
|
Abstract
SUMMARY Genome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcript. To choose canonical transcripts, Transcript Ranking and Canonical Election (TRaCE) holds an 'election' in which a set of RNA-seq samples rank transcripts by annotation edit distance. These sample-specific votes are tallied along with other criteria such as protein length and InterPro domain coverage. The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance. Based on the set of expression data provided, TRaCE can identify the most common isoforms from a broad expression atlas or prioritize alternative transcripts expressed in specific contexts. AVAILABILITY AND IMPLEMENTATION Transcript ranking code can be found on GitHub at {{https://github.com/warelab/TRaCE}}. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Doreen Ware
- Plant Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11768, USA,USDA ARS Robert W. Holley Center for Agriculture and Health Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
3
|
Autin L, Maritan M, Barbaro BA, Gardner A, Olson AJ, Sanner M, Goodsell DS. Mesoscope: A Web-based Tool for Mesoscale Data Integration and Curation. MolVa (2020) 2020; 2020:23-31. [PMID: 37928321 PMCID: PMC10624244 DOI: 10.2312/molva.20201098] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]
Abstract
Interest is growing for 3D models of the biological mesoscale, the intermediate scale between the nanometer scale of molecular structure and micrometer scale of cellular biology. However, it is currently difficult to gather, curate and integrate all the data required to define such models. To address this challenge we developed Mesoscope (mesoscope.scripps.edu/beta), a web-based data integration and curation tool. Mesoscope allows users to begin with a listing of molecules (such as data from proteomics), and to use resources at UniProt and the PDB to identify, prepare and validate appropriate structures and representations for each molecule, ultimately producing a portable output file used by CellPACK and other modeling tools for generation of 3D models of the biological mesoscale. The availability of this tool has proven essential in several exploratory applications, given the high complexity of mesoscale models and the heterogeneity of the available data sources.
Collapse
Affiliation(s)
- L Autin
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - M Maritan
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - B A Barbaro
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - A Gardner
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - A J Olson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - M Sanner
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - D S Goodsell
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
- RCSB Protein Data Bank and Center for Integrative Proteomics Research, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
4
|
Law M, Childs KL, Campbell MS, Stein JC, Olson AJ, Holt C, Panchy N, Lei J, Jiao D, Andorf CM, Lawrence CJ, Ware D, Shiu SH, Sun Y, Jiang N, Yandell M. Automated update, revision, and quality control of the maize genome annotations using MAKER-P improves the B73 RefGen_v3 gene models and identifies new genes. Plant Physiol 2015; 167:25-39. [PMID: 25384563 PMCID: PMC4280997 DOI: 10.1104/pp.114.245027] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 11/02/2014] [Indexed: 05/18/2023]
Abstract
The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes.
Collapse
Affiliation(s)
- MeiYee Law
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Kevin L Childs
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Michael S Campbell
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Joshua C Stein
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Andrew J Olson
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Carson Holt
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Nicholas Panchy
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Jikai Lei
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Dian Jiao
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Carson M Andorf
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Carolyn J Lawrence
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Doreen Ware
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Shin-Han Shiu
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Yanni Sun
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Ning Jiang
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| | - Mark Yandell
- The Jackson Laboratory, Bar Harbor, Maine 04609 (M.L.);Eccles Institute of Human Genetics (M.L., M.S.C., M.Y.), Department of Biomedical Informatics (M.L.), and USTAR Center for Genetic Discovery (C.H., M.Y.), University of Utah, Salt Lake City, Utah 84112;Genetics Program (N.P., S.-H.S., N.J.), Department of Plant Biology (K.L.C., S.-H.S.), Department of Computer Science and Engineering (J.L., Y.S.), and Department of Horticulture (N.J.), Michigan State University, East Lansing, Michigan 48824;iPlant Collaborative, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 (J.C.S., A.J.O., D.W.);Ontario Institute for Cancer Research, Toronto, Ontario, Canada M5G 1L7 (C.H.);Texas Advanced Computing Center, University of Texas, Austin, Texas 78758 (D.J.);Department of Genetics, Development, and Cell Biology and Department of Agronomy (C.J.L.), and United States Department of Agriculture-Agricultural Research Service Corn Insects and Crop Genetics Research (C.M.A.), Iowa State University, Ames, Iowa 50011; andUnited States Department of Agriculture-Agricultural Research Service Northeast Area, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14853 (D.W.)
| |
Collapse
|
5
|
Abstract
![]()
Embryogenesis is regulated by genetic programs that are dynamically executed in a stereotypic manner, and deciphering these molecular mechanisms requires the ability to control embryonic gene function with similar spatial and temporal precision. Chemical technologies can enable such genetic manipulations, as exemplified by the use of caged morpholino (cMO) oligonucleotides to inactivate genes in zebrafish and other optically transparent organisms with spatiotemporal control. Here we report optimized methods for the design and synthesis of hairpin cMOs incorporating a dimethoxynitrobenzyl (DMNB)-based bifunctional linker that permits cMO assembly in only three steps from commercially available reagents. Using this simplified procedure, we have systematically prepared cMOs with differing structural configurations and investigated how the in vitro thermodynamic properties of these reagents correlate with their in vivo activities. Through these studies, we have established general principles for cMO design and successfully applied them to several developmental genes. Our optimized synthetic and design methodologies have also enabled us to prepare a next-generation cMO that contains a bromohydroxyquinoline (BHQ)-based linker for two-photon uncaging. Collectively, these advances establish the generality of cMO technologies and will facilitate the application of these chemical probes in vivo for functional genomic studies.
Collapse
Affiliation(s)
- Xiaohu Ouyang
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, California 94305, USA
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
The advent of large-scale sequencing has opened up new areas of research, such as the study of Piwi-interacting small RNAs (piRNAs). piRNAs are longer than miRNAs, close to 30 nucleotides in length, involved in various functions, such as the suppression of transposons in germline. Since a large number of them (many tens of thousands) are generated from a wide range of positions in the genome, large-scale sequencing is the only way to study them. The key to understanding their genesis and biological roles is efficient analysis, which is complicated by the large volumes of sequence data. Taking account of the underlying biology is also important. We describe here novel analyses techniques and tools applied to small RNAs from germ cells in D. melanogaster, that allowed us to infer mechanism and biological function.
Collapse
Affiliation(s)
- A J Olson
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | | | | | | | | |
Collapse
|
7
|
Roca X, Olson AJ, Rao AR, Enerly E, Kristensen VN, Børresen-Dale AL, Andresen BS, Krainer AR, Sachidanandam R. Features of 5'-splice-site efficiency derived from disease-causing mutations and comparative genomics. Genome Res 2007; 18:77-87. [PMID: 18032726 DOI: 10.1101/gr.6859308] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Many human diseases, including Fanconi anemia, hemophilia B, neurofibromatosis, and phenylketonuria, can be caused by 5'-splice-site (5'ss) mutations that are not predicted to disrupt splicing, according to position weight matrices. By using comparative genomics, we identify pairwise dependencies between 5'ss nucleotides as a conserved feature of the entire set of 5'ss. These dependencies are also conserved in human-mouse pairs of orthologous 5'ss. Many disease-associated 5'ss mutations disrupt these dependencies, as can some human SNPs that appear to alter splicing. The consistency of the evidence signifies the relevance of this approach and suggests that 5'ss SNPs play a role in complex diseases.
Collapse
Affiliation(s)
- Xavier Roca
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Faith JJ, Olson AJ, Gardner TS, Sachidanandam R. Lightweight genome viewer: portable software for browsing genomics data in its chromosomal context. BMC Bioinformatics 2007; 8:344. [PMID: 17877794 PMCID: PMC2238324 DOI: 10.1186/1471-2105-8-344] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2007] [Accepted: 09/18/2007] [Indexed: 11/10/2022] Open
Abstract
Background Lightweight genome viewer (lwgv) is a web-based tool for visualization of sequence annotations in their chromosomal context. It performs most of the functions of larger genome browsers, while relying on standard flat-file formats and bypassing the database needs of most visualization tools. Visualization as an aide to discovery requires display of novel data in conjunction with static annotations in their chromosomal context. With database-based systems, displaying dynamic results requires temporary tables that need to be tracked for removal. Results lwgv simplifies the visualization of user-generated results on a local computer. The dynamic results of these analyses are written to transient files, which can import static content from a more permanent file. lwgv is currently used in many different applications, from whole genome browsers to single-gene RNAi design visualization, demonstrating its applicability in a large variety of contexts and scales. Conclusion lwgv provides a lightweight alternative to large genome browsers for visualizing biological annotations and dynamic analyses in their chromosomal context. It is particularly suited for applications ranging from short sequences to medium-sized genomes when the creation and maintenance of a large software and database infrastructure is not necessary or desired.
Collapse
|
9
|
Abstract
BACKGROUND Independent identification of genes in different organisms and assays has led to a multitude of names for each gene. This balkanization makes it difficult to use gene names to locate genomic resources, homologs in other species and relevant publications. METHODS We solve the naming problem by collecting data from a variety of sources and building a name-translation database. We have also built a table of homologs across several model organisms: H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. cerevisiae, S. pombe and A. thaliana. This allows GeneSeer to draw phylogenetic trees and identify the closest homologs. This, in turn, allows the use of names from one species to identify homologous genes in another species. A website http://geneseer.cshl.org/ is connected to the database to allow user-friendly access to our tools and external genomic resources using familiar gene names. CONCLUSION GeneSeer allows access to gene information through common names and can map sequences to names. GeneSeer also allows identification of homologs and paralogs for a given gene. A variety of genomic data such as sequences, SNPs, splice variants, expression patterns and others can be accessed through the GeneSeer interface. It is freely available over the web http://geneseer.cshl.org/ and can be incorporated in other tools through an http-based software interface described on the website. It is currently used as the search engine in the RNAi codex resource, which is a portal for short hairpin RNA (shRNA) gene-silencing constructs.
Collapse
Affiliation(s)
- Andrew J Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Tim Tully
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | |
Collapse
|
10
|
Abstract
Recognition templates encapsulate the structural and energetic features for the specific recognition of a given ligand by a protein active site. These templates identify the major interactions used for specific recognition and may be used to find specific binding sites in proteins of unknown function. We present a grid-based method for deriving recognition templates for adenylate groups from a set of diverse nucleotide-binding proteins. The templates reveal the basis of specific binding of adenylate, including tight shape complementarity, specific hydrogen bonds, and underscoring the importance of a key steric contact for excluding guanylate from adenylate-specific sites. We demonstrate the utility of recognition templates in identifying specific adenylate-binding sites in a diverse set of dinucleotide-binding proteins.
Collapse
Affiliation(s)
- S Zhao
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
11
|
Bühler B, Lin YC, Morris G, Olson AJ, Wong CH, Richman DD, Elder JH, Torbett BE. Viral evolution in response to the broad-based retroviral protease inhibitor TL-3. J Virol 2001; 75:9502-8. [PMID: 11533212 PMCID: PMC114517 DOI: 10.1128/jvi.75.19.9502-9508.2001] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
TL-3 is a protease inhibitor developed using the feline immunodeficiency virus protease as a model. It has been shown to efficiently inhibit replication of human, simian, and feline immunodeficiency viruses and therefore has broad-based activity. We now demonstrate that TL-3 efficiently inhibits the replication of 6 of 12 isolates with confirmed resistance mutations to known protease inhibitors. To dissect the spectrum of molecular changes in protease and viral properties associated with resistance to TL-3, a panel of chronological in vitro escape variants was generated. We have virologically and biochemically characterized mutants with one (V82A), three (M46I/F53L/V82A), or six (L24I/M46I/F53L/L63P/V77I/V82A) changes in the protease and structurally modeled the protease mutant containing six changes. Virus containing six changes was found to be 17-fold more resistant to TL-3 in cell culture than was wild-type virus but maintained similar in vitro replication kinetics compared to the wild-type virus. Analyses of enzyme activity of protease variants with one, three, and six changes indicated that these enzymes, compared to wild-type protease, retained 40, 47, and 61% activity, respectively. These results suggest that deficient protease enzymatic activity is sufficient for function, and the observed protease restoration might imply a selective advantage, at least in vitro, for increased protease activity.
Collapse
Affiliation(s)
- B Bühler
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Zhao S, Goodsell DS, Olson AJ. Analysis of a data set of paired uncomplexed protein structures: new metrics for side-chain flexibility and model evaluation. Proteins 2001; 43:271-9. [PMID: 11288177 DOI: 10.1002/prot.1038] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We compiled and analyzed a data set of paired protein structures containing proteins for which multiple high-quality uncomplexed atomic structures were available in the Protein Data Bank. Side-chain flexibility was quantified, yielding a set of residue- and environment-specific confidence levels describing the range of motion around chi1 and chi2 angles. As expected, buried residues were inflexible, adopting similar conformations in different crystal structure analyses. Ile, Thr, Asn, Asp, and the large aromatics also showed limited flexibility when exposed on the protein surface, whereas exposed Ser, Lys, Arg, Met, Gln, and Glu residues were very flexible. This information is different from and complementary to the information available from rotamer surveys. The confidence levels are useful for assessing the significance of observed side-chain motion and estimating the extent of side-chain motion in protein structure prediction. We compare the performance of a simple 40 degrees threshold with these quantitative confidence levels in a critical evaluation of side-chain prediction with the program SCWRL.
Collapse
Affiliation(s)
- S Zhao
- Department of Molecular Biology, Scripps Research Institute, La Jolla, California 92037, USA
| | | | | |
Collapse
|
13
|
Zhang YM, Rao MS, Heath RJ, Price AC, Olson AJ, Rock CO, White SW. Identification and analysis of the acyl carrier protein (ACP) docking site on beta-ketoacyl-ACP synthase III. J Biol Chem 2001; 276:8231-8. [PMID: 11078736 DOI: 10.1074/jbc.m008042200] [Citation(s) in RCA: 139] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The molecular details that govern the specific interactions between acyl carrier protein (ACP) and the enzymes of fatty acid biosynthesis are unknown. We investigated the mechanism of ACP-protein interactions using a computational analysis to dock the NMR structure of ACP with the crystal structure of beta-ketoacyl-ACP synthase III (FabH) and experimentally tested the model by the biochemical analysis of FabH mutants. The activities of the mutants were assessed using both an ACP-dependent and an ACP-independent assay. The ACP interaction surface was defined by mutations that compromised FabH activity in the ACP-dependent assay but had no effect in the ACP-independent assay. ACP docked to a positively charged/hydrophobic patch adjacent to the active site tunnel on FabH, which included a conserved arginine (Arg-249) that was required for ACP docking. Kinetic analysis and direct binding studies between FabH and ACP confirmed the identification of Arg-249 as critical for FabH-ACP interaction. Our experiments reveal the significance of the positively charged/hydrophobic patch located adjacent to the active site cavities of the fatty acid biosynthesis enzymes and the high degree of sequence conservation in helix II of ACP across species.
Collapse
Affiliation(s)
- Y M Zhang
- Department of Biochemistry, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
The majority of soluble and membrane-bound proteins in modern cells are symmetrical oligomeric complexes with two or more subunits. The evolutionary selection of symmetrical oligomeric complexes is driven by functional, genetic, and physicochemical needs. Large proteins are selected for specific morphological functions, such as formation of rings, containers, and filaments, and for cooperative functions, such as allosteric regulation and multivalent binding. Large proteins are also more stable against denaturation and have a reduced surface area exposed to solvent when compared with many individual, smaller proteins. Large proteins are constructed as oligomers for reasons of error control in synthesis, coding efficiency, and regulation of assembly. Symmetrical oligomers are favored because of stability and finite control of assembly. Several functions limit symmetry, such as interaction with DNA or membranes, and directional motion. Symmetry is broken or modified in many forms: quasisymmetry, in which identical subunits adopt similar but different conformations; pleomorphism, in which identical subunits form different complexes; pseudosymmetry, in which different molecules form approximately symmetrical complexes; and symmetry mismatch, in which oligomers of different symmetries interact along their respective symmetry axes. Asymmetry is also observed at several levels. Nearly all complexes show local asymmetry at the level of side chain conformation. Several complexes have reciprocating mechanisms in which the complex is asymmetric, but, over time, all subunits cycle through the same set of conformations. Global asymmetry is only rarely observed. Evolution of oligomeric complexes may favor the formation of dimers over complexes with higher cyclic symmetry, through a mechanism of prepositioned pairs of interacting residues. However, examples have been found for all of the crystallographic point groups, demonstrating that functional need can drive the evolution of any symmetry.
Collapse
Affiliation(s)
- D S Goodsell
- Department of Molecular Biology, Scripps Research Institute, La Jolla, California 92037, USA. ,
| | | |
Collapse
|
15
|
Abstract
Recent progress in the field of electron cryo-microscopy and image analysis has shown that there is an overwhelming need to interpret medium resolution (5 to 10 A) three-dimensional maps. Traditional methods of fitting amino acid residues into electron density using molecular modeling programs must be supplemented with further analysis. We have used a potential of mean force (PMF) method, derived from Boltzmann statistics in protein structure, to generate models for the packing of alpha-helices, using pairwise potentials between amino acid residues. The approach was tested using the three-dimensional map of a recombinant cardiac gap junction membrane channel provided by electron cryo-crystallography (Unger et al., 1997; 1999a, 1999b) which had a resolution of 7.5 A in the membrane plane and 21 A in the vertical direction. The dodecameric channel was formed by the end-to-end docking of two hexamers, each of which displayed 24 rods of density in the membrane interior, which was consistent with an alpha-helical conformation for the four transmembrane domains of each connexin subunit. Based on the three-dimensional map and the amino acid sequence for the 4 transmembrane domains determined by hydropathy analysis, we used the modeling utility SymServ (Macke et al., 1998) to build hexameric connexons with 24 transmembrane alpha-helices. Canonical alpha-helices were aligned to the axes of the rods of density and translated along the density so that the center of masses coincided. The PMF function was used to evaluate 162,000 conformations for each of the 24 possible alpha-helical packing models. Since the different packing models yielded different energy distributions, the pair potential function appears to be a promising tool for evaluating the packing of alpha-helices in membrane proteins. The analysis will be refined by energy calculations based on the expectations that the outer boundary of the channel will be formed by hydrophobic residues in contact with the lipids.
Collapse
Affiliation(s)
- R S Nunn
- Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | |
Collapse
|
16
|
Olson AJ, Picones A, Korenbrot JI. Developmental switch in excitability, Ca(2+) and K(+) currents of retinal ganglion cells and their dendritic structure. J Neurophysiol 2000; 84:2063-77. [PMID: 11024098 DOI: 10.1152/jn.2000.84.4.2063] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
In the retina of teleost fish, continuous neuronal development occurs at the margin, in the peripheral growth zone (PGZ). We prepared tissue slices from the retina of rainbow trout that include the PGZ and that comprise a time line of retinal development, in which cells at progressive stages of differentiation are present side by side. We studied the changes in dendritic structure and voltage-dependent Ca(2+), Na(+), and K(+) currents that occur as ganglion cells mature. The youngest ganglion cells form a distinct bulge. Cells in the bulge have spare and short dendritic trees. Only half express Ca(2+) currents and then only high-voltage-activated currents with slow inactivation (HVAslow). Bulge cells are rarely electrically excitable. They express a mixture of rapidly inactivating and noninactivating K(+) currents (IKA and IKdr). The ganglion cells next organize into a transition zone, consisting of a layered structure two to three nuclei thick, before forming the single layered structure characteristic of the mature retina. In the transition zone, the dendritic arbor is elaborately branched and extends over multiple laminae in the inner plexiform layer, without apparent stratification. The arbor of the mature cells is stratified, and the span of the dendritic arbor is well over five times the cell body's diameter. The electrical properties of cells in the transition and mature zones differ significantly from those in the bulge cells. Correlated with the more elaborate dendritic structures are the expression of both rapidly inactivating HVA (HVAfast) and of low-voltage-activated (LVA) Ca(2+) currents and of a high density of Na(+) currents that renders the cells electrically excitable. The older ganglion cells also express a slowly activating K(+) current (IKsa).
Collapse
Affiliation(s)
- A J Olson
- Department of Physiology, School of Medicine, University of California at San Francisco, San Francisco, California 94143, USA
| | | | | |
Collapse
|
17
|
Lin YC, Beck Z, Lee T, Le VD, Morris GM, Olson AJ, Wong CH, Elder JH. Alteration of substrate and inhibitor specificity of feline immunodeficiency virus protease. J Virol 2000; 74:4710-20. [PMID: 10775609 PMCID: PMC111993 DOI: 10.1128/jvi.74.10.4710-4720.2000] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Feline immunodeficiency virus (FIV) protease is structurally very similar to human immunodeficiency virus (HIV) protease but exhibits distinct substrate and inhibitor specificities. We performed mutagenesis of subsite residues of FIV protease in order to define interactions that dictate this specificity. The I37V, N55M, M56I, V59I, and Q99V mutants yielded full activity. The I37V, N55M, V59I, and Q99V mutants showed a significant increase in activity against the HIV-1 reverse transcriptase/integrase and P2/nucleocapsid junction peptides compared with wild-type (wt) FIV protease. The I37V, V59I, and Q99V mutants also showed an increase in activity against two rapidly cleaved peptides selected by cleavage of a phage display library with HIV-1 protease. Mutations at Q54K, I98P, and L101I dramatically reduced activity. Mutants containing a I35D or I57G substitution showed no activity against either FIV or HIV substrates. FIV proteases all failed to cut HIV-1 matrix/capsid, P1/P6, P6/protease, and protease/reverse transcriptase junctions, indicating that none of the substitutions were sufficient to change the specificity completely. The I37V, N55M, M56I, V59I, and Q99V mutants, compared with wt FIV protease, all showed inhibitor specificity more similar to that of HIV-1 protease. The data also suggest that FIV protease prefers a hydrophobic P2/P2' residue like Val over Asn or Glu, which are utilized by HIV-1 protease, and that S2/S2' might play a critical role in distinguishing FIV and HIV-1 protease by specificity. The findings extend our observations regarding the interactions involved in substrate binding and aid in the development of broad-based inhibitors.
Collapse
Affiliation(s)
- Y C Lin
- Departments of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Soares T, Goodsell D, Ferreira R, Olson AJ, Briggs JM. Ionization state and molecular docking studies for the macrophage migration inhibitory factor: the role of lysine 32 in the catalytic mechanism. J Mol Recognit 2000; 13:146-56. [PMID: 10867710 DOI: 10.1002/1099-1352(200005/06)13:3<146::aid-jmr497>3.0.co;2-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The macrophage migration inhibitory factor (MIF) is a cytokine that is structurally similar to certain isomerases and for which multiple immune and catalytic roles have been proposed. Different catalytic activities have been reported for MIF, yet the exact mechanism by which MIF acts is not completely known. As a tautomerase, the enzyme uses a general acid-base mechanism of proton transfer in which the amino-terminal proline has been shown to function as the catalytic base. We report the results of molecular docking simulations of macrophage migration inhibitory factor with three substrates, D-dopachrome, L-dopachrome methyl ester and p-(hydroxyphenyl)pyruvate. Electrostatic pK(a) predictions were also performed for the free and complexed forms of the enzyme. The predicted binding mode of p-(hydroxyphenyl)pyruvate is in agreement with the recently published X-ray structure. A model for the binding mode of D-dopachrome and L-dopachrome methyl ester to MIF is proposed which offers insights into the catalytic mechanism of D-dopachrome tautomerase activity of MIF. The proposed catalytic mechanism is further supported by the pK(a) predictions, which suggest that residue Lys32 acts as the general acid for the enzymatic catalysis of D-dopachrome.
Collapse
Affiliation(s)
- T Soares
- Department of Biology and Biochemistry, University of Houston, TX 77204-5513, USA
| | | | | | | | | |
Collapse
|
19
|
Abstract
Three forms of feline immunodeficiency virus protease (FIV PR), the wild type (wt) and two single point mutants, V59I and Q99V, as well as human immunodeficiency virus type 1 protease (HIV-1 PR), were cocrystallized with the C2-symmetric inhibitor, TL-3. The mutants of FIV PR were designed to replace residues involved in enzyme-ligand interactions by the corresponding HIV-1 PR residues at the structurally equivalent position. TL-3 shows decreased (improved) inhibition constants with these FIV PR mutants relative to wt FIV PR. Despite similar modes of binding of the inhibitor to all PRs (from P3 to P3'), small differences are evident in the conformation of the Phe side chains of TL-3 at the P1 and P1' positions in the complexes with the mutated FIV PRs. The differences mimick the observed binding of TL-3 in HIV-1 PR and correlate with a significant improvement in the inhibition constants of TL-3 with the two mutant FIV PRs. Large differences between the HIV-1 and FIV PR complexes are evident in the binding modes of the carboxybenzyl groups of TL-3 at P4 and P4'. In HIV-1 PR:TL-3, these groups bind over the flap region, whereas in the FIV PR complexes, the rings are located along the major axis of the active site. A significant difference in the location of the flaps in this region of the HIV-1 and FIV PRs correlates with the observed conformational changes in the binding mode of the peptidomimetic inhibitor at the P4 and P4' positions. These findings provide a structural explanation of the observed Ki values for TL-3 with the different PRs and will further assist in the development of improved inhibitors.
Collapse
Affiliation(s)
- M Li
- Macromolecular Structure Laboratory, ABL-Basic Research Program, NCI-Frederick Cancer Research and Development Center, Maryland 21702, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Abstract
The retina in teleost fish continues to grow throughout much of the life of the animal, in part by the continuing differentiation of new tissue at the retinal margin, an area termed the peripheral growth zone (PGZ) (Lyall, Q J Micros Sci, 1957:98:101-110). We have developed a retinal slice preparation--including the PGZ--from juvenile rainbow trout (Onchorynchus mykiss), a species in which retinal growth is rapid and the PGZ is correspondingly pronounced. The PGZ slice preparation contains a time line of retinal development, with cells at different stages of maturation present side by side. We present evidence that the birth sequence of the various retinal cell types in the PGZ recapitulates the sequence during embryonic development. We also report data on the rate of growth of the PGZ in juvenile trout in vivo. Finally, we have used the PGZ slice preparation to make whole-cell voltage clamp recordings from individual retinal GCs at both early and late stages of maturation. We report that the amplitude of delayed rectifier and A-type potassium currents increases during GC maturation.
Collapse
Affiliation(s)
- A J Olson
- Department of Physiology, School of Medicine, University of California at San Francisco, 94143, USA
| | | | | | | |
Collapse
|
21
|
Hu L, Olson AJ, Weiner RI, Goldsmith PC. Connexin 26 expression and extensive gap junctional coupling in cultures of GT1-7 cells secreting gonadotropin-releasing hormone. Neuroendocrinology 1999; 70:221-7. [PMID: 10529616 DOI: 10.1159/000054480] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Gap junctions (GJs) are transmembrane channels that permit rapid intercellular transit of various small molecules including ions, second messengers and metabolites. GJs promote communication and coordinated activity between coupled neurons, and may help facilitate the synchronous release and pulsatile secretion of neurohormones. A previous study using GnRH-secreting GT1-7 cells reported that connexin 26 was the major GJ subunit present, and that about 20% of the cultured cells engaged in GJ coupling as assayed by fluorescence recovery after photobleaching of 5,6-carboxyfluorescein diacetate (MW 460 D). To reassess GJ connectivity with a more permeant probe, we grew GT1-7 cells to 70% confluency on Matrigel-coated glass coverslips and microinjected Neurobiotin(TM) (MW 322 D) into single cells. Dye was allowed to diffuse for 30 min before cultures were fixed, and subsequently immunostained for Neurobiotin with 3,3'-diaminobenzidine HCl and examined by light microscopy. Dye coupling between 2 or more GT1-7 cells was observed after 75% of all microinjections. Connectivity involved the somata and neurites of an average of 6.6 +/- 2.0 adjoining cells, but in one instance was seen in a group of 32 GT1-7 neighbors. Western blotting and immunofluorescence staining confirmed that connexin 26 was the predominant GJ subunit expressed by GT1-7 cultures. Our results using Neurobiotin suggest these GJ channels may be smaller than anticipated. In addition, functional GJ connectivity between subconfluent GT1-7 cells is more extensive than previously reported, occurring with higher frequency and coupling significantly greater numbers of cultured cells. Since cAMP, IP3, and Ca(2+) are able to pass through GJs and can elicit secretion of GnRH by GT1 cell cultures, GJs may play an important role in the coordination and synchronization of GnRH release.
Collapse
Affiliation(s)
- L Hu
- Department of Obstetrics and Gynecology, Reproductive Endocrinology Center, San Francisco, Calif., USA.
| | | | | | | |
Collapse
|
22
|
Abstract
The enzyme 4-oxalocrotonate tautomerase catalyzes the ketonization of dienols, which after further processing become intermediates in the Krebs cycle. The enzyme uses a general acid-base mechanism for proton transfer: the amino-terminal proline has been shown to function as the catalytic base and Arg39 has been implicated as the catalytic acid. We report the results of molecular docking simulations of 4-oxalocrotonate tautomerase with two substrates, 2-hydroxymuconate and 5-carboxymethyl-2-hydroxymuconate. pKa calculations are also performed for the free enzyme. The predicted binding mode of 2-hydroxymuconate is in agreement with experimental data. A model for the binding mode of 5-carboxymethyl-2-hydroxymuconate is proposed which explains the lower catalytic efficiency of the enzyme toward this substrate. The pKa predictions and docking simulations support residue Arg39 as the general acid for the enzyme catalysis.
Collapse
Affiliation(s)
- T A Soares
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | | | | | |
Collapse
|
23
|
Coombs GS, Rao MS, Olson AJ, Dawson PE, Madison EL. Revisiting catalysis by chymotrypsin family serine proteases using peptide substrates and inhibitors with unnatural main chains. J Biol Chem 1999; 274:24074-9. [PMID: 10446178 DOI: 10.1074/jbc.274.34.24074] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Chymotrypsin family serine proteases play essential roles in key biological and pathological processes and are frequently targets of drug discovery efforts. This large enzyme family is also among the most advanced model systems for detailed studies of enzyme mechanism and structure/function relationships. Productive interactions between these enzymes and their substrates are widely believed to mimic the "canonical" interactions between serine proteases and "standard" inhibitors observed in numerous protease-inhibitor complexes. To test this central hypothesis we have synthesized and characterized a series of peptide analogs, based on model substrates and inhibitors of trypsin, that contain unnatural main chains. These results call into question a long accepted theory regarding the interaction of chymotrypsin family serine proteases with substrates and suggest that the canonical interactions observed between these enzymes and standard inhibitors may represent nonproductive rather than productive, substrate-like interactions.
Collapse
Affiliation(s)
- G S Coombs
- Department of Molecular Biology, Corvas International, San Diego, California 92121, USA
| | | | | | | | | |
Collapse
|
24
|
Sanner MF, Duncan BS, Carrillo CJ, Olson AJ. Integrating computation and visualization for biomolecular analysis: an example using python and AVS. Pac Symp Biocomput 1999:401-12. [PMID: 10380214 DOI: 10.1142/9789814447300_0039] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
One of the challenges in biocomputing is to enable the efficient use of a wide variety of fast-evolving computational methods to simulate, analyze, and understand the complex properties and interactions of molecular systems. Our laboratory investigates several areas including molecular visualization, protein-ligand docking, protein-protein docking, molecular surfaces, and the derivation of phenomenological potentials. In this paper we present an approach based on the Python programming language to achieve a high level of integration between these different computational methods and our primary visualization system AVS. This approach removes many limitations of AVS while increasing dramatically the inter-operability of our computational tools. Several examples are shown to illustrate how this approach enables a high level of integration and inter-operability between different tools, while retaining modularity and avoiding the creation of a large monolithic package that is difficult to extend and maintain.
Collapse
Affiliation(s)
- M F Sanner
- Scripps Research Institute, La Jolla, CA-92037, USA
| | | | | | | |
Collapse
|
25
|
Rosin CD, Belew RK, Walker WL, Morris GM, Olson AJ, Goodsell DS. Coevolution and subsite decomposition for the design of resistance-evading HIV-1 protease inhibitors. J Mol Biol 1999; 287:77-92. [PMID: 10074408 DOI: 10.1006/jmbi.1998.2579] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Drug resistance sharply limits the effectiveness of human immunodeficiency virus (HIV) protease inhibitors in acquired immunodeficiency syndrome therapy. In previous work, we presented methods for design of resistance-evading inhibitors using a computational coevolution technique. Here, we report subsite decomposition experiments that examine the relative importance and roles of each subsite in HIV protease, and the constraints on robust inhibitor design that are imposed by possible resistance mutations in each subsite. The results identify several structural features of robust resistance-evading inhibitors for use in drug design, and show their basis in the constraints imposed by the range of allowable mutation in the protease. In particular, the results identify the P3 and P3' sites as being particularly sensitive to protease mutation: inhibitors designed to fill the S3 and S3' sites of the wild-type protease will be susceptible to viral resistance, but inhibitors with side-chains smaller than a phenylalanine residue at P3 and P3', preferably medium-sized amino acids in the range from valine to leucine and isoleucine residues, will be more robust in the face of protease resistance mutation.
Collapse
Affiliation(s)
- C D Rosin
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | | | | | | | | | | |
Collapse
|
26
|
Rosin CD, Belew RK, Morris GM, Olson AJ, Goodsell DS. Coevolutionary analysis of resistance-evading peptidomimetic inhibitors of HIV-1 protease. Proc Natl Acad Sci U S A 1999; 96:1369-74. [PMID: 9990030 PMCID: PMC15469 DOI: 10.1073/pnas.96.4.1369] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/1998] [Indexed: 11/18/2022] Open
Abstract
We have developed a coevolutionary method for the computational design of HIV-1 protease inhibitors selected for their ability to retain efficacy in the face of protease mutation. For HIV-1 protease, typical drug design techniques are shown to be ineffective for the design of resistance-evading inhibitors: An inhibitor that is a direct analogue of one of the natural substrates will be susceptible to resistance mutation, as will inhibitors designed to fill the active site of the wild-type or a mutant enzyme. Two design principles are demonstrated: (i) For enzymes with broad substrate specificity, such as HIV-1 protease, resistance-evading inhibitors are best designed against the immutable properties of the active site-the properties that must be conserved in any mutant protease to retain the ability to bind and cleave all of the native substrates. (ii) Robust resistance-evading inhibitors can be designed by optimizing activity simultaneously against a large set of mutant enzymes, incorporating as much of the mutational space as possible.
Collapse
Affiliation(s)
- C D Rosin
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA 92093, USA
| | | | | | | | | |
Collapse
|
27
|
Abstract
In order to understand the structural basis of Factor Xa (FXa) specificity, structural complexes of FXa with its synthetic inhibitors are determined using a computational docking approach. The AutoDock suite of programs is used to determine the binding modes of the synthetic inhibitors such as 3- and 4-amidinobenzylphenyl ether (ABP), amidinophenyl pyruvic acid (APPA), diamidinobenzofuranyl ethene (DABE), and DX-9065a 2-(5'-amidino-2'-benzofuranyl)-3-(7'amidino-2'-napthyl)-propionic acid (ABAP) to FXa. The synthetic inhibitors docked in the present study are different in size, nature of linkage, and properties. Two sets of simulations were carried out for synthetic inhibitors docking to FXa. In the first set of simulations, no explicit water molecules were included. In the second set of simulations two explicit solvent molecules were considered. In all the computationally predicted synthetic inhibitor complexes of FXa, the specificity pocket residue Asp-189 is involved in hydrogen bonding with the bound inhibitor. The active site water molecule WAT522 is involved in hydrogen bonding with all the bound inhibitors. The computed energies clearly discriminate the high affinity from low affinity binders.
Collapse
Affiliation(s)
- M S Rao
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | |
Collapse
|
28
|
Abstract
The modeling of supramolecular structure presents two major challenges: (1) managing the large amount of sequence, structural and biochemical data, and (2) presenting the data to the user in a flexible and comprehensible manner that addresses these problems. We describe a visualization environment for the creation and analysis of supramolecular models. A set of modular symmetry tools, collectively called SymGen, has been created, providing a flexible platform for the creation of complex assemblies, with interactive control of all symmetry elements and their parameters. A second tool, SymSearch, allows a range of parameters defined within SymGen to be sampled and the resulting conformations to be evaluated. The environment avoids information overload, caused by the large number of atoms in supramolecular complexes, by using a multiresolution spherical harmonic representation that allows the user to display only essential features. Spherical harmonics also enables control of the triangulation level, allowing the user to reduce the complexity of the geometric description to retain interactive speed. The visual fidelity of the surface data is retained by using texture maps that are independent of the resolution of the underlying triangulation. We describe the design and implementation of this environment, and three illustrative examples of its utility.
Collapse
Affiliation(s)
- T J Macke
- Department of Molecular Biology, Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | |
Collapse
|
29
|
Abstract
BACKGROUND Most soluble proteins are active as low-number oligomers. Statistical surveys of oligomeric proteins have defined the roles of hydrophobicity and complementarity in the stability of protein interfaces, but tend to average structural features over a diverse set of protein-protein interfaces, blurring information on how individual interfaces are stabilized. RESULTS We report a visual survey of 136 homodimeric proteins from the Brookhaven Protein Data Bank, with images that highlight the major structural features of each protein-protein interaction surface. Nearly all of these proteins have interfaces formed between two globular subunits. Surprisingly, the pattern of hydrophilicity over the surface of these interfaces is quite variable. Approximately one-third of the interfaces show a recognizable hydrophobic core, with a single large, contiguous, hydrophobic patch surrounded by a ring of intersubunit polar interactions. The remaining two-thirds of the proteins show a varied mixture of small hydrophobic patches, polar interactions and water molecules scattered over the entire interfacial area. Ten proteins in the survey have intertwined interfaces formed by extensive interdigitation of the two subunit chains. These interfaces are very hydrophobic and are associated with proteins that require both stability and internal symmetry. CONCLUSIONS The archetypal protein interface, with a defined hydrophobic core, is present in only a minority of the surveyed homodimeric proteins. Most homodimeric proteins are stabilized by a combination of small hydrophobic patches, polar interactions and a considerable number of bridging water molecules. The presence or absence of a hydrophobic core within these interfaces does not correlate with specific protein functions.
Collapse
Affiliation(s)
- T A Larsen
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
30
|
Abstract
This article will discuss the motivations, technologies, and future directions of computational automated docking in the context of the structure-based rational design of HIV-1 protease inhibitors. Docking simulations are widely used for screening of compound libraries to identify new drug leads, employing a simple model for rapid testing of thousands of compounds. Docking simulations are also useful for lead enhancement, using more detailed models to analyze the atomic interactions between inhibitors and target macromolecules. Major advances have been reported in the development of empirical force fields, which now allow assessment of relative binding strength and drug specificity, and extensions of automated docking techniques allow de novo drug design.
Collapse
Affiliation(s)
- A J Olson
- Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | | |
Collapse
|
31
|
Abstract
The field of computer graphics has played an important role in the advancement of structural molecular biology and in the development of structure-based drug design. This article will provide a brief background on the development of this technology, and then focus on the current trends and future directions in molecular graphics and how they will impact the practice of molecular modeling and design. Specific areas that will be covered include: 1) the development of surface and volume based representations of molecular properties and interactions; 2) new approaches to modeling flexible and multi-component structures, and 3) the impact of object-oriented graphics-based programming and the rapidly growing use of network based computing.
Collapse
Affiliation(s)
- A J Olson
- Department of Molecular Biology, Scripps Research Institute La Jolla, CA 92037, USA
| | | |
Collapse
|
32
|
Abstract
An understanding of antiviral drug resistance is important in the design of effective drugs. Comprehensive features of the interaction between drug designs and resistance mutations are difficult to study experimentally because of the very large numbers of drugs and mutants involved. We describe a computational framework for studying antiviral drug resistance. Data on HIV-1 protease are used to derive an approximate model that predicts interaction of a wide range of mutant forms of the protease with a broad class of protease inhibitors. An algorithm based on competitive coevolution is used to find highly resistant mutant forms of the protease, and effective inhibitors against such mutants, in the context of the model. We use this method to characterize general features of inhibitors that are effective in overcoming resistance, and to study related issues of selection pathways, cross-resistance, and combination therapies.
Collapse
Affiliation(s)
- C D Rosin
- Scripps Research Institute, La Jolla, CA 92037, USA.
| | | | | | | | | |
Collapse
|
33
|
Sanner MF, Olson AJ. Real time surface reconstruction for moving molecular fragments. Pac Symp Biocomput 1997:385-96. [PMID: 9390308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Recently we introduced the Reduced Surface as an efficient tool to built molecular surfaces. We describe here how this geometric construct can be used to efficiently reconstruct the solvent excluded surface of a protein for which the coordinates of a subset of atoms are changing. We show that, the complexity of that operation is not dependent upon the size of the molecule and is in O[tlog(t)] where t is the maximum of the number of probes and atoms involved in the reconstruction of the surface. The algorithms described here have been implemented and tested on several proteins. The triangulation of the solvent excluded surface of proteins in which a side chain was changing conformation could be updated at rates ranging from 7 to 22 frames per second. We also applied this method to compute the surface area fluctuation of the FIV protease undergoing a constrained molecular dynamics simulation (16 mobile residues). Rate of 6 frames per second were obtained in this case.
Collapse
Affiliation(s)
- M F Sanner
- Scripps Research Institute, La Jolla, CA 92037, USA
| | | |
Collapse
|
34
|
Reva BA, Finkelstein AV, Sanner MF, Olson AJ. Accurate mean-force pairwise-residue potentials for discrimination of protein folds. Pac Symp Biocomput 1997:373-84. [PMID: 9390307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
We present two new sets of energy functions for protein structure recognition. The first set of potentials is based on the positions of alpha- and the second on positions of beta-carbon atoms of amino acid residues. The potentials are derived using a theory of Boltzmann-like statistics of protein structure by Finkelstein et al. The energy terms incorporate both long-range interactions between residues remote along a chain and short-range interactions between near neighbors. Distance-dependence is approximated by a piecewise constant function defined on intervals of equal size. The size of this interval is optimized. A database of 222 non-homologous proteins was used both for the derivation of the potentials, and for the "threading" test originally suggested by Hendlich et al. For threading, we used 102 non-homologous protein chains of 60 to 200 residues. The energy of each of the native structures was compared with the energy of 45 to 20 thousand alternative structures generated by threading. Of these 102 native structures 94 have the lowest energy with alpha-carbon-based potentials, and even more, 100 of these 102 structures, have the lowest energy with the beta-carbon-based potentials.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, Scripps Research Institute, Ca. 92037, USA
| | | | | | | |
Collapse
|
35
|
Abstract
A new Fourier series representation of supercoiled DNA is employed in Langevin dynamics simulations to study large-scale configurational motions of intermediate-length chains. The polymer is modeled as an ideal elastic rod subject to long-range van der Waals' interactions. The van der Waals' term prevents the self-contact of distant chain segments and also mimics attractive forces thought to stabilize the association of closely spaced charged rods. The finite Fourier series-derived polymer formulation is an alternative to the piecewise B-spline curves used in past work to describe the motion of smoothly deformed supercoiled DNA in terms of a limited number of independent variables. This study focuses on two large-scale configurational events: the interconversion between circular and figure-8 forms at a relatively low level of supercoiling, and the transformation between branched and interwound structures at a higher superhelical density.
Collapse
Affiliation(s)
- G Liu
- Department of Chemistry, Rutgers, the State University of New Jersey, New Brunswick 08903, USA
| | | | | | | |
Collapse
|
36
|
Reva BA, Finkelstein AV, Sanner M, Olson AJ, Skolnick J. Recognition of protein structure on coarse lattices with residue-residue energy functions. Protein Eng 1997; 10:1123-30. [PMID: 9488137 DOI: 10.1093/protein/10.10.1123] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
We suggest and test potentials for the modeling of protein structure on coarse lattices. The coarser the lattice, the more complete and faster is the exploration of the conformational space of a molecule. However, there are inevitable energy errors in lattice modeling caused by distortions in distances between interacting residues; the coarser the lattice, the larger are the energy errors. It is generally believed that an improvement in the accuracy of lattice modelling can be achieved only by reducing the lattice spacing. We reduce the errors on coarse lattices with lattice-adapted potentials. Two methods are used: in the first approach, 'lattice-derived' potentials are obtained directly from a database of lattice models of protein structure; in the second approach, we derive 'lattice-adjusted' potentials using our previously developed method of statistical adjustment of the 'off-lattice' energy functions for lattices. The derivation of off-lattice Calpha atom-based distance-dependent pairwise potentials has been reported previously. The accuracy of 'lattice-derived', 'lattice-adjusted' and 'off-lattice' potentials is estimated in threading tests. It is shown that 'lattice-derived' and 'lattice-adjusted' potentials give virtually the same accuracy and ensure reasonable protein fold recognition on the coarsest considered lattice (spacing 3.8 A), however, the 'off-lattice' potentials, which efficiently recognize off-lattice folds, do not work on this lattice, mainly because of the errors in short-range interactions between neighboring residues.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, The Scripps Research Institute, CA 92037, USA
| | | | | | | | | |
Collapse
|
37
|
Abstract
We present two new sets of energy functions for protein structure recognition, given the primary sequence of amino acids along the polypeptide chain. The first set of potentials is based on the positions of alpha- and the second on positions of beta- and alpha-carbon atoms of amino acid residues. The potentials are derived using a theory of Boltzmann-like statistics of protein structure. The energy terms incorporate both long-range interactions between residues remote along a chain and short-range interactions between near neighbors. Distance dependence is approximated by a piecewise constant function defined on intervals of equal size. The size of the interval is optimized to preserve as much detail as possible without introducing excessive error due to limited statistics. A database of 214 non-homologous proteins was used both for the derivation of the potentials, and for the 'threading' test originally suggested by Hendlich et al. (1990) J. Mol. Biol., 216, 167-180. Special care is taken to avoid systematic error in this test. For threading, we used 100 non-homologous protein chains of 60-205 residues. The energy of each of the native structures was compared with the energy of 43,000 to 19,000 alternative structures generated by threading. Of these 100 native structures, 92 have the lowest energy with alpha-carbon-based potentials and, even more, 98 of these 100 structures, have the lowest energy with the beta- and alpha-carbon based potentials.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, Scripps Research Institute, CA 92037, USA
| | | | | | | |
Collapse
|
38
|
Laco GS, Fitzgerald MC, Morris GM, Olson AJ, Kent SB, Elder JH. Molecular analysis of the feline immunodeficiency virus protease: generation of a novel form of the protease by autoproteolysis and construction of cleavage-resistant proteases. J Virol 1997; 71:5505-11. [PMID: 9188624 PMCID: PMC191792 DOI: 10.1128/jvi.71.7.5505-5511.1997] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The feline immunodeficiency virus (FIV) protease is essential for virion maturation and subsequent viral replication in that it cleaves the Gag and Gag/Pol polyproteins at eight sites to release the respective structural proteins and enzymes. During purification of a recombinant FIV protease (PR), we noted that it underwent autoproteolysis (autolysis) to give discrete cleavage products. These additional PR cleavage sites were defined using N-terminal amino acid sequence analysis and mass spectrometry. Protease breakdown products were also found in FIV virions and were of the same apparent molecular weights as the in vitro autolysis products. Four primary PR autolysis sites were blocked via substitution of either the P1 amino acid with a beta-branched amino acid or the P1' amino acid with lysine. Cleavage-resistant PRs which had Km and k(cat) values similar to those of FIV PR were constructed. An autolysis time course determined that blocking all four primary autolysis sites yielded a cleavage-resistant PR which was enzymatically stable. Concomitant with autolysis is the generation of an N-terminally truncated form of the PR (Thr6/PR) which has enhanced stability with respect to that of FIV PR. A structural basis for the Thr6/PR activity is presented, as are the possible roles of autolysis in the viral replication cycle.
Collapse
Affiliation(s)
- G S Laco
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | | | |
Collapse
|
39
|
Abstract
We present an algorithm to build self-avoiding lattice models of chain molecules with low RMS deviation from their actual 3D structures. To find the optimal coordinates for the lattice chain model, we minimize a function that consists of three terms: (1) the sum of squared deviations of link coordinates on a lattice from their off-lattice values, (2) the sum of "short-range" terms, penalizing violation of chain connectivity, and (3) the sum of "long-range" repulsive terms, penalizing chain self-intersections. We treat this function as a chain molecule "energy" and minimize it using self-consistent field (SCF) theory to represent the pairwise link repulsions as 3D fields acting on the links. The statistical mechanics of chain molecules enables computation of the chain distribution in this field on the lattice. The field is refined by iteration to become self-consistent with the chain distribution, then dynamic programming is used to find the optimal lattice model as the "lowest-energy" chain pathway in this SCF. We have tested the method on one of the coarsest (and most difficult) lattices used for model building on proteins of all structural types and show that the method is adequate for building self-avoiding models of proteins with low RMS deviations from the actual structures.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | |
Collapse
|
40
|
Morris GM, Goodsell DS, Huey R, Olson AJ. Distributed automated docking of flexible ligands to proteins: parallel applications of AutoDock 2.4. J Comput Aided Mol Des 1996; 10:293-304. [PMID: 8877701 DOI: 10.1007/bf00124499] [Citation(s) in RCA: 770] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
AutoDock 2.4 predicts the bound conformations of a small, flexible ligand to a nonflexible macromolecular target of known structure. The technique combines simulated annealing for conformation searching with a rapid grid-based method of energy evaluation based on the AMBER force field. AutoDock has been optimized in performance without sacrificing accuracy; it incorporates many enhancements and additions, including an intuitive interface. We have developed a set of tools for launching and analyzing many independent docking jobs in parallel on a heterogeneous network of UNIX-based workstations. This paper describes the current release, and the results of a suite of diverse test systems. We also present the results of a systematic investigation into the effects of varying simulated-annealing parameters on molecular docking. We show that even for ligands with a large number of degrees of freedom, root-mean-square deviations of less than 1 A from the crystallographic conformation are obtained for the lowest-energy dockings, although fewer dockings find the crystallographic conformation when there are more degrees of freedom.
Collapse
Affiliation(s)
- G M Morris
- Department of Molecular Biology, MB-5, Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
41
|
Abstract
Lattice models of proteins can approximate off-lattice structure to arbitrary precision with RMS (root mean squared) deviations roughly equal to half the lattice spacing (Rykunov et al., Proteins 22:100-109, 1995; Reva et al., J. Comp. Biol., 1996). However, even small distortions in the positions of chain links lead to significant errors in lattice-based energy calculations (Reva et al., J. Comp. Chem., 1996). These errors arise mainly from rigid interactions (such as steric repulsion) which change their energies considerably at a range which is much smaller than the usual accuracy of lattice modeling (> 1.0 A). To reduce this error, we suggest a procedure of adjusting energy functions to a given lattice. The general approach is illustrated with energy calculations based on pairwise potentials by Kolinski et al. (J. Chem. Phys. 98:1-14, 1993). At all the lattice spacings, from 0.5-3.8 A, the lattice-adjusted potentials improve the accuracy of lattice-based energy calculations and increase the correlations between off-lattice and lattice energies.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | |
Collapse
|
42
|
Abstract
Because of their wide use in molecular modeling, methods to compute molecular surfaces have received a lot of interest in recent years. However, most of the proposed algorithms compute the analytical representation of only the solvent-accessible surface. There are a few programs that compute the analytical representation of the solvent-excluded surface, but they often have problems handling singular cases of self-intersecting surfaces and tend to fail on large molecules (more than 10,000 atoms). We describe here a program called MSMS, which is shown to be fast and reliable in computing molecular surfaces. It relies on the use of the reduced surface that is briefly defined here and from which the solvent-accessible and solvent-excluded surfaces are computed. The four algorithms composing MSMS are described and their complexity is analyzed. Special attention is given to the handling of self-intersecting parts of the solvent-excluded surface called singularities. The program has been compared with Connolly's program PQMS [M.L. Connolly (1993) Journal of Molecular Graphics, Vol. 11, pp. 139-141] on a set of 709 molecules taken from the Brookhaven Data Base. MSMS was able to compute topologically correct surfaces for each molecule in the set. Moreover, the actual time spent to compute surfaces is in agreement with the theoretical complexity of the program, which is shown to be O[n log(n)] for n atoms. On a Hewlett-Packard 9000/735 workstation, MSMS takes 0.73 s to produce a triangulated solvent-excluded surface for crambin (1 crn, 46 residues, 327 atoms, 4772 triangles), 4.6 s for thermolysin (3tln, 316 residues, 2437 atoms, 26462 triangles), and 104.53 s for glutamine synthetase (2gls, 5676 residues, 43632 atoms, 476665 triangles).
Collapse
Affiliation(s)
- M F Sanner
- The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | |
Collapse
|
43
|
Abstract
AutoDock is a suite of C programs used to predict the bound conformations of a small, flexible ligand to a macromolecular target of known structure. The technique combines simulated annealing for conformation searching with a rapid grid-based method of energy evaluation. This paper reviews recent applications of the technique and describes the enhancements included in the current release.
Collapse
Affiliation(s)
- D S Goodsell
- Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
44
|
Abstract
AutoDock is a suite of C programs used to predict the bound conformations of a small, flexible ligand to a macromolecular target of known structure. The technique combines simulated annealing for conformation searching with a rapid grid-based method of energy evaluation. This paper reviews recent applications of the technique and describes the enhancements included in the current release.
Collapse
Affiliation(s)
- D S Goodsell
- Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
45
|
Abstract
Dataflow systems for scientific visualization are becoming increasingly sophisticated in their architecture and functionality. AVS, from Advanced Visual Systems Inc., is a powerful dataflow environment that has been applied to many computation and visualization tasks. An important, yet complex, application area is molecular modeling and biomolecular visualization. Problems in biomolecular visualization tax the capability of dataflow systems because of the diversity of operations that are required and because many operations do not fit neatly into the dataflow paradigm. Here we describe visualization strategies and auxiliary programs developed to enhance the applicability of AVS for molecular modelling. Our visualization strategy is to use general-purpose AVS modules and a small number of chemistry-specific modules. We have developed methods to control AVS using AVS-tool, a programmable interface to the AVS Command Line Interpreter (CLI), and have also developed NAB, a C-like language for writing AVS modules that has extensions for operating on proteins and nucleic acids. This strategy provides a flexible and extensible framework for a wide variety of molecular modeling tasks.
Collapse
Affiliation(s)
- B S Duncan
- Scripps Research Institute, La Jolla, California 92037, USA
| | | | | |
Collapse
|
46
|
Abstract
We present a method for the approximation and real-time visualization of large-scale motion of protein surfaces. A molecular surface is represented by an expansion of spherical harmonic functions, and the motion of protein atoms around their equilibrium positions is computed by normal mode analysis. The motion of the surface is approximated by projecting the normal mode vectors of the solvent-accessible atoms to the spherical harmonic representation of the molecular surface. These surface motion vectors are represented by a separate spherical harmonic expansion. Representing the surface geometry and the surface motion vectors by spherical harmonic expansions allows variable-resolution analysis and real-time display of the large-scale surface motion. This technique has been applied to interactive visualization, interactive surface manipulation, and animation.
Collapse
Affiliation(s)
- B S Duncan
- Scripps Research Institute, La Jolla, California 92037, USA
| | | |
Collapse
|
47
|
Abstract
Texture mapping is an increasingly popular technique in molecular modeling. It is particularly effective in representing high-resolution surface detail using a low-resolution polygonal model. We describe how texture mapping can be used with parametric molecular surfaces represented as expansions of spherical harmonic functions. We define analytically the texture image and its transformation to a parametric surface. Unlike most methods of texture mapping, this transformation defines a one-to-one correspondence between the surface and the texture; texture coordinates are derived from the location of the surface point and not from physical properties at the surface point. This has advantages for the interactive visualization of surface data. We control the interactive response time by lowering the resolution of the polygon mesh while retaining the high-resolution detail of the texture, or we can lower the resolution of the texture image with the same polygonal model. By using a well-defined convention for texture coordinates, we can use the same image for the original surface or its parametric representation, and we can rapidly switch between images that represent different surface properties without recomputing the texture coordinates. Parametric surfaces allow new flexibility for the visualization of molecular surface data.
Collapse
Affiliation(s)
- B S Duncan
- Scripps Research Institute, La Jolla, California 92037, USA
| | | |
Collapse
|
48
|
Abstract
An algorithm to construct lattice models of polymers with side chains is presented. A search for the global minimum of the error function for a given lattice-to-chain orientation is done by dynamic programming, making the search both fast and complete. Application of the algorithm is illustrated by constructing lattice models for 12 proteins of different sizes and structural types.
Collapse
Affiliation(s)
- B A Reva
- Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
49
|
Abstract
Automated docking of substrates to proteins of known structure aids the process of crystallographic analysis in two ways. First, automated docking can be used to generate a small number of starting models for substrates using only protein coordinates from an early stage of refinement. Second, automated docking provides a method for exploring aspects of catalysis that are inaccessible to crystallography by postulating binding modes of catalytic intermediates. This paper describes the use of automated docking to explore the binding of substrates to aconitase. The technique starts with a substrate molecule in an arbitrary configuration and position and finds favorable docked configurations in a (static) protein active site based on a molecular mechanics type force field. Using protein coordinates from an early stage of refinement of an aconitase-isocitrate complex, we successfully predicted the binding configuration of isocitrate. Four configurations were found, the energetically most favorable of which fit the observed electron density well and was used as a starting model for further refinement. Two configurations were found in citrate docking experiments, the second of which approximates the mode of substrate binding in an aconitase-nitrocitrate complex. We were also able to propose two binding modes of the catalytic intermediate cis-aconitate. These correspond closely to the isocitrate and the citrate binding modes. The relation of these new results to the proposed reaction mechanism is discussed.
Collapse
Affiliation(s)
- D S Goodsell
- Molecular Biology Institute, University of California, Los Angeles 90024
| | | | | | | |
Collapse
|
50
|
Abstract
Why are proteins so big? Why do cells build oligomeric proteins? A visual survey of the protein structures available in the Protein Data Bank sheds new light on these questions.
Collapse
Affiliation(s)
- D S Goodsell
- Molecular Biology Institute, University of California, Los Angeles 90024
| | | |
Collapse
|