1
|
Buchko GW, Abendroth J, Robinson JI, Phan IQ, Myler PJ, Edwards TE. Structural diversity in the Mycobacteria DUF3349 superfamily. Protein Sci 2019; 29:670-685. [PMID: 31658388 DOI: 10.1002/pro.3758] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 10/17/2019] [Accepted: 10/21/2019] [Indexed: 11/11/2022]
Abstract
A protein superfamily with a "Domain of Unknown Function,", DUF3349 (PF11829), is present predominately in Mycobacterium and Rhodococcus bacterial species suggesting that these proteins may have a biological function unique to these bacteria. We previously reported the inaugural structure of a DUF3349 superfamily member, Mycobacterium tuberculosis Rv0543c. Here, we report the structures determined for three additional DUF3349 proteins: Mycobacterium smegmatis MSMEG_1063 and MSMEG_1066 and Mycobacterium abscessus MAB_3403c. Like Rv0543c, the NMR solution structure of MSMEG_1063 revealed a monomeric five α-helix bundle with a similar overall topology. Conversely, the crystal structure of MSMEG_1066 revealed a five α-helix protein with a strikingly different topology and a tetrameric quaternary structure that was confirmed by size exclusion chromatography. The NMR solution structure of a fourth member of the DUF3349 superfamily, MAB_3403c, with 18 residues missing at the N-terminus, revealed a monomeric α-helical protein with a folding topology similar to the three C-terminal helices in the protomer of the MSMEG_1066 tetramer. These structures, together with a GREMLIN-based bioinformatics analysis of the DUF3349 primary amino acid sequences, suggest two subfamilies within the DUF3349 family. The division of the DUF3349 into two distinct subfamilies would have been lost if structure solution had stopped with the first structure in the DUF3349 family, highlighting the insights generated by solving multiple structures within a protein superfamily. Future studies will determine if the structural diversity at the tertiary and quaternary levels in the DUF3349 protein superfamily have functional roles in Mycobacteria and Rhodococcus species with potential implications for structure-based drug discovery.
Collapse
Affiliation(s)
- Garry W Buchko
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington.,School of Molecular Biosciences, Washington State University, Pullman, Washington
| | - Jan Abendroth
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,UCB, Bainbridge Island, Washington
| | - John I Robinson
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,UCB, Bainbridge Island, Washington
| | - Isabelle Q Phan
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,Center for Global Infectious Disease Research, Seattle Children's Hospital, Seattle, Washington
| | - Peter J Myler
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,Center for Global Infectious Disease Research, Seattle Children's Hospital, Seattle, Washington.,Department of Medical Education and Biomedical Informatics, University of Washington, Seattle, Washington.,Department of Global Health, University of Washington, Seattle, Washington
| | - Thomas E Edwards
- Seattle Structural Genomics Center for Infectious Disease, Seattle, Washington.,UCB, Bainbridge Island, Washington
| |
Collapse
|
2
|
Matelska D, Steczkiewicz K, Ginalski K. Comprehensive classification of the PIN domain-like superfamily. Nucleic Acids Res 2017; 45:6995-7020. [PMID: 28575517 PMCID: PMC5499597 DOI: 10.1093/nar/gkx494] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 05/24/2017] [Indexed: 12/21/2022] Open
Abstract
PIN-like domains constitute a widespread superfamily of nucleases, diverse in terms of the reaction mechanism, substrate specificity, biological function and taxonomic distribution. Proteins with PIN-like domains are involved in central cellular processes, such as DNA replication and repair, mRNA degradation, transcription regulation and ncRNA maturation. In this work, we identify and classify the most complete set of PIN-like domains to provide the first comprehensive analysis of sequence–structure–function relationships within the whole PIN domain-like superfamily. Transitive sequence searches using highly sensitive methods for remote homology detection led to the identification of several new families, including representatives of Pfam (DUF1308, DUF4935) and CDD (COG2454), and 23 other families not classified in the public domain databases. Further sequence clustering revealed relationships between individual sequence clusters and showed heterogeneity within some families, suggesting a possible functional divergence. With five structural groups, 70 defined clusters, over 100,000 proteins, and broad biological functions, the PIN domain-like superfamily constitutes one of the largest and most diverse nuclease superfamilies. Detailed analyses of sequences and structures, domain architectures, and genomic contexts allowed us to predict biological function of several new families, including new toxin-antitoxin components, proteins involved in tRNA/rRNA maturation and transcription/translation regulation.
Collapse
Affiliation(s)
- Dorota Matelska
- University of Warsaw, CeNT, Laboratory of Bioinformatics and Systems Biology, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Kamil Steczkiewicz
- University of Warsaw, CeNT, Laboratory of Bioinformatics and Systems Biology, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Krzysztof Ginalski
- University of Warsaw, CeNT, Laboratory of Bioinformatics and Systems Biology, Zwirki i Wigury 93, 02-089 Warsaw, Poland
| |
Collapse
|
3
|
Solution NMR Studies of Mycobacterium tuberculosis Proteins for Antibiotic Target Discovery. Molecules 2017; 22:molecules22091447. [PMID: 28858250 PMCID: PMC6151718 DOI: 10.3390/molecules22091447] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 08/27/2017] [Indexed: 11/17/2022] Open
Abstract
Tuberculosis is an infectious disease caused by Mycobacteriumtuberculosis, which triggers severe pulmonary diseases. Recently, multidrug/extensively drug-resistant tuberculosis strains have emerged and continue to threaten global health. Because of the development of drug-resistant tuberculosis, there is an urgent need for novel antibiotics to treat these drug-resistant bacteria. In light of the clinical importance of M. tuberculosis, 2067 structures of M. tuberculsosis proteins have been determined. Among them, 52 structures have been solved and studied using solution nuclear magnetic resonance (NMR). The functional details based on structural analysis of M. tuberculosis using NMR can provide essential biochemical data for the development of novel antibiotic drugs. In this review, we introduce diverse structural and biochemical studies on M. tuberculosis proteins determined using NMR spectroscopy.
Collapse
|
4
|
Buchko GW, Yee A, Semesi A, Myler PJ, Arrowsmith CH, Hui R. Solution-state NMR structure of the putative morphogene protein BolA (PFE0790c) from Plasmodium falciparum. Acta Crystallogr F Struct Biol Commun 2015; 71:514-21. [PMID: 25945703 PMCID: PMC4427159 DOI: 10.1107/s2053230x1402799x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 12/23/2014] [Indexed: 12/22/2022] Open
Abstract
Protozoa of the genus Plasmodium are responsible for malaria, which is perhaps the most important parasitic disease to infect mankind. The emergence of Plasmodium strains resistant to current therapeutics and prophylactics makes the development of new treatment strategies urgent. Among the potential targets for new antimalarial drugs is the BolA-like protein PFE0790c from Plasmodium falciparum (Pf-BolA). While the function of BolA is unknown, it has been linked to cell morphology by regulating transcription in response to stress. Using an NMR-based method, an ensemble of 20 structures of Pf-BolA was determined and deposited in the PDB (PDB entry 2kdn). The overall topology of the Pf-BolA structure, α1-β1-β2-η1-α2/η2-β3-α3, with the β-strands forming a mixed β-sheet, is similar to the fold observed in other BolA structures. A helix-turn-helix motif similar to the class II KH fold associated with nucleic acid-binding proteins is present, but contains an FXGXXXL signature sequence that differs from the GXXG signature sequence present in class II KH folds, suggesting that the BolA family of proteins may use a novel protein-nucleic acid interface. A well conserved arginine residue, Arg50, hypothesized to play a role in governing the formation of the C-terminal α-helix in the BolA family of proteins, is too distant to form polar contacts with any side chains in this α-helix in Pf-BolA, suggesting that this conserved arginine may only serve a role in guiding the orientation of this C-terminal helix in some BolA proteins. A survey of BolA structures suggests that the C-terminal helix may not have a functional role and that the third helix (α2/η2) has a `kink' that appears to be conserved among the BolA protein structures. Circular dichroism spectroscopy shows that Pf-BolA is fairly robust, partially unfolding when heated to 353 K and refolding upon cooling to 298 K.
Collapse
Affiliation(s)
- Garry W. Buchko
- Seattle Structural Genomics Center for Infectious Disease, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Adelinda Yee
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Anthony Semesi
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Peter J. Myler
- Seattle Structural Genomics Center for Infectious Disease, USA
- Seattle BioMed, Seattle, Washington, USA
- Department of Medical Education and Biomedical Informatics and Department of Global Health, University of Washington, Seattle, Washington, USA
| | - Cheryl H. Arrowsmith
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Structural Genomics Consortium, England
| | - Raymond Hui
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Structural Genomics Consortium, England
| |
Collapse
|
5
|
Stacy R, Begley DW, Phan I, Staker BL, Van Voorhis WC, Varani G, Buchko GW, Stewart LJ, Myler PJ. Structural genomics of infectious disease drug targets: the SSGCID. Acta Crystallogr Sect F Struct Biol Cryst Commun 2011; 67:979-84. [PMID: 21904037 PMCID: PMC3169389 DOI: 10.1107/s1744309111029204] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2011] [Accepted: 07/19/2011] [Indexed: 11/29/2022]
Abstract
The Seattle Structural Genomics Center for Infectious Disease (SSGCID) is a consortium of researchers at Seattle BioMed, Emerald BioStructures, the University of Washington and Pacific Northwest National Laboratory that was established to apply structural genomics approaches to drug targets from infectious disease organisms. The SSGCID is currently funded over a five-year period by the National Institute of Allergy and Infectious Diseases (NIAID) to determine the three-dimensional structures of 400 proteins from a variety of Category A, B and C pathogens. Target selection engages the infectious disease research and drug-therapy communities to identify drug targets, essential enzymes, virulence factors and vaccine candidates of biomedical relevance to combat infectious diseases. The protein-expression systems, purified proteins, ligand screens and three-dimensional structures produced by SSGCID constitute a valuable resource for drug-discovery research, all of which is made freely available to the greater scientific community. This issue of Acta Crystallographica Section F, entirely devoted to the work of the SSGCID, covers the details of the high-throughput pipeline and presents a series of structures from a broad array of pathogenic organisms. Here, a background is provided on the structural genomics of infectious disease, the essential components of the SSGCID pipeline are discussed and a survey of progress to date is presented.
Collapse
Affiliation(s)
- Robin Stacy
- Seattle Structural Genomics Center for Infectious Disease, USA
- Seattle Biomedical Research Institute, 307 Westlake Avenue North, Suite 500, Seattle, WA 98109-5219, USA
| | - Darren W. Begley
- Seattle Structural Genomics Center for Infectious Disease, USA
- Emerald BioStructures, 7869 NE Day Road West, Bainbridge Island, WA 98110, USA
| | - Isabelle Phan
- Seattle Structural Genomics Center for Infectious Disease, USA
- Seattle Biomedical Research Institute, 307 Westlake Avenue North, Suite 500, Seattle, WA 98109-5219, USA
| | - Bart L. Staker
- Seattle Structural Genomics Center for Infectious Disease, USA
- Emerald BioStructures, 7869 NE Day Road West, Bainbridge Island, WA 98110, USA
| | - Wesley C. Van Voorhis
- Seattle Structural Genomics Center for Infectious Disease, USA
- Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Box 357185, Seattle, WA 98195, USA
| | - Gabriele Varani
- Seattle Structural Genomics Center for Infectious Disease, USA
- Departments of Chemistry and Biochemistry, University of Washington, Box 351700, Seattle, WA 98185, USA
| | - Garry W. Buchko
- Seattle Structural Genomics Center for Infectious Disease, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Lance J. Stewart
- Seattle Structural Genomics Center for Infectious Disease, USA
- Emerald BioStructures, 7869 NE Day Road West, Bainbridge Island, WA 98110, USA
| | - Peter J. Myler
- Seattle Structural Genomics Center for Infectious Disease, USA
- Seattle Biomedical Research Institute, 307 Westlake Avenue North, Suite 500, Seattle, WA 98109-5219, USA
- Departments of Global Health and Medical Education and Biomedical Informatics, University of Washington, Box 357238, Seattle, WA 98195, USA
| |
Collapse
|
6
|
Abendroth J, Gardberg AS, Robinson JI, Christensen JS, Staker BL, Myler PJ, Stewart LJ, Edwards TE. SAD phasing using iodide ions in a high-throughput structural genomics environment. ACTA ACUST UNITED AC 2011; 12:83-95. [PMID: 21359836 PMCID: PMC3123459 DOI: 10.1007/s10969-011-9101-7] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Accepted: 02/14/2011] [Indexed: 03/16/2023]
Abstract
The Seattle Structural Genomics Center for Infectious Disease (SSGCID) focuses on the structure elucidation of potential drug targets from class A, B, and C infectious disease organisms. Many SSGCID targets are selected because they have homologs in other organisms that are validated drug targets with known structures. Thus, many SSGCID targets are expected to be solved by molecular replacement (MR), and reflective of this, all proteins are expressed in native form. However, many community request targets do not have homologs with known structures and not all internally selected targets readily solve by MR, necessitating experimental phase determination. We have adopted the use of iodide ion soaks and single wavelength anomalous dispersion (SAD) experiments as our primary method for de novo phasing. This method uses existing native crystals and in house data collection, resulting in rapid, low cost structure determination. Iodide ions are non-toxic and soluble at molar concentrations, facilitating binding at numerous hydrophobic or positively charged sites. We have used this technique across a wide range of crystallization conditions with successful structure determination in 16 of 17 cases within the first year of use (94% success rate). Here we present a general overview of this method as well as several examples including SAD phasing of proteins with novel folds and the combined use of SAD and MR for targets with weak MR solutions. These cases highlight the straightforward and powerful method of iodide ion SAD phasing in a high-throughput structural genomics environment.
Collapse
|