1
|
Ye H, Li T, Rigden DJ, Wei Z. m6ACali: machine learning-powered calibration for accurate m6A detection in MeRIP-Seq. Nucleic Acids Res 2024:gkae280. [PMID: 38634812 DOI: 10.1093/nar/gkae280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 03/18/2024] [Accepted: 04/04/2024] [Indexed: 04/19/2024] Open
Abstract
We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.
Collapse
Affiliation(s)
- Haokai Ye
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Tenglong Li
- Wisdom Lake Academy of Pharmacy, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Life Course and Medical Sciences, University of Liverpool, L7 8TX Liverpool, UK
| |
Collapse
|
2
|
Saleh H, Liloglou T, Rigden DJ, Parsons JL, Grundy GJ. KH-like Domains in PARP9/DTX3L and PARP14 Coordinate Protein-Protein Interactions to Promote Cancer Cell Survival. J Mol Biol 2024; 436:168434. [PMID: 38182103 DOI: 10.1016/j.jmb.2023.168434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 12/12/2023] [Accepted: 12/30/2023] [Indexed: 01/07/2024]
Abstract
Certain members of the ADP-ribosyltransferase superfamily (ARTD or PARP enzymes) catalyse ADP-ribosylation in response to cellular stress, DNA damage and viral infection and are upregulated in various tumours. PARP9, its binding partner DTX3L and PARP14 protein levels are significantly correlated in head and neck squamous cell carcinoma (HNSCC) and other tumour types though a mechanism where PARP9/DTX3L regulates PARP14 post-transcriptionally. Depleting PARP9, DTX3L or PARP14 expression in HNSCC or HeLa cell lines decreases cell survival through a reduction of proliferation and an increase in apoptosis. A partial rescue of survival was achieved by expressing a PARP14 truncation containing a predicted eukaryotic type I KH domain. KH-like domains were also found in PARP9 and in DTX3L and contributed to protein-protein interactions between PARP9-DTX3L and PARP14-DTX3L. Homodimerization of DTX3L was also coordinated by a KH-like domain and was disrupted by site-specific mutation. Although, cell survival promoted by PARP14 did not require ADP-ribosyltransferase activity, interaction of DTX3L in vitro suppressed PARP14 auto-ADP-ribosylation and promoted trans-ADP-ribosylation of PARP9 and DTX3L. In summary, we characterised PARP9-DTX3L-PARP14 interactions important to pro-survival signalling in HNSCC cells, albeit in PARP14 catalytically independent fashion.
Collapse
Affiliation(s)
- Hadil Saleh
- University of Liverpool, Department of Molecular and Clinical Cancer Medicine, 6 West Derby St, Liverpool L7 8TX, UK
| | - Triantafillos Liloglou
- Edge Hill University, Faculty of Health, Social Care & Medicine, St Helens Road, Ormskirk, Lancashire L39 4QP, UK
| | - Daniel J Rigden
- University of Liverpool, Department of Biochemistry, Cell and Systems Biology, Liverpool L69 7ZB, UK
| | - Jason L Parsons
- University of Birmingham, Institute of Cancer and Genomic Sciences, IBR West, Birmingham B15 2TT, UK
| | - Gabrielle J Grundy
- University of Liverpool, Department of Molecular and Clinical Cancer Medicine, 6 West Derby St, Liverpool L7 8TX, UK.
| |
Collapse
|
3
|
Glover HL, Mendes M, Gomes-Neto J, Rusilowicz-Jones EV, Rigden DJ, Dittmar G, Urbé S, Clague MJ. Microtubule association of TRIM3 revealed by differential extraction proteomics. J Cell Sci 2024; 137:jcs261522. [PMID: 38149663 PMCID: PMC10917062 DOI: 10.1242/jcs.261522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 12/15/2023] [Indexed: 12/28/2023] Open
Abstract
The microtubule network is formed from polymerised tubulin subunits and associating proteins, which govern microtubule dynamics and a diverse array of functions. To identify novel microtubule-binding proteins, we have developed an unbiased biochemical assay, which relies on the selective extraction of cytosolic proteins from U2OS cells, while leaving behind the microtubule network. Candidate proteins are linked to microtubules by their sensitivities to the depolymerising drug nocodazole or the microtubule-stabilising drug taxol, which is quantitated by mass spectrometry. Our approach is benchmarked by co-segregation of tubulin and previously established microtubule-binding proteins. We then identify several novel candidate microtubule-binding proteins, from which we have selected the ubiquitin E3 ligase tripartite motif-containing protein 3 (TRIM3) for further characterisation. We map TRIM3 microtubule binding to its C-terminal NHL-repeat region. We show that TRIM3 is required for the accumulation of acetylated tubulin, following treatment with taxol. Furthermore, loss of TRIM3 partially recapitulates the reduction in nocodazole-resistant microtubules characteristic of α-tubulin acetyltransferase 1 (ATAT1) depletion. These results can be explained by a decrease in ATAT1 following depletion of TRIM3 that is independent of transcription.
Collapse
Affiliation(s)
- Hannah L. Glover
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| | - Marta Mendes
- Proteomics of Cellular Signalling, Department of Infection and Immunity,Luxembourg Institute of Health, L-1445 Strassen, Luxembourg
| | - Joana Gomes-Neto
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| | - Emma V. Rusilowicz-Jones
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| | - Daniel J. Rigden
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| | - Gunnar Dittmar
- Proteomics of Cellular Signalling, Department of Infection and Immunity,Luxembourg Institute of Health, L-1445 Strassen, Luxembourg
- Department of Life Sciences and Medicine, University of Luxembourg, 2 Avenue de l'Université, Campus Belval, L-4365 Esch-sur-Alzette, Luxembourg
| | - Sylvie Urbé
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| | - Michael J. Clague
- Department of Biochemistry, Cell and Systems Biology, ISMIB, University of Liverpool, Liverpool L69 3BX, UK
| |
Collapse
|
4
|
Rigden DJ, Fernández XM. The 2024 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2024; 52:D1-D9. [PMID: 38035367 PMCID: PMC10767945 DOI: 10.1093/nar/gkad1173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 11/23/2023] [Indexed: 12/02/2023] Open
Abstract
The 2024 Nucleic Acids Research database issue contains 180 papers from across biology and neighbouring disciplines. There are 90 papers reporting on new databases and 83 updates from resources previously published in the Issue. Updates from databases most recently published elsewhere account for a further seven. Nucleic acid databases include the new NAKB for structural information and updates from Genbank, ENA, GEO, Tarbase and JASPAR. The Issue's Breakthrough Article concerns NMPFamsDB for novel prokaryotic protein families and the AlphaFold Protein Structure Database has an important update. Metabolism is covered by updates from Reactome, Wikipathways and Metabolights. Microbes are covered by RefSeq, UNITE, SPIRE and P10K; viruses by ViralZone and PhageScope. Medically-oriented databases include the familiar COSMIC, Drugbank and TTD. Genomics-related resources include Ensembl, UCSC Genome Browser and Monarch. New arrivals cover plant imaging (OPIA and PlantPAD) and crop plants (SoyMD, TCOD and CropGS-Hub). The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Over the last year the NAR online Molecular Biology Database Collection has been updated, reviewing 1060 entries, adding 97 new resources and eliminating 388 discontinued URLs bringing the current total to 1959 databases. It is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
Affiliation(s)
- Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|
5
|
Liang Z, Ye H, Ma J, Wei Z, Wang Y, Zhang Y, Huang D, Song B, Meng J, Rigden DJ, Chen K. m6A-Atlas v2.0: updated resources for unraveling the N6-methyladenosine (m6A) epitranscriptome among multiple species. Nucleic Acids Res 2024; 52:D194-D202. [PMID: 37587690 PMCID: PMC10768109 DOI: 10.1093/nar/gkad691] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/02/2023] [Accepted: 08/10/2023] [Indexed: 08/18/2023] Open
Abstract
N 6-Methyladenosine (m6A) is one of the most abundant internal chemical modifications on eukaryote mRNA and is involved in numerous essential molecular functions and biological processes. To facilitate the study of this important post-transcriptional modification, we present here m6A-Atlas v2.0, an updated version of m6A-Atlas. It was expanded to include a total of 797 091 reliable m6A sites from 13 high-resolution technologies and two single-cell m6A profiles. Additionally, three methods (exomePeaks2, MACS2 and TRESS) were used to identify >16 million m6A enrichment peaks from 2712 MeRIP-seq experiments covering 651 conditions in 42 species. Quality control results of MeRIP-seq samples were also provided to help users to select reliable peaks. We also estimated the condition-specific quantitative m6A profiles (i.e. differential methylation) under 172 experimental conditions for 19 species. Further, to provide insights into potential functional circuitry, the m6A epitranscriptomics were annotated with various genomic features, interactions with RNA-binding proteins and microRNA, potentially linked splicing events and single nucleotide polymorphisms. The collected m6A sites and their functional annotations can be freely queried and downloaded via a user-friendly graphical interface at: http://rnamd.org/m6a.
Collapse
Affiliation(s)
- Zhanmin Liang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian 350004, China
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Haokai Ye
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, Liverpool, UK
| | - Jiongming Ma
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian 350004, China
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, Liverpool, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L69 7ZB, UK
| | - Yue Wang
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Department of Computer Science, University of Liverpool, Liverpool L69 7ZB, UK
| | - Yuxin Zhang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian 350004, China
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, Liverpool, UK
| | - Daiyun Huang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Department of Computer Science, University of Liverpool, Liverpool L69 7ZB, UK
| | - Bowen Song
- Department of Public Health, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, Liverpool, UK
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, Liverpool, UK
| | - Kunqi Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian 350004, China
| |
Collapse
|
6
|
Wang X, Zhang Y, Chen K, Liang Z, Ma J, Xia R, de Magalhães JP, Rigden DJ, Meng J, Song B. m7GHub V2.0: an updated database for decoding the N7-methylguanosine (m7G) epitranscriptome. Nucleic Acids Res 2024; 52:D203-D212. [PMID: 37811871 PMCID: PMC10767970 DOI: 10.1093/nar/gkad789] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/18/2023] [Accepted: 09/18/2023] [Indexed: 10/10/2023] Open
Abstract
With recent progress in mapping N7-methylguanosine (m7G) RNA methylation sites, tens of thousands of experimentally validated m7G sites have been discovered in various species, shedding light on the significant role of m7G modification in regulating numerous biological processes including disease pathogenesis. An integrated resource that enables the sharing, annotation and customized analysis of m7G data will greatly facilitate m7G studies under various physiological contexts. We previously developed the m7GHub database to host mRNA m7G sites identified in the human transcriptome. Here, we present m7GHub v.2.0, an updated resource for a comprehensive collection of m7G modifications in various types of RNA across multiple species: an m7GDB database containing 430 898 putative m7G sites identified in 23 species, collected from both widely applied next-generation sequencing (NGS) and the emerging Oxford Nanopore direct RNA sequencing (ONT) techniques; an m7GDiseaseDB hosting 156 206 m7G-associated variants (involving addition or removal of an m7G site), including 3238 disease-relevant m7G-SNPs that may function through epitranscriptome disturbance; and two enhanced analysis modules to perform interactive analyses on the collections of m7G sites (m7GFinder) and functional variants (m7GSNPer). We expect that m7Ghub v.2.0 should serve as a valuable centralized resource for studying m7G modification. It is freely accessible at: www.rnamd.org/m7GHub2.
Collapse
Affiliation(s)
- Xuan Wang
- Department of Public Health, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
| | - Yuxin Zhang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, UK
| | - Kunqi Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China
| | - Zhanmin Liang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
| | - Jiongming Ma
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, UK
| | - Rong Xia
- Department of Financial and Actuarial Mathematics, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
| | | | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, UK
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, UK
- AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Bowen Song
- Department of Public Health, School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| |
Collapse
|
7
|
Simpkin AJ, Mesdaghi S, Sánchez Rodríguez F, Elliott L, Murphy DL, Kryshtafovych A, Keegan RM, Rigden DJ. Tertiary structure assessment at CASP15. Proteins 2023; 91:1616-1635. [PMID: 37746927 PMCID: PMC10792517 DOI: 10.1002/prot.26593] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 08/25/2023] [Accepted: 09/07/2023] [Indexed: 09/26/2023]
Abstract
The results of tertiary structure assessment at CASP15 are reported. For the first time, recognizing the outstanding performance of AlphaFold 2 (AF2) at CASP14, all single-chain predictions were assessed together, irrespective of whether a template was available. At CASP15, there was no single stand-out group, with most of the best-scoring groups-led by PEZYFoldings, UM-TBM, and Yang Server-employing AF2 in one way or another. Many top groups paid special attention to generating deep Multiple Sequence Alignments (MSAs) and testing variant MSAs, thereby allowing them to successfully address some of the hardest targets. Such difficult targets, as well as lacking templates, were typically proteins with few homologues. Local divergence between prediction and target correlated with localization at crystal lattice or chain interfaces, and with regions exhibiting high B-factor factors in crystal structure targets, and should not necessarily be considered as representing error in the prediction. However, analysis of exposed and buried side chain accuracy showed room for improvement even in the latter. Nevertheless, a majority of groups produced high-quality predictions for most targets, which are valuable for experimental structure determination, functional analysis, and many other tasks across biology. These include those applying methods similar to those used to generate major resources such as the AlphaFold Protein Structure Database and the ESM Metagenomic atlas: the confidence estimates of the former were also notably accurate.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| | - Shahram Mesdaghi
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
- Computational Biology Facility, MerseyBio, University of LiverpoolLiverpoolUK
| | - Filomeno Sánchez Rodríguez
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
- Life Science, Diamond Light Source, Harwell Science and Innovation CampusOxfordshireUK
- Department of Chemistry, York Structural Biology LaboratoryUniversity of YorkYorkUK
| | - Luc Elliott
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| | - David L. Murphy
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| | | | - Ronan M. Keegan
- UKRI‐STFC, Rutherford Appleton Laboratory, Research Complex at HarwellDidcotUK
| | - Daniel J. Rigden
- Department of Biochemistry, Cell and Systems BiologyInstitute of Structural, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| |
Collapse
|
8
|
Das R, Kretsch RC, Simpkin AJ, Mulvaney T, Pham P, Rangan R, Bu F, Keegan RM, Topf M, Rigden DJ, Miao Z, Westhof E. Assessment of three-dimensional RNA structure prediction in CASP15. Proteins 2023; 91:1747-1770. [PMID: 37876231 PMCID: PMC10841292 DOI: 10.1002/prot.26602] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 08/21/2023] [Accepted: 09/07/2023] [Indexed: 10/26/2023]
Abstract
The prediction of RNA three-dimensional structures remains an unsolved problem. Here, we report assessments of RNA structure predictions in CASP15, the first CASP exercise that involved RNA structure modeling. Forty-two predictor groups submitted models for at least one of twelve RNA-containing targets. These models were evaluated by the RNA-Puzzles organizers and, separately, by a CASP-recruited team using metrics (GDT, lDDT) and approaches (Z-score rankings) initially developed for assessment of proteins and generalized here for RNA assessment. The two assessments independently ranked the same predictor groups as first (AIchemy_RNA2), second (Chen), and third (RNAPolis and GeneSilico, tied); predictions from deep learning approaches were significantly worse than these top ranked groups, which did not use deep learning. Further analyses based on direct comparison of predicted models to cryogenic electron microscopy (cryo-EM) maps and x-ray diffraction data support these rankings. With the exception of two RNA-protein complexes, models submitted by CASP15 groups correctly predicted the global fold of the RNA targets. Comparisons of CASP15 submissions to designed RNA nanostructures as well as molecular replacement trials highlight the potential utility of current RNA modeling approaches for RNA nanotechnology and structural biology, respectively. Nevertheless, challenges remain in modeling fine details such as noncanonical pairs, in ranking among submitted models, and in prediction of multiple structures resolved by cryo-EM or crystallography.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, CA USA
- Biophysics Program, Stanford University School of Medicine, CA USA
- Howard Hughes Medical Institute, Stanford University, CA USA
| | | | - Adam J. Simpkin
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Phillip Pham
- Department of Biochemistry, Stanford University School of Medicine, CA USA
| | - Ramya Rangan
- Biophysics Program, Stanford University School of Medicine, CA USA
| | - Fan Bu
- Guangzhou Laboratory, Guangzhou International Bio Island, Guangzhou 510005, China
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230036, Anhui, China
| | - Ronan M. Keegan
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
- Life Science, Diamond Light Source, Harwell Science, UK
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Daniel J. Rigden
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Zhichao Miao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai 200434, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, F-67084, Strasbourg, France
| |
Collapse
|
9
|
Mulvaney T, Kretsch RC, Elliott L, Beton JG, Kryshtafovych A, Rigden DJ, Das R, Topf M. CASP15 cryo-EM protein and RNA targets: Refinement and analysis using experimental maps. Proteins 2023; 91:1935-1951. [PMID: 37994556 PMCID: PMC10697286 DOI: 10.1002/prot.26644] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 10/25/2023] [Accepted: 11/10/2023] [Indexed: 11/24/2023]
Abstract
CASP assessments primarily rely on comparing predicted coordinates with experimental reference structures. However, experimental structures by their nature are only models themselves-their construction involves a certain degree of subjectivity in interpreting density maps and translating them to atomic coordinates. Here, we directly utilized density maps to evaluate the predictions by employing a method for ranking the quality of protein chain predictions based on their fit into the experimental density. The fit-based ranking was found to correlate well with the CASP assessment scores. Overall, the evaluation against the density map indicated that the models are of high accuracy, and occasionally even better than the reference structure in some regions of the model. Local assessment of predicted side chains in a 1.52 Å resolution map showed that side-chains are sometimes poorly positioned. Additionally, the top 118 predictions associated with 9 protein target reference structures were selected for automated refinement, in addition to the top 40 predictions for 11 RNA targets. For both proteins and RNA, the refinement of CASP15 predictions resulted in structures that are close to the reference target structure. This refinement was successful despite large conformational changes often being required, showing that predictions from CASP-assessed methods could serve as a good starting point for building atomic models in cryo-EM maps for both proteins and RNA. Loop modeling continued to pose a challenge for predictors, and together with the lack of consensus amongst models in these regions suggests that modeling, in combination with model-fit to the density, holds the potential for identifying more flexible regions within the structure.
Collapse
Affiliation(s)
- Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Rachael C Kretsch
- Biophysics Program, Stanford University School of Medicine, California, USA
| | - Luc Elliott
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, Liverpool, UK
| | - Joseph G Beton
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
| | | | - Daniel J Rigden
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, Liverpool, UK
| | - Rhiju Das
- Biophysics Program, Stanford University School of Medicine, California, USA
- Department of Biochemistry, Stanford University School of Medicine, California, USA
- Howard Hughes Medical Institute, Stanford University, California, USA
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
10
|
Kryshtafovych A, Montelione GT, Rigden DJ, Mesdaghi S, Karaca E, Moult J. Breaking the conformational ensemble barrier: Ensemble structure modeling challenges in CASP15. Proteins 2023; 91:1903-1911. [PMID: 37872703 PMCID: PMC10840738 DOI: 10.1002/prot.26584] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 08/14/2023] [Indexed: 10/25/2023]
Abstract
For the first time, the 2022 CASP (Critical Assessment of Structure Prediction) community experiment included a section on computing multiple conformations for protein and RNA structures. There was full or partial success in reproducing the ensembles for four of the nine targets, an encouraging result. For protein structures, enhanced sampling with variations of the AlphaFold2 deep learning method was by far the most effective approach. One substantial conformational change caused by a single mutation across a complex interface was accurately reproduced. In two other assembly modeling cases, methods succeeded in sampling conformations near to the experimental ones even though environmental factors were not included in the calculations. An experimentally derived flexibility ensemble allowed a single accurate RNA structure model to be identified. Difficulties included how to handle sparse or low-resolution experimental data and the current lack of effective methods for modeling RNA/protein complexes. However, these and other obstacles appear addressable.
Collapse
Affiliation(s)
| | - Gaetano T Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular, and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Shahram Mesdaghi
- Institute of Systems, Molecular, and Integrative Biology, University of Liverpool, Liverpool, UK
- Computational Biology Facility, MerseyBio, University of Liverpool, Liverpool, UK
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey
| | - John Moult
- Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland, USA
| |
Collapse
|
11
|
Kryshtafovych A, Rigden DJ. To split or not to split: CASP15 targets and their processing into tertiary structure evaluation units. Proteins 2023; 91:1558-1570. [PMID: 37254889 PMCID: PMC10687315 DOI: 10.1002/prot.26533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/02/2023] [Accepted: 05/18/2023] [Indexed: 06/01/2023]
Abstract
Processing of CASP15 targets into evaluation units (EUs) and assigning them to evolutionary-based prediction classes is presented in this study. The targets were first split into structural domains based on compactness and similarity to other proteins. Models were then evaluated against these domains and their combinations. The domains were joined into larger EUs if predictors' performance on the combined units was similar to that on individual domains. Alternatively, if most predictors performed better on the individual domains, then they were retained as EUs. As a result, 112 evaluation units were created from 77 tertiary structure prediction targets. The EUs were assigned to four prediction classes roughly corresponding to target difficulty categories in previous CASPs: TBM (template-based modeling, easy or hard), FM (free modeling), and the TBM/FM overlap category. More than a third of CASP15 EUs were attributed to the historically most challenging FM class, where homology or structural analogy to proteins of known fold cannot be detected.
Collapse
Affiliation(s)
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
12
|
Das R, Kretsch RC, Simpkin AJ, Mulvaney T, Pham P, Rangan R, Bu F, Keegan RM, Topf M, Rigden DJ, Miao Z, Westhof E. Assessment of three-dimensional RNA structure prediction in CASP15. bioRxiv 2023:2023.04.25.538330. [PMID: 37162955 PMCID: PMC10168427 DOI: 10.1101/2023.04.25.538330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The prediction of RNA three-dimensional structures remains an unsolved problem. Here, we report assessments of RNA structure predictions in CASP15, the first CASP exercise that involved RNA structure modeling. Forty two predictor groups submitted models for at least one of twelve RNA-containing targets. These models were evaluated by the RNA-Puzzles organizers and, separately, by a CASP-recruited team using metrics (GDT, lDDT) and approaches (Z-score rankings) initially developed for assessment of proteins and generalized here for RNA assessment. The two assessments independently ranked the same predictor groups as first (AIchemy_RNA2), second (Chen), and third (RNAPolis and GeneSilico, tied); predictions from deep learning approaches were significantly worse than these top ranked groups, which did not use deep learning. Further analyses based on direct comparison of predicted models to cryogenic electron microscopy (cryo-EM) maps and X-ray diffraction data support these rankings. With the exception of two RNA-protein complexes, models submitted by CASP15 groups correctly predicted the global fold of the RNA targets. Comparisons of CASP15 submissions to designed RNA nanostructures as well as molecular replacement trials highlight the potential utility of current RNA modeling approaches for RNA nanotechnology and structural biology, respectively. Nevertheless, challenges remain in modeling fine details such as non-canonical pairs, in ranking among submitted models, and in prediction of multiple structures resolved by cryo-EM or crystallography.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, CA USA
- Biophysics Program, Stanford University School of Medicine, CA USA
- Howard Hughes Medical Institute, Stanford University, CA USA
| | | | - Adam J. Simpkin
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV)
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Phillip Pham
- Department of Biochemistry, Stanford University School of Medicine, CA USA
| | - Ramya Rangan
- Biophysics Program, Stanford University School of Medicine, CA USA
| | - Fan Bu
- Guangzhou Laboratory, Guangzhou International Bio Island, Guangzhou 510005, China
- Division of Life Sciences and Medicine,University of Science and Technology of China, Hefei 230036, Anhui, China
| | - Ronan M. Keegan
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
- Life Science, Diamond Light Source, Harwell Science, UK
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV)
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Daniel J. Rigden
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Zhichao Miao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People’s Hospital, School of Medicine, Tongji University, Shanghai 200434, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, F-67084, Strasbourg, France
| |
Collapse
|
13
|
Simpkin AJ, Caballero I, McNicholas S, Stevenson K, Jiménez E, Sánchez Rodríguez F, Fando M, Uski V, Ballard C, Chojnowski G, Lebedev A, Krissinel E, Usón I, Rigden DJ, Keegan RM. Predicted models and CCP4. Acta Crystallogr D Struct Biol 2023; 79:806-819. [PMID: 37594303 PMCID: PMC10478639 DOI: 10.1107/s2059798323006289] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 07/19/2023] [Indexed: 08/19/2023] Open
Abstract
In late 2020, the results of CASP14, the 14th event in a series of competitions to assess the latest developments in computational protein structure-prediction methodology, revealed the giant leap forward that had been made by Google's Deepmind in tackling the prediction problem. The level of accuracy in their predictions was the first instance of a competitor achieving a global distance test score of better than 90 across all categories of difficulty. This achievement represents both a challenge and an opportunity for the field of experimental structural biology. For structure determination by macromolecular X-ray crystallography, access to highly accurate structure predictions is of great benefit, particularly when it comes to solving the phase problem. Here, details of new utilities and enhanced applications in the CCP4 suite, designed to allow users to exploit predicted models in determining macromolecular structures from X-ray diffraction data, are presented. The focus is mainly on applications that can be used to solve the phase problem through molecular replacement.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Stuart McNicholas
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Kyle Stevenson
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Maria Fando
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Ville Uski
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Charles Ballard
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Andrey Lebedev
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Eugene Krissinel
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Ronan M. Keegan
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| |
Collapse
|
14
|
Mesdaghi S, Price RM, Madine J, Rigden DJ. Deep Learning-based structure modelling illuminates structure and function in uncharted regions of β-solenoid fold space. J Struct Biol 2023; 215:108010. [PMID: 37544372 DOI: 10.1016/j.jsb.2023.108010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/19/2023] [Accepted: 08/03/2023] [Indexed: 08/08/2023]
Abstract
Repeat proteins are common in all domains of life and exhibit a wide range of functions. One class of repeat protein contains solenoid folds where the repeating unit consists of β-strands separated by tight turns. β-solenoids have distinguishing structural features such as handedness, twist, oligomerisation state, coil shape and size which give rise to their diversity. Characterised β-solenoid repeat proteins are known to form regions in bacterial and viral virulence factors, antifreeze proteins and functional amyloids. For many of these proteins, the experimental structure has not been solved, as they are difficult to crystallise or model. Here we use various deep learning-based structure-modelling methods to discover novel predicted β-solenoids, perform structural database searches to mine further structural neighbours and relate their predicted structure to possible functions. We find both eukaryotic and prokaryotic adhesins, confirming a known functional linkage between adhesin function and the β-solenoid fold. We further identify exceptionally long, flat β-solenoid folds as possible structures of mucin tandem repeat regions and unprecedentedly small β-solenoid structures. Additionally, we characterise a novel β-solenoid coil shape, the FapC Greek key β-solenoid as well as plausible complexes between it and other proteins involved in Pseudomonas functional amyloid fibres.
Collapse
Affiliation(s)
- Shahram Mesdaghi
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom; Computational Biology Facility, MerseyBio, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Rebecca M Price
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Jillian Madine
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| | - Daniel J Rigden
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| |
Collapse
|
15
|
Ward LC, Goulding E, Rigden DJ, Allan FE, Pellis A, Hatton H, Guebitz GM, Salcedo‐Sora JE, Carnell AJ. Engineering a Carboxyl Methyltransferase for the Formation of a Furan-Based Bioplastic Precursor. ChemSusChem 2023; 16:e202300516. [PMID: 37067062 PMCID: PMC10946451 DOI: 10.1002/cssc.202300516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 04/15/2023] [Accepted: 04/16/2023] [Indexed: 06/19/2023]
Abstract
FtpM from Aspergillus fumigatus was the first carboxyl methyltransferase reported to catalyse the dimethylation of dicarboxylic acids. Here the creation of mutant R166M that can catalyse the quantitative conversion of bio-derived 2,5-furandicarboxylic acid (FDCA) to its dimethyl ester (FDME), a bioplastics precursor, was reported. Wild type FtpM gave low conversion due to its reduced catalytic efficiency for the second methylation step. An AlphaFold 2 model revealed a highly electropositive active site, due to the presence of 4 arginine residues, postulated to favour the binding of the dicarboxylic acid over the intermediate monoester. The R166M mutation improved both binding and turnover of the monoester to permit near quantitative conversion to the target dimethyl ester product. The mutant also had improved activity for other diacids and a range of monoacids. R166M was incorporated into 2 multienzyme cascades for the synthesis of the bioplastics precursor FDME from bioderived 5-hydroxymethylfurfural (HMF) as well as from poly(ethylene furanoate) (PEF) plastic, demonstrating the potential to recycle waste plastic.
Collapse
Affiliation(s)
- Lucy C. Ward
- Department of ChemistryUniversity of LiverpoolCrown StreetLiverpoolL69 7ZDUnited Kingdom
| | - Ellie Goulding
- Department of ChemistryUniversity of LiverpoolCrown StreetLiverpoolL69 7ZDUnited Kingdom
| | - Daniel J. Rigden
- Institute of SystemsMolecular and Integrative BiologyUniversity of LiverpoolCrown StreetLiverpoolL69 7ZBUnited Kingdom
| | - Faye E. Allan
- Department of ChemistryUniversity of LiverpoolCrown StreetLiverpoolL69 7ZDUnited Kingdom
| | - Alessandro Pellis
- Department of Chemistry and Industrial ChemistryUniversity of Genovavia Dodecaneso 3116146GenovaItaly
| | - Harry Hatton
- Department of ChemistryUniversity of LiverpoolCrown StreetLiverpoolL69 7ZDUnited Kingdom
| | - Georg M. Guebitz
- Institute of Environmental Biotechnology, Department of Agrobiotechnology, IFA-TullnUniversity of Natural Resources and Life Sciences ViennaKonrad Lorenz Strasse 203430TullnAustria
- Austrian Centre of Industrial BiotechnologyKonrad Lorenz Strasse 203430TullnAustria
| | | | - Andrew J. Carnell
- Department of ChemistryUniversity of LiverpoolCrown StreetLiverpoolL69 7ZDUnited Kingdom
| |
Collapse
|
16
|
Mulvaney T, Kretsch RC, Elliott L, Beton J, Kryshtafovych A, Rigden DJ, Das R, Topf M. CASP15 cryoEM protein and RNA targets: refinement and analysis using experimental maps. bioRxiv 2023:2023.08.07.552287. [PMID: 37609268 PMCID: PMC10441278 DOI: 10.1101/2023.08.07.552287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
CASP assessments primarily rely on comparing predicted coordinates with experimental reference structures. However, errors in the reference structures can potentially reduce the accuracy of the assessment. This issue is particularly prominent in cryoEM-determined structures, and therefore, in the assessment of CASP15 cryoEM targets, we directly utilized density maps to evaluate the predictions. A method for ranking the quality of protein chain predictions based on rigid fitting to experimental density was found to correlate well with the CASP assessment scores. Overall, the evaluation against the density map indicated that the models are of high accuracy although local assessment of predicted side chains in a 1.52 Å resolution map showed that side-chains are sometimes poorly positioned. The top 136 predictions associated with 9 protein target reference structures were selected for refinement, in addition to the top 40 predictions for 11 RNA targets. To this end, we have developed an automated hierarchical refinement pipeline in cryoEM maps. For both proteins and RNA, the refinement of CASP15 predictions resulted in structures that are close to the reference target structure, including some regions with better fit to the density. This refinement was successful despite large conformational changes and secondary structure element movements often being required, suggesting that predictions from CASP-assessed methods could serve as a good starting point for building atomic models in cryoEM maps for both proteins and RNA. Loop modeling continued to pose a challenge for predictors with even short loops failing to be accurately modeled or refined at times. The lack of consensus amongst models suggests that modeling holds the potential for identifying more flexible regions within the structure.
Collapse
|
17
|
Song B, Huang D, Zhang Y, Wei Z, Su J, Pedro de Magalhães J, Rigden DJ, Meng J, Chen K. m6A-TSHub: Unveiling the Context-specific m 6A Methylation and m 6A-affecting Mutations in 23 Human Tissues. Genomics Proteomics Bioinformatics 2023; 21:678-694. [PMID: 36096444 PMCID: PMC10787194 DOI: 10.1016/j.gpb.2022.09.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Revised: 08/19/2022] [Accepted: 09/02/2022] [Indexed: 06/15/2023]
Abstract
As the most pervasive epigenetic marker present on mRNAs and long non-coding RNAs (lncRNAs), N6-methyladenosine (m6A) RNA methylation has been shown to participate in essential biological processes. Recent studies have revealed the distinct patterns of m6A methylome across human tissues, and a major challenge remains in elucidating the tissue-specific presence and circuitry of m6A methylation. We present here a comprehensive online platform, m6A-TSHub, for unveiling the context-specific m6A methylation and genetic mutations that potentially regulate m6A epigenetic mark. m6A-TSHub consists of four core components, including (1) m6A-TSDB, a comprehensive database of 184,554 functionally annotated m6A sites derived from 23 human tissues and 499,369 m6A sites from 25 tumor conditions, respectively; (2) m6A-TSFinder, a web server for high-accuracy prediction of m6A methylation sites within a specific tissue from RNA sequences, which was constructed using multi-instance deep neural networks with gated attention; (3) m6A-TSVar, a web server for assessing the impact of genetic variants on tissue-specific m6A RNA modifications; and (4) m6A-CAVar, a database of 587,983 The Cancer Genome Atlas (TCGA) cancer mutations (derived from 27 cancer types) that were predicted to affect m6A modifications in the primary tissue of cancers. The database should make a useful resource for studying the m6A methylome and the genetic factors of epitranscriptome disturbance in a specific tissue (or cancer type). m6A-TSHub is accessible at www.xjtlu.edu.cn/biologicalsciences/m6ats.
Collapse
Affiliation(s)
- Bowen Song
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China; Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Daiyun Huang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Department of Computer Science, University of Liverpool, Liverpool L69 7ZB, United Kingdom.
| | - Yuxin Zhang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Ageing & Chronic Disease, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Jionglong Su
- School of AI and Advanced Computing, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - João Pedro de Magalhães
- Institute of Ageing & Chronic Disease, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Jia Meng
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom; Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Kunqi Chen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China.
| |
Collapse
|
18
|
Agirre J, Atanasova M, Bagdonas H, Ballard CB, Baslé A, Beilsten-Edmands J, Borges RJ, Brown DG, Burgos-Mármol JJ, Berrisford JM, Bond PS, Caballero I, Catapano L, Chojnowski G, Cook AG, Cowtan KD, Croll TI, Debreczeni JÉ, Devenish NE, Dodson EJ, Drevon TR, Emsley P, Evans G, Evans PR, Fando M, Foadi J, Fuentes-Montero L, Garman EF, Gerstel M, Gildea RJ, Hatti K, Hekkelman ML, Heuser P, Hoh SW, Hough MA, Jenkins HT, Jiménez E, Joosten RP, Keegan RM, Keep N, Krissinel EB, Kolenko P, Kovalevskiy O, Lamzin VS, Lawson DM, Lebedev AA, Leslie AGW, Lohkamp B, Long F, Malý M, McCoy AJ, McNicholas SJ, Medina A, Millán C, Murray JW, Murshudov GN, Nicholls RA, Noble MEM, Oeffner R, Pannu NS, Parkhurst JM, Pearce N, Pereira J, Perrakis A, Powell HR, Read RJ, Rigden DJ, Rochira W, Sammito M, Sánchez Rodríguez F, Sheldrick GM, Shelley KL, Simkovic F, Simpkin AJ, Skubak P, Sobolev E, Steiner RA, Stevenson K, Tews I, Thomas JMH, Thorn A, Valls JT, Uski V, Usón I, Vagin A, Velankar S, Vollmar M, Walden H, Waterman D, Wilson KS, Winn MD, Winter G, Wojdyr M, Yamashita K. The CCP4 suite: integrative software for macromolecular crystallography. Acta Crystallogr D Struct Biol 2023; 79:449-461. [PMID: 37259835 PMCID: PMC10233625 DOI: 10.1107/s2059798323003595] [Citation(s) in RCA: 84] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 06/02/2023] Open
Abstract
The Collaborative Computational Project No. 4 (CCP4) is a UK-led international collective with a mission to develop, test, distribute and promote software for macromolecular crystallography. The CCP4 suite is a multiplatform collection of programs brought together by familiar execution routines, a set of common libraries and graphical interfaces. The CCP4 suite has experienced several considerable changes since its last reference article, involving new infrastructure, original programs and graphical interfaces. This article, which is intended as a general literature citation for the use of the CCP4 software suite in structure determination, will guide the reader through such transformations, offering a general overview of the new features and outlining future developments. As such, it aims to highlight the individual programs that comprise the suite and to provide the latest references to them for perusal by crystallographers around the world.
Collapse
Affiliation(s)
- Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Haroldas Bagdonas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Charles B. Ballard
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Arnaud Baslé
- Biosciences Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - James Beilsten-Edmands
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Rafael J. Borges
- The Center of Medicinal Chemistry (CQMED), Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Av. Dr. André Tosello 550, 13083-886 Campinas, Brazil
| | - David G. Brown
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - J. Javier Burgos-Mármol
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - John M. Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Paul S. Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Lucrezia Catapano
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Atlanta G. Cook
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King’s Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Kevin D. Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tristan I. Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Altos Labs, Portway Building, Granta Park, Great Abington, Cambridge CB21 6GP, United Kingdom
| | - Judit É. Debreczeni
- Discovery Sciences, R&D BioPharmaceuticals, AstraZeneca, Darwin Building, Cambridge Science Park, Milton Road, Cambridge CB4 0WG, United Kingdom
| | - Nicholas E. Devenish
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Eleanor J. Dodson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tarik R. Drevon
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Paul Emsley
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Gwyndaf Evans
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Phil R. Evans
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Maria Fando
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - James Foadi
- Department of Mathematical Sciences, University of Bath, Bath, United Kingdom
| | - Luis Fuentes-Montero
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Elspeth F. Garman
- Department of Biochemistry, University of Oxford, Dorothy Crowfoot Hodgkin Building, Oxford OX1 3QU, United Kingdom
| | - Markus Gerstel
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Richard J. Gildea
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Kaushik Hatti
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Maarten L. Hekkelman
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Michael A. Hough
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Huw T. Jenkins
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Robbie P. Joosten
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Ronan M. Keegan
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Nicholas Keep
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck College, London WC1E 7HX, United Kingdom
| | - Eugene B. Krissinel
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Petr Kolenko
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
| | - Oleg Kovalevskiy
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Victor S. Lamzin
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - David M. Lawson
- Department of Biochemistry and Metabolism, John Innes Centre, Norwich NR4 7UH, United Kingdom
| | - Andrey A. Lebedev
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Andrew G. W. Leslie
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Bernhard Lohkamp
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, SE-171 77 Stockholm, Sweden
| | - Fei Long
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin Malý
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Airlie J. McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Stuart J. McNicholas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Ana Medina
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Claudia Millán
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - James W. Murray
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Garib N. Murshudov
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robert A. Nicholls
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin E. M. Noble
- Translational and Clinical Research Institute, Newcastle University, Paul O’Gorman Building, Medical School, Framlington Place, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Robert Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Navraj S. Pannu
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - James M. Parkhurst
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Nicholas Pearce
- Department of Physics, Chemistry and Biology (IFM), Linköping University, SE-581 83 Linköping, Sweden
| | - Joana Pereira
- Biozentrum and SIB Swiss Institute of Bioinformatics, University of Basel, 4056 Basel, Switzerland
| | - Anastassis Perrakis
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Harold R. Powell
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - William Rochira
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Massimo Sammito
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Discovery Centre, Biologics Engineering, AstraZeneca, Biomedical Campus, 1 Francis Crick Avenue, Trumpington, Cambridge CB2 0AA, United Kingdom
| | - Filomeno Sánchez Rodríguez
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - George M. Sheldrick
- Department of Structural Chemistry, Georg-August-Universität Göttingen, Tammannstrasse 4, 37077 Göttingen, Germany
| | - Kathryn L. Shelley
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Felix Simkovic
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Adam J. Simpkin
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - Pavol Skubak
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Roberto A. Steiner
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
- Department of Biomedical Sciences, University of Padova, Italy
| | - Kyle Stevenson
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Ivo Tews
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Jens M. H. Thomas
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Andrea Thorn
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Josep Triviño Valls
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Ville Uski
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Alexei Vagin
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Melanie Vollmar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Helen Walden
- School of Molecular Biosciences, College of Medical Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - David Waterman
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Keith S. Wilson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Martyn D. Winn
- Scientific Computing Department, Science and Technology Facilities Council, Didcot OX11 0FA, United Kingdom
| | - Graeme Winter
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited (United Kingdom), Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Keitaro Yamashita
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
19
|
Dusi DMA, Alves ER, Cabral GB, Mello LV, Rigden DJ, Silveira ÉD, Alves-Ferreira M, Guimarães LA, Gomes ACMM, Rodrigues JCM, Carneiro VTC. An exonuclease V homologue is expressed predominantly during early megasporogenesis in apomictic Brachiaria brizantha. Planta 2023; 258:5. [PMID: 37219749 DOI: 10.1007/s00425-023-04162-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/15/2023] [Indexed: 05/24/2023]
Abstract
MAIN CONCLUSION An exonuclease V homologue from apomictic Brachiaria brizantha is expressed and localized in nucellar cells at key moments when these cells differentiate to give rise to unreduced gametophytes. Brachiaria is a genus of forage grasses with economical and agricultural importance to Brazil. Brachiaria reproduces by aposporic apomixis, in which unreduced embryo sacs, derived from nucellar cells, other than the megaspore mother cell (MMC), are formed. The unreduced embryo sacs produce an embryo without fertilization resulting in clones of the mother plant. Comparative gene expression analysis in ovaries of sexual and apomictic Brachiaria spp. revealed a sequence from B. brizantha that showed a distinct pattern of expression in ovaries of sexual and apomictic plants. In this work, we describe a gene named BbrizExoV with strong identity to exonuclease V (Exo V) genes from other grasses. Sequence analysis in signal prediction tools showed that BbrizExoV might have dual localization, depending on the translation point. A longer form to the nucleus and a shorter form which would be directed to the chloroplast. This is also the case for monocot sequences analyzed from other species. The long form of BbrizExoV protein localizes to the nucleus of onion epidermal cells. Analysis of ExoV proteins from dicot species, with exception of Arabidopsis thaliana ExoVL protein, showed only one localization. Using a template-based AlphaFold 2 modelling approach the structure of BbrizExoV in complex with metal and ssDNA was predicted based on the holo structure of the human counterpart. Features predicted to define ssDNA binding but a lack of sequence specificity are shared between the human enzyme and BbrizExoV. Expression analyses indicated the precise site and timing of transcript accumulation during ovule development, which coincides with the differentiation of nucelar cells to form the typical aposporic four-celled unreduced gametophyte. A putative function for this protein is proposed based on its homology and expression pattern.
Collapse
Affiliation(s)
- Diva M A Dusi
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
| | - Elizângela R Alves
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
- Department of Celular Biology, University of Brasilia, Brasília, DF, 70910-900, Brazil
| | - Gláucia B Cabral
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
| | - Luciane V Mello
- School of Life Sciences, University of Liverpool, Crown St, Liverpool, L69 7ZB, UK
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown St, Liverpool, L69 7ZB, UK
| | - Érica D Silveira
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
- Department of Genetics, Universidade Federal do Rio de Janeiro, Av. Prof. Rodolpho Paulo Rocco, s/n Prédio do CCS Instituto de Biologia, Rio de Janeiro, RJ, Brazil
| | - Márcio Alves-Ferreira
- Department of Genetics, Universidade Federal do Rio de Janeiro, Av. Prof. Rodolpho Paulo Rocco, s/n Prédio do CCS Instituto de Biologia, Rio de Janeiro, RJ, Brazil
| | - Larissa A Guimarães
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
- Department of Celular Biology, University of Brasilia, Brasília, DF, 70910-900, Brazil
| | - Ana Cristina M M Gomes
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil
| | - Júlio C M Rodrigues
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil.
| | - Vera T C Carneiro
- Brazilian Agricultural Research Corporation (Embrapa), Embrapa Genetic Resources and Biotechnology, Cx. Postal 02372, Brasilia, DF, 70770-917, Brazil.
| |
Collapse
|
20
|
Abstract
The 2023 Nucleic Acids Research Database Issue contains 178 papers ranging across biology and related fields. There are 90 papers reporting on new databases and 82 updates from resources previously published in the Issue. Six more papers are updates from databases most recently published elsewhere. Major nucleic acid databases reporting updates include Genbank, ENA, ChIPBase, JASPAR, mirDIP and the Issue's first Breakthrough Article, NACDDB for Circular Dichroism data. Updates from BMRB and RCSB cover experimental protein structural data while AlphaFold 2 computational structure predictions feature widely. STRING and REBASE are stand-out updates in the signalling and enzymes section. Immunology-related databases include CEDAR, the second Breakthrough Article, for cancer epitopes and receptors alongside returning IPD-IMGT/HLA and the new PGG.MHC. Genomics-related resources include Ensembl, GWAS Central and UCSC Genome Browser. Major returning databases for drugs and their targets include Open Targets, DrugCentral, CTD and Pubchem. The EMPIAR image archive appears in the Issue for the first time. The entire database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). The NAR online Molecular Biology Database Collection has been updated, revisiting 463 entries, adding 92 new resources and eliminating 96 discontinued URLs so bringing the current total to 1764 databases. It is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
|
21
|
Sánchez Rodríguez F, Chojnowski G, Keegan RM, Rigden DJ. Using deep-learning predictions of inter-residue distances for model validation. Acta Crystallogr D Struct Biol 2022; 78:1412-1427. [PMID: 36458613 PMCID: PMC9716559 DOI: 10.1107/s2059798322010415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 10/28/2022] [Indexed: 11/27/2022] Open
Abstract
Determination of protein structures typically entails building a model that satisfies the collected experimental observations and its deposition in the Protein Data Bank. Experimental limitations can lead to unavoidable uncertainties during the process of model building, which result in the introduction of errors into the deposited model. Many metrics are available for model validation, but most are limited to consideration of the physico-chemical aspects of the model or its match to the experimental data. The latest advances in the field of deep learning have enabled the increasingly accurate prediction of inter-residue distances, an advance which has played a pivotal role in the recent improvements observed in the field of protein ab initio modelling. Here, new validation methods are presented based on the use of these precise inter-residue distance predictions, which are compared with the distances observed in the protein model. Sequence-register errors are particularly clearly detected and the register shifts required for their correction can be reliably determined. The method is available in the ConKit package (https://www.conkit.org).
Collapse
Affiliation(s)
- Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom,Life Science, Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Ronan M. Keegan
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom,Correspondence e-mail:
| |
Collapse
|
22
|
Huang D, Chen K, Song B, Wei Z, Su J, Coenen F, de Magalhães JP, Rigden DJ, Meng J. Geographic encoding of transcripts enabled high-accuracy and isoform-aware deep learning of RNA methylation. Nucleic Acids Res 2022; 50:10290-10310. [PMID: 36155798 PMCID: PMC9561283 DOI: 10.1093/nar/gkac830] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 08/26/2022] [Accepted: 09/15/2022] [Indexed: 12/25/2022] Open
Abstract
As the most pervasive epigenetic mark present on mRNA and lncRNA, N6-methyladenosine (m6A) RNA methylation regulates all stages of RNA life in various biological processes and disease mechanisms. Computational methods for deciphering RNA modification have achieved great success in recent years; nevertheless, their potential remains underexploited. One reason for this is that existing models usually consider only the sequence of transcripts, ignoring the various regions (or geography) of transcripts such as 3′UTR and intron, where the epigenetic mark forms and functions. Here, we developed three simple yet powerful encoding schemes for transcripts to capture the submolecular geographic information of RNA, which is largely independent from sequences. We show that m6A prediction models based on geographic information alone can achieve comparable performances to classic sequence-based methods. Importantly, geographic information substantially enhances the accuracy of sequence-based models, enables isoform- and tissue-specific prediction of m6A sites, and improves m6A signal detection from direct RNA sequencing data. The geographic encoding schemes we developed have exhibited strong interpretability, and are applicable to not only m6A but also N1-methyladenosine (m1A), and can serve as a general and effective complement to the widely used sequence encoding schemes in deep learning applications concerning RNA transcripts.
Collapse
Affiliation(s)
- Daiyun Huang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China.,Department of Computer Sciences, University of Liverpool, Liverpool L69 7ZB, UK
| | - Kunqi Chen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, PR China
| | - Bowen Song
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China.,Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L69 7ZB, UK
| | - Jionglong Su
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China.,School of AI and Advanced Computing, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China
| | - Frans Coenen
- Department of Computer Sciences, University of Liverpool, Liverpool L69 7ZB, UK
| | - João Pedro de Magalhães
- Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L69 7ZB, UK
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.,AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou 215123, PR China
| |
Collapse
|
23
|
Song B, Wang X, Liang Z, Ma J, Huang D, Wang Y, de Magalhães JP, Rigden DJ, Meng J, Liu G, Chen K, Wei Z. RMDisease V2.0: an updated database of genetic variants that affect RNA modifications with disease and trait implication. Nucleic Acids Res 2022; 51:D1388-D1396. [PMID: 36062570 PMCID: PMC9825452 DOI: 10.1093/nar/gkac750] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/02/2022] [Accepted: 08/24/2022] [Indexed: 01/30/2023] Open
Abstract
Recent advances in epitranscriptomics have unveiled functional associations between RNA modifications (RMs) and multiple human diseases, but distinguishing the functional or disease-related single nucleotide variants (SNVs) from the majority of 'silent' variants remains a major challenge. We previously developed the RMDisease database for unveiling the association between genetic variants and RMs concerning human disease pathogenesis. In this work, we present RMDisease v2.0, an updated database with expanded coverage. Using deep learning models and from 873 819 experimentally validated RM sites, we identified a total of 1 366 252 RM-associated variants that may affect (add or remove an RM site) 16 different types of RNA modifications (m6A, m5C, m1A, m5U, Ψ, m6Am, m7G, A-to-I, ac4C, Am, Cm, Um, Gm, hm5C, D and f5C) in 20 organisms (human, mouse, rat, zebrafish, maize, fruit fly, yeast, fission yeast, Arabidopsis, rice, chicken, goat, sheep, pig, cow, rhesus monkey, tomato, chimpanzee, green monkey and SARS-CoV-2). Among them, 14 749 disease- and 2441 trait-associated genetic variants may function via the perturbation of epitranscriptomic markers. RMDisease v2.0 should serve as a useful resource for studying the genetic drivers of phenotypes that lie within the epitranscriptome layer circuitry, and is freely accessible at: www.rnamd.org/rmdisease2.
Collapse
Affiliation(s)
| | | | | | | | - Daiyun Huang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China,Department of Computer Science, University of Liverpool, Liverpool L7 8TX, UK
| | - Yue Wang
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China,Department of Computer Science, University of Liverpool, Liverpool L7 8TX, UK
| | - João Pedro de Magalhães
- Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L7 8TX, UK
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L7 8TX, UK,AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, 215123, China
| | - Gang Liu
- Correspondence may also be addressed to Gang Liu.
| | - Kunqi Chen
- Correspondence may also be addressed to Kunqi Chen.
| | - Zhen Wei
- To whom correspondence should be addressed.
| |
Collapse
|
24
|
Krissinel E, Lebedev AA, Uski V, Ballard CB, Keegan RM, Kovalevskiy O, Nicholls RA, Pannu NS, Skubák P, Berrisford J, Fando M, Lohkamp B, Wojdyr M, Simpkin AJ, Thomas JMH, Oliver C, Vonrhein C, Chojnowski G, Basle A, Purkiss A, Isupov MN, McNicholas S, Lowe E, Triviño J, Cowtan K, Agirre J, Rigden DJ, Uson I, Lamzin V, Tews I, Bricogne G, Leslie AGW, Brown DG. CCP4 Cloud for structure determination and project management in macromolecular crystallography. Acta Crystallogr D Struct Biol 2022; 78:1079-1089. [PMID: 36048148 PMCID: PMC9435598 DOI: 10.1107/s2059798322007987] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 08/08/2022] [Indexed: 11/10/2022] Open
Abstract
Nowadays, progress in the determination of three-dimensional macromolecular structures from diffraction images is achieved partly at the cost of increasing data volumes. This is due to the deployment of modern high-speed, high-resolution detectors, the increased complexity and variety of crystallographic software, the use of extensive databases and high-performance computing. This limits what can be accomplished with personal, offline, computing equipment in terms of both productivity and maintainability. There is also an issue of long-term data maintenance and availability of structure-solution projects as the links between experimental observations and the final results deposited in the PDB. In this article, CCP4 Cloud, a new front-end of the CCP4 software suite, is presented which mitigates these effects by providing an online, cloud-based environment for crystallographic computation. CCP4 Cloud was developed for the efficient delivery of computing power, database services and seamless integration with web resources. It provides a rich graphical user interface that allows project sharing and long-term storage for structure-solution projects, and can be linked to data-producing facilities. The system is distributed with the CCP4 software suite version 7.1 and higher, and an online publicly available instance of CCP4 Cloud is provided by CCP4.
Collapse
|
25
|
Simpkin AJ, Thomas JMH, Keegan RM, Rigden DJ. MrParse: finding homologues in the PDB and the EBI AlphaFold database for molecular replacement and more. Acta Crystallogr D Struct Biol 2022; 78:553-559. [PMID: 35503204 PMCID: PMC9063843 DOI: 10.1107/s2059798322003576] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 03/29/2022] [Indexed: 11/10/2022]
Abstract
Crystallographers have an array of search-model options for structure solution by molecular replacement (MR). The well established options of homologous experimental structures and regular secondary-structure elements or motifs are increasingly supplemented by computational modelling. Such modelling may be carried out locally or may use pre-calculated predictions retrieved from databases such as the EBI AlphaFold database. MrParse is a new pipeline to help to streamline the decision process in MR by consolidating bioinformatic predictions in one place. When reflection data are provided, MrParse can rank any experimental homologues found using eLLG, which indicates the likelihood that a given search model will work in MR. Inbuilt displays of predicted secondary structure, coiled-coil and transmembrane regions further inform the choice of MR protocol. MrParse can also identify and rank homologues in the EBI AlphaFold database, a function that will also interest other structural biologists and bioinformaticians.
Collapse
|
26
|
Ward LC, McCue HV, Rigden DJ, Kershaw NM, Ashbrook C, Hatton H, Goulding E, Johnson JR, Carnell AJ. Carboxyl Methyltransferase Catalysed Formation of Mono- and Dimethyl Esters under Aqueous Conditions: Application in Cascade Biocatalysis. Angew Chem Int Ed Engl 2022; 61:e202117324. [PMID: 35138660 PMCID: PMC9307002 DOI: 10.1002/anie.202117324] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Indexed: 11/10/2022]
Abstract
Carboxyl methyltransferase (CMT) enzymes catalyse the biomethylation of carboxylic acids under aqueous conditions and have potential for use in synthetic enzyme cascades. Herein we report that the enzyme FtpM from Aspergillus fumigatus can methylate a broad range of aromatic mono- and dicarboxylic acids in good to excellent conversions. The enzyme shows high regioselectivity on its natural substrate fumaryl-l-tyrosine, trans, trans-muconic acid and a number of the dicarboxylic acids tested. Dicarboxylic acids are generally better substrates than monocarboxylic acids, although some substituents are able to compensate for the absence of a second acid group. For dicarboxylic acids, the second methylation shows strong pH dependency with an optimum at pH 5.5-6. Potential for application in industrial biotechnology was demonstrated in a cascade for the production of a bioplastics precursor (FDME) from bioderived 5-hydroxymethylfurfural (HMF).
Collapse
Affiliation(s)
- Lucy C Ward
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| | - Hannah V McCue
- GeneMill, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Neil M Kershaw
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| | - Chloe Ashbrook
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| | - Harry Hatton
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| | - Ellie Goulding
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| | - James R Johnson
- GeneMill, Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool, L69 7ZB, UK
| | - Andrew J Carnell
- Department of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
| |
Collapse
|
27
|
Ward LC, McCue HV, Rigden DJ, Kershaw NM, Ashbrook C, Hatton H, Goulding E, Johnson JR, Carnell AJ. Carboxyl Methyltransferase Catalysed Formation of Mono‐ and Dimethyl Esters under Aqueous Conditions: Application in Cascade Biocatalysis. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.202117324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Lucy C. Ward
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| | - Hannah V. McCue
- GeneMill, Institute of Integrative Biology University of Liverpool Crown Street Liverpool L69 7ZB UK
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology University of Liverpool Crown Street Liverpool L69 7ZB UK
| | - Neil M. Kershaw
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| | - Chloe Ashbrook
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| | - Harry Hatton
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| | - Ellie Goulding
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| | - James R. Johnson
- GeneMill, Institute of Integrative Biology University of Liverpool Crown Street Liverpool L69 7ZB UK
| | - Andrew J. Carnell
- Department of Chemistry University of Liverpool Crown Street Liverpool L69 7ZD UK
| |
Collapse
|
28
|
Bengtsson RJ, Simpkin AJ, Pulford CV, Low R, Rasko DA, Rigden DJ, Hall N, Barry EM, Tennant SM, Baker KS. Pathogenomic analyses of Shigella isolates inform factors limiting shigellosis prevention and control across LMICs. Nat Microbiol 2022; 7:251-261. [PMID: 35102306 PMCID: PMC8813619 DOI: 10.1038/s41564-021-01054-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 12/17/2021] [Indexed: 12/17/2022]
Abstract
AbstractShigella spp. are the leading bacterial cause of severe childhood diarrhoea in low- and middle-income countries (LMICs), are increasingly antimicrobial resistant and have no widely available licenced vaccine. We performed genomic analyses of 1,246 systematically collected shigellae sampled from seven countries in sub-Saharan Africa and South Asia as part of the Global Enteric Multicenter Study (GEMS) between 2007 and 2011, to inform control and identify factors that could limit the effectiveness of current approaches. Through contemporaneous comparison among major subgroups, we found that S. sonnei contributes ≥6-fold more disease than other Shigella species relative to its genomic diversity, and highlight existing diversity and adaptative capacity among S. flexneri that may generate vaccine escape variants in <6 months. Furthermore, we show convergent evolution of resistance against ciprofloxacin, the current WHO-recommended antimicrobial for the treatment of shigellosis, among Shigella isolates. This demonstrates the urgent need to integrate existing genomic diversity into vaccine and treatment plans for Shigella, providing a framework for the focused application of comparative genomics to guide vaccine development, and the optimization of control and prevention strategies for other pathogens relevant to public health policy considerations.
Collapse
|
29
|
Abstract
The 2022 Nucleic Acids Research Database Issue contains 185 papers, including 87 papers reporting on new databases and 85 updates from resources previously published in the Issue. Thirteen additional manuscripts provide updates on databases most recently published elsewhere. Seven new databases focus specifically on COVID-19 and SARS-CoV-2, including SCoV2-MD, the first of the Issue's Breakthrough Articles. Major nucleic acid databases reporting updates include MODOMICS, JASPAR and miRTarBase. The AlphaFold Protein Structure Database, described in the second Breakthrough Article, is the stand-out in the protein section, where the Human Proteoform Atlas and GproteinDb are other notable new arrivals. Updates from DisProt, FuzDB and ELM comprehensively cover disordered proteins. Under the metabolism and signalling section Reactome, ConsensusPathDB, HMDB and CAZy are major returning resources. In microbial and viral genomes taxonomy and systematics are well covered by LPSN, TYGS and GTDB. Genomics resources include Ensembl, Ensembl Genomes and UCSC Genome Browser. Major returning pharmacology resource names include the IUPHAR/BPS guide and the Therapeutic Target Database. New plant databases include PlantGSAD for gene lists and qPTMplants for post-translational modifications. The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Our latest update to the NAR online Molecular Biology Database Collection brings the total number of entries to 1645. Following last year's major cleanup, we have updated 317 entries, listing 89 new resources and trimming 80 discontinued URLs. The current release is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
Affiliation(s)
- Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|
30
|
Ma J, Song B, Wei Z, Huang D, Zhang Y, Su J, de Magalhães JP, Rigden DJ, Meng J, Chen K. m5C-Atlas: a comprehensive database for decoding and annotating the 5-methylcytosine (m5C) epitranscriptome. Nucleic Acids Res 2022; 50:D196-D203. [PMID: 34986603 PMCID: PMC8728298 DOI: 10.1093/nar/gkab1075] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 10/11/2021] [Accepted: 10/22/2021] [Indexed: 01/19/2023] Open
Abstract
5-Methylcytosine (m5C) is one of the most prevalent covalent modifications on RNA. It is known to regulate a broad variety of RNA functions, including nuclear export, RNA stability and translation. Here, we present m5C-Atlas, a database for comprehensive collection and annotation of RNA 5-methylcytosine. The database contains 166 540 m5C sites in 13 species identified from 5 base-resolution epitranscriptome profiling technologies. Moreover, condition-specific methylation levels are quantified from 351 RNA bisulfite sequencing samples gathered from 22 different studies via an integrative pipeline. The database also presents several novel features, such as the evolutionary conservation of a m5C locus, its association with SNPs, and any relevance to RNA secondary structure. All m5C-atlas data are accessible through a user-friendly interface, in which the m5C epitranscriptomes can be freely explored, shared, and annotated with putative post-transcriptional mechanisms (e.g. RBP intermolecular interaction with RNA, microRNA interaction and splicing sites). Together, these resources offer unprecedented opportunities for exploring m5C epitranscriptomes. The m5C-Atlas database is freely accessible at https://www.xjtlu.edu.cn/biologicalsciences/m5c-atlas.
Collapse
Affiliation(s)
- Jiongming Ma
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China.,Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Bowen Song
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB, Liverpool, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Ageing & Chronic Disease, University of Liverpool, L69 7ZB, Liverpool, UK
| | - Daiyun Huang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Department of Computer Science, University of Liverpool, L69 7ZB, Liverpool, UK
| | - Yuxin Zhang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Jionglong Su
- School of AI and Advanced Computing, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | | | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB, Liverpool, UK
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB, Liverpool, UK
| | - Kunqi Chen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China
| |
Collapse
|
31
|
Chojnowski G, Simpkin AJ, Leonardo DA, Seifert-Davila W, Vivas-Ruiz DE, Keegan RM, Rigden DJ. findMySequence: a neural-network-based approach for identification of unknown proteins in X-ray crystallography and cryo-EM. IUCrJ 2022; 9:86-97. [PMID: 35059213 PMCID: PMC8733886 DOI: 10.1107/s2052252521011088] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 10/22/2021] [Indexed: 05/15/2023]
Abstract
Although experimental protein-structure determination usually targets known proteins, chains of unknown sequence are often encountered. They can be purified from natural sources, appear as an unexpected fragment of a well characterized protein or appear as a contaminant. Regardless of the source of the problem, the unknown protein always requires characterization. Here, an automated pipeline is presented for the identification of protein sequences from cryo-EM reconstructions and crystallographic data. The method's application to characterize the crystal structure of an unknown protein purified from a snake venom is presented. It is also shown that the approach can be successfully applied to the identification of protein sequences and validation of sequence assignments in cryo-EM protein structures.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
- Correspondence e-mail:
| | - Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Diego A. Leonardo
- São Carlos Institute of Physics, University of São Paulo, Avenida João Dagnone 1100, São Carlos, SP 13563-120, Brazil
| | | | - Dan E. Vivas-Ruiz
- Laboratorio de Biología Molecular, Facultad de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Avenida Venezuela Cdra 34 S/N, Ciudad Universitaria, Lima, Peru
| | - Ronan M. Keegan
- Rutherford Appleton Laboratory, Research Complex at Harwell, UKRI-STFC, Didcot OX11 0FA, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| |
Collapse
|
32
|
Guo J, Keegan RM, Rigden DJ, Erskine PT, Wood SP, Li S, Cooper JB. The X-ray structure of juvenile hormone diol kinase from the silkworm Bombyx mori. Acta Crystallogr F Struct Biol Commun 2021; 77:465-472. [PMID: 34866602 PMCID: PMC8647211 DOI: 10.1107/s2053230x21012012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 11/10/2021] [Indexed: 11/11/2022] Open
Abstract
Insect juvenile hormones (JHs) are a family of sesquiterpenoid molecules that are secreted into the haemolymph. JHs have multiple roles in insect development, metamorphosis and sexual maturation. A number of pesticides work by chemically mimicking JHs, thus preventing insects from developing and reproducing normally. The haemolymph levels of JH are governed by the rates of its biosynthesis and degradation. One enzyme involved in JH catabolism is JH diol kinase (JHDK), which uses ATP (or GTP) to phosphorylate JH diol to JH diol phosphate, which can be excreted. The X-ray structure of JHDK from the silkworm Bombyx mori has been determined at a resolution of 2.0 Å with an R factor of 19.0% and an Rfree of 24.8%. The structure possesses three EF-hand motifs which are occupied by calcium ions. This is in contrast to the recently reported structure of the JHDK-like-2 protein from B. mori (PDB entry 6kth), which possessed only one calcium ion. Since JHDK is known to be inhibited by calcium ions, it is likely that our structure represents the calcium-inhibited form of the enzyme. The electrostatic surface of the protein suggests a binding site for the triphosphate of ATP close to the N-terminal end of the molecule in a cavity between the N- and C-terminal domains. Superposition with a number of calcium-activated photoproteins suggests that there may be parallels between the binding of JH diol to JHDK and the binding of luciferin to aequorin.
Collapse
Affiliation(s)
- Jingxu Guo
- Division of Medicine, UCL, Gower Street, London WC1E 6BT, United Kingdom
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge CB2 0QQ, United Kingdom
| | - Ronan M. Keegan
- Scientific Computing Department, Science and Technologies Facilities Council, UK Research and Innovation, Didcot, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Biosciences Building, Crown Street, Liverpool L69 7BE, United Kingdom
| | - Peter T. Erskine
- Division of Medicine, UCL, Gower Street, London WC1E 6BT, United Kingdom
- Department of Biological Sciences, Birkbeck, University of London, Malet Street, Bloomsbury, London WC1E 7HX, United Kingdom
| | - Steve P. Wood
- Division of Medicine, UCL, Gower Street, London WC1E 6BT, United Kingdom
- Institute of Biomedical and Biomolecular Science, School of Biological Sciences, University of Portsmouth, King Henry Building, Portsmouth PO1 2DY, United Kingdom
| | - Sheng Li
- Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, 320 Yue Yang Road, Shanghai 200031, People’s Republic of China
| | - Jonathan B. Cooper
- Division of Medicine, UCL, Gower Street, London WC1E 6BT, United Kingdom
- Department of Biological Sciences, Birkbeck, University of London, Malet Street, Bloomsbury, London WC1E 7HX, United Kingdom
| |
Collapse
|
33
|
Moroz OV, Blagova E, Lebedev AA, Sánchez Rodríguez F, Rigden DJ, Tams JW, Wilting R, Vester JK, Longhin E, Hansen GH, Krogh KBRM, Pache RA, Davies GJ, Wilson KS. Multitasking in the gut: the X-ray structure of the multidomain BbgIII from Bifidobacterium bifidum offers possible explanations for its alternative functions. Acta Crystallogr D Struct Biol 2021; 77:1564-1578. [PMID: 34866612 DOI: 10.1107/s2059798321010949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 10/20/2021] [Indexed: 11/10/2022] Open
Abstract
β-Galactosidases catalyse the hydrolysis of lactose into galactose and glucose; as an alternative reaction, some β-galactosidases also catalyse the formation of galactooligosaccharides by transglycosylation. Both reactions have industrial importance: lactose hydrolysis is used to produce lactose-free milk, while galactooligosaccharides have been shown to act as prebiotics. For some multi-domain β-galactosidases, the hydrolysis/transglycosylation ratio can be modified by the truncation of carbohydrate-binding modules. Here, an analysis of BbgIII, a multidomain β-galactosidase from Bifidobacterium bifidum, is presented. The X-ray structure has been determined of an intact protein corresponding to a gene construct of eight domains. The use of evolutionary covariance-based predictions made sequence docking in low-resolution areas of the model spectacularly easy, confirming the relevance of this rapidly developing deep-learning-based technique for model building. The structure revealed two alternative orientations of the CBM32 carbohydrate-binding module relative to the GH2 catalytic domain in the six crystallographically independent chains. In one orientation the CBM32 domain covers the entrance to the active site of the enzyme, while in the other orientation the active site is open, suggesting a possible mechanism for switching between the two activities of the enzyme, namely lactose hydrolysis and transgalactosylation. The location of the carbohydrate-binding site of the CBM32 domain on the opposite site of the module to where it comes into contact with the catalytic GH2 domain is consistent with its involvement in adherence to host cells. The role of the CBM32 domain in switching between hydrolysis and transglycosylation modes offers protein-engineering opportunities for selective β-galactosidase modification for industrial purposes in the future.
Collapse
Affiliation(s)
- Olga V Moroz
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Elena Blagova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Andrey A Lebedev
- CCP4, STFC Rutherford Appleton Laboratory, Harwell Oxford, Didcot OX11 0QX, United Kingdom
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | | | | | | | - Elena Longhin
- Novozymes A/S, Biologiens Vej 2, 2800 Kgs. Lyngby, Denmark
| | | | | | - Roland A Pache
- Novozymes A/S, Biologiens Vej 2, 2800 Kgs. Lyngby, Denmark
| | - Gideon J Davies
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Keith S Wilson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| |
Collapse
|
34
|
Simpkin AJ, Rodríguez FS, Mesdaghi S, Kryshtafovych A, Rigden DJ. Evaluation of model refinement in CASP14. Proteins 2021; 89:1852-1869. [PMID: 34288138 PMCID: PMC8616799 DOI: 10.1002/prot.26185] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/19/2021] [Accepted: 07/11/2021] [Indexed: 12/15/2022]
Abstract
We report here an assessment of the model refinement category of the 14th round of Critical Assessment of Structure Prediction (CASP14). As before, predictors submitted up to five ranked refinements, along with associated residue-level error estimates, for targets that had a wide range of starting quality. The ability of groups to accurately rank their submissions and to predict coordinate error varied widely. Overall, only four groups out-performed a "naïve predictor" corresponding to the resubmission of the starting model. Among the top groups, there are interesting differences of approach and in the spread of improvements seen: some methods are more conservative, others more adventurous. Some targets were "double-barreled" for which predictors were offered a high-quality AlphaFold 2 (AF2)-derived prediction alongside another of lower quality. The AF2-derived models were largely unimprovable, many of their apparent errors being found to reside at domain and, especially, crystal lattice contacts. Refinement is shown to have a mixed impact overall on structure-based function annotation methods to predict nucleic acid binding, spot catalytic sites, and dock protein structures.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
- Life Science, Diamond Light Source, Harwell Science and Innovation Campus, Didcot, Oxfordshire OX11 0DE, England
| | - Shahram Mesdaghi
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | | | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
35
|
Owen SV, Wenner N, Dulberger CL, Rodwell EV, Bowers-Barnard A, Quinones-Olvera N, Rigden DJ, Rubin EJ, Garner EC, Baym M, Hinton JCD. Prophages encode phage-defense systems with cognate self-immunity. Cell Host Microbe 2021; 29:1620-1633.e8. [PMID: 34597593 PMCID: PMC8585504 DOI: 10.1016/j.chom.2021.09.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 02/23/2021] [Accepted: 09/03/2021] [Indexed: 12/18/2022]
Abstract
Temperate phages are pervasive in bacterial genomes, existing as vertically inherited islands termed prophages. Prophages are vulnerable to predation of their host bacterium by exogenous phages. Here, we identify BstA, a family of prophage-encoded phage-defense proteins in diverse Gram-negative bacteria. BstA localizes to sites of exogenous phage DNA replication and mediates abortive infection, suppressing the competing phage epidemic. During lytic replication, the BstA-encoding prophage is not itself inhibited by BstA due to self-immunity conferred by the anti-BstA (aba) element, a short stretch of DNA within the bstA locus. Inhibition of phage replication by distinct BstA proteins from Salmonella, Klebsiella, and Escherichia prophages is generally interchangeable, but each possesses a cognate aba element. The specificity of the aba element ensures that immunity is exclusive to the replicating prophage, preventing exploitation by variant BstA-encoding phages. The BstA protein allows prophages to defend host cells against exogenous phage attack without sacrificing the ability to replicate lytically. BstA is an abortive infection protein found in prophages of Gram-negative bacteria aba, a short DNA sequence within the bstA locus, acts as a self-immunity element aba gives BstA-encoding prophages immunity to BstA-driven abortive infection Variant BstA proteins have distinct and cognate aba elements
Collapse
Affiliation(s)
- Siân V Owen
- Department of Biomedical Informatics and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.
| | - Nicolas Wenner
- Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, Liverpool, UK; Biozentrum, University of Basel, Basel, Switzerland
| | - Charles L Dulberger
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Molecular and Cellular Biology, Harvard University, Boston, MA, USA
| | - Ella V Rodwell
- Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Arthur Bowers-Barnard
- Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Natalia Quinones-Olvera
- Department of Biomedical Informatics and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Eric J Rubin
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Ethan C Garner
- Department of Molecular and Cellular Biology, Harvard University, Boston, MA, USA
| | - Michael Baym
- Department of Biomedical Informatics and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.
| | - Jay C D Hinton
- Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, Liverpool, UK.
| |
Collapse
|
36
|
Simpkin AJ, Winn MD, Rigden DJ, Keegan RM. Redeployment of automated MrBUMP search-model identification for map fitting in cryo-EM. Acta Crystallogr D Struct Biol 2021; 77:1378-1385. [PMID: 34726166 PMCID: PMC8561737 DOI: 10.1107/s2059798321009165] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 09/03/2021] [Indexed: 11/22/2022] Open
Abstract
In crystallography, the phase problem can often be addressed by the careful preparation of molecular-replacement search models. This has led to the development of pipelines such as MrBUMP that can automatically identify homologous proteins from an input sequence and edit them to focus on the areas that are most conserved. Many of these approaches can be applied directly to cryo-EM to help discover, prepare and correctly place models (here called cryo-EM search models) into electrostatic potential maps. This can significantly reduce the amount of manual model building that is required for structure determination. Here, MrBUMP is repurposed to fit automatically obtained PDB-derived chains and domains into cryo-EM maps. MrBUMP was successfully able to identify and place cryo-EM search models across a range of resolutions. Methods such as map segmentation are also explored as potential routes to improved performance. Map segmentation was also found to improve the effectiveness of the pipeline for higher resolution (<8 Å) data sets.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Martyn D. Winn
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Daniel J. Rigden
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Ronan M. Keegan
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| |
Collapse
|
37
|
Burgos-Mármol JJ, Keegan RM, Krissinel E, Rigden DJ. PISACov: expanding jsPISA with evolutionary covariance data to better determine protein quaternary state from a crystal structure. Acta Crystallogr A Found Adv 2021. [DOI: 10.1107/s0108767321090036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
38
|
Sanchez Rodriguez F, Keegan RM, Vollmar M, Evans G, Rigden DJ. Applications of residue contact predictions in structural biology. Acta Crystallogr A Found Adv 2021. [DOI: 10.1107/s010876732109629x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
39
|
Simpkin AJ, Thomas JMH, Keegan RM, Rigden DJ. Exploiting new generation ab initio and homology models from databases for MR. Acta Crystallogr A Found Adv 2021. [DOI: 10.1107/s0108767321090085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
40
|
Millán C, Keegan RM, Pereira J, Sammito MD, Simpkin AJ, McCoy AJ, Lupas AN, Hartmann MD, Rigden DJ, Read RJ. Assessing the utility of CASP14 models for molecular replacement. Proteins 2021; 89:1752-1769. [PMID: 34387010 PMCID: PMC8881082 DOI: 10.1002/prot.26214] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/20/2021] [Accepted: 07/27/2021] [Indexed: 11/21/2022]
Abstract
The assessment of CASP models for utility in molecular replacement is a measure of their use in a valuable real‐world application. In CASP7, the metric for molecular replacement assessment involved full likelihood‐based molecular replacement searches; however, this restricted the assessable targets to crystal structures with only one copy of the target in the asymmetric unit, and to those where the search found the correct pose. In CASP10, full molecular replacement searches were replaced by likelihood‐based rigid‐body refinement of models superimposed on the target using the LGA algorithm, with the metric being the refined log‐likelihood‐gain (LLG) score. This enabled multi‐copy targets and very poor models to be evaluated, but a significant further issue remained: the requirement of diffraction data for assessment. We introduce here the relative‐expected‐LLG (reLLG), which is independent of diffraction data. This reLLG is also independent of any crystal form, and can be calculated regardless of the source of the target, be it X‐ray, NMR or cryo‐EM. We calibrate the reLLG against the LLG for targets in CASP14, showing that it is a robust measure of both model and group ranking. Like the LLG, the reLLG shows that accurate coordinate error estimates add substantial value to predicted models. We find that refinement by CASP groups can often convert an inadequate initial model into a successful MR search model. Consistent with findings from others, we show that the AlphaFold2 models are sufficiently good, and reliably so, to surpass other current model generation strategies for attempting molecular replacement phasing.
Collapse
Affiliation(s)
- Claudia Millán
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge, United Kingdom
| | - Ronan M Keegan
- Scientific Computing Dept., Science and Technologies Facilities Council, UK Research and Innovation, Didcot, Oxfordshire, United Kingdom
| | - Joana Pereira
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, Tübingen, Germany
| | - Massimo D Sammito
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge, United Kingdom
| | - Adam J Simpkin
- Institute of Systems, Molecular and Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7BE, United Kingdom
| | - Airlie J McCoy
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge, United Kingdom
| | - Andrei N Lupas
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, Tübingen, Germany
| | - Marcus D Hartmann
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, Tübingen, Germany
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7BE, United Kingdom
| | - Randy J Read
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge, United Kingdom
| |
Collapse
|
41
|
Pereira J, Simpkin AJ, Hartmann MD, Rigden DJ, Keegan RM, Lupas AN. High-accuracy protein structure prediction in CASP14. Proteins 2021; 89:1687-1699. [PMID: 34218458 DOI: 10.1002/prot.26171] [Citation(s) in RCA: 161] [Impact Index Per Article: 53.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/16/2021] [Accepted: 06/23/2021] [Indexed: 12/25/2022]
Abstract
The application of state-of-the-art deep-learning approaches to the protein modeling problem has expanded the "high-accuracy" category in CASP14 to encompass all targets. Building on the metrics used for high-accuracy assessment in previous CASPs, we evaluated the performance of all groups that submitted models for at least 10 targets across all difficulty classes, and judged the usefulness of those produced by AlphaFold2 (AF2) as molecular replacement search models with AMPLE. Driven by the qualitative diversity of the targets submitted to CASP, we also introduce DipDiff as a new measure for the improvement in backbone geometry provided by a model versus available templates. Although a large leap in high-accuracy is seen due to AF2, the second-best method in CASP14 out-performed the best in CASP13, illustrating the role of community-based benchmarking in the development and evolution of the protein structure prediction field.
Collapse
Affiliation(s)
- Joana Pereira
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Adam J Simpkin
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Marcus D Hartmann
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Daniel J Rigden
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Ronan M Keegan
- Department of Scientific Computing, Science and Technologies Facilities Council, UK Research and Innovation, Didcot, Oxfordshire, UK
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
42
|
Song Z, Huang D, Song B, Chen K, Song Y, Liu G, Su J, Magalhães JPD, Rigden DJ, Meng J. Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications. Nat Commun 2021; 12:4011. [PMID: 34188054 PMCID: PMC8242015 DOI: 10.1038/s41467-021-24313-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2020] [Accepted: 06/07/2021] [Indexed: 02/08/2023] Open
Abstract
Recent studies suggest that epi-transcriptome regulation via post-transcriptional RNA modifications is vital for all RNA types. Precise identification of RNA modification sites is essential for understanding the functions and regulatory mechanisms of RNAs. Here, we present MultiRM, a method for the integrated prediction and interpretation of post-transcriptional RNA modifications from RNA sequences. Built upon an attention-based multi-label deep learning framework, MultiRM not only simultaneously predicts the putative sites of twelve widely occurring transcriptome modifications (m6A, m1A, m5C, m5U, m6Am, m7G, Ψ, I, Am, Cm, Gm, and Um), but also returns the key sequence contents that contribute most to the positive predictions. Importantly, our model revealed a strong association among different types of RNA modifications from the perspective of their associated sequence contexts. Our work provides a solution for detecting multiple RNA modifications, enabling an integrated analysis of these RNA modifications, and gaining a better understanding of sequence-based RNA modification mechanisms. RNA modifications appear to play a role in determining RNA structure and function. Here, the authors develop a deep learning model that predicts the location of 12 RNA modifications using primary sequence, and show that several modifications are associated, which suggests dependencies between them.
Collapse
Affiliation(s)
- Zitao Song
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China
| | - Daiyun Huang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China. .,Department of Computer Sciences, University of Liverpool, Liverpool, United Kingdom.
| | - Bowen Song
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Kunqi Chen
- Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, PR China
| | - Yiyou Song
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China
| | - Gang Liu
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China
| | - Jionglong Su
- School of AI and Advanced Computing, XJTLU Entrepreneur College (Taicang), Xi'an Jiaotong-Liverpool University, Suzhou, PR China
| | | | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, PR China. .,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom. .,AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou, PR China.
| |
Collapse
|
43
|
Midha S, Rigden DJ, Siozios S, Hurst GDD, Jackson AP. Bodo saltans (Kinetoplastida) is dependent on a novel Paracaedibacter-like endosymbiont that possesses multiple putative toxin-antitoxin systems. ISME J 2021; 15:1680-1694. [PMID: 33452479 PMCID: PMC8163844 DOI: 10.1038/s41396-020-00879-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 12/03/2020] [Accepted: 12/09/2020] [Indexed: 12/30/2022]
Abstract
Bacterial endosymbiosis has been instrumental in eukaryotic evolution, and includes both mutualistic, dependent and parasitic associations. Here we characterize an intracellular bacterium inhabiting the flagellated protist Bodo saltans (Kinetoplastida). We present a complete bacterial genome comprising a 1.39 Mb circular chromosome with 40.6% GC content. Fluorescent in situ hybridisation confirms that the endosymbiont is located adjacent to the nuclear membrane, and a detailed model of its intracellular niche is generated using serial block-face scanning electron microscopy. Phylogenomic analysis shows that the endosymbiont belongs to the Holosporales, most closely related to other α-proteobacterial endosymbionts of ciliates and amoebae. Comparative genomics indicates that it has a limited metabolism and is nutritionally host-dependent. However, the endosymbiont genome does encode diverse symbiont-specific secretory proteins, including a type VI secretion system and three separate toxin-antitoxin systems. We show that these systems are actively transcribed and hypothesize they represent a mechanism by which B. saltans becomes addicted to its endosymbiont. Consistent with this idea, attempts to cure Bodo of endosymbionts led to rapid and uniform cell death. This study adds kinetoplastid flagellates to ciliates and amoebae as hosts of Paracaedibacter-like bacteria, suggesting that these antagonistic endosymbioses became established very early in Eukaryotic evolution.
Collapse
Affiliation(s)
- Samriti Midha
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Ic2 Liverpool Science Park, 146 Brownlow Hill, Liverpool, L3 5RF, UK.
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown St., Liverpool, L69 7ZB, UK
| | - Stefanos Siozios
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Ic2 Liverpool Science Park, 146 Brownlow Hill, Liverpool, L3 5RF, UK
| | - Gregory D D Hurst
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Ic2 Liverpool Science Park, 146 Brownlow Hill, Liverpool, L3 5RF, UK
| | - Andrew P Jackson
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Ic2 Liverpool Science Park, 146 Brownlow Hill, Liverpool, L3 5RF, UK
| |
Collapse
|
44
|
Song B, Chen K, Tang Y, Wei Z, Su J, de Magalhães JP, Rigden DJ, Meng J. ConsRM: collection and large-scale prediction of the evolutionarily conserved RNA methylation sites, with implications for the functional epitranscriptome. Brief Bioinform 2021; 22:6276017. [PMID: 33993206 DOI: 10.1093/bib/bbab088] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 02/04/2021] [Accepted: 02/24/2021] [Indexed: 12/15/2022] Open
Abstract
Motivation N6-methyladenosine (m6A) is the most prevalent RNA modification on mRNAs and lncRNAs. Evidence increasingly demonstrates its crucial importance in essential molecular mechanisms and various diseases. With recent advances in sequencing techniques, tens of thousands of m6A sites are identified in a typical high-throughput experiment, posing a key challenge to distinguish the functional m6A sites from the remaining 'passenger' (or 'silent') sites. Results: We performed a comparative conservation analysis of the human and mouse m6A epitranscriptomes at single site resolution. A novel scoring framework, ConsRM, was devised to quantitatively measure the degree of conservation of individual m6A sites. ConsRM integrates multiple information sources and a positive-unlabeled learning framework, which integrated genomic and sequence features to trace subtle hints of epitranscriptome layer conservation. With a series validation experiments in mouse, fly and zebrafish, we showed that ConsRM outperformed well-adopted conservation scores (phastCons and phyloP) in distinguishing the conserved and unconserved m6A sites. Additionally, the m6A sites with a higher ConsRM score are more likely to be functionally important. An online database was developed containing the conservation metrics of 177 998 distinct human m6A sites to support conservation analysis and functional prioritization of individual m6A sites. And it is freely accessible at: https://www.xjtlu.edu.cn/biologicalsciences/con.
Collapse
Affiliation(s)
- Bowen Song
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Kunqi Chen
- Key Laboratory of Ministry of Education of Gastrointestinal Cancer, School of Basic Medical Science, Fujian Medical University, Fuzhou, China
| | - Yujiao Tang
- Xi'an Jiaotong-Liverpool University, L7 8TX, Liverpool, United Kingdom
| | - Zhen Wei
- Department of Biological Science, Xi'an Jiaotong-Liverpool University, L7 8TX, Liverpool, United Kingdom
| | - Jionglong Su
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, L7 8TX, Liverpool, United Kingdom
| | - João Pedro de Magalhães
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, L7 8TX, Liverpool, United Kingdom
| |
Collapse
|
45
|
Edogbanya J, Tejada-Martinez D, Jones NJ, Jaiswal A, Bell S, Cordeiro R, van Dam S, Rigden DJ, de Magalhães JP. Evolution, structure and emerging roles of C1ORF112 in DNA replication, DNA damage responses, and cancer. Cell Mol Life Sci 2021; 78:4365-4376. [PMID: 33625522 PMCID: PMC8164572 DOI: 10.1007/s00018-021-03789-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 01/28/2021] [Accepted: 02/09/2021] [Indexed: 02/08/2023]
Abstract
The C1ORF112 gene initially drew attention when it was found to be strongly co-expressed with several genes previously associated with cancer and implicated in DNA repair and cell cycle regulation, such as RAD51 and the BRCA genes. The molecular functions of C1ORF112 remain poorly understood, yet several studies have uncovered clues as to its potential functions. Here, we review the current knowledge on C1ORF112 biology, its evolutionary history, possible functions, and its potential relevance to cancer. C1ORF112 is conserved throughout eukaryotes, from plants to humans, and is very highly conserved in primates. Protein models suggest that C1ORF112 is an alpha-helical protein. Interestingly, homozygous knockout mice are not viable, suggesting an essential role for C1ORF112 in mammalian development. Gene expression data show that, among human tissues, C1ORF112 is highly expressed in the testes and overexpressed in various cancers when compared to healthy tissues. C1ORF112 has also been shown to have altered levels of expression in some tumours with mutant TP53. Recent screens associate C1ORF112 with DNA replication and reveal possible links to DNA damage repair pathways, including the Fanconi anaemia pathway and homologous recombination. These insights provide important avenues for future research in our efforts to understand the functions and potential disease relevance of C1ORF112.
Collapse
Affiliation(s)
- Jacob Edogbanya
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, L7 8TX, UK
| | - Daniela Tejada-Martinez
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, L7 8TX, UK
- Programa de Doctorado en Ciencias mención Ecología Y Evolución, Facultad de Ciencias, Instituto de Ciencias Ambientales Y Evolutivas, Universidad Austral de Chile, Valdivia, 5090000, Chile
- Department of Biochemistry and Molecular Biology, Thomas Jefferson University, Philadelphia, PA, 19107, USA
| | - Nigel J Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Amit Jaiswal
- Institute of Aging Research, School of Medicine, Hangzhou Normal University, Hangzhou, China
- Faculty of Biological Sciences, Friedrich Schiller University, Jena, Germany
| | - Sarah Bell
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Rui Cordeiro
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, L7 8TX, UK
| | - Sipko van Dam
- Department of Endocrinology, University of Groningen, University Medical Center Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands
- Ancora Health, Herestraat 106, 9711 LM, Groningen, The Netherlands
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - João Pedro de Magalhães
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, L7 8TX, UK.
| |
Collapse
|
46
|
Mesdaghi S, Murphy DL, Sánchez Rodríguez F, Burgos-Mármol JJ, Rigden DJ. In silico prediction of structure and function for a large family of transmembrane proteins that includes human Tmem41b. F1000Res 2021; 9:1395. [PMID: 33520197 PMCID: PMC7818093 DOI: 10.12688/f1000research.27676.2] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/11/2021] [Indexed: 01/07/2023] Open
Abstract
Background: Recent strides in computational structural biology have opened up an opportunity to understand previously uncharacterised proteins. The under-representation of transmembrane proteins in the Protein Data Bank highlights the need to apply new and advanced bioinformatics methods to shed light on their structure and function. This study focuses on a family of transmembrane proteins containing the Pfam domain PF09335 ('SNARE_ASSOC'/ 'VTT '/'Tvp38'/'DedA'). One prominent member, Tmem41b, has been shown to be involved in early stages of autophagosome formation and is vital in mouse embryonic development as well as being identified as a viral host factor of SARS-CoV-2. Methods: We used evolutionary covariance-derived information to construct and validate ab initio models, make domain boundary predictions and infer local structural features. Results: The results from the structural bioinformatics analysis of Tmem41b and its homologues showed that they contain a tandem repeat that is clearly visible in evolutionary covariance data but much less so by sequence analysis. Furthermore, cross-referencing of other prediction data with covariance analysis showed that the internal repeat features two-fold rotational symmetry. Ab initio modelling of Tmem41b and homologues reinforces these structural predictions. Local structural features predicted to be present in Tmem41b were also present in Cl -/H + antiporters. Conclusions: The results of this study strongly point to Tmem41b and its homologues being transporters for an as-yet uncharacterised substrate and possibly using H + antiporter activity as its mechanism for transport.
Collapse
Affiliation(s)
- Shahram Mesdaghi
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - David L. Murphy
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - J. Javier Burgos-Mármol
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK,
| |
Collapse
|
47
|
Mesdaghi S, Murphy DL, Sánchez Rodríguez F, Burgos-Mármol JJ, Rigden DJ. In silico prediction of structure and function for a large family of transmembrane proteins that includes human Tmem41b. F1000Res 2021; 9:1395. [PMID: 33520197 PMCID: PMC7818093 DOI: 10.12688/f1000research.27676.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/23/2020] [Indexed: 01/07/2023] Open
Abstract
Background: Recent strides in computational structural biology have opened up an opportunity to understand previously uncharacterised proteins. The under-representation of transmembrane proteins in the Protein Data Bank highlights the need to apply new and advanced bioinformatics methods to shed light on their structure and function. This study focuses on a family of transmembrane proteins containing the Pfam domain PF09335 ('SNARE_ASSOC'/ 'VTT '/'Tvp38'). One prominent member, Tmem41b, has been shown to be involved in early stages of autophagosome formation and is vital in mouse embryonic development as well as being identified as a viral host factor of SARS-CoV-2. Methods: We used evolutionary covariance-derived information to construct and validate ab initio models, make domain boundary predictions and infer local structural features. Results: The results from the structural bioinformatics analysis of Tmem41b and its homologues showed that they contain a tandem repeat that is clearly visible in evolutionary covariance data but much less so by sequence analysis. Furthermore, cross-referencing of other prediction data with covariance analysis showed that the internal repeat features two-fold rotational symmetry. Ab initio modelling of Tmem41b and homologues reinforces these structural predictions. Local structural features predicted to be present in Tmem41b were also present in Cl -/H + antiporters. Conclusions: The results of this study strongly point to Tmem41b and its homologues being transporters for an as-yet uncharacterised substrate and possibly using H + antiporter activity as its mechanism for transport.
Collapse
Affiliation(s)
- Shahram Mesdaghi
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - David L. Murphy
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - J. Javier Burgos-Mármol
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK,
| |
Collapse
|
48
|
Rodríguez FS, Mesdaghi S, Simpkin AJ, Burgos-Mármol JJ, Murphy DL, Uski V, Keegan RM, Rigden DJ. ConPlot: Web-based application for the visualisation of protein contact maps integrated with other data. Bioinformatics 2021; 37:2763-2765. [PMID: 34499718 PMCID: PMC8428603 DOI: 10.1093/bioinformatics/btab049] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Revised: 12/18/2020] [Accepted: 01/21/2021] [Indexed: 12/15/2022] Open
Abstract
Summary Covariance-based predictions of residue contacts and inter-residue distances are an increasingly popular data type in protein bioinformatics. Here we present ConPlot, a web-based application for convenient display and analysis of contact maps and distograms. Integration of predicted contact data with other predictions is often required to facilitate inference of structural features. ConPlot can therefore use the empty space near the contact map diagonal to display multiple coloured tracks representing other sequence-based predictions. Popular file formats are natively read and bespoke data can also be flexibly displayed. This novel visualization will enable easier interpretation of predicted contact maps. Availability and implementation available online at www.conplot.org, along with documentation and examples. Alternatively, ConPlot can be installed and used locally using the docker image from the project’s Docker Hub repository. ConPlot is licensed under the BSD 3-Clause. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Filomeno Sánchez Rodríguez
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.,Life Science, Diamond Light Source, Harwell Science and Innovation Campus, Oxfordshire OX11 0DE, Didcot, England
| | - Shahram Mesdaghi
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Adam J Simpkin
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - J Javier Burgos-Mármol
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - David L Murphy
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Ville Uski
- UKRI-STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, England
| | - Ronan M Keegan
- UKRI-STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, England
| | - Daniel J Rigden
- Institute of Structural, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
49
|
Chen K, Song B, Tang Y, Wei Z, Xu Q, Su J, de Magalhães JP, Rigden DJ, Meng J. RMDisease: a database of genetic variants that affect RNA modifications, with implications for epitranscriptome pathogenesis. Nucleic Acids Res 2021; 49:D1396-D1404. [PMID: 33010174 PMCID: PMC7778951 DOI: 10.1093/nar/gkaa790] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 09/08/2020] [Accepted: 09/11/2020] [Indexed: 12/11/2022] Open
Abstract
Deciphering the biological impacts of millions of single nucleotide variants remains a major challenge. Recent studies suggest that RNA modifications play versatile roles in essential biological mechanisms, and are closely related to the progression of various diseases including multiple cancers. To comprehensively unveil the association between disease-associated variants and their epitranscriptome disturbance, we built RMDisease, a database of genetic variants that can affect RNA modifications. By integrating the prediction results of 18 different RNA modification prediction tools and also 303,426 experimentally-validated RNA modification sites, RMDisease identified a total of 202,307 human SNPs that may affect (add or remove) sites of eight types of RNA modifications (m6A, m5C, m1A, m5U, Ψ, m6Am, m7G and Nm). These include 4,289 disease-associated variants that may imply disease pathogenesis functioning at the epitranscriptome layer. These SNPs were further annotated with essential information such as post-transcriptional regulations (sites for miRNA binding, interaction with RNA-binding proteins and alternative splicing) revealing putative regulatory circuits. A convenient graphical user interface was constructed to support the query, exploration and download of the relevant information. RMDisease should make a useful resource for studying the epitranscriptome impact of genetic variants via multiple RNA modifications with emphasis on their potential disease relevance. RMDisease is freely accessible at: www.xjtlu.edu.cn/biologicalsciences/rmd.
Collapse
Affiliation(s)
- Kunqi Chen
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX Liverpool, UK
| | - Bowen Song
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK.,Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Yujiao Tang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Qingru Xu
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Jionglong Su
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | | | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK.,AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| |
Collapse
|
50
|
Tang Y, Chen K, Song B, Ma J, Wu X, Xu Q, Wei Z, Su J, Liu G, Rong R, Lu Z, de Magalhães J, Rigden DJ, Meng J. m6A-Atlas: a comprehensive knowledgebase for unraveling the N6-methyladenosine (m6A) epitranscriptome. Nucleic Acids Res 2021; 49:D134-D143. [PMID: 32821938 PMCID: PMC7779050 DOI: 10.1093/nar/gkaa692] [Citation(s) in RCA: 168] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 08/05/2020] [Accepted: 08/09/2020] [Indexed: 12/25/2022] Open
Abstract
N 6-Methyladenosine (m6A) is the most prevalent RNA modification on mRNAs and lncRNAs. It plays a pivotal role during various biological processes and disease pathogenesis. We present here a comprehensive knowledgebase, m6A-Atlas, for unraveling the m6A epitranscriptome. Compared to existing databases, m6A-Atlas features a high-confidence collection of 442 162 reliable m6A sites identified from seven base-resolution technologies and the quantitative (rather than binary) epitranscriptome profiles estimated from 1363 high-throughput sequencing samples. It also offers novel features, such as; the conservation of m6A sites among seven vertebrate species (including human, mouse and chimp), the m6A epitranscriptomes of 10 virus species (including HIV, KSHV and DENV), the putative biological functions of individual m6A sites predicted from epitranscriptome data, and the potential pathogenesis of m6A sites inferred from disease-associated genetic mutations that can directly destroy m6A directing sequence motifs. A user-friendly graphical user interface was constructed to support the query, visualization and sharing of the m6A epitranscriptomes annotated with sites specifying their interaction with post-transcriptional machinery (RBP-binding, microRNA interaction and splicing sites) and interactively display the landscape of multiple RNA modifications. These resources provide fresh opportunities for unraveling the m6A epitranscriptomes. m6A-Atlas is freely accessible at: www.xjtlu.edu.cn/biologicalsciences/atlas.
Collapse
Affiliation(s)
- Yujiao Tang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Kunqi Chen
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX Liverpool, UK
| | - Bowen Song
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Jiongming Ma
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Xiangyu Wu
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX Liverpool, UK
| | - Qingru Xu
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Zhen Wei
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Jionglong Su
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Gang Liu
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Rong Rong
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Zhiliang Lu
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | | | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
- AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| |
Collapse
|