1
|
Schmok JC, Jain M, Street LA, Tankka AT, Schafer D, Her HL, Elmsaouri S, Gosztyla ML, Boyle EA, Jagannatha P, Luo EC, Kwon EJ, Jovanovic M, Yeo GW. Author Correction: Large-scale evaluation of the ability of RNA-binding proteins to activate exon inclusion. Nat Biotechnol 2024:10.1038/s41587-024-02178-3. [PMID: 38418649 DOI: 10.1038/s41587-024-02178-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]
Affiliation(s)
- Jonathan C Schmok
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Manya Jain
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Lena A Street
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Alex T Tankka
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Danielle Schafer
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Sara Elmsaouri
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Maya L Gosztyla
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Evan A Boyle
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Pratibha Jagannatha
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - En-Ching Luo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Ester J Kwon
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA.
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA.
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
2
|
Lewis BM, Cho CY, Her HL, Mizrahi O, Hunter T, Yeo GW. LARP4 is an RNA-binding protein that binds nuclear-encoded mitochondrial mRNAs to promote mitochondrial function. RNA 2024; 30:223-239. [PMID: 38164626 PMCID: PMC10870378 DOI: 10.1261/rna.079799.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/25/2023] [Indexed: 01/03/2024]
Abstract
Mitochondria-associated RNA-binding proteins (RBPs) have emerged as key contributors to mitochondrial biogenesis and homeostasis. With few examples known, we set out to identify RBPs that regulate nuclear-encoded mitochondrial mRNAs (NEMmRNAs). Our systematic analysis of RNA targets of 150 RBPs identified RBPs with a preference for binding NEMmRNAs, including LARP4, a La RBP family member. We show that LARP4's targets are particularly enriched in mRNAs that encode respiratory chain complex proteins (RCCPs) and mitochondrial ribosome proteins (MRPs) across multiple human cell lines. Through quantitative proteomics, we demonstrate that depletion of LARP4 leads to a significant reduction in RCCP and MRP protein levels. Furthermore, we show that LARP4 depletion reduces mitochondrial function, and that LARP4 re-expression rescues this phenotype. Our findings shed light on a novel function for LARP4 as an RBP that binds to and positively regulates NEMmRNAs to promote mitochondrial respiratory function.
Collapse
Affiliation(s)
- Benjamin M Lewis
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92037, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
- Stem Cell Program, University of California San Diego, La Jolla, California 92037, USA
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Chae Yun Cho
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92037, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
- Stem Cell Program, University of California San Diego, La Jolla, California 92037, USA
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, California 92037, USA
| | - Orel Mizrahi
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92037, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
- Stem Cell Program, University of California San Diego, La Jolla, California 92037, USA
| | - Tony Hunter
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92037, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
- Stem Cell Program, University of California San Diego, La Jolla, California 92037, USA
| |
Collapse
|
3
|
Schmok JC, Jain M, Street LA, Tankka AT, Schafer D, Her HL, Elmsaouri S, Gosztyla ML, Boyle EA, Jagannatha P, Luo EC, Kwon EJ, Jovanovic M, Yeo GW. Large-scale evaluation of the ability of RNA-binding proteins to activate exon inclusion. Nat Biotechnol 2024:10.1038/s41587-023-02014-0. [PMID: 38168984 DOI: 10.1038/s41587-023-02014-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 09/29/2023] [Indexed: 01/05/2024]
Abstract
RNA-binding proteins (RBPs) modulate alternative splicing outcomes to determine isoform expression and cellular survival. To identify RBPs that directly drive alternative exon inclusion, we developed tethered function luciferase-based splicing reporters that provide rapid, scalable and robust readouts of exon inclusion changes and used these to evaluate 718 human RBPs. We performed enhanced cross-linking immunoprecipitation, RNA sequencing and affinity purification-mass spectrometry to investigate a subset of candidates with no prior association with splicing. Integrative analysis of these assays indicates surprising roles for TRNAU1AP, SCAF8 and RTCA in the modulation of hundreds of endogenous splicing events. We also leveraged our tethering assays and top candidates to identify potent and compact exon inclusion activation domains for splicing modulation applications. Using these identified domains, we engineered programmable fusion proteins that outperform current artificial splicing factors at manipulating inclusion of reporter and endogenous exons. This tethering approach characterizes the ability of RBPs to induce exon inclusion and yields new molecular parts for programmable splicing control.
Collapse
Affiliation(s)
- Jonathan C Schmok
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Manya Jain
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Lena A Street
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Alex T Tankka
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Danielle Schafer
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Sara Elmsaouri
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Maya L Gosztyla
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Evan A Boyle
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Pratibha Jagannatha
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - En-Ching Luo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Ester J Kwon
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA.
- Sanford Stem Cell Institute Innovation Center and Stem Cell Program, University of California San Diego, La Jolla, CA, USA.
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
4
|
Kathman SG, Koo SJ, Lindsey GL, Her HL, Blue SM, Li H, Jaensch S, Remsberg JR, Ahn K, Yeo GW, Ghosh B, Cravatt BF. Remodeling oncogenic transcriptomes by small molecules targeting NONO. Nat Chem Biol 2023; 19:825-836. [PMID: 36864190 PMCID: PMC10337234 DOI: 10.1038/s41589-023-01270-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 01/20/2023] [Indexed: 03/04/2023]
Abstract
Much of the human proteome is involved in mRNA homeostasis, but most RNA-binding proteins lack chemical probes. Here we identify electrophilic small molecules that rapidly and stereoselectively decrease the expression of transcripts encoding the androgen receptor and its splice variants in prostate cancer cells. We show by chemical proteomics that the compounds engage C145 of the RNA-binding protein NONO. Broader profiling revealed that covalent NONO ligands suppress an array of cancer-relevant genes and impair cancer cell proliferation. Surprisingly, these effects were not observed in cells genetically disrupted for NONO, which were instead resistant to NONO ligands. Reintroduction of wild-type NONO, but not a C145S mutant, restored ligand sensitivity in NONO-disrupted cells. The ligands promoted NONO accumulation in nuclear foci and stabilized NONO-RNA interactions, supporting a trapping mechanism that may prevent compensatory action of paralog proteins PSPC1 and SFPQ. These findings show that NONO can be co-opted by covalent small molecules to suppress protumorigenic transcriptional networks.
Collapse
Affiliation(s)
- Stefan G Kathman
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA, USA.
| | - Seong Joo Koo
- Molecular and Cellular Pharmacology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Beerse, Belgium
| | - Garrett L Lindsey
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA, USA
| | - Hsuan-Lin Her
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - Steven M Blue
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Haoxin Li
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA, USA
| | - Steffen Jaensch
- High Dimensional and Computational Biology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Beerse, Belgium
| | - Jarrett R Remsberg
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA, USA
| | - Kay Ahn
- Molecular and Cellular Pharmacology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Spring House, PA, USA.
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA.
| | - Brahma Ghosh
- Discovery Chemistry, Janssen Research and Development, Spring House, PA, USA.
| | - Benjamin F Cravatt
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA, USA.
| |
Collapse
|
5
|
Boyle EA, Her HL, Mueller JR, Naritomi JT, Nguyen GG, Yeo GW. Skipper analysis of eCLIP datasets enables sensitive detection of constrained translation factor binding sites. Cell Genom 2023; 3:100317. [PMID: 37388912 PMCID: PMC10300551 DOI: 10.1016/j.xgen.2023.100317] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 02/17/2023] [Accepted: 04/06/2023] [Indexed: 07/01/2023]
Abstract
Technology for crosslinking and immunoprecipitation (CLIP) followed by sequencing (CLIP-seq) has identified the transcriptomic targets of hundreds of RNA-binding proteins in cells. To increase the power of existing and future CLIP-seq datasets, we introduce Skipper, an end-to-end workflow that converts unprocessed reads into annotated binding sites using an improved statistical framework. Compared with existing methods, Skipper on average calls 210%-320% more transcriptomic binding sites and sometimes >1,000% more sites, providing deeper insight into post-transcriptional gene regulation. Skipper also calls binding to annotated repetitive elements and identifies bound elements for 99% of enhanced CLIP experiments. We perform nine translation factor enhanced CLIPs and apply Skipper to learn determinants of translation factor occupancy, including transcript region, sequence, and subcellular localization. Furthermore, we observe depletion of genetic variation in occupied sites and nominate transcripts subject to selective constraint because of translation factor occupancy. Skipper offers fast, easy, customizable, and state-of-the-art analysis of CLIP-seq data.
Collapse
Affiliation(s)
- Evan A. Boyle
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Jasmine R. Mueller
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Jack T. Naritomi
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Grady G. Nguyen
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Gene W. Yeo
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
6
|
Lazear MR, Remsberg JR, Jaeger MG, Rothamel K, Her HL, DeMeester KE, Njomen E, Hogg SJ, Rahman J, Whitby LR, Won SJ, Schafroth MA, Ogasawara D, Yokoyama M, Lindsey GL, Li H, Germain J, Barbas S, Vaughan J, Hanigan TW, Vartabedian VF, Reinhardt CJ, Dix MM, Koo SJ, Heo I, Teijaro JR, Simon GM, Ghosh B, Abdel-Wahab O, Ahn K, Saghatelian A, Melillo B, Schreiber SL, Yeo GW, Cravatt BF. Proteomic discovery of chemical probes that perturb protein complexes in human cells. Mol Cell 2023; 83:1725-1742.e12. [PMID: 37084731 PMCID: PMC10198961 DOI: 10.1016/j.molcel.2023.03.026] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 01/09/2023] [Accepted: 03/28/2023] [Indexed: 04/23/2023]
Abstract
Most human proteins lack chemical probes, and several large-scale and generalizable small-molecule binding assays have been introduced to address this problem. How compounds discovered in such "binding-first" assays affect protein function, nonetheless, often remains unclear. Here, we describe a "function-first" proteomic strategy that uses size exclusion chromatography (SEC) to assess the global impact of electrophilic compounds on protein complexes in human cells. Integrating the SEC data with cysteine-directed activity-based protein profiling identifies changes in protein-protein interactions that are caused by site-specific liganding events, including the stereoselective engagement of cysteines in PSME1 and SF3B1 that disrupt the PA28 proteasome regulatory complex and stabilize a dynamic state of the spliceosome, respectively. Our findings thus show how multidimensional proteomic analysis of focused libraries of electrophilic compounds can expedite the discovery of chemical probes with site-specific functional effects on protein complexes in human cells.
Collapse
Affiliation(s)
- Michael R Lazear
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | | | - Martin G Jaeger
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Katherine Rothamel
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | | | - Evert Njomen
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Simon J Hogg
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY 10021, USA
| | - Jahan Rahman
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY 10021, USA
| | - Landon R Whitby
- Vividion Therapeutics, 5820 Nancy Ridge Drive, San Diego, CA 92121, USA
| | - Sang Joon Won
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | | | | | - Minoru Yokoyama
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | | | - Haoxin Li
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Jason Germain
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Sabrina Barbas
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Joan Vaughan
- Clayton Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Thomas W Hanigan
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Vincent F Vartabedian
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA 92037, USA
| | | | - Melissa M Dix
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA
| | - Seong Joo Koo
- Molecular and Cellular Pharmacology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Turnhoutseweg 30, 2340 Beerse, Belgium
| | - Inha Heo
- Molecular and Cellular Pharmacology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Turnhoutseweg 30, 2340 Beerse, Belgium
| | - John R Teijaro
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA 92037, USA
| | - Gabriel M Simon
- Vividion Therapeutics, 5820 Nancy Ridge Drive, San Diego, CA 92121, USA
| | - Brahma Ghosh
- Discovery Chemistry, Janssen Research & Development, Spring House, PA 19477, USA
| | - Omar Abdel-Wahab
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY 10021, USA
| | - Kay Ahn
- Molecular and Cellular Pharmacology, Discovery Technologies and Molecular Pharmacology, Janssen Research and Development, Spring House, PA 19477, USA
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Bruno Melillo
- Department of Chemistry, Scripps Research, La Jolla, CA 92037, USA; Chemical Biology and Therapeutics Science Program, Broad Institute, Cambridge, MA 02142, USA
| | - Stuart L Schreiber
- Chemical Biology and Therapeutics Science Program, Broad Institute, Cambridge, MA 02142, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | | |
Collapse
|
7
|
Lorenz DA, Her HL, Shen KA, Rothamel K, Hutt KR, Nojadera AC, Bruns SC, Manakov SA, Yee BA, Chapman KB, Yeo GW. Multiplexed transcriptome discovery of RNA-binding protein binding sites by antibody-barcode eCLIP. Nat Methods 2023; 20:65-69. [PMID: 36550273 PMCID: PMC9834051 DOI: 10.1038/s41592-022-01708-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 10/28/2022] [Indexed: 12/24/2022]
Abstract
Ultraviolet crosslinking and immunoprecipitation (CLIP) methodologies enable the identification of RNA binding sites of RNA-binding proteins (RBPs). Despite improvements in the library preparation of RNA fragments, the enhanced CLIP (eCLIP) protocol requires 4 days of hands-on time and lacks the ability to process several RBPs in parallel. We present a new method termed antibody-barcode eCLIP that utilizes DNA-barcoded antibodies and proximity ligation of the DNA oligonucleotides to RBP-protected RNA fragments to interrogate several RBPs simultaneously. We observe performance comparable with that of eCLIP with the advantage of dramatically increased scaling while maintaining the same material requirement of a single eCLIP experiment.
Collapse
Affiliation(s)
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
- Stem Cell Program, University of California San Diego, La Jolla, CA, USA
| | | | - Katie Rothamel
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
- Stem Cell Program, University of California San Diego, La Jolla, CA, USA
| | | | | | | | | | - Brian A Yee
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
- Stem Cell Program, University of California San Diego, La Jolla, CA, USA
| | | | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA.
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA.
- Stem Cell Program, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
8
|
Her HL, Boyle E, Yeo GW. Metadensity: a background-aware python pipeline for summarizing CLIP signals on various transcriptomic sites. Bioinform Adv 2022; 2:vbac083. [PMID: 36388152 PMCID: PMC9653213 DOI: 10.1093/bioadv/vbac083] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 09/15/2022] [Accepted: 11/07/2022] [Indexed: 09/19/2023]
Abstract
Motivation Cross-linking and immunoprecipitation (CLIP) is a technology to map the binding sites of RNA-binding proteins (RBPs). The region where an RBP binds within RNA is often indicative of its molecular function in RNA processing. As an example, the binding sites of splicing factors are found within or proximal to alternatively spliced exons. To better reveal the function of RBPs, we developed a tool to visualize the distribution of CLIP signals around various transcript features. Results Here, we present Metadensity (https://github.com/YeoLab/Metadensity), a software that allows users to generate metagene plots. Metadensity allows users to input features such as branchpoints and preserves the near-nucleotide resolution of CLIP technologies by not scaling the features by length. Metadensity normalizes immunoprecipitated libraries with background controls, such as size-matched inputs, then windowing in various user-defined features. Finally, the signals are averaged across a provided set of transcripts. Availability and implementation Metadensity is available at https://github.com/YeoLab/Metadensity, with example notebooks at https://metadensity.readthedocs.io/en/latest/tutorial.html. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Hsuan-Lin Her
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Evan Boyle
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Gene W Yeo
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Stem Cell Program, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
9
|
Xiang JS, Mueller JR, Luo EC, Yee BA, Schafer D, Schmok JC, Tan FE, Rothamel K, McVicar RN, Kwong EM, Croker BA, Jones KL, Her HL, Chen CY, Vu AQ, Jin W, Park SS, Le P, Brannan KW, Kofman ER, Li Y, Tankka AT, Dong KD, Song Y, Clark AE, Carlin AF, Van Nostrand EL, Leibel SL, Yeo GW. Discovery and functional interrogation of SARS-CoV-2 protein-RNA interactions. Res Sq 2022:rs.3.rs-1394331. [PMID: 35313591 PMCID: PMC8936114 DOI: 10.21203/rs.3.rs-1394331/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The COVID-19 pandemic is caused by severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2). The betacoronvirus has a positive sense RNA genome which encodes for several RNA binding proteins. Here, we use enhanced crosslinking and immunoprecipitation to investigate SARS-CoV-2 protein interactions with viral and host RNAs in authentic virus-infected cells. SARS-CoV-2 proteins, NSP8, NSP12, and nucleocapsid display distinct preferences to specific regions in the RNA viral genome, providing evidence for their shared and separate roles in replication, transcription, and viral packaging. SARS-CoV-2 proteins expressed in human lung epithelial cells bind to 4773 unique host coding RNAs. Nine SARS-CoV-2 proteins upregulate target gene expression, including NSP12 and ORF9c, whose RNA substrates are associated with pathways in protein N-linked glycosylation ER processing and mitochondrial processes. Furthermore, siRNA knockdown of host genes targeted by viral proteins in human lung organoid cells identify potential antiviral host targets across different SARS-CoV-2 variants. Conversely, NSP9 inhibits host gene expression by blocking mRNA export and dampens cytokine productions, including interleukin-1α/β. Our viral protein-RNA interactome provides a catalog of potential therapeutic targets and offers insight into the etiology of COVID-19 as a safeguard against future pandemics.
Collapse
Affiliation(s)
- Joy S. Xiang
- Institute of Molecular and Cellular Biology, A*STAR, Singapore
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Jasmine R. Mueller
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - En-Ching Luo
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Brian A. Yee
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Danielle Schafer
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Jonathan C. Schmok
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Frederick E. Tan
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Katherine Rothamel
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Rachael N. McVicar
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Elizabeth M. Kwong
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Ben A. Croker
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA 92037, USA
| | - Krysten L. Jones
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Chun-Yuan Chen
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Anthony Q. Vu
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Wenhao Jin
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Samuel S. Park
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Phuong Le
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Kristopher W. Brannan
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Eric R. Kofman
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Yanhua Li
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Alexandra T. Tankka
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Kevin D. Dong
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Yan Song
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Alex E. Clark
- Department of Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Aaron F. Carlin
- Department of Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Eric L. Van Nostrand
- Verna & Marrs McLean Department of Biochemistry & Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sandra L. Leibel
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA 92037, USA
| | - Gene W. Yeo
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| |
Collapse
|
10
|
Xiang JS, Mueller JR, Luo EC, Yee BA, Schafer D, Schmok JC, Tan FE, Rothamel K, McVicar RN, Kwong EM, Jones KL, Her HL, Chen CY, Vu AQ, Jin W, Park SS, Le P, Brannan KW, Kofman ER, Li Y, Tankka AT, Dong KD, Song Y, Carlin AF, Van Nostrand EL, Leibel SL, Yeo GW. Discovery and functional interrogation of SARS-CoV-2 protein-RNA interactions. bioRxiv 2022:2022.02.21.481223. [PMID: 35233578 PMCID: PMC8887137 DOI: 10.1101/2022.02.21.481223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The COVID-19 pandemic is caused by severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2). The betacoronvirus has a positive sense RNA genome which encodes for several RNA binding proteins. Here, we use enhanced crosslinking and immunoprecipitation to investigate SARS-CoV-2 protein interactions with viral and host RNAs in authentic virus-infected cells. SARS-CoV-2 proteins, NSP8, NSP12, and nucleocapsid display distinct preferences to specific regions in the RNA viral genome, providing evidence for their shared and separate roles in replication, transcription, and viral packaging. SARS-CoV-2 proteins expressed in human lung epithelial cells bind to 4773 unique host coding RNAs. Nine SARS-CoV-2 proteins upregulate target gene expression, including NSP12 and ORF9c, whose RNA substrates are associated with pathways in protein N-linked glycosylation ER processing and mitochondrial processes. Furthermore, siRNA knockdown of host genes targeted by viral proteins in human lung organoid cells identify potential antiviral host targets across different SARS-CoV-2 variants. Conversely, NSP9 inhibits host gene expression by blocking mRNA export and dampens cytokine productions, including interleukin-1α/β. Our viral protein-RNA interactome provides a catalog of potential therapeutic targets and offers insight into the etiology of COVID-19 as a safeguard against future pandemics.
Collapse
Affiliation(s)
- Joy S. Xiang
- Institute of Molecular and Cellular Biology, A*STAR, Singapore
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Jasmine R. Mueller
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - En-Ching Luo
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Brian A. Yee
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Danielle Schafer
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Jonathan C. Schmok
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Frederick E. Tan
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Katherine Rothamel
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Rachael N. McVicar
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Elizabeth M. Kwong
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Krysten L. Jones
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Hsuan-Lin Her
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Chun-Yuan Chen
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Anthony Q. Vu
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Wenhao Jin
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Samuel S. Park
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Phuong Le
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Kristopher W. Brannan
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Eric R. Kofman
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Yanhua Li
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Alexandra T. Tankka
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Kevin D. Dong
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Yan Song
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Aaron F. Carlin
- Department of Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Eric L. Van Nostrand
- Verna & Marrs McLean Department of Biochemistry & Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sandra L. Leibel
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA 92037, USA
| | - Gene W. Yeo
- Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, UCSD Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA
| |
Collapse
|
11
|
Her HL, Lin PT, Wu YW. PangenomeNet: a pan-genome-based network reveals functional modules on antimicrobial resistome for Escherichia coli strains. BMC Bioinformatics 2021; 22:548. [PMID: 34758735 PMCID: PMC8579557 DOI: 10.1186/s12859-021-04459-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 10/19/2021] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Discerning genes crucial to antimicrobial resistance (AMR) mechanisms is becoming more and more important to accurately and swiftly identify AMR pathogenic strains. Pangenome-wide association studies (e.g. Scoary) identified numerous putative AMR genes. However, only a tiny proportion of the putative resistance genes are annotated by AMR databases or Gene Ontology. In addition, many putative resistance genes are of unknown function (termed hypothetical proteins). An annotation tool is crucially needed in order to reveal the functional organization of the resistome and expand our knowledge of the AMR gene repertoire. RESULTS We developed an approach (PangenomeNet) for building co-functional networks from pan-genomes to infer functions for hypothetical genes. Using Escherichia coli as an example, we demonstrated that it is possible to build co-functional network from its pan-genome using co-inheritance, domain-sharing, and protein-protein-interaction information. The investigation of the network revealed that it fits the characteristics of biological networks and can be used for functional inferences. The subgraph consisting of putative meropenem resistance genes consists of clusters of stress response genes and resistance gene acquisition pathways. Resistome subgraphs also demonstrate drug-specific AMR genes such as beta-lactamase, as well as functional roles shared among multiple classes of drugs, mostly in the stress-related pathways. CONCLUSIONS By demonstrating the idea of pan-genome-based co-functional network on the E. coli species, we showed that the network can infer functional roles of the genes, including those without functional annotations, and provides holistic views on the putative antimicrobial resistomes. We hope that the pan-genome network idea can help formulate hypothesis for targeted experimental works.
Collapse
Affiliation(s)
- Hsuan-Lin Her
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, 92093, USA
| | - Po-Ting Lin
- Department of Mechanical Engineering, National Taiwan University of Science and Technology, No.43, Keelung Rd., Sec.4, Da'an Dist., Taipei City, 10609, Taiwan.
- Center for Cyber-Physical System Innovation, National Taiwan University of Science and Technology, Taipei, 10609, Taiwan.
| | - Yu-Wei Wu
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, 250, Wuxing St., Sinyi District, Taipei, 11031, Taiwan.
- Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, 11031, Taiwan.
| |
Collapse
|
12
|
Abstract
Motivation Antimicrobial resistance (AMR) is becoming a huge problem in both developed and developing countries, and identifying strains resistant or susceptible to certain antibiotics is essential in fighting against antibiotic-resistant pathogens. Whole-genome sequences have been collected for different microbial strains in order to identify crucial characteristics that allow certain strains to become resistant to antibiotics; however, a global inspection of the gene content responsible for AMR activities remains to be done. Results We propose a pan-genome-based approach to characterize antibiotic-resistant microbial strains and test this approach on the bacterial model organism Escherichia coli. By identifying core and accessory gene clusters and predicting AMR genes for the E. coli pan-genome, we not only showed that certain classes of genes are unevenly distributed between the core and accessory parts of the pan-genome but also demonstrated that only a portion of the identified AMR genes belong to the accessory genome. Application of machine learning algorithms to predict whether specific strains were resistant to antibiotic drugs yielded the best prediction accuracy for the set of AMR genes within the accessory part of the pan-genome, suggesting that these gene clusters were most crucial to AMR activities in E. coli. Selecting subsets of AMR genes for different antibiotic drugs based on a genetic algorithm (GA) achieved better prediction performances than the gene sets established in the literature, hinting that the gene sets selected by the GA may warrant further analysis in investigating more details about how E. coli fight against antibiotics. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hsuan-Lin Her
- School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Yu-Wei Wu
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|