1
|
Elsing D, Luy B, Kozlowska M. Enantiomer Differentiation by Interaction-Specific Prediction of Residual Dipolar Couplings in Spherical-like Molecules. J Chem Theory Comput 2024. [PMID: 39099221 DOI: 10.1021/acs.jctc.4c00441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/06/2024]
Abstract
Residual Dipolar Couplings (RDCs) are averaged dipolar couplings between nuclear spins of atoms in a molecule that can be measured by nuclear magnetic resonance (NMR) spectroscopy upon partial alignment by a chiral alignment medium. The estimation of differences in alignment of enantiomers may, in principle, enable the determination of absolute configuration. Here, we use molecular dynamics (MD) simulations to mimic the alignment of chiral molecules (i.e., isopinocampheol, quinuclidin-3-ol, borneol, and camphor) to the chiral poly-γ-benzyl-L-glutamate (PBLG) polymer to predict RDCs in silico and compare calculated and experimentally measured residual dipolar couplings for the four enantiomeric pairs. The aim is to validate the computational scheme for the prediction of RDCs in chiral molecules and understand the interaction leading to the alignment in more detail. We determine the indispensable importance of hydrogen bonds between a chiral molecule and the alignment medium on the overall quality of the simulated alignment and interaction poses toward high agreement with experiments. A good correlation with experimental data is found for camphor and isopinocampheol, while the correlation for quinuclidin-3-ol and borneol is lower. We attribute this observation to the high difficulty of the RDC prediction for rather almost spherical molecules. The study reveals that the prediction of alignment with small enantiomeric differences is possible with an MD-based approach; however, extended simulation times (e.g., 50-100 μs) are required to sufficiently reduce the statistical uncertainty. This may be further used for the determination of the relative, as well as absolute, configuration of chiral molecules.
Collapse
Affiliation(s)
- David Elsing
- Institute of Nanotechnology, Karlsruhe Institute of Technology (KIT), Kaiserstr. 12, 76131 Karlsruhe, Germany
| | - Burkhard Luy
- Institute for Biological Interfaces 4, Karlsruhe Institute of Technology (KIT), Kaiserstr. 12, 76131 Karlsruhe, Germany
- Institute of Organic Chemistry, Karlsruhe Institute of Technology (KIT), Kaiserstr. 12, 76131 Karlsruhe, Germany
| | - Mariana Kozlowska
- Institute of Nanotechnology, Karlsruhe Institute of Technology (KIT), Kaiserstr. 12, 76131 Karlsruhe, Germany
| |
Collapse
|
2
|
Kim J, Varki R, Oliva M, Boucher C. Re 2 Pair: Increasing the Scalability of RePair by Decreasing Memory Usage. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.11.603142. [PMID: 39071359 PMCID: PMC11275962 DOI: 10.1101/2024.07.11.603142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
The RePair compression algorithm produces a context-free grammar by iteratively substituting the most frequently occurring pair of consecutive symbols with a new symbol until all consecutive pairs of symbols appear only once in the compressed text. It is widely used in the settings of bioinformatics, machine learning, and information retrieval where random access to the original input text is needed. For example, in pangenomics, RePair is used for random access to a population of genomes. BigRePair improves the scalability of the original RePair algorithm by using Prefix-Free Parsing (PFP) to preprocess the text prior to building the RePair grammar. Despite the efficiency of PFP on repetitive text, there is a scalability issue with the size of the parse which causes a memory bottleneck in BigRePair. In this paper, we design and implement recursive RePair (denoted as Re 2 Pair), which builds the RePair grammar using recursive PFP. Our novel algorithm faces the challenge of constructing the RePair grammar without direct access to the parse of text, relying solely on the dictionary of the text and the parse and dictionary of the parse of the text. We compare Re 2 Pair to BigRePair using SARS-CoV-2 haplotypes and haplotypes from the 1000 Genomes Project. We show that our method Re 2 Pair achieves over a 40% peak memory reduction and a speed up ranging between 12% to 79% compared to BigRePair when compressing the largest input texts in all experiments. Re 2 Pair is made publicly available under the GNU public license here: https://github.com/jkim210/Recursive-RePair. 2012 ACM Subject Classification Theory of computation → Formal languages and automata theory.
Collapse
|
3
|
Rosa F, Silva C, Rodrigues R, Esteves-Vieira M, Barbosa I, Rosa S, Dias D, Pina-Martins F. Island hitchhikers: pathogen agents of Madeira and Azores ticks. Parasitol Res 2024; 123:261. [PMID: 38967653 PMCID: PMC11226517 DOI: 10.1007/s00436-024-08278-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 06/24/2024] [Indexed: 07/06/2024]
Abstract
Ticks are blood-sucking arthropods that can transmit pathogens to their host. As insular ecosystems can enhance tick-host interactions, this study aimed to understand tick diversity, pathogen presence, and their respective associations in the Azores and Madeira archipelagos. Unfed or partially engorged ticks (n = 120) were collected from 58 cats and dogs in the Azores (n = 41 specimens) and Madeira (n = 79 specimens) from November 2018 to March 2019. Vector identification was based on morphology and molecular criteria. For pathogen sequencing, 18S gene fragment for Babesia/Hepatozoon and gltA for Rickettsia were performed. Sequence data was explored using BLAST and BLAST and phylogenetic inference tools. In the Azores, Ixodes hexagonus, I. ventalloi, and Rhipicephalus sanguineus (n = 6; 14.6%, n = 6; 14.6%, and n = 29; 70.7% respectively) were found and in Madeira I. ricinus and R. sanguineus (n = 78, 98.7%; and n = 1, 1.3%; respectively) were identified. Tick COI markers confirmed species highlighting confirmation of R. sanguineus s.s. and genotype A of I. ventalloi. In the Azores Islands, the detected Rickettsia massiliae was linked to R. sanguineus (dogs and cats) and I. hexagonus (dogs), and in Madeira Island, R. monacensis (dogs) and Hepatozoon silvestris (cats) were found associated with I. ricinus. Further, I. ventalloi presence in the Azores expands west its known range, and Hepatozoon silvestris in Madeira may suggest that I. ricinus could have a role as a potential vector. Finally, as R. massiliae and R. monacensis presence underlines public health risks, surveillance by health authorities is crucial as pathogen-tick interactions may drive disease spread, therefore monitoring remains pivotal for disease prevention.
Collapse
Affiliation(s)
- Fernanda Rosa
- Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017, Lisbon, Portugal
- Centro de Estudos Do Ambiente E Do Mar (CESAM), LA, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Carla Silva
- Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Ricardo Rodrigues
- Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | | | - Inês Barbosa
- MSD Animal Health Portugal, Quinta da Fonte, Ed Vasco da Gama,19, Paço de Arcos, Portugal
| | - Sara Rosa
- Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Deodália Dias
- Centro de Estudos Do Ambiente E Do Mar (CESAM), LA, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
- Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Francisco Pina-Martins
- Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal.
- Computational Biology and Population Genomics Group, cE3c - Center for Ecology, Evolution and Environmental Changes & CHANGE - Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisbon, Portugal.
- Departamento de Engenharia Química E Biológica, Escola Superior de Tecnologia Do Barreiro, Instituto Politécnico de Setúbal, Rua Américo da Silva Marinho, 2839-001, Lavradio, Portugal.
| |
Collapse
|
4
|
Thomas A, Battenfeld T, Kraiselburd I, Anastasiou O, Dittmer U, Dörr AK, Dörr A, Elsner C, Gosch J, Le-Trilling VTK, Magin S, Scholtysik R, Yilmaz P, Trilling M, Schöler L, Köster J, Meyer F. UnCoVar: a reproducible and scalable workflow for transparent and robust virus variant calling and lineage assignment using SARS-CoV-2 as an example. BMC Genomics 2024; 25:647. [PMID: 38943066 PMCID: PMC11214259 DOI: 10.1186/s12864-024-10539-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/18/2024] [Indexed: 07/01/2024] Open
Abstract
BACKGROUND At a global scale, the SARS-CoV-2 virus did not remain in its initial genotype for a long period of time, with the first global reports of variants of concern (VOCs) in late 2020. Subsequently, genome sequencing has become an indispensable tool for characterizing the ongoing pandemic, particularly for typing SARS-CoV-2 samples obtained from patients or environmental surveillance. For such SARS-CoV-2 typing, various in vitro and in silico workflows exist, yet to date, no systematic cross-platform validation has been reported. RESULTS In this work, we present the first comprehensive cross-platform evaluation and validation of in silico SARS-CoV-2 typing workflows. The evaluation relies on a dataset of 54 patient-derived samples sequenced with several different in vitro approaches on all relevant state-of-the-art sequencing platforms. Moreover, we present UnCoVar, a robust, production-grade reproducible SARS-CoV-2 typing workflow that outperforms all other tested approaches in terms of precision and recall. CONCLUSIONS In many ways, the SARS-CoV-2 pandemic has accelerated the development of techniques and analytical approaches. We believe that this can serve as a blueprint for dealing with future pandemics. Accordingly, UnCoVar is easily generalizable towards other viral pathogens and future pandemics. The fully automated workflow assembles virus genomes from patient samples, identifies existing lineages, and provides high-resolution insights into individual mutations. UnCoVar includes extensive quality control and automatically generates interactive visual reports. UnCoVar is implemented as a Snakemake workflow. The open-source code is available under a BSD 2-clause license at github.com/IKIM-Essen/uncovar.
Collapse
Affiliation(s)
- Alexander Thomas
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Thomas Battenfeld
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Ivana Kraiselburd
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Olympia Anastasiou
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Ulf Dittmer
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Ann-Kathrin Dörr
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Adrian Dörr
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Carina Elsner
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Jule Gosch
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Vu Thuy Khanh Le-Trilling
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
- Institute for the Research on HIV & AIDS-associated Diseases, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Simon Magin
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - René Scholtysik
- Institute for the Research on HIV & AIDS-associated Diseases, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Pelin Yilmaz
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Mirko Trilling
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
- Institute for the Research on HIV & AIDS-associated Diseases, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Lara Schöler
- Institute for Virology, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
- Institute for the Research on HIV & AIDS-associated Diseases, University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
- Institute of Cell Biology (Cancer Research), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
| | - Johannes Köster
- Bioinformatics and Computational Oncology, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany
- Division of Molecular and Cellular Oncology, Department of Medical Oncology, Harvard Medical School, Boston, MA, USA
| | - Folker Meyer
- Data Science Research Group, Institute for Artificial Intelligence in Medicine (IKIM), University Hospital of Essen, University of Duisburg-Essen, Essen, Germany.
| |
Collapse
|
5
|
Lucaci AG, Brew WE, Lamanna J, Selberg A, Carnevale V, Moore AR, Kosakovsky Pond SL. The evolution of mammalian Rem2: unraveling the impact of purifying selection and coevolution on protein function, and implications for human disorders. FRONTIERS IN BIOINFORMATICS 2024; 4:1381540. [PMID: 38978817 PMCID: PMC11228553 DOI: 10.3389/fbinf.2024.1381540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 05/28/2024] [Indexed: 07/10/2024] Open
Abstract
Rad And Gem-Like GTP-Binding Protein 2 (Rem2), a member of the RGK family of Ras-like GTPases, is implicated in Huntington's disease and Long QT Syndrome and is highly expressed in the brain and endocrine cells. We examine the evolutionary history of Rem2 identified in various mammalian species, focusing on the role of purifying selection and coevolution in shaping its sequence and protein structural constraints. Our analysis of Rem2 sequences across 175 mammalian species found evidence for strong purifying selection in 70% of non-invariant codon sites which is characteristic of essential proteins that play critical roles in biological processes and is consistent with Rem2's role in the regulation of neuronal development and function. We inferred epistatic effects in 50 pairs of codon sites in Rem2, some of which are predicted to have deleterious effects on human health. Additionally, we reconstructed the ancestral evolutionary history of mammalian Rem2 using protein structure prediction of extinct and extant sequences which revealed the dynamics of how substitutions that change the gene sequence of Rem2 can impact protein structure in variable regions while maintaining core functional mechanisms. By understanding the selective pressures, protein- and gene - interactions that have shaped the sequence and structure of the Rem2 protein, we gain a stronger understanding of its biological and functional constraints.
Collapse
Affiliation(s)
- Alexander G Lucaci
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States
- Weill Cornell Medicine, The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, United States
| | - William E Brew
- Department of Biology, Temple University, Philadelphia, PA, United States
| | - Jason Lamanna
- Department of Biology, Temple University, Philadelphia, PA, United States
- Institute for Computational Molecular Science, Temple University, Philadelphia, PA, United States
| | - Avery Selberg
- Department of Biology, Temple University, Philadelphia, PA, United States
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States
| | - Vincenzo Carnevale
- Department of Biology, Temple University, Philadelphia, PA, United States
- Institute for Computational Molecular Science, Temple University, Philadelphia, PA, United States
| | - Anna R Moore
- Department of Biology, Temple University, Philadelphia, PA, United States
| | - Sergei L Kosakovsky Pond
- Department of Biology, Temple University, Philadelphia, PA, United States
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States
| |
Collapse
|
6
|
Gupta A, Mirarab S, Turakhia Y. Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.27.596098. [PMID: 38854139 PMCID: PMC11160643 DOI: 10.1101/2024.05.27.596098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Inference of species trees plays a crucial role in advancing our understanding of evolutionary relationships and has immense significance for diverse biological and medical applications. Extensive genome sequencing efforts are currently in progress across a broad spectrum of life forms, holding the potential to unravel the intricate branching patterns within the tree of life. However, estimating species trees starting from raw genome sequences is quite challenging, and the current cutting-edge methodologies require a series of error-prone steps that are neither entirely automated nor standardized. In this paper, we present ROADIES, a novel pipeline for species tree inference from raw genome assemblies that is fully automated, easy to use, scalable, free from reference bias, and provides flexibility to adjust the tradeoff between accuracy and runtime. The ROADIES pipeline eliminates the need to align whole genomes, choose a single reference species, or pre-select loci such as functional genes found using cumbersome annotation steps. Moreover, it leverages recent advances in phylogenetic inference to allow multi-copy genes, eliminating the need to detect orthology. Using the genomic datasets released from large-scale sequencing consortia across three diverse life forms (placental mammals, pomace flies, and birds), we show that ROADIES infers species trees that are comparable in quality with the state-of-the-art approaches but in a fraction of the time. By incorporating optimal approaches and automating all steps from assembled genomes to species and gene trees, ROADIES is poised to improve the accuracy, scalability, and reproducibility of phylogenomic analyses.
Collapse
Affiliation(s)
- Anshu Gupta
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California, San Diego; San Diego, CA 92093, USA
| | - Yatish Turakhia
- Department of Electrical and Computer Engineering, University of California, San Diego; San Diego, CA 92093, USA
| |
Collapse
|
7
|
Rao J, Xin R, Macdonald C, Howard MK, Estevam GO, Yee SW, Wang M, Fraser JS, Coyote-Maestas W, Pimentel H. Rosace: a robust deep mutational scanning analysis framework employing position and mean-variance shrinkage. Genome Biol 2024; 25:138. [PMID: 38789982 PMCID: PMC11127319 DOI: 10.1186/s13059-024-03279-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 05/14/2024] [Indexed: 05/26/2024] Open
Abstract
Deep mutational scanning (DMS) measures the effects of thousands of genetic variants in a protein simultaneously. The small sample size renders classical statistical methods ineffective. For example, p-values cannot be correctly calibrated when treating variants independently. We propose Rosace, a Bayesian framework for analyzing growth-based DMS data. Rosace leverages amino acid position information to increase power and control the false discovery rate by sharing information across parameters via shrinkage. We also developed Rosette for simulating the distributional properties of DMS. We show that Rosace is robust to the violation of model assumptions and is more powerful than existing tools.
Collapse
Affiliation(s)
- Jingyou Rao
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | - Ruiqi Xin
- Computational and Systems Biology Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Christian Macdonald
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
| | - Matthew K Howard
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Tetrad Graduate Program, UCSF, San Francisco, CA, USA
- Department of Pharmaceutical Chemistry, UCSF, San Francisco, CA, USA
| | - Gabriella O Estevam
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Tetrad Graduate Program, UCSF, San Francisco, CA, USA
| | - Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
| | - Mingsen Wang
- Department of Mathematics, Baruch College, CUNY, New York, NY, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, USA
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, UCSF, San Francisco, CA, USA.
- Quantitative Biosciences Institute, UCSF, San Francisco, CA, USA.
| | - Harold Pimentel
- Department of Computer Science, UCLA, Los Angeles, CA, USA.
- Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
8
|
Spohr P, Scharf S, Rommerskirchen A, Henrich B, Jäger P, Klau GW, Haas R, Dilthey A, Pfeffer K. Insights into gut microbiomes in stem cell transplantation by comprehensive shotgun long-read sequencing. Sci Rep 2024; 14:4068. [PMID: 38374282 PMCID: PMC10876974 DOI: 10.1038/s41598-024-53506-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 02/01/2024] [Indexed: 02/21/2024] Open
Abstract
The gut microbiome is a diverse ecosystem, dominated by bacteria; however, fungi, phages/viruses, archaea, and protozoa are also important members of the gut microbiota. Exploration of taxonomic compositions beyond bacteria as well as an understanding of the interaction between the bacteriome with the other members is limited using 16S rDNA sequencing. Here, we developed a pipeline enabling the simultaneous interrogation of the gut microbiome (bacteriome, mycobiome, archaeome, eukaryome, DNA virome) and of antibiotic resistance genes based on optimized long-read shotgun metagenomics protocols and custom bioinformatics. Using our pipeline we investigated the longitudinal composition of the gut microbiome in an exploratory clinical study in patients undergoing allogeneic hematopoietic stem cell transplantation (alloHSCT; n = 31). Pre-transplantation microbiomes exhibited a 3-cluster structure, characterized by Bacteroides spp. /Phocaeicola spp., mixed composition and Enterococcus abundances. We revealed substantial inter-individual and temporal variabilities of microbial domain compositions, human DNA, and antibiotic resistance genes during the course of alloHSCT. Interestingly, viruses and fungi accounted for substantial proportions of microbiome content in individual samples. In the course of HSCT, bacterial strains were stable or newly acquired. Our results demonstrate the disruptive potential of alloHSCTon the gut microbiome and pave the way for future comprehensive microbiome studies based on long-read metagenomics.
Collapse
Affiliation(s)
- Philipp Spohr
- Chair Algorithmic Bioinformatics, Faculty of Mathematics and Natural Sciences, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Düsseldorf, Germany
| | - Sebastian Scharf
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany
| | - Anna Rommerskirchen
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany
| | - Birgit Henrich
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany
| | - Paul Jäger
- Department of Hematology, Immunology, and Clinical Immunology, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany
| | - Gunnar W Klau
- Chair Algorithmic Bioinformatics, Faculty of Mathematics and Natural Sciences, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
- Center for Digital Medicine, Düsseldorf, Germany.
| | - Rainer Haas
- Department of Hematology, Immunology, and Clinical Immunology, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany.
| | - Alexander Dilthey
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany.
- Center for Digital Medicine, Düsseldorf, Germany.
| | - Klaus Pfeffer
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, University Hospital Düsseldorf, Düsseldorf, Germany.
| |
Collapse
|
9
|
Koromina M, Ravi A, Panagiotaropoulou G, Schilder BM, Humphrey J, Braun A, Bidgeli T, Chatzinakos C, Coombes B, Kim J, Liu X, Terao C, O.’Connell KS, Adams M, Adolfsson R, Alda M, Alfredsson L, Andlauer TFM, Andreassen OA, Antoniou A, Baune BT, Bengesser S, Biernacka J, Boehnke M, Bosch R, Cairns M, Carr VJ, Casas M, Catts S, Cichon S, Corvin A, Craddock N, Dafnas K, Dalkner N, Dannlowski U, Degenhardt F, Di Florio A, Dikeos D, Fellendorf FT, Ferentinos P, Forstner AJ, Forty L, Frye M, Fullerton JM, Gawlik M, Gizer IR, Gordon-Smith K, Green MJ, Grigoroiu-Serbanescu M, Guzman-Parra J, Hahn T, Henskens F, Hillert J, Jablensky AV, Jones L, Jones I, Jonsson L, Kelsoe JR, Kircher T, Kirov G, Kittel-Schneider S, Kogevinas M, Landén M, Leboyer M, Lenger M, Lissowska J, Lochner C, Loughland C, MacIntyre D, Martin NG, Maratou E, Mathews CA, Mayoral F, McElroy SL, McGregor NW, McIntosh A, McQuillin A, Michie P, Milanova V, Mitchell PB, Moutsatsou P, Mowry B, Müller-Myhsok B, Myers R, Nenadić I, Nöthen MM, O’Donovan C, O’Donovan M, Ophoff RA, Owen MJ, Pantelis C, Pato C, Pato MT, Patrinos GP, Pawlak JM, Perlis RH, Porichi E, Posthuma D, Ramos-Quiroga JA, Reif A, Reininghaus EZ, Ribasés M, Rietschel M, Schall U, Schulze TG, Scott L, Scott RJ, Serretti A, Weickert CS, Smoller JW, Artigas MS, Stein DJ, Streit F, Toma C, Tooney P, Vieta E, Vincent JB, Waldman ID, Weickert T, Witt SH, Hong KS, Ikeda M, Iwata N, Świątkowska B, Won HH, Edenberg HJ, Ripke S, Raj T, Coleman JRI, Mullins N. Fine-mapping genomic loci refines bipolar disorder risk genes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.12.24302716. [PMID: 38405768 PMCID: PMC10889003 DOI: 10.1101/2024.02.12.24302716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Bipolar disorder (BD) is a heritable mental illness with complex etiology. While the largest published genome-wide association study identified 64 BD risk loci, the causal SNPs and genes within these loci remain unknown. We applied a suite of statistical and functional fine-mapping methods to these loci, and prioritized 22 likely causal SNPs for BD. We mapped these SNPs to genes, and investigated their likely functional consequences by integrating variant annotations, brain cell-type epigenomic annotations, brain quantitative trait loci, and results from rare variant exome sequencing in BD. Convergent lines of evidence supported the roles of SCN2A, TRANK1, DCLK3, INSYN2B, SYNE1, THSD7A, CACNA1B, TUBBP5, PLCB3, PRDX5, KCNK4, AP001453.3, TRPT1, FKBP2, DNAJC4, RASGRP1, FURIN, FES, YWHAE, DPH1, GSDMB, MED24, THRA, EEF1A2, and KCNQ2 in BD. These represent promising candidates for functional experiments to understand biological mechanisms and therapeutic potential. Additionally, we demonstrated that fine-mapping effect sizes can improve performance and transferability of BD polygenic risk scores across ancestrally diverse populations, and present a high-throughput fine-mapping pipeline (https://github.com/mkoromina/SAFFARI).
Collapse
Affiliation(s)
- Maria Koromina
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ashvin Ravi
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Brian M. Schilder
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jack Humphrey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alice Braun
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin, Berlin, Germany
| | | | | | - Brandon Coombes
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Jaeyoung Kim
- Samsung Advanced Institute for Health Sciences and Technology (SAIHST), Sungkyunkwan University, Samsung Medical Center, Seoul, Republic of Korea
- Samsung Genome Institute, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| | - Xiaoxi Liu
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
- The Department of Applied Genetics, The School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Kevin S. O.’Connell
- Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
- NORMENT, University of Oslo, Oslo, Norway
| | - Mark Adams
- Division of Psychiatry, Centre for Clinical Brain Sciences, The University of Edinburgh, Edinburgh, UK
| | - Rolf Adolfsson
- Department of Clinical Sciences, Psychiatry, Umeå, University Medical Faculty, Umeå, Sweden
| | - Martin Alda
- Department 20 of Psychiatry, Dalhousie University, Halifax, NS, Canada
- National Institute of Mental Health, Klecany, Czech Republic
| | - Lars Alfredsson
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Till F. M. Andlauer
- Department of Neurology, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Ole A. Andreassen
- Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
- NORMENT, University of Oslo, Oslo, Norway
| | - Anastasia Antoniou
- National Kapodistrian University of Athens, 2nd Department of Psychiatry, Attikon General Hospital, Athens, Greece
| | - Bernhard T. Baune
- Department of Psychiatry, University of Münster, Münster, Germany
- Department of Psychiatry, Melbourne Medical School, The University of Melbourne, Melbourne, VIC, Australia
- The Florey Institute of Neuroscience and Mental Health, The University of Melbourne, Parkville, VIC, Australia
| | - Susanne Bengesser
- Medical University of Graz, Division of Psychiatry and Psychotherapeutic Medicine, Graz, Austria
| | - Joanna Biernacka
- Department of Quantitative Health Sciences Research, Mayo Clinic, Rochester, MN, USA
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Michael Boehnke
- Center for Statistical Genetics and Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Rosa Bosch
- Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, Spain
- Programa SJD MIND Escoles, Hospital Sant Joan de Déu, Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, Spain
| | | | - Vaughan J. Carr
- School of Clinical Medicine, Discipline of Psychiatry and Mental Health, University of New South Wales, Sydney, NSW, Australia
| | - Miquel Casas
- Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, Spain
- Programa SJD MIND Escoles, Hospital Sant Joan de Déu, Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, Spain
| | | | - Sven Cichon
- Department of Biomedicine, University of Basel, Basel, Switzerland
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
- Institute of Medical Genetics and Pathology, University Hospital Basel, Basel, Switzerland
- Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich, Germany
| | - Aiden Corvin
- Neuropsychiatric Genetics Research Group, Dept of Psychiatry and Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
| | - Nicholas Craddock
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | - Konstantinos Dafnas
- National Kapodistrian University of Athens, 2nd Department of Psychiatry, Attikon General Hospital, Athens, Greece
| | - Nina Dalkner
- Medical University of Graz, Division of Psychiatry and Psychotherapeutic Medicine, Graz, Austria
| | - Udo Dannlowski
- Institute for Translatiol Psychiatry, University of Münster, Münster, Germany
| | - Franziska Degenhardt
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, University Hospital Essen, University of Duisburg-Essen, Duisburg, Germany
| | - Arianna Di Florio
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
- Department of Psychiatry, University of North Caroli at Chapel Hill, Chapel Hill, NC, USA
| | - Dimitris Dikeos
- National Kapodistrian University of Athens, 2nd Department of Psychiatry, Attikon General Hospital, Athens, Greece
| | | | - Panagiotis Ferentinos
- National Kapodistrian University of Athens, 2nd Department of Psychiatry, Attikon General Hospital, Athens, Greece
- Social, Genetic and Developmental Psychiatry Centre, King’s College London, London, UK
| | - Andreas J. Forstner
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
- Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich, Germany
- Centre for Human Genetics, University of Marburg, Marburg, Germany
| | - Liz Forty
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | - Mark Frye
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Janice M. Fullerton
- Neuroscience Research Australia, Sydney, NSW, Australia
- School of Biomedical Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Micha Gawlik
- Department of Psychiatry, Psychosomatics and Psychotherapy, Center of Mental Health, University Hospital Würzburg, Würzburg, Germany
| | - Ian R. Gizer
- Department of Psychological Sciences, University of Missouri, Columbia, MO, USA
| | | | - Melissa J. Green
- Neuroscience Research Australia, Sydney, NSW, Australia
- Discipline of Psychiatry and Mental Health, School of Clinical Medicine, Faculty of Medicine and Health, University of New South Wales, Sydney, NSW, Australia
| | - Maria Grigoroiu-Serbanescu
- Biometric Psychiatric Genetics Research Unit, Alexandru Obregia Clinical Psychiatric Hospital, Bucharest, Romania
| | - José Guzman-Parra
- Mental Health Department, University Regional Hospital, Biomedicine Institute (IBIMA), Málaga, Spain
| | - Tim Hahn
- Institute for Translatiol Psychiatry, University of Münster, Münster, Germany
| | | | - Jan Hillert
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | | | - Lisa Jones
- Psychological Medicine, University of Worcester, Worcester, UK
| | - Ian Jones
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | - Lina Jonsson
- Institute of Neuroscience and Physiology, University of Gothenburg, Gothenburg, Sweden
| | - John R. Kelsoe
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Tilo Kircher
- Department of Psychiatry and Psychotherapy, University of Marburg, Germany
| | - George Kirov
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | - Sarah Kittel-Schneider
- Department of Psychiatry, Psychosomatics and Psychotherapy, Center of Mental Health, University Hospital Würzburg, Würzburg, Germany
- Department of Psychiatry, Psychosomatic Medicine and Psychotherapy, University Hospital Frankfurt, Frankfurt am Main, Germany
- Department of Psychiatry and Neurobehavioural Science, University College Cork, Cork, Ireland
| | | | - Mikael Landén
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Marion Leboyer
- Université Paris Est Créteil, INSERM, IMRB, Translational Neuropsychiatry, Créteil, France
- Department of Psychiatry and Addiction Medicine, Assistance Publique - Hôpitaux de Paris, Paris, France
| | - Melanie Lenger
- Medical University of Graz, Division of Psychiatry and Psychotherapeutic Medicine, Graz, Austria
| | - Jolanta Lissowska
- Cancer Epidemiology and Prevention, M. Sklodowska-Curie National Research Institute of Oncology, Warsaw, Poland
| | - Christine Lochner
- SA MRC Unit on Risk and Resilience in Mental Disorders, Dept of Psychiatry, Stellenbosch University, Stellenbosch, South Africa
| | | | - Donald MacIntyre
- Division of Psychiatry, Centre for Clinical Brain Sciences, The University of Edinburgh, Edinburgh, UK
| | - Nicholas G. Martin
- Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
- School of Psychology, The University of Queensland, Brisbane, QLD, Australia
| | - Eirini Maratou
- National and Kapodistrian University of Athens, Medical School, Clinical Biochemistry Laboratory, Attikon General Hospital, Athens, Greece
| | - Carol A. Mathews
- Department of Psychiatry and Genetics Institute, University of Florida, Gainesville, FL, USA
| | - Fermin Mayoral
- Mental Health Department, University Regional Hospital, Biomedicine Institute (IBIMA), Málaga, Spain
| | | | - Nathaniel W. McGregor
- Systems Genetics Working Group, Department of Genetics, Stellenbosch University, Stellenbosch, South Africa
| | - Andrew McIntosh
- Division of Psychiatry, Centre for Clinical Brain Sciences, The University of Edinburgh, Edinburgh, UK
| | | | | | - Vihra Milanova
- Psychiatric Clinic, Alexander University Hospital, Bulgaria
| | - Philip B. Mitchell
- Discipline of Psychiatry and Mental Health, School of Clinical Medicine, Faculty of Medicine and Health, University of New South Wales, Sydney, NSW, Australia
| | - Paraskevi Moutsatsou
- National Kapodistrian University of Athens, Medical School, Clinical Biochemistry Laboratory, Attikon General Hospital, Athens, Greece
| | - Bryan Mowry
- University of Queensland, Brisbane, QLD, Australia
| | - Bertram Müller-Myhsok
- Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich, Germany
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany
| | - Richard Myers
- Hudsolpha Institute for Biotechnology, Huntsville, AL, USA
| | - Igor Nenadić
- Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany
- Center for Mind, Brain and Behavior (CMBB), University of Marburg and Justus Liebig University Giessen, Giessen, Germany
| | - Markus M. Nöthen
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Claire O’Donovan
- Department 20 of Psychiatry, Dalhousie University, Halifax, NS, Canada
| | - Michael O’Donovan
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | - Roel A. Ophoff
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
- Department of Psychiatry, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Psychiatry and Biobehavioral Science, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Michael J Owen
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
| | | | - Carlos Pato
- Institute for Genomic Health, SUNY Downstate Medical Center College of Medicine, Brooklyn, NY, USA
| | - Michele T. Pato
- Institute for Genomic Health, SUNY Downstate Medical Center College of Medicine, Brooklyn, NY, USA
| | - George P. Patrinos
- University of Patras, School of Health Sciences, Department of Pharmacy, Laboratory of Pharmacogenomics and Individualized Therapy, Patras, Greece
- United Arab Emirates University, College of Medicine and Health Sciences, Department of Genetics and Genomics, Al-Ain, United Arab Emirates
- United Arab Emirates University, Zayed Center for Health Sciences, Al-Ain, United Arab Emirates
| | - Joanna M. Pawlak
- Department of Psychiatry, Departmet of Psychiatric Genetics, Poznan University of Medical Sciences, Poznan, Poland
| | - Roy H. Perlis
- Psychiatry, Harvard Medical School, Boston, MA, USA
- Division of Clinical Research, Massachusetts General Hospital, Boston, MA, USA
| | - Evgenia Porichi
- National and Kapodistrian University of Athens, 2nd Department of Psychiatry, Attikon General Hospital, Athens, Greece
| | - Danielle Posthuma
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Department of Clinical Genetics, Amsterdam Neuroscience, Vrije Universiteit Medical Center, Amsterdam, The Netherlands
| | - Josep Antoni Ramos-Quiroga
- Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, Spain
- Department of Psychiatry, Hospital Universitari Vall d´Hebron, Barcelo, Spain
- Department of Psychiatry and Forensic Medicine, Universitat Autònoma de Barcelo, Barcelo, Spain
- Psychiatric Genetics Unit, Group of Psychiatry Mental Health and Addictions, Vall d´Hebron Research Institut (VHIR), Universitat Autònoma de Barcelo, Barcelo, Spain
| | - Andreas Reif
- Department of Psychiatry, Psychosomatic Medicine and Psychotherapy, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Eva Z. Reininghaus
- Medical University of Graz, Division of Psychiatry and Psychotherapeutic Medicine, Graz, Austria
| | - Marta Ribasés
- Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, Spain
- Department of Psychiatry, Hospital Universitari Vall d´Hebron, Barcelona, Spain
- Psychiatric Genetics Unit, Group of Psychiatry Mental Health and Addictions, Vall d´Hebron Research Institut (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain
- Department of Genetics, Microbiology, and Statistics, Faculty of Biology, Universitat de Barcelona, Barcelona, Spain. Department of Genetics, Microbiology and Statistics, Faculty of Biology, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Marcella Rietschel
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | | | - Thomas G. Schulze
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Psychiatry and Psychotherapy, University Medical Center Göttingen, Göttingen, Germany
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Laura Scott
- Center for Statistical Genetics and Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | | | - Alessandro Serretti
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Bologna, Italy
| | - Cynthia Shannon Weickert
- Discipline of Psychiatry and Mental Health, School of Clinical Medicine, Faculty of Medicine and Health, University of New South Wales, Sydney, NSW, Australia
- Department of Neuroscience, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Jordan W. Smoller
- Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, USA
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
- Psychiatric and Neurodevelopmental Genetics Unit (PNGU), Massachusetts General Hospital, Boston, MA, USA
| | - Maria Soler Artigas
- Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, Spain
- Department of Psychiatry, Hospital Universitari Vall d´Hebron, Barcelo, Spain
- Psychiatric Genetics Unit, Group of Psychiatry Mental Health and Addictions, Vall d´Hebron Research Institut (VHIR), Universitat Autònoma de Barcelo, Barcelo, Spain
- Department of Genetics, Microbiology, and Statistics, Faculty of Biology, Universitat de Barcelo, Barcelo, Spain
| | - Dan J. Stein
- SAMRC Unit on Risk and Resilience in Mental Disorders, Dept of Psychiatry and Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Fabian Streit
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Claudio Toma
- Neuroscience Research Australia, Sydney, NSW, Australia
- Discipline of Psychiatry and Mental Health, School of Clinical Medicine, Faculty of Medicine and Health, University of New South Wales, Sydney, NSW, Australia
- Centro de Biología Molecular Severo Ochoa, Universidad Autónoma de Madrid and CSIC, Madrid, Spain
| | - Paul Tooney
- University of Newcastle, Newcastle, NSW, Australia
| | - Eduard Vieta
- Clinical Institute of Neuroscience, Hospital Clinic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Spain
| | - John B. Vincent
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | | | - Thomas Weickert
- Discipline of Psychiatry and Mental Health, School of Clinical Medicine, Faculty of Medicine and Health, University of New South Wales, Sydney, NSW, Australia
- Department of Neuroscience, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Stephanie H. Witt
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Kyung Sue Hong
- Department of Psychiatry, Sungkyunkwan University School of Medicine, Samsung Medical Center, Seoul, South Korea
| | - Masashi Ikeda
- Department of Psychiatry, Fujita Health University School of Medicine, Toyoake, Japan
| | - Nakao Iwata
- Department of Psychiatry, Fujita Health University School of Medicine, Toyoake, Japan
| | - Beata Świątkowska
- Department of Environmental Epidemiology, Nofer Institute of Occupational Medicine, Lodz, Poland
| | - Hong-Hee Won
- Samsung Advanced Institute for Health Sciences and Technology (SAIHST), Sungkyunkwan University, Samsung Medical Center, Seoul, Republic of Korea
| | - Howard J. Edenberg
- Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana University, Indianapolis, IN, USA
| | - Stephan Ripke
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin, Berlin, Germany
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Towfique Raj
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jonathan R. I. Coleman
- Social, Genetic and Developmental Psychiatry Centre, King’s College London, London, UK
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Foundation Trust, London, UK
| | - Niamh Mullins
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
10
|
Smeds L, Huson LSA, Ellegren H. Structural genomic variation in the inbred Scandinavian wolf population contributes to the realized genetic load but is positively affected by immigration. Evol Appl 2024; 17:e13652. [PMID: 38333557 PMCID: PMC10848878 DOI: 10.1111/eva.13652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 01/08/2024] [Accepted: 01/16/2024] [Indexed: 02/10/2024] Open
Abstract
When populations decrease in size and may become isolated, genomic erosion by loss of diversity from genetic drift and accumulation of deleterious mutations is likely an inevitable consequence. In such cases, immigration (genetic rescue) is necessary to restore levels of genetic diversity and counteract inbreeding depression. Recent work in conservation genomics has studied these processes focusing on the genetic diversity of single nucleotide polymorphisms. In contrast, our knowledge about structural genomic variation (insertions, deletions, duplications and inversions) in endangered species is limited. We analysed whole-genome, short-read sequences from 212 wolves from the inbred Scandinavian population and from neighbouring populations in Finland and Russia, and detected >35,000 structural variants (SVs) after stringent quality and genotype frequency filtering; >26,000 high-confidence variants remained after manual curation. The majority of variants were shorter than 1 kb, with a distinct peak in the length distribution of deletions at 190 bp, corresponding to insertion events of SINE/tRNA-Lys elements. The site frequency spectrum of SVs in protein-coding regions was significantly shifted towards rare alleles compared to putatively neutral variants, consistent with purifying selection. The realized genetic load of SVs in protein-coding regions increased with inbreeding levels in the Scandinavian population, but immigration provided a genetic rescue effect by lowering the load and reintroducing ancestral alleles at loci fixed for derived SVs. Our study shows that structural variation comprises a common type of in part deleterious mutations in endangered species and that establishing gene flow is necessary to mitigate the negative consequences of loss of diversity.
Collapse
Affiliation(s)
- Linnéa Smeds
- Department of Ecology and Genetics, Evolutionary BiologyUppsala UniversityUppsalaSweden
| | - Lars S. A. Huson
- Department of Ecology and Genetics, Evolutionary BiologyUppsala UniversityUppsalaSweden
| | - Hans Ellegren
- Department of Ecology and Genetics, Evolutionary BiologyUppsala UniversityUppsalaSweden
| |
Collapse
|
11
|
Shao Z, Buchanan LB, Zuanazzi D, Khan YN, Khan AR, Prodger JL. Comparison between a deep-learning and a pixel-based approach for the automated quantification of HIV target cells in foreskin tissue. Sci Rep 2024; 14:1985. [PMID: 38263439 PMCID: PMC10806185 DOI: 10.1038/s41598-024-52613-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 01/21/2024] [Indexed: 01/25/2024] Open
Abstract
The availability of target cells expressing the HIV receptors CD4 and CCR5 in genital tissue is a critical determinant of HIV susceptibility during sexual transmission. Quantification of immune cells in genital tissue is therefore an important outcome for studies on HIV susceptibility and prevention. Immunofluorescence microscopy allows for precise visualization of immune cells in mucosal tissues; however, this technique is limited in clinical studies by the lack of an accurate, unbiased, high-throughput image analysis method. Current pixel-based thresholding methods for cell counting struggle in tissue regions with high cell density and autofluorescence, both of which are common features in genital tissue. We describe a deep-learning approach using the publicly available StarDist method to count cells in immunofluorescence microscopy images of foreskin stained for nuclei, CD3, CD4, and CCR5. The accuracy of the model was comparable to manual counting (gold standard) and surpassed the capability of a previously described pixel-based cell counting method. We show that the performance of our deep-learning model is robust in tissue regions with high cell density and high autofluorescence. Moreover, we show that this deep-learning analysis method is both easy to implement and to adapt for the identification of other cell types in genital mucosal tissue.
Collapse
Affiliation(s)
- Zhongtian Shao
- Department of Microbiology and Immunology, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada
| | - Lane B Buchanan
- Department of Microbiology and Immunology, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada
| | - David Zuanazzi
- Department of Microbiology and Immunology, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada
| | - Yazan N Khan
- Department of Microbiology and Immunology, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada
| | - Ali R Khan
- Department of Medical Biophysics, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada
| | - Jessica L Prodger
- Department of Microbiology and Immunology, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada.
- Department of Epidemiology and Biostatistics, The University of Western Ontario, 1151 Richmond St, London, ON, N6A 3K7, Canada.
| |
Collapse
|
12
|
Clouard C, Nettelblad C. Genotyping of SNPs in bread wheat at reduced cost from pooled experiments and imputation. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:26. [PMID: 38243086 PMCID: PMC10799138 DOI: 10.1007/s00122-023-04533-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 12/19/2023] [Indexed: 01/21/2024]
Abstract
KEY MESSAGE Pooling and imputation are computational methods that can be combined for achieving cost-effective and accurate high-density genotyping of both common and rare variants, as demonstrated in a MAGIC wheat population. The plant breeding industry has shown growing interest in using the genotype data of relevant markers for performing selection of new competitive varieties. The selection usually benefits from large amounts of marker data, and it is therefore crucial to dispose of data collection methods that are both cost-effective and reliable. Computational methods such as genotype imputation have been proposed earlier in several plant science studies for addressing the cost challenge. Genotype imputation methods have though been used more frequently and investigated more extensively in human genetics research. The various algorithms that exist have shown lower accuracy at inferring the genotype of genetic variants occurring at low frequency, while these rare variants can have great significance and impact in the genetic studies that underlie selection. In contrast, pooling is a technique that can efficiently identify low-frequency items in a population, and it has been successfully used for detecting the samples that carry rare variants in a population. In this study, we propose to combine pooling and imputation and demonstrate this by simulating a hypothetical microarray for genotyping a population of recombinant inbred lines in a cost-effective and accurate manner, even for rare variants. We show that with an adequate imputation model, it is feasible to accurately predict the individual genotypes at lower cost than sample-wise genotyping and time-effectively. Moreover, we provide code resources for reproducing the results presented in this study in the form of a containerized workflow.
Collapse
Affiliation(s)
- Camille Clouard
- Division of Scientific Computing, Department of Information Technology, Uppsala University, Lägerhyddsvägen 1, 75237, Uppsala, Sweden.
| | - Carl Nettelblad
- Division of Scientific Computing, Department of Information Technology, Uppsala University, Lägerhyddsvägen 1, 75237, Uppsala, Sweden
- SciLifeLab, Science for Life Laboratory, Husargatan 3, 75237, Uppsala, Sweden
| |
Collapse
|
13
|
Barquin M, Kouzel IU, Ehrmann B, Basler M, Gruber AJ. scTEA-db: a comprehensive database of novel terminal exon isoforms identified from human single cell transcriptomes. Nucleic Acids Res 2024; 52:D1018-D1023. [PMID: 37850641 PMCID: PMC10767918 DOI: 10.1093/nar/gkad878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/12/2023] [Accepted: 09/29/2023] [Indexed: 10/19/2023] Open
Abstract
The usage of alternative terminal exons results in messenger RNA (mRNA) isoforms that differ in their 3' untranslated regions (3' UTRs) and often also in their protein-coding sequences. Alternative 3' UTRs contain different sets of cis-regulatory elements known to regulate mRNA stability, translation and localization, all of which are vital to cell identity and function. In previous work, we revealed that ∼25 percent of the experimentally observed RNA 3' ends are located within regions currently annotated as intronic, indicating that many 3' end isoforms remain to be uncovered. Also, the inclusion of not yet annotated terminal exons is more tissue specific compared to the already annotated ones. Here, we present the single cell-based Terminal Exon Annotation database (scTEA-db, www.scTEA-db.org) that provides the community with 12 063 so far not yet annotated terminal exons and associated transcript isoforms identified by analysing 53 069 publicly available single cell transcriptomes. Our scTEA-db web portal offers an array of features to find and explore novel terminal exons belonging to 5538 human genes, 110 of which are known cancer drivers. In summary, scTEA-db provides the foundation for studying the biological role of large numbers of so far not annotated terminal exon isoforms in cell identity and function.
Collapse
Affiliation(s)
- Miguel Barquin
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| | - Ian U Kouzel
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| | - Beat Ehrmann
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| | - Michael Basler
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
- Biotechnology Institute Thurgau (BITg) at the University of Konstanz, 8280, Kreuzlingen, Switzerland
| | - Andreas J Gruber
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| |
Collapse
|
14
|
Putzeys L, Intizar D, Lavigne R, Boon M. Obtaining Detailed Phage Transcriptomes Using ONT-Cappable-Seq. Methods Mol Biol 2024; 2793:207-235. [PMID: 38526733 DOI: 10.1007/978-1-0716-3798-2_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Detailed transcription maps of bacteriophages are not usually explored, limiting our understanding of molecular phage biology and restricting their exploitation and engineering. The ONT-cappable-seq method described here brings phage transcriptomics to the accessible nanopore sequencing platform and provides an affordable and more detailed overview of transcriptional features compared to traditional RNA-seq experiments. With ONT-cappable-seq, primary transcripts are specifically capped, enriched, and prepared for long-read sequencing on the nanopore sequencing platform. This enables end-to-end sequencing of unprocessed transcripts covering both phage and host genome, thus providing insight on their operons. The subsequent analysis pipeline makes it possible to rapidly identify the most important transcriptional features such as transcription start and stop sites. The obtained data can thus provide a comprehensive overview of the transcription by your phage of interest.
Collapse
Affiliation(s)
- Leena Putzeys
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium
| | - Danish Intizar
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium
| | - Rob Lavigne
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium.
| | - Maarten Boon
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium
| |
Collapse
|
15
|
Corut AK, Wallace JG. kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS. G3 (BETHESDA, MD.) 2023; 14:jkad246. [PMID: 37976215 PMCID: PMC10755180 DOI: 10.1093/g3journal/jkad246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 10/15/2023] [Indexed: 11/19/2023]
Abstract
Genome-wide association studies (GWAS) have been widely used to identify genetic variation associated with complex traits. Despite its success and popularity, the traditional GWAS approach comes with a variety of limitations. For this reason, newer methods for GWAS have been developed, including the use of pan-genomes instead of a reference genome and the utilization of markers beyond single-nucleotide polymorphisms, such as structural variations and k-mers. The k-mers-based GWAS approach has especially gained attention from researchers in recent years. However, these new methodologies can be complicated and challenging to implement. Here, we present kGWASflow, a modular, user-friendly, and scalable workflow to perform GWAS using k-mers. We adopted an existing kmersGWAS method into an easier and more accessible workflow using management tools like Snakemake and Conda and eliminated the challenges caused by missing dependencies and version conflicts. kGWASflow increases the reproducibility of the kmersGWAS method by automating each step with Snakemake and using containerization tools like Docker. The workflow encompasses supplemental components such as quality control, read-trimming procedures, and generating summary statistics. kGWASflow also offers post-GWAS analysis options to identify the genomic location and context of trait-associated k-mers. kGWASflow can be applied to any organism and requires minimal programming skills. kGWASflow is freely available on GitHub (https://github.com/akcorut/kGWASflow) and Bioconda (https://anaconda.org/bioconda/kgwasflow).
Collapse
Affiliation(s)
- Adnan Kivanc Corut
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Jason G Wallace
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Institute of Plant Breeding, Genetics, and Genomics, University of Georgia, Athens, GA 30602, USA
- Department of Crop and Soil Sciences, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
16
|
Singh NP, Wu EY, Fan J, Love MI, Patro R. Tree-based differential testing using inferential uncertainty for RNA-Seq. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.25.573288. [PMID: 38234739 PMCID: PMC10793400 DOI: 10.1101/2023.12.25.573288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Identifying differentially expressed transcripts poses a crucial yet challenging problem in transcriptomics. Substantial uncertainty is associated with the abundance estimates of certain transcripts which, if ignored, can lead to the exaggeration of false positives and, if included, may lead to reduced power. For a given set of RNA-Seq samples, TreeTerminus arranges transcripts in a hierarchical tree structure that encodes different layers of resolution for interpretation of the abundance of transcriptional groups, with uncertainty generally decreasing as one ascends the tree from the leaves. We introduce trenDi, which utilizes the tree structure from TreeTerminus for differential testing. The candidate nodes are determined in a data-driven manner to maximize the signal that can be extracted from the data while controlling for the uncertainty associated with estimating the transcript abundances. The identified candidate nodes can include transcripts and inner nodes, with no two nodes having an ancestor/descendant relationship. We evaluated our method on both simulated and experimental datasets, comparing its performance with other tree-based differential methods as well as with uncertainty-aware differential transcript/gene expression methods. Our method detects inner nodes that show a strong signal for differential expression, which would have been overlooked when analyzing the transcripts alone.
Collapse
Affiliation(s)
- Noor Pratap Singh
- Department of Computer Science, University of Maryland, College Park
| | - Euphy Y. Wu
- Department of Biostatistics, University of North Carolina-Chapel Hill
| | - Jason Fan
- Department of Computer Science, University of Maryland, College Park
| | - Michael I. Love
- Department of Biostatistics, University of North Carolina-Chapel Hill
- Department of Genetics, University of North Carolina-Chapel Hill
| | - Rob Patro
- Department of Computer Science, University of Maryland, College Park
| |
Collapse
|
17
|
Sasani TA, Quinlan AR, Harris K. Epistasis between mutator alleles contributes to germline mutation spectra variability in laboratory mice. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.537217. [PMID: 37162999 PMCID: PMC10168256 DOI: 10.1101/2023.04.25.537217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Maintaining germline genome integrity is essential and enormously complex. Although many proteins are involved in DNA replication, proofreading, and repair [1], mutator alleles have largely eluded detection in mammals. DNA replication and repair proteins often recognize sequence motifs or excise lesions at specific nucleotides. Thus, we might expect that the spectrum of de novo mutations - the frequencies of C>T, A>G, etc. - will differ between genomes that harbor either a mutator or wild-type allele. Previously, we used quantitative trait locus mapping to discover candidate mutator alleles in the DNA repair gene Mutyh that increased the C>A germline mutation rate in a family of inbred mice known as the BXDs [2,3]. In this study we developed a new method to detect alleles associated with mutation spectrum variation and applied it to mutation data from the BXDs. We discovered an additional C>A mutator locus on chromosome 6 that overlaps Ogg1, a DNA glycosylase involved in the same base-excision repair network as Mutyh [4]. Its effect depended on the presence of a mutator allele near Mutyh, and BXDs with mutator alleles at both loci had greater numbers of C>A mutations than those with mutator alleles at either locus alone. Our new methods for analyzing mutation spectra reveal evidence of epistasis between germline mutator alleles and may be applicable to mutation data from humans and other model organisms.
Collapse
Affiliation(s)
| | - Aaron R. Quinlan
- Department of Human Genetics, University of Utah; Department of Biomedical Informatics, University of Utah · Funded by NIH/NHGRI R01HG012252
| | - Kelley Harris
- Department of Genome Sciences, University of Washington · Funded by NIH/NIGMS R35GM133428; Burroughs Wellcome Career Award at the Scientific Interface; Searle Scholarship; Pew Scholarship; Sloan Fellowship; Allen Discovery Center for Cell Lineage Tracing
| |
Collapse
|
18
|
Sieg JP, Jolley EA, Huot MJ, Babitzke P, Bevilacqua P. In vivo-like nearest neighbor parameters improve prediction of fractional RNA base-pairing in cells. Nucleic Acids Res 2023; 51:11298-11317. [PMID: 37855684 PMCID: PMC10639048 DOI: 10.1093/nar/gkad807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 09/11/2023] [Accepted: 09/27/2023] [Indexed: 10/20/2023] Open
Abstract
We conducted a thermodynamic analysis of RNA stability in Eco80 artificial cytoplasm, which mimics in vivo conditions, and compared it to transcriptome-wide probing of mRNA. Eco80 contains 80% of Escherichia coli metabolites, with biological concentrations of metal ions, including 2 mM free Mg2+ and 29 mM metabolite-chelated Mg2+. Fluorescence-detected binding isotherms (FDBI) were used to conduct a thermodynamic analysis of 24 RNA helices and found that these helices, which have an average stability of -12.3 kcal/mol, are less stable by ΔΔGo37 ∼1 kcal/mol. The FDBI data was used to determine a set of Watson-Crick free energy nearest neighbor parameters (NNPs), which revealed that Eco80 reduces the stability of three NNPs. This information was used to adjust the NN model using the RNAstructure package. The in vivo-like adjustments have minimal effects on the prediction of RNA secondary structures determined in vitro and in silico, but markedly improve prediction of fractional RNA base pairing in E. coli, as benchmarked with our in vivo DMS and EDC RNA chemical probing data. In summary, our thermodynamic and chemical probing analyses of RNA helices indicate that RNA secondary structures are less stable in cells than in artificially stable in vitro buffer conditions.
Collapse
Affiliation(s)
- Jacob P Sieg
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Elizabeth A Jolley
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Melanie J Huot
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Paul Babitzke
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Philip C Bevilacqua
- Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA
- Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
19
|
Pochon Z, Bergfeldt N, Kırdök E, Vicente M, Naidoo T, van der Valk T, Altınışık NE, Krzewińska M, Dalén L, Götherström A, Mirabello C, Unneberg P, Oskolkov N. aMeta: an accurate and memory-efficient ancient metagenomic profiling workflow. Genome Biol 2023; 24:242. [PMID: 37872569 PMCID: PMC10591440 DOI: 10.1186/s13059-023-03083-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 10/06/2023] [Indexed: 10/25/2023] Open
Abstract
Analysis of microbial data from archaeological samples is a growing field with great potential for understanding ancient environments, lifestyles, and diseases. However, high error rates have been a challenge in ancient metagenomics, and the availability of computational frameworks that meet the demands of the field is limited. Here, we propose aMeta, an accurate metagenomic profiling workflow for ancient DNA designed to minimize the amount of false discoveries and computer memory requirements. Using simulated data, we benchmark aMeta against a current state-of-the-art workflow and demonstrate its superiority in microbial detection and authentication, as well as substantially lower usage of computer memory.
Collapse
Affiliation(s)
- Zoé Pochon
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
| | - Nora Bergfeldt
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Zoology, Stockholm University, Stockholm, Sweden
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Emrah Kırdök
- Department of Biotechnology, Faculty of Science, Mersin University, Mersin, Turkey
| | - Mário Vicente
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
| | - Thijessen Naidoo
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
- Ancient DNA Unit, Science for Life Laboratory, Stockholm, Sweden
- Ancient DNA Unit, Science for Life Laboratory, Uppsala, Sweden
| | - Tom van der Valk
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - N Ezgi Altınışık
- Human-G Laboratory, Department of Anthropology, Hacettepe University, 06800, Beytepe, Ankara, Turkey
| | - Maja Krzewińska
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
| | - Love Dalén
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Anders Götherström
- Centre for Palaeogenetics, Stockholm, Sweden
- Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
| | - Claudio Mirabello
- Department of Physics, Chemistry and Biology, Science for Life Laboratory, National Bioinformatics Infrastructure Sweden, Linköping University, Linköping, Sweden
| | - Per Unneberg
- Department of Cell and Molecular Biology, Science for Life Laboratory, National Bioinformatics Infrastructure Sweden, Uppsala University, Uppsala, Sweden
| | - Nikolay Oskolkov
- Department of Biology, Science for Life Laboratory, National Bioinformatics Infrastructure Sweden, Lund University, Lund, Sweden.
| |
Collapse
|
20
|
Skiadopoulou D, Vašíček J, Kuznetsova K, Bouyssié D, Käll L, Vaudel M. Retention Time and Fragmentation Predictors Increase Confidence in Identification of Common Variant Peptides. J Proteome Res 2023; 22:3190-3199. [PMID: 37656829 PMCID: PMC10563157 DOI: 10.1021/acs.jproteome.3c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Indexed: 09/03/2023]
Abstract
Precision medicine focuses on adapting care to the individual profile of patients, for example, accounting for their unique genetic makeup. Being able to account for the effect of genetic variation on the proteome holds great promise toward this goal. However, identifying the protein products of genetic variation using mass spectrometry has proven very challenging. Here we show that the identification of variant peptides can be improved by the integration of retention time and fragmentation predictors into a unified proteogenomic pipeline. By combining these intrinsic peptide characteristics using the search-engine post-processor Percolator, we demonstrate improved discrimination power between correct and incorrect peptide-spectrum matches. Our results demonstrate that the drop in performance that is induced when expanding a protein sequence database can be compensated, hence enabling efficient identification of genetic variation products in proteomics data. We anticipate that this enhancement of proteogenomic pipelines can provide a more refined picture of the unique proteome of patients and thereby contribute to improving patient care.
Collapse
Affiliation(s)
- Dafni Skiadopoulou
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - Jakub Vašíček
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - Ksenia Kuznetsova
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - David Bouyssié
- Institut
de Pharmacologie et de Biologie Structurale (IPBS), Université
de Toulouse, CNRS, Université Toulouse III—Paul Sabatier
(UT3), 31000 Toulouse, France
| | - Lukas Käll
- Science
for Life Laboratory, School of Engineering Sciences in Chemistry,
Biotechnology and Health, KTH Royal Institute
of Technology, SE-100 44 Stockholm, Sweden
| | - Marc Vaudel
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
- Department
of Genetics and Bioinformatics, Health Data and Digitalization, Norwegian Institute of Public Health, N-0213 Oslo, Norway
| |
Collapse
|
21
|
Hsieh PH, Lopes-Ramos CM, Zucknick M, Sandve GK, Glass K, Kuijjer ML. Adjustment of spurious correlations in co-expression measurements from RNA-Sequencing data. Bioinformatics 2023; 39:btad610. [PMID: 37802917 PMCID: PMC10598588 DOI: 10.1093/bioinformatics/btad610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 08/05/2023] [Accepted: 10/05/2023] [Indexed: 10/08/2023] Open
Abstract
MOTIVATION Gene co-expression measurements are widely used in computational biology to identify coordinated expression patterns across a group of samples. Coordinated expression of genes may indicate that they are controlled by the same transcriptional regulatory program, or involved in common biological processes. Gene co-expression is generally estimated from RNA-Sequencing data, which are commonly normalized to remove technical variability. Here, we demonstrate that certain normalization methods, in particular quantile-based methods, can introduce false-positive associations between genes. These false-positive associations can consequently hamper downstream co-expression network analysis. Quantile-based normalization can, however, be extremely powerful. In particular, when preprocessing large-scale heterogeneous data, quantile-based normalization methods such as smooth quantile normalization can be applied to remove technical variability while maintaining global differences in expression for samples with different biological attributes. RESULTS We developed SNAIL (Smooth-quantile Normalization Adaptation for the Inference of co-expression Links), a normalization method based on smooth quantile normalization specifically designed for modeling of co-expression measurements. We show that SNAIL avoids formation of false-positive associations in co-expression as well as in downstream network analyses. Using SNAIL, one can avoid arbitrary gene filtering and retain associations to genes that only express in small subgroups of samples. This highlights the method's potential future impact on network modeling and other association-based approaches in large-scale heterogeneous data. AVAILABILITY AND IMPLEMENTATION The implementation of the SNAIL algorithm and code to reproduce the analyses described in this work can be found in the GitHub repository https://github.com/kuijjerlab/PySNAIL.
Collapse
Affiliation(s)
- Ping-Han Hsieh
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
- Department of Informatics, University of Oslo, Oslo 0316, Norway
| | - Camila Miranda Lopes-Ramos
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, United States
| | - Manuela Zucknick
- Oslo Centre for Biostatistics and Epidemiology, Institute of Basic Medical Sciences, University of Oslo, Oslo 0317, Norway
| | | | - Kimberly Glass
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, United States
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, United States
| | - Marieke Lydia Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
- Department of Pathology, Leiden University Medical Center, Leiden 2300RC, The Netherlands
- Leiden Center of Computational Oncology, Leiden University Medical Center,Leiden 2300RC, The Netherlands
| |
Collapse
|
22
|
Chi H, Hoikkala V, Grüschow S, Graham S, Shirran S, White MF. Antiviral type III CRISPR signalling via conjugation of ATP and SAM. Nature 2023; 622:826-833. [PMID: 37853119 PMCID: PMC10600005 DOI: 10.1038/s41586-023-06620-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 09/06/2023] [Indexed: 10/20/2023]
Abstract
CRISPR systems are widespread in the prokaryotic world, providing adaptive immunity against mobile genetic elements1,2. Type III CRISPR systems, with the signature gene cas10, use CRISPR RNA to detect non-self RNA, activating the enzymatic Cas10 subunit to defend the cell against mobile genetic elements either directly, via the integral histidine-aspartate (HD) nuclease domain3-5 or indirectly, via synthesis of cyclic oligoadenylate second messengers to activate diverse ancillary effectors6-9. A subset of type III CRISPR systems encode an uncharacterized CorA-family membrane protein and an associated NrN family phosphodiesterase that are predicted to function in antiviral defence. Here we demonstrate that the CorA-associated type III-B (Cmr) CRISPR system from Bacteroides fragilis provides immunity against mobile genetic elements when expressed in Escherichia coli. However, B. fragilis Cmr does not synthesize cyclic oligoadenylate species on activation, instead generating S-adenosyl methionine (SAM)-AMP (SAM is also known as AdoMet) by conjugating ATP to SAM via a phosphodiester bond. Once synthesized, SAM-AMP binds to the CorA effector, presumably leading to cell dormancy or death by disruption of the membrane integrity. SAM-AMP is degraded by CRISPR-associated phosphodiesterases or a SAM-AMP lyase, potentially providing an 'off switch' analogous to cyclic oligoadenylate-specific ring nucleases10. SAM-AMP thus represents a new class of second messenger for antiviral signalling, which may function in different roles in diverse cellular contexts.
Collapse
Affiliation(s)
- Haotian Chi
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK
| | - Ville Hoikkala
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK
- University of Jyväskylä, Department of Biological and Environmental Science and Nanoscience Center, Jyväskylä, Finland
| | - Sabine Grüschow
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK
| | - Shirley Graham
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK
| | - Sally Shirran
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK
| | - Malcolm F White
- Biomedical Sciences Research Complex, School of Biology, University of St Andrews, St Andrews, UK.
| |
Collapse
|
23
|
Durand R, Jalbert-Ross J, Fijarczyk A, Dubé AK, Landry CR. Cross-feeding affects the target of resistance evolution to an antifungal drug. PLoS Genet 2023; 19:e1011002. [PMID: 37856537 PMCID: PMC10617708 DOI: 10.1371/journal.pgen.1011002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 10/31/2023] [Accepted: 10/02/2023] [Indexed: 10/21/2023] Open
Abstract
Pathogenic fungi are a cause of growing concern. Developing an efficient and safe antifungal is challenging because of the similar biological properties of fungal and host cells. Consequently, there is an urgent need to better understand the mechanisms underlying antifungal resistance to prolong the efficacy of current molecules. A major step in this direction would be to be able to predict or even prevent the acquisition of resistance. We leverage the power of experimental evolution to quantify the diversity of paths to resistance to the antifungal 5-fluorocytosine (5-FC), commercially known as flucytosine. We generated hundreds of independent 5-FC resistant mutants derived from two genetic backgrounds from wild isolates of Saccharomyces cerevisiae. Through automated pin-spotting, whole-genome and amplicon sequencing, we identified the most likely causes of resistance for most strains. Approximately a third of all resistant mutants evolved resistance through a pleiotropic drug response, a potentially novel mechanism in response to 5-FC, marked by cross-resistance to fluconazole. These cross-resistant mutants are characterized by a loss of respiration and a strong tradeoff in drug-free media. For the majority of the remaining two thirds, resistance was acquired through loss-of-function mutations in FUR1, which encodes an important enzyme in the metabolism of 5-FC. We describe conditions in which mutations affecting this particular step of the metabolic pathway are favored over known resistance mutations affecting a step upstream, such as the well-known target cytosine deaminase encoded by FCY1. This observation suggests that ecological interactions may dictate the identity of resistance hotspots.
Collapse
Affiliation(s)
- Romain Durand
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, Québec, Canada
- Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec, Canada
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec, Canada
| | - Jordan Jalbert-Ross
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, Québec, Canada
- Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec, Canada
| | - Anna Fijarczyk
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, Québec, Canada
- Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec, Canada
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec, Canada
| | - Alexandre K. Dubé
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, Québec, Canada
- Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec, Canada
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec, Canada
| | - Christian R. Landry
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l’ingénierie et les applications des protéines, Université Laval, Québec, Canada
- Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec, Canada
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec, Canada
| |
Collapse
|
24
|
Ziemann M, Poulain P, Bora A. The five pillars of computational reproducibility: bioinformatics and beyond. Brief Bioinform 2023; 24:bbad375. [PMID: 37870287 PMCID: PMC10591307 DOI: 10.1093/bib/bbad375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/26/2023] [Accepted: 09/30/2023] [Indexed: 10/24/2023] Open
Abstract
Computational reproducibility is a simple premise in theory, but is difficult to achieve in practice. Building upon past efforts and proposals to maximize reproducibility and rigor in bioinformatics, we present a framework called the five pillars of reproducible computational research. These include (1) literate programming, (2) code version control and sharing, (3) compute environment control, (4) persistent data sharing and (5) documentation. These practices will ensure that computational research work can be reproduced quickly and easily, long into the future. This guide is designed for bioinformatics data analysts and bioinformaticians in training, but should be relevant to other domains of study.
Collapse
Affiliation(s)
- Mark Ziemann
- Deakin University, School of Life and Environmental Sciences, Geelong, Australia
- Burnet Institute, Melbourne, Australia
| | - Pierre Poulain
- Université Paris Cité, CNRS, Institut Jacques Monod, Paris, France
| | - Anusuiya Bora
- Deakin University, School of Life and Environmental Sciences, Geelong, Australia
| |
Collapse
|
25
|
de Visser C, Johansson LF, Kulkarni P, Mei H, Neerincx P, Joeri van der Velde K, Horvatovich P, van Gool AJ, Swertz MA, Hoen PAC‘, Niehues A. Ten quick tips for building FAIR workflows. PLoS Comput Biol 2023; 19:e1011369. [PMID: 37768885 PMCID: PMC10538699 DOI: 10.1371/journal.pcbi.1011369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/30/2023] Open
Abstract
Research data is accumulating rapidly and with it the challenge of fully reproducible science. As a consequence, implementation of high-quality management of scientific data has become a global priority. The FAIR (Findable, Accesible, Interoperable and Reusable) principles provide practical guidelines for maximizing the value of research data; however, processing data using workflows-systematic executions of a series of computational tools-is equally important for good data management. The FAIR principles have recently been adapted to Research Software (FAIR4RS Principles) to promote the reproducibility and reusability of any type of research software. Here, we propose a set of 10 quick tips, drafted by experienced workflow developers that will help researchers to apply FAIR4RS principles to workflows. The tips have been arranged according to the FAIR acronym, clarifying the purpose of each tip with respect to the FAIR4RS principles. Altogether, these tips can be seen as practical guidelines for workflow developers who aim to contribute to more reproducible and sustainable computational science, aiming to positively impact the open science and FAIR community.
Collapse
Affiliation(s)
- Casper de Visser
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Lennart F. Johansson
- Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Purva Kulkarni
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Hailiang Mei
- Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| | - Pieter Neerincx
- Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - K. Joeri van der Velde
- Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Péter Horvatovich
- Department of Analytical Biochemistry, Groningen Research Institute of Pharmacy, University of Groningen, Groningen, the Netherlands
| | - Alain J. van Gool
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Morris A. Swertz
- Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Peter A. C. ‘t Hoen
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Anna Niehues
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
| |
Collapse
|
26
|
Prokoph N, Matthews JD, Trigg RM, Montes‐Mojarro IA, Burke GAA, Fend F, Merkel O, Kenner L, Geoerger B, Johnston R, Murray MJ, Riguad C, Brugières L, Turner SD. Patient-derived xenograft models of ALK+ ALCL reveal preclinical promise for therapy with brigatinib. Br J Haematol 2023; 202:985-994. [PMID: 37357529 PMCID: PMC10952693 DOI: 10.1111/bjh.18953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 06/09/2023] [Accepted: 06/15/2023] [Indexed: 06/27/2023]
Abstract
Anaplastic large-cell lymphoma (ALCL) is a T-cell malignancy predominantly driven by the oncogenic anaplastic lymphoma kinase (ALK), accounting for approximately 15% of all paediatric non-Hodgkin lymphoma. Patients with central nervous system (CNS) relapse are particularly difficult to treat with a 3-year overall survival of 49% and a median survival of 23.5 months. The second-generation ALK inhibitor brigatinib shows superior penetration of the blood-brain barrier unlike the first-generation drug crizotinib and has shown promising results in ALK+ non-small-cell lung cancer. However, the benefits of brigatinib in treating aggressive paediatric ALK+ ALCL are largely unknown. We established a patient-derived xenograft (PDX) resource from ALK+ ALCL patients at or before CNS relapse serving as models to facilitate the development of future therapies. We show in vivo that brigatinib is effective in inducing the remission of PDX models of crizotinib-resistant (ALK C1156Y, TP53 loss) ALCL and furthermore that it is superior to crizotinib as a second-line approach to the treatment of a standard chemotherapy relapsed/refractory ALCL PDX pointing to brigatinib as a future therapeutic option.
Collapse
Affiliation(s)
- Nina Prokoph
- Division of Cellular and Molecular Pathology, Department of PathologyUniversity of Cambridge, Addenbrooke's HospitalCambridgeUK
| | - Jamie D. Matthews
- Division of Cellular and Molecular Pathology, Department of PathologyUniversity of Cambridge, Addenbrooke's HospitalCambridgeUK
| | - Ricky M. Trigg
- Division of Cellular and Molecular Pathology, Department of PathologyUniversity of Cambridge, Addenbrooke's HospitalCambridgeUK
| | - Ivonne A. Montes‐Mojarro
- Institute of Pathology and Neuropathology and Comprehensive Cancer Center TübingenUniversity Hospital Tübingen, Eberhard‐Karls‐UniversityTübingenGermany
| | - G. A. Amos Burke
- Department of Paediatric Haematology and OncologyCambridge University Hospitals NHS Foundation TrustCambridgeUK
| | - Falko Fend
- Institute of Pathology and Neuropathology and Comprehensive Cancer Center TübingenUniversity Hospital Tübingen, Eberhard‐Karls‐UniversityTübingenGermany
| | - Olaf Merkel
- Department of Experimental Pathology and Laboratory Animal Pathology, Institute of Clinical PathologyMedical University of ViennaViennaAustria
| | - Lukas Kenner
- Department of Experimental Pathology and Laboratory Animal Pathology, Institute of Clinical PathologyMedical University of ViennaViennaAustria
- Unit of Laboratory Animal PathologyUniversity of Veterinary Medicine ViennaViennaAustria
- Christian Doppler Laboratory for Applied MetabolomicsMedical University of ViennaViennaAustria
- Center for Biomarker Research in Medicine (CBmed) Vienna, Core‐Lab2Medical University of ViennaViennaAustria
| | - Birgit Geoerger
- Department of Pediatric and Adolescent OncologyGustave Roussy Cancer CenterVillejuifFrance
- INSERM U1015, Gustave Roussy Cancer CenterUniversité Paris‐SaclayVillejuifFrance
| | - Robert Johnston
- Department of Paediatric Oncology/HaematologyRoyal Belfast Hospital for Sick ChildrenBelfastUK
| | - Matthew J. Murray
- Division of Cellular and Molecular Pathology, Department of PathologyUniversity of Cambridge, Addenbrooke's HospitalCambridgeUK
- Department of Paediatric Haematology and OncologyCambridge University Hospitals NHS Foundation TrustCambridgeUK
| | - Charlotte Riguad
- Department of Pediatric and Adolescent OncologyGustave Roussy Cancer CenterVillejuifFrance
| | - Laurence Brugières
- Department of Pediatric and Adolescent OncologyGustave Roussy Cancer CenterVillejuifFrance
| | - Suzanne D. Turner
- Division of Cellular and Molecular Pathology, Department of PathologyUniversity of Cambridge, Addenbrooke's HospitalCambridgeUK
- Institute of Medical Genetics and Genomics, Faculty of MedicineMasaryk UniversityBrnoCzech Republic
| |
Collapse
|
27
|
Blum LN, Colman DR, Eloe-Fadrosh EA, Kellom M, Boyd ES, Zhaxybayeva O, Leavitt WD. Distribution and abundance of tetraether lipid cyclization genes in terrestrial hot springs reflect pH. Environ Microbiol 2023; 25:1644-1658. [PMID: 37032561 DOI: 10.1111/1462-2920.16375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 03/15/2023] [Indexed: 04/11/2023]
Abstract
Many Archaea produce membrane-spanning lipids that enable life in extreme environments. These isoprenoid glycerol dibiphytanyl glycerol tetraethers (GDGTs) may contain up to eight cyclopentyl and one cyclohexyl ring, where higher degrees of cyclization are associated with more acidic, hotter or energy-limited conditions. Recently, the genes encoding GDGT ring synthases, grsAB, were identified in two Sulfolobaceae; however, the distribution and abundance of grs homologs across environments inhabited by these and related organisms remain a mystery. To address this, we examined the distribution of grs homologs in relation to environmental temperature and pH, from thermal springs across Earth, where sequences derive from metagenomes, metatranscriptomes, single-cell and cultivar genomes. The abundance of grs homologs shows a strong negative correlation to pH, but a weak positive correlation to temperature. Archaeal genomes and metagenome-assembled genomes (MAGs) that carry two or more grs copies are more abundant in low pH springs. We also find grs in 12 archaeal classes, with the most representatives in Thermoproteia, followed by MAGs of the uncultured Korarchaeia, Bathyarchaeia and Hadarchaeia, while several Nitrososphaeria encodes >3 copies. Our findings highlight the key role of grs-catalysed lipid cyclization in archaeal diversification across hot and acidic environments.
Collapse
Affiliation(s)
- Laura N Blum
- Department of Earth Sciences, Dartmouth College, Hanover, New Hampshire, USA
- Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Daniel R Colman
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, Montana, USA
| | | | - Matthew Kellom
- Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Eric S Boyd
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, Montana, USA
| | - Olga Zhaxybayeva
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA
| | - William D Leavitt
- Department of Earth Sciences, Dartmouth College, Hanover, New Hampshire, USA
- Department of Chemistry, Dartmouth College, Hanover, New Hampshire, USA
| |
Collapse
|
28
|
Sønderstrup M, Batiuk MY, Mantas P, Tapias-Espinosa C, Oliveras I, Cañete T, Sampedro-Viana D, Brudek T, Rydbirk R, Khodosevich K, Fernandez-Teruel A, Elfving B, Aznar S. A maturational shift in the frontal cortex synaptic transcriptional landscape underlies schizophrenia-relevant behavioural traits: A congenital rat model. Eur Neuropsychopharmacol 2023; 74:32-46. [PMID: 37263043 DOI: 10.1016/j.euroneuro.2023.05.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/05/2023] [Accepted: 05/10/2023] [Indexed: 06/03/2023]
Abstract
Disruption of brain development early in life may underlie the neurobiology behind schizophrenia. We have reported more immature synaptic spines in the frontal cortex (FC) of adult Roman High-Avoidance (RHA-I) rats, a behavioural model displaying schizophrenia-like traits. Here, we performed a whole transcriptome analysis in the FC of 4 months old male RHA-I (n=8) and its counterpart, the Roman Low-Avoidance (RLA-I) (n=8). We identified 203 significant genes with overrepresentation of genes involved in synaptic function. Next, we performed a gene set enrichment analysis (GSEA) for genes co-expressed during neurodevelopment. Gene networks were obtained by weighted gene co-expression network analysis (WGCNA) of a transcriptomic dataset containing human FC during lifespan (n=269). Out of thirty-one functional gene networks, six were significantly enriched in the RHA-I. These were differentially regulated during infancy and enriched in biological ontologies related to myelination, synaptic function, and immune response. We validated differential gene expression in a new cohort of adolescent (<=2 months old) and young-adult (>=3 months old) RHA-I and RLA-I rats. The results confirmed overexpression of Gsn, Nt5cd1, Ppp1r1b, and Slc9a3r1 in young-adult RHA-I, while Cables1, a regulator of Cdk5 phosphorylation in actin regulation and involved in synaptic plasticity and maturation, was significantly downregulated in adolescent RHA-I. This age-related expression change was also observed for presynaptic components Snap25 and Snap29. Our results show a different maturational expression profile of synaptic components in the RHA-I strain, supporting a shift in FC maturation underlying schizophrenia-like behavioural traits and adding construct validity to this strain as a neurodevelopmental model.
Collapse
Affiliation(s)
- Marie Sønderstrup
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Translational Neuropsychiatry Unit, Department of Clinical Medicine, Aarhus University, Denmark
| | - Mykhailo Y Batiuk
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Panagiotis Mantas
- Department of Health Technology, Technical University of Denmark (DTU), Denmark
| | - Carles Tapias-Espinosa
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Ignasi Oliveras
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Toni Cañete
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Daniel Sampedro-Viana
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain
| | - Tomasz Brudek
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Center for Translational Research, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark
| | - Rasmus Rydbirk
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Konstantin Khodosevich
- Biotech Research and Innovation Centre (BRIC), Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Alberto Fernandez-Teruel
- Department of Psychiatry and Forensic Medicine, School of Medicine, Universidad Autónoma de Barcelona, Spain.
| | - Betina Elfving
- Translational Neuropsychiatry Unit, Department of Clinical Medicine, Aarhus University, Denmark
| | - Susana Aznar
- Centre for Neuroscience and Stereology, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark; Center for Translational Research, Copenhagen University Hospital Bispebjerg-Frederiksberg, Denmark.
| |
Collapse
|
29
|
Das S, Dinpazhoh L, Tanemura KA, Merz KM. Rapid and Automated Ab Initio Metabolite Collisional Cross Section Prediction from SMILES Input. J Chem Inf Model 2023; 63:4995-5000. [PMID: 37548575 DOI: 10.1021/acs.jcim.3c00890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
We implemented an ab initio CCS prediction workflow which incrementally refines generated structures using molecular mechanics, a deep learning potential, conformational clustering, and quantum mechanics (QM). Automating intermediate steps for a high performance computing (HPC) environment allows users to input the SMILES structure of small organic molecules and obtain a Boltzmann averaged collisional cross section (CCS) value as output. The CCS of a molecular species is a metric measured by ion mobility spectrometry (IMS) which can improve annotation of untargeted metabolomics experiments. We report only a minor drop in accuracy when we expedite the CCS calculation by replacing the QM geometry refinement step with a single-point energy calculation. Even though the workflow involves stochastic steps (i.e., conformation generation and clustering), the final CCS value was highly reproducible for multiple iterations on L-carnosine. Finally, we illustrate that the gas phase ensembles modeled for the workflow are intermediate files which can be used for the prediction of other properties such as aqueous phase nuclear magnetic resonance chemical shift prediction. The software is available at the following link: https://github.com/DasSusanta/snakemake_ccs.
Collapse
Affiliation(s)
- Susanta Das
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Laleh Dinpazhoh
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Kiyoto Aramis Tanemura
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| |
Collapse
|
30
|
Dimayacyac JR, Wu S, Jiang D, Pennell M. Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.09.527893. [PMID: 37645857 PMCID: PMC10461906 DOI: 10.1101/2023.02.09.527893] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Phylogenetic comparative methods are increasingly used to test hypotheses about the evolutionary processes that drive divergence in gene expression among species. However, it is unknown whether the distributional assumptions of phylogenetic models designed for quantitative phenotypic traits are realistic for expression data and importantly, the reliability of conclusions of phylogenetic comparative studies of gene expression may depend on whether the data is well-described by the chosen model. To evaluate this, we first fit several phylogenetic models of trait evolution to 8 previously published comparative expression datasets, comprising a total of 54,774 genes with 145,927 unique gene-tissue combinations. Using a previously developed approach, we then assessed how well the best model of the set described the data in an absolute (not just relative) sense. First, we find that Ornstein-Uhlenbeck models, in which expression values are constrained around an optimum, were the preferred model for 66% of gene-tissue combinations. Second, we find that for 61% of gene-tissue combinations, the best fit model of the set was found to perform well; the rest were found to be performing poorly by at least one of the test statistics we examined. Third, we find that when simple models do not perform well, this appears to be typically a consequence of failing to fully account for heterogeneity in the rate of the evolution. We advocate that assessment of model performance should become a routine component of phylogenetic comparative expression studies; doing so can improve the reliability of inferences and inspire the development of novel models.
Collapse
Affiliation(s)
- Jose Rafael Dimayacyac
- Department of Zoology, University of British Columbia, Canada
- Michael Smith Laboratories, University of British Columbia, Canada
| | - Shanyun Wu
- Department of Zoology, University of British Columbia, Canada
- Department of Genetics, Washington University School of Medicine, USA
| | - Daohan Jiang
- Department of Quantitative and Computational Biology, University of Southern California, USA
| | - Matt Pennell
- Department of Zoology, University of British Columbia, Canada
- Department of Quantitative and Computational Biology, University of Southern California, USA
- Department of Biological Sciences, University of Southern California, USA
| |
Collapse
|
31
|
Kwok N, Aretz Z, Takao S, Ser Z, Cifani P, Kentsis A. Integrative Proteogenomics Using ProteomeGenerator2. J Proteome Res 2023; 22:2750-2764. [PMID: 37418425 PMCID: PMC10783198 DOI: 10.1021/acs.jproteome.3c00005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/09/2023]
Abstract
Recent advances in nucleic acid sequencing now permit rapid and genome-scale analysis of genetic variation and transcription, enabling population-scale studies of human biology, disease, and diverse organisms. Likewise, advances in mass spectrometry proteomics now permit highly sensitive and accurate studies of protein expression at the whole proteome-scale. However, most proteomic studies rely on consensus databases to match spectra to peptide and protein sequences, and thus remain limited to the analysis of canonical protein sequences. Here, we develop ProteomeGenerator2 (PG2), based on the scalable and modular ProteomeGenerator framework. PG2 integrates genome and transcriptome sequencing to incorporate protein variants containing amino acid substitutions, insertions, and deletions, as well as noncanonical reading frames, exons, and other variants caused by genomic and transcriptomic variation. We benchmarked PG2 using synthetic data and genomic, transcriptomic, and proteomic analysis of human leukemia cells. PG2 can be integrated with current and emerging sequencing technologies, assemblers, variant callers, and mass spectral analysis algorithms, and is available open-source from https://github.com/kentsisresearchgroup/ProteomeGenerator2.
Collapse
Affiliation(s)
- Nathaniel Kwok
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
- Doctor of Medicine Program, Weill Cornell Medicine, New York, NY
- Department of Graduate Medical Education, HCA TriStar-Centennial Medical Center, Nashville, TN
| | - Zita Aretz
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
- Physiology Biophysics and Systems Biology Program, Weill Cornell Graduate School, Cornell University, New York, NY
| | - Sumiko Takao
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
- Tow Center for Developmental Oncology, Department of Pediatrics, Memorial Sloan Kettering Cancer Center New York, NY
| | - Zheng Ser
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Paolo Cifani
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Alex Kentsis
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY
- Tow Center for Developmental Oncology, Department of Pediatrics, Memorial Sloan Kettering Cancer Center New York, NY
- Departments of Pediatrics, Pharmacology, and Physiology & Biophysics, Weill Cornell Medical College, Cornell University, New York, NY
| |
Collapse
|
32
|
Paril J, Zare T, Fournier-Level A. Compare_Genomes: A Comparative Genomics Workflow to Streamline the Analysis of Evolutionary Divergence Across Eukaryotic Genomes. Curr Protoc 2023; 3:e876. [PMID: 37638775 DOI: 10.1002/cpz1.876] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2023]
Abstract
The dawn of cost-effective genome assembly is enabling deep comparative genomics to address fundamental evolutionary questions by comparing the genomes of multiple species. However, comparative genomics analyses frequently deploy multiple, often purpose-built frameworks, limiting their transferability and replicability. Here, we present compare_genomes, a transferable and extensible comparative genomics workflow package we developed that streamlines the identification of orthologous families within and across eukaryotic genomes and tests for the presence of several mechanisms of evolution (gene family expansion or contraction and substitution rates within protein-coding sequences). The workflow is available for Linux, written as a Nextflow workflow that calls established genomics and phylogenetics tools to streamline the analysis and visualization of eukaryotic genome divergence. This workflow is freely available at https://github.com/jeffersonfparil/compare_genomes, distributed under the GNU General Public License version 3 (GPLv3). © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol: Comparative genomics with Nextflow and Conda.
Collapse
Affiliation(s)
- Jefferson Paril
- School of BioSciences, University of Melbourne, Parkville, Victoria, Australia
| | - Tannaz Zare
- School of BioSciences, University of Melbourne, Parkville, Victoria, Australia
| | | |
Collapse
|
33
|
Teo QW, Wang Y, Lv H, Tan TJ, Lei R, Mao KJ, Wu NC. Stringent and complex sequence constraints of an IGHV1-69 broadly neutralizing antibody to influenza HA stem. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.06.547908. [PMID: 37461670 PMCID: PMC10350038 DOI: 10.1101/2023.07.06.547908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
IGHV1-69 is frequently utilized by broadly neutralizing influenza antibodies to the hemagglutinin (HA) stem. These IGHV1-69 HA stem antibodies have diverse complementarity-determining region (CDR) H3 sequences. Besides, their light chains have minimal to no contact with the epitope. Consequently, sequence determinants that confer IGHV1-69 antibodies with HA stem specificity remain largely elusive. Using high-throughput experiments, this study revealed the importance of light chain sequence for the IGHV1-69 HA stem antibody CR9114, which is the broadest influenza antibody known to date. Moreover, we demonstrated that the CDR H3 sequences from many other IGHV1-69 antibodies, including those to HA stem, were incompatible with CR9114. Along with mutagenesis and structural analysis, our results indicate that light chain and CDR H3 sequences coordinately determine the HA stem specificity of IGHV1-69 antibodies. Overall, this work provides molecular insights into broadly neutralizing antibody responses to influenza virus, which have important implications for universal influenza vaccine development.
Collapse
Affiliation(s)
- Qi Wen Teo
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Yiquan Wang
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Huibin Lv
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Timothy J.C. Tan
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Ruipeng Lei
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Kevin J. Mao
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Nicholas C. Wu
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
- Carle Illinois College of Medicine, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
34
|
Kunzmann P, Müller TD, Greil M, Krumbach JH, Anter JM, Bauer D, Islam F, Hamacher K. Biotite: new tools for a versatile Python bioinformatics library. BMC Bioinformatics 2023; 24:236. [PMID: 37277726 PMCID: PMC10243083 DOI: 10.1186/s12859-023-05345-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 05/18/2023] [Indexed: 06/07/2023] Open
Abstract
BACKGROUND Biotite is a program library for sequence and structural bioinformatics written for the Python programming language. It implements widely used computational methods into a consistent and accessible package. This allows for easy combination of various data analysis, modeling and simulation methods. RESULTS This article presents major functionalities introduced into Biotite since its original publication. The fields of application are shown using concrete examples. We show that the computational performance of Biotite for bioinformatics tasks is comparable to individual, special purpose software systems specifically developed for the respective single task. CONCLUSIONS The results show that Biotite can be used as program library to either answer specific bioinformatics questions and simultaneously allow the user to write entire, self-contained software applications with sufficient performance for general application.
Collapse
Affiliation(s)
- Patrick Kunzmann
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany.
| | - Tom David Müller
- Department of Computer Science, Eberhard Karls University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | | | - Jan Hendrik Krumbach
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Jacob Marcel Anter
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Daniel Bauer
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Faisal Islam
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Kay Hamacher
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| |
Collapse
|
35
|
O'Connell KA, Yosufzai ZB, Campbell RA, Lobb CJ, Engelken HT, Gorrell LM, Carlson TB, Catana JJ, Mikdadi D, Bonazzi VR, Klenk JA. Accelerating genomic workflows using NVIDIA Parabricks. BMC Bioinformatics 2023; 24:221. [PMID: 37259021 PMCID: PMC10230726 DOI: 10.1186/s12859-023-05292-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 04/15/2023] [Indexed: 06/02/2023] Open
Abstract
BACKGROUND As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets. Although much work has been done to reduce compute times using various configurations of traditional CPU computing infrastructures, Graphics Processing Units (GPUs) offer opportunities to accelerate genomic workflows by orders of magnitude. Here we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, including two germline callers (HaplotypeCaller and DeepVariant) and four somatic callers (Mutect2, Muse, LoFreq, SomaticSniper). RESULTS We achieved up to 65 × acceleration with germline variant callers, bringing HaplotypeCaller runtimes down from 36 h to 33 min on AWS, 35 min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the number of GPUs and computing platforms. On cloud platforms, GPU-accelerated germline callers resulted in cost savings compared with CPU runs, whereas some somatic callers were more expensive than CPU runs because their GPU acceleration was not sufficient to overcome the increased GPU cost. CONCLUSIONS Germline variant callers scaled well with the number of GPUs across platforms, whereas somatic variant callers exhibited more variation in the number of GPUs with the fastest runtimes, suggesting that, at least with the version of Parabricks used here, these workflows are less GPU optimized and require benchmarking on the platform of choice before being deployed at production scales. Our study demonstrates that GPUs can be used to greatly accelerate genomic workflows, thus bringing closer to grasp urgent societal advances in the areas of biosurveillance and personalized medicine.
Collapse
Affiliation(s)
- Kyle A O'Connell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | | | - Ross A Campbell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Collin J Lobb
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Haley T Engelken
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Laura M Gorrell
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Thad B Carlson
- Cloud Managed Services, Deloitte Consulting LLP, Detroit, MI, 48226, USA
| | - Josh J Catana
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Dina Mikdadi
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA
| | - Vivien R Bonazzi
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA.
| | - Juergen A Klenk
- Health Data and AI, Deloitte Consulting LLP, VA, 22009, Arlington, USA.
| |
Collapse
|
36
|
Hubbard A, Hemming-Schroeder E, Machani MG, Afrane Y, Yan G, Lo E, Janies D. Implementing landscape genetics in molecular epidemiology to determine drivers of vector-borne disease: A malaria case study. Mol Ecol 2023; 32:1848-1859. [PMID: 36645165 PMCID: PMC10694861 DOI: 10.1111/mec.16846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/02/2022] [Accepted: 01/05/2023] [Indexed: 01/17/2023]
Abstract
This study employs landscape genetics to investigate the environmental drivers of a deadly vector-borne disease, malaria caused by Plasmodium falciparum, in a more spatially comprehensive manner than any previous work. With 1804 samples from 44 sites collected in western Kenya in 2012 and 2013, we performed resistance surface analysis to show that Lake Victoria acts as a barrier to transmission between areas north and south of the Winam Gulf. In addition, Mantel correlograms clearly showed significant correlations between genetic and geographic distance over short distances (less than 70 km). In both cases, we used an identity-by-state measure of relatedness tailored to find highly related individual parasites in order to focus on recent gene flow that is more relevant to disease transmission. To supplement these results, we performed conventional population genetics analyses, including Bayesian clustering methods and spatial ordination techniques. These analyses revealed some differentiation on the basis of geography and elevation and a cluster of genetic similarity in the lowlands north of the Winam Gulf of Lake Victoria. Taken as a whole, these results indicate low overall genetic differentiation in the Lake Victoria region, but with some separation of parasite populations north and south of the Winam Gulf that is explained by the presence of the lake as a geographic barrier to gene flow. We recommend similar landscape genetics analyses in future molecular epidemiology studies of vector-borne diseases to extend and contextualize the results of traditional population genetics.
Collapse
Affiliation(s)
- Alfred Hubbard
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, Charlotte, USA
- Center for Computational Intelligence to Predict Health and Environmental Risks (CIPHER), University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| | - Elizabeth Hemming-Schroeder
- Department of Microbiology, Center for Vector-borne Infectious Diseases (CVID), Colorado State University, Fort Collins, Colorado, USA
| | | | - Yaw Afrane
- Department of Medical Microbiology, University of Ghana Medical School, Accra, Ghana
| | - Guiyun Yan
- Program in Public Health, University of California, Irvine, California, USA
| | - Eugenia Lo
- Center for Computational Intelligence to Predict Health and Environmental Risks (CIPHER), University of North Carolina at Charlotte, Charlotte, North Carolina, USA
- Department of Biological Sciences, University of North Carolina at Charlotte, Charlotte, North Carolina, USA
- School of Data Science, University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| | - Daniel Janies
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, Charlotte, USA
- Center for Computational Intelligence to Predict Health and Environmental Risks (CIPHER), University of North Carolina at Charlotte, Charlotte, North Carolina, USA
| |
Collapse
|
37
|
Swift M, Horns F, Quake SR. Lineage tracing reveals fate bias and transcriptional memory in human B cells. Life Sci Alliance 2023; 6:e202201792. [PMID: 36639222 PMCID: PMC9840405 DOI: 10.26508/lsa.202201792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/15/2022] [Accepted: 12/16/2022] [Indexed: 01/15/2023] Open
Abstract
We combined single-cell transcriptomics and lineage tracing to understand fate choice in human B cells. Using the antibody sequences of B cells, we tracked clones during in vitro differentiation. Clonal analysis revealed a subset of IgM+ B cells which were more proliferative than other B-cell types. Whereas the population of B cells adopted diverse states during differentiation, clones had a restricted set of fates available to them; there were two times more single-fate clones than expected given population-level cell-type diversity. This implicated a molecular memory of initial cell states that was propagated through differentiation. We then identified the genes which had strongest coherence within clones. These genes significantly overlapped known B-cell fate determination programs, suggesting the genes which determine cell identity are most robustly controlled on a clonal level. Persistent clonal identities were also observed in human plasma cells from bone marrow, indicating that these transcriptional programs maintain long-term cell identities in vivo. Thus, we show how cell-intrinsic fate bias influences differentiation outcomes in vitro and in vivo.
Collapse
Affiliation(s)
- Michael Swift
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA, USA
| | - Felix Horns
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA, USA
| | - Stephen R Quake
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Department of Applied Physics, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
38
|
Metagenomic Analysis of the Abundance and Composition of Antibiotic Resistance Genes in Hospital Wastewater in Benin, Burkina Faso, and Finland. mSphere 2023; 8:e0053822. [PMID: 36728456 PMCID: PMC9942590 DOI: 10.1128/msphere.00538-22] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Antibiotic resistance is a global threat to human health, with the most severe effect in low- and middle-income countries. We explored the presence of antibiotic resistance genes (ARGs) in the hospital wastewater (HWW) of nine hospitals in Benin and Burkina Faso, two low-income countries in West Africa, with shotgun metagenomic sequencing. For comparison, we also studied six hospitals in Finland. The highest sum of the relative abundance of ARGs in the 68 HWW samples was detected in Benin and the lowest in Finland. HWW resistomes and mobilomes in Benin and Burkina Faso resembled each other more than those in Finland. Many carbapenemase genes were detected at various abundances, especially in HWW from Burkina Faso and Finland. The blaGES genes, the most widespread carbapenemase gene in the Beninese HWW, were also found in water intended for hand washing and in a puddle at a hospital yard in Benin. mcr genes were detected in the HWW of all three countries, with mcr-5 being the most common mcr gene. These and other mcr genes were observed in very high relative abundances, even in treated wastewater in Burkina Faso and a street gutter in Benin. The results highlight the importance of wastewater treatment, with particular attention to HWW. IMPORTANCE The global emergence and increased spread of antibiotic resistance threaten the effectiveness of antibiotics and, thus, the health of the entire population. Therefore, understanding the resistomes in different geographical locations is crucial in the global fight against the antibiotic resistance crisis. However, this information is scarce in many low- and middle-income countries (LMICs), such as those in West Africa. In this study, we describe the resistomes of hospital wastewater in Benin and Burkina Faso and, as a comparison, Finland. Our results help to understand the hitherto unrevealed resistance in Beninese and Burkinabe hospitals. Furthermore, the results emphasize the importance of wastewater management infrastructure design to minimize exposure events between humans, HWW, and the environment, preventing the circulation of resistant bacteria and ARGs between humans (hospitals and community) and the environment.
Collapse
|
39
|
Schmidt S, Toivonen S, Medvedev P, Tomescu AI. The omnitig framework can improve genome assembly contiguity in practice. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.30.526175. [PMID: 36778435 PMCID: PMC9915519 DOI: 10.1101/2023.01.30.526175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Despite the long history of genome assembly research, there remains a large gap between the theoretical and practical work. There is practical software with little theoretical underpinning of accuracy on one hand and theoretical algorithms which have not been adopted in practice on the other. In this paper we attempt to bridge the gap between theory and practice by showing how the theoretical safe-and-complete framework can be integrated into existing assemblers in order to improve contiguity. The optimal algorithm in this framework, called the omnitig algorithm, has not been used in practice due to its complexity and its lack of robustness to real data. Instead, we pursue a simplified notion of omnitigs, giving an efficient algorithm to compute them and demonstrating their safety under certain conditions. We modify two assemblers (wtdbg2 and Flye) by replacing their unitig algorithm with the simple omnitig algorithm. We test our modifications using real HiFi data from the Drosophilia melanogaster and the Caenorhabditis elegans genome. Our modified algorithms lead to a substantial improvement in alignment-based contiguity, with negligible computational costs and either no or a small increase in the number of misassemblies.
Collapse
Affiliation(s)
| | | | - Paul Medvedev
- Department of Computer Science and Engineering, The Pennsylvania State University
- Huck Institutes of the Life Sciences, The Pennsylvania State University
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University
| | | |
Collapse
|
40
|
Seddighi S, Qi YA, Brown AL, Wilkins OG, Bereda C, Belair C, Zhang Y, Prudencio M, Keuss MJ, Khandeshi A, Pickles S, Hill SE, Hawrot J, Ramos DM, Yuan H, Roberts J, Kelmer Sacramento E, Shah SI, Nalls MA, Colon-Mercado J, Reyes JF, Ryan VH, Nelson MP, Cook C, Li Z, Screven L, Kwan JY, Shantaraman A, Ping L, Koike Y, Oskarsson B, Staff N, Duong DM, Ahmed A, Secrier M, Ule J, Jacobson S, Rohrer J, Malaspina A, Glass JD, Ori A, Seyfried NT, Maragkakis M, Petrucelli L, Fratta P, Ward ME. Mis-spliced transcripts generate de novo proteins in TDP-43-related ALS/FTD. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.23.525149. [PMID: 36747793 PMCID: PMC9900763 DOI: 10.1101/2023.01.23.525149] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Functional loss of TDP-43, an RNA-binding protein genetically and pathologically linked to ALS and FTD, leads to inclusion of cryptic exons in hundreds of transcripts during disease. Cryptic exons can promote degradation of affected transcripts, deleteriously altering cellular function through loss-of-function mechanisms. However, the possibility of de novo protein synthesis from cryptic exon transcripts has not been explored. Here, we show that mRNA transcripts harboring cryptic exons generate de novo proteins both in TDP-43 deficient cellular models and in disease. Using coordinated transcriptomic and proteomic studies of TDP-43 depleted iPSC-derived neurons, we identified numerous peptides that mapped to cryptic exons. Cryptic exons identified in iPSC models were highly predictive of cryptic exons expressed in brains of patients with TDP-43 proteinopathy, including cryptic transcripts that generated de novo proteins. We discovered that inclusion of cryptic peptide sequences in proteins altered their interactions with other proteins, thereby likely altering their function. Finally, we showed that these de novo peptides were present in CSF from patients with ALS. The demonstration of cryptic exon translation suggests new mechanisms for ALS pathophysiology downstream of TDP-43 dysfunction and may provide a strategy for novel biomarker development.
Collapse
Affiliation(s)
- Sahba Seddighi
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
- Medical Scientist Training Program, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Yue A Qi
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | - Anna-Leigh Brown
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Oscar G Wilkins
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
- The Francis Crick Institute, London, UK
| | - Colleen Bereda
- Medical Scientist Training Program, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | - Cedric Belair
- Laboratory of Genetics and Genomics, National Institute on Aging, Intramural Research Program, National Institutes of Health, Baltimore, MD, USA
| | - Yongjie Zhang
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Mercedes Prudencio
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Matthew J Keuss
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Aditya Khandeshi
- Laboratory of Genetics and Genomics, National Institute on Aging, Intramural Research Program, National Institutes of Health, Baltimore, MD, USA
| | - Sarah Pickles
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Sarah E Hill
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - James Hawrot
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Daniel M Ramos
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | - Hebao Yuan
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Jessica Roberts
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | | | - Syed I Shah
- Data Tecnica International, Washington, DC, USA
| | - Mike A Nalls
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Washington, DC, USA
| | - Jenn Colon-Mercado
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Joel F Reyes
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Veronica H Ryan
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Matthew P Nelson
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | - Casey Cook
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Ziyi Li
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Washington, DC, USA
| | - Laurel Screven
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| | - Justin Y Kwan
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | | | - Lingyan Ping
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
| | - Yuka Koike
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Björn Oskarsson
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Nathan Staff
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Duc M Duong
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
| | - Aisha Ahmed
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Maria Secrier
- Department of Genetics, Evolution and Environment, UCL Genetics Institute, UCL, London, UK
| | - Jerneg Ule
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Steven Jacobson
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
| | - Jonathan Rohrer
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Andrea Malaspina
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Jonathan D Glass
- Department of Neurology, Center for Neurodegenerative Diseases, Emory University, Atlanta, GA, USA
| | - Alessandro Ori
- Leibniz Institute on Aging - Fritz Lipmann Institute (FLI), Jena, Germany
| | - Nicholas T Seyfried
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
- Department of Neurology, Emory University School of Medicine, Atlanta, GA, USA
| | - Manolis Maragkakis
- Laboratory of Genetics and Genomics, National Institute on Aging, Intramural Research Program, National Institutes of Health, Baltimore, MD, USA
| | - Leonard Petrucelli
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Pietro Fratta
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Michael E Ward
- National Institute of Neurological Disorders and Stroke, NIH, Bethesda, MD, USA
- Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
41
|
Kwok N, Aretz Z, Takao S, Ser Z, Cifani P, Kentsis A. Integrative proteogenomics using ProteomeGenerator2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.04.522774. [PMID: 36711693 PMCID: PMC9882001 DOI: 10.1101/2023.01.04.522774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Recent advances in nucleic acid sequencing now permit rapid and genome-scale analysis of genetic variation and transcription, enabling population-scale studies of human biology, disease, and diverse organisms. Likewise, advances in mass spectrometry proteomics now permit highly sensitive and accurate studies of protein expression at the whole proteome-scale. However, most proteomic studies rely on consensus databases to match spectra to peptide and proteins sequences, and thus remain limited to the analysis of canonical protein sequences. Here, we develop ProteomeGenerator2 (PG2), based on the scalable and modular ProteomeGenerator framework. PG2 integrates genome and transcriptome sequencing to incorporate protein variants containing amino acid substitutions, insertions, and deletions, as well as non-canonical reading frames, exons, and other variants caused by genomic and transcriptomic variation. We benchmarked PG2 using synthetic data and genomic, transcriptomic, and proteomic analysis of human leukemia cells. PG2 can be integrated with current and emerging sequencing technologies, assemblers, variant callers, and mass spectral analysis algorithms, and is available open-source from https://github.com/kentsisresearchgroup/ProteomeGenerator2 .
Collapse
|
42
|
Calboli FCF, Iso-Touru T, Bitz O, Fischer D, Nousiainen A, Koskinen H, Tapio M, Tapio I, Kause A. Genomic selection for survival under naturally occurring Saprolegnia oomycete infection in farmed European whitefish Coregonus lavaretus. J Anim Sci 2023; 101:skad333. [PMID: 37777972 PMCID: PMC10583997 DOI: 10.1093/jas/skad333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 09/29/2023] [Indexed: 10/03/2023] Open
Abstract
Saprolegnia oomycete infection causes serious economic losses and reduces fish health in aquaculture. Genomic selection based on thousands of DNA markers is a powerful tool to improve fish traits in selective breeding programs. Our goal was to develop a single nucleotide polymorphism (SNP) marker panel and to test its use in genomic selection for improved survival against Saprolegnia infection in European whitefish Coregonus lavaretus, the second most important farmed fish species in Finland. We used a double digest restriction site associated DNA (ddRAD) genotyping by sequencing method to produce a SNP panel, and we tested it analyzing data from a cohort of 1,335 fish, which were measured at different times for mortality to Saprolegnia oomycete infection and weight traits. We calculated the genetic relationship matrix (GRM) from the genome-wide genetic data, integrating it in multivariate mixed models used for the estimation of variance components and genomic breeding values (GEBVs), and to carry out Genome-Wide Association Studies for the presence of quantitative trait loci (QTL) affecting the phenotypes in analysis. We identified one major QTL on chromosome 6 affecting mortality to Saprolegnia infection, explaining 7.7% to 51.3% of genetic variance, and a QTL for weight on chromosome 4, explaining 1.8% to 5.4% of genetic variance. Heritability for mortality was 0.20 to 0.43 on the liability scale, and heritability for weight was 0.44 to 0.53. The QTL for mortality showed an additive allelic effect. We tested whether integrating the QTL for mortality as a fixed factor, together with a new GRM calculated excluding the QTL from the genetic data, would improve the accuracy estimation of GEBVs. This test was done through a cross-validation approach, which indicated that the inclusion of the QTL increased the mean accuracy of the GEBVs by 0.28 points, from 0.33 to 0.61, relative to the use of full GRM only. The area under the curve of the receiver-operator curve for mortality increased from 0.58 to 0.67 when the QTL was included in the model. The inclusion of the QTL as a fixed effect in the model increased the correlation between the GEBVs of early mortality with the late mortality, compared to a model that did not include the QTL. These results validate the usability of the produced SNP panel for genomic selection in European whitefish and highlight the opportunity for modeling QTLs in genomic evaluation of mortality due to Saprolegnia infection.
Collapse
Affiliation(s)
| | - Terhi Iso-Touru
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| | - Oliver Bitz
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| | - Daniel Fischer
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| | - Antti Nousiainen
- Natural Resources Institute Finland (LUKE), FI-70210 Kuopio, Finland
| | - Heikki Koskinen
- Natural Resources Institute Finland (LUKE), FI-70210 Kuopio, Finland
| | - Miika Tapio
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| | - Ilma Tapio
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| | - Antti Kause
- Natural Resources Institute Finland (LUKE), FI-31600 Jokioinen, Finland
| |
Collapse
|
43
|
Umu SU, Rapp Vander-Elst K, Karlsen VT, Chouliara M, Bækkevold ES, Jahnsen FL, Domanska D. Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis. Gigascience 2022; 12:giad091. [PMID: 37889009 PMCID: PMC10603768 DOI: 10.1093/gigascience/giad091] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/25/2023] [Accepted: 10/05/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND Single-cell RNA sequencing (scRNA-seq) provides high-resolution transcriptome data to understand the heterogeneity of cell populations at the single-cell level. The analysis of scRNA-seq data requires the utilization of numerous computational tools. However, nonexpert users usually experience installation issues, a lack of critical functionality or batch analysis modes, and the steep learning curves of existing pipelines. RESULTS We have developed cellsnake, a comprehensive, reproducible, and accessible single-cell data analysis workflow, to overcome these problems. Cellsnake offers advanced features for standard users and facilitates downstream analyses in both R and Python environments. It is also designed for easy integration into existing workflows, allowing for rapid analyses of multiple samples. CONCLUSION As an open-source tool, cellsnake is accessible through Bioconda, PyPi, Docker, and GitHub, making it a cost-effective and user-friendly option for researchers. By using cellsnake, researchers can streamline the analysis of scRNA-seq data and gain insights into the complex biology of single cells.
Collapse
Affiliation(s)
- Sinan U Umu
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo 0372, Norway
| | | | - Victoria T Karlsen
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Manto Chouliara
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Espen Sønderaal Bækkevold
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
- Institute of Oral Biology, University of Oslo, Oslo 0372, Norway
| | - Frode Lars Jahnsen
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo 0372, Norway
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Diana Domanska
- Department of Pathology, Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
- Department of Microbiology, University of Oslo, Rikshospitalet, Oslo 0372, Norway
| |
Collapse
|
44
|
Angst P, Ameline C, Haag CR, Ben-Ami F, Ebert D, Fields PD. Genetic Drift Shapes the Evolution of a Highly Dynamic Metapopulation. Mol Biol Evol 2022; 39:6874788. [PMID: 36472514 PMCID: PMC9778854 DOI: 10.1093/molbev/msac264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 11/28/2022] [Accepted: 12/01/2022] [Indexed: 12/12/2022] Open
Abstract
The dynamics of extinction and (re)colonization in habitat patches are characterizing features of dynamic metapopulations, causing them to evolve differently than large, stable populations. The propagule model, which assumes genetic bottlenecks during colonization, posits that newly founded subpopulations have low genetic diversity and are genetically highly differentiated from each other. Immigration may then increase diversity and decrease differentiation between subpopulations. Thus, older and/or less isolated subpopulations are expected to have higher genetic diversity and less genetic differentiation. We tested this theory using whole-genome pool-sequencing to characterize nucleotide diversity and differentiation in 60 subpopulations of a natural metapopulation of the cyclical parthenogen Daphnia magna. For comparison, we characterized diversity in a single, large, and stable D. magna population. We found reduced (synonymous) genomic diversity, a proxy for effective population size, weak purifying selection, and low rates of adaptive evolution in the metapopulation compared with the large, stable population. These differences suggest that genetic bottlenecks during colonization reduce effective population sizes, which leads to strong genetic drift and reduced selection efficacy in the metapopulation. Consistent with the propagule model, we found lower diversity and increased differentiation in younger and also in more isolated subpopulations. Our study sheds light on the genomic consequences of extinction-(re)colonization dynamics to an unprecedented degree, giving strong support for the propagule model. We demonstrate that the metapopulation evolves differently from a large, stable population and that evolution is largely driven by genetic drift.
Collapse
Affiliation(s)
| | - Camille Ameline
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland,Evolutionary Biology, Instituto Gulbenkian de Ciência, Oeiras 2780-156, Portugal
| | - Christoph R Haag
- CEFE, Université de Montpellier, CNRS, EPHE, IRD, Montpellier 34293, France,Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Frida Ben-Ami
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland,George S. Wise Faculty of Life Sciences, School of Zoology, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dieter Ebert
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland,Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Peter D Fields
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland,Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| |
Collapse
|
45
|
Pavlin A, Lovše A, Bajc G, Otoničar J, Kujović A, Lengar Ž, Gutierrez-Aguirre I, Kostanjšek R, Konc J, Fornelos N, Butala M. A small bacteriophage protein determines the hierarchy over co-residential jumbo phage in Bacillus thuringiensis serovar israelensis. Commun Biol 2022; 5:1286. [PMID: 36434275 PMCID: PMC9700832 DOI: 10.1038/s42003-022-04238-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 11/08/2022] [Indexed: 11/26/2022] Open
Abstract
Bacillus thuringiensis serovar israelensis is the most widely used biopesticide against insects, including vectors of animal and human diseases. Among several extrachromosomal elements, this endospore-forming entomopathogen harbors two bacteriophages: a linear DNA replicon named GIL01 that does not integrate into the chromosome during lysogeny and a circular-jumbo prophage known as pBtic235. Here, we show that GIL01 hinders the induction of cohabiting prophage pBtic235. The GIL01-encoded small protein, gp7, which interacts with the host LexA repressor, is a global transcription regulator and represses the induction of pBtic235 after DNA damage to presumably allow GIL01 to multiply first. In a complex with host LexA in stressed cells, gp7 down-regulates the expression of more than 250 host and pBtic235 genes, many of which are involved in the cellular functions of genome maintenance, cell-wall transport, and membrane and protein stability. We show that gp7 homologs that are found exclusively in bacteriophages act in a similar fashion to enhance LexA's binding to DNA, while likely also affecting host gene expression. Our results provide evidence that GIL01 influences both its host and its co-resident bacteriophage.
Collapse
Affiliation(s)
- Anja Pavlin
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Anže Lovše
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia ,Genialis, Inc., Boston, MA USA
| | - Gregor Bajc
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Jan Otoničar
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Amela Kujović
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Živa Lengar
- grid.419523.80000 0004 0637 0790Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, Slovenia
| | - Ion Gutierrez-Aguirre
- grid.419523.80000 0004 0637 0790Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, Slovenia
| | - Rok Kostanjšek
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Janez Konc
- grid.454324.00000 0001 0661 0844Theory Department, National Institute of Chemistry, Ljubljana, Slovenia
| | - Nadine Fornelos
- grid.66859.340000 0004 0546 1623Broad Institute of MIT and Harvard, Cambridge, MA USA
| | - Matej Butala
- grid.8954.00000 0001 0721 6013Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
46
|
Chen W, Achakkagari SR, Strömvik M. Plastaumatic: Automating plastome assembly and annotation. FRONTIERS IN PLANT SCIENCE 2022; 13:1011948. [PMID: 36407635 PMCID: PMC9669643 DOI: 10.3389/fpls.2022.1011948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 10/14/2022] [Indexed: 06/16/2023]
Abstract
Plastome sequence data is most often extracted from plant whole genome sequencing data and need to be assembled and annotated separately from the nuclear genome sequence. In projects comprising multiple genomes, it is labour intense to individually process the plastomes as it requires many steps and software. This study developed Plastaumatic - an automated pipeline for both assembly and annotation of plastomes, with the scope of the researcher being able to load whole genome sequence data with minimal manual input, and therefore a faster runtime. The main structure of the current automated pipeline includes trimming of adaptor and low-quality sequences using fastp, de novo plastome assembly using NOVOPlasty, standardization and quality checking of the assembled genomes through a custom script utilizing BLAST+ and SAMtools, annotation of the assembled genomes using AnnoPlast, and finally generating the required files for NCBI GenBank submissions. The pipeline is demonstrated with 12 potato accessions and three soybean accessions.
Collapse
|
47
|
Dadonaite B, Crawford KHD, Radford CE, Farrell AG, Yu TC, Hannon WW, Zhou P, Andrabi R, Burton DR, Liu L, Ho DD, Neher RA, Bloom JD. A pseudovirus system enables deep mutational scanning of the full SARS-CoV-2 spike. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.10.13.512056. [PMID: 36263061 PMCID: PMC9580381 DOI: 10.1101/2022.10.13.512056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A major challenge in understanding SARS-CoV-2 evolution is interpreting the antigenic and functional effects of emerging mutations in the viral spike protein. Here we describe a new deep mutational scanning platform based on non-replicative pseudotyped lentiviruses that directly quantifies how large numbers of spike mutations impact antibody neutralization and pseudovirus infection. We demonstrate this new platform by making libraries of the Omicron BA.1 and Delta spikes. These libraries each contain ~7000 distinct amino-acid mutations in the context of up to ~135,000 unique mutation combinations. We use these libraries to map escape mutations from neutralizing antibodies targeting the receptor binding domain, N-terminal domain, and S2 subunit of spike. Overall, this work establishes a high-throughput and safe approach to measure how ~10 5 combinations of mutations affect antibody neutralization and spike-mediated infection. Notably, the platform described here can be extended to the entry proteins of many other viruses.
Collapse
Affiliation(s)
- Bernadeta Dadonaite
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
| | - Katharine H D Crawford
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
- Department of Genome Sciences & Medical Scientist Training Program, University of Washington, Seattle, Washington, 98109, USA
| | - Caelan E Radford
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
- Molecular and Cellular Biology Graduate Program, University of Washington, and Basic Sciences Division, Fred Hutch Cancer Center, Seattle, Washington, 98109, USA
| | - Ariana G Farrell
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
| | - Timothy C Yu
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
- Molecular and Cellular Biology Graduate Program, University of Washington, and Basic Sciences Division, Fred Hutch Cancer Center, Seattle, Washington, 98109, USA
| | - William W Hannon
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
- Molecular and Cellular Biology Graduate Program, University of Washington, and Basic Sciences Division, Fred Hutch Cancer Center, Seattle, Washington, 98109, USA
| | - Panpan Zhou
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
- IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Raiees Andrabi
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
- IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Dennis R Burton
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
- IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
- Consortium for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA
- Ragon Institute of MGH, MIT & Harvard, Cambridge, MA 02139, USA
| | - Lihong Liu
- Aaron Diamond AIDS Research Center, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA
| | - David D. Ho
- Aaron Diamond AIDS Research Center, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA
- Department of Microbiology and Immunology, Columbia University Vagelos College of Physicians and Surgeons, New York, NY 10032, USA
- Division of Infectious Diseases, Department of Medicine, Columbia University Vagelos College of Physicians and Surgeons, New York, NY 10032, USA
| | - Richard A. Neher
- Biozentrum, University of Basel, Basel, Switzerland, Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Jesse D Bloom
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, Washington, 98109, USA
- Howard Hughes Medical Institute, Seattle, WA, 98195, USA
| |
Collapse
|
48
|
Emergence of Tn 1999.7, a New Transposon in blaOXA-48-Harboring Plasmids Associated with Increased Plasmid Stability. Antimicrob Agents Chemother 2022; 66:e0078722. [PMID: 36200773 DOI: 10.1128/aac.00787-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
OXA-48 is the most common carbapenemase in Enterobacterales in Germany and many other European countries. Depending on the genomic location of blaOXA-48, OXA-48-producing isolates vary in phenotype and intra- and interspecies transferability of blaOXA-48. In most bacterial isolates, blaOXA-48 is located on one of seven variants of Tn1999 (Tn1999.1 to Tn1999.6 and invTn1999.2). Here, a novel Tn1999 variant, Tn1999.7, is described, which was identified in 11 clinical isolates from 2016 to 2020. Tn1999.7 differs from Tn1999.1 by the insertion of the 8,349-bp Tn3 family transposon Tn7442 between the lysR gene and blaOXA-48 open reading frame. Tn7442 carries genes coding for a restriction endonuclease and a DNA methyltransferase as cargo, forming a type III restriction modification system. Tn1999.7 was carried on an ~71-kb IncL plasmid in 9/11 isolates. In one isolate, Tn1999.7 was situated on an ~76-kb plasmid, harboring an additional insertion sequence in the plasmid backbone. In one isolate, the plasmid size is only ~63 kb due to a deletion adjacent to Tn7442 that extends into the plasmid backbone. Mean conjugation rates of the Tn1999.7-harboring plasmids in J53 ranged from 4.47 × 10-5 to 2.03 × 10-2, similar to conjugation rates of other pOXA-48-type IncL plasmids. The stability of plasmids with Tn1999.7 was significantly higher than that of a Tn1999.2-harboring plasmid in vitro. This increase in stability could be related to the insertion of a restriction-modification system, which can promote postsegregational killing. The increased plasmid stability associated with Tn1999.7 could contribute to the further spread of OXA-48.
Collapse
|
49
|
Schaefer M, Nabih A, Spies D, Hermes V, Bodak M, Wischnewski H, Stalder P, Ngondo RP, Liechti LA, Sajic T, Aebersold R, Gatfield D, Ciaudo C. Global and precise identification of functional
miRNA
targets in
mESCs
by integrative analysis. EMBO Rep 2022; 23:e54762. [PMID: 35899551 PMCID: PMC9442311 DOI: 10.15252/embr.202254762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 06/27/2022] [Accepted: 06/30/2022] [Indexed: 12/03/2022] Open
Abstract
MicroRNA (miRNA) loaded Argonaute (AGO) complexes regulate gene expression via direct base pairing with their mRNA targets. Previous works suggest that up to 60% of mammalian transcripts might be subject to miRNA‐mediated regulation, but it remains largely unknown which fraction of these interactions are functional in a specific cellular context. Here, we integrate transcriptome data from a set of miRNA‐depleted mouse embryonic stem cell (mESC) lines with published miRNA interaction predictions and AGO‐binding profiles. Using this integrative approach, combined with molecular validation data, we present evidence that < 10% of expressed genes are functionally and directly regulated by miRNAs in mESCs. In addition, analyses of the stem cell‐specific miR‐290‐295 cluster target genes identify TFAP4 as an important transcription factor for early development. The extensive datasets developed in this study will support the development of improved predictive models for miRNA‐mRNA functional interactions.
Collapse
Affiliation(s)
- Moritz Schaefer
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
- Life Science Zurich Graduate School University of Zürich Zurich Switzerland
| | - Amena Nabih
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
- Life Science Zurich Graduate School University of Zürich Zurich Switzerland
| | - Daniel Spies
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
- Life Science Zurich Graduate School University of Zürich Zurich Switzerland
| | - Victoria Hermes
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
| | - Maxime Bodak
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
- Life Science Zurich Graduate School University of Zürich Zurich Switzerland
| | - Harry Wischnewski
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
| | - Patrick Stalder
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
- Life Science Zurich Graduate School University of Zürich Zurich Switzerland
| | - Richard Patryk Ngondo
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
| | - Luz Angelica Liechti
- Center for Integrative Genomics (CIG) University of Lausanne Lausanne Switzerland
| | - Tatjana Sajic
- Swiss Federal Institute of Technology Zurich, IMSB Zürich Switzerland
| | - Ruedi Aebersold
- Swiss Federal Institute of Technology Zurich, IMSB Zürich Switzerland
| | - David Gatfield
- Center for Integrative Genomics (CIG) University of Lausanne Lausanne Switzerland
| | - Constance Ciaudo
- Swiss Federal Institute of Technology Zurich IMHS, Chair of RNAi and Genome Integrity Zurich Switzerland
| |
Collapse
|
50
|
Filazzola A, Lortie CJ. A call for clean code to effectively communicate science. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Alessandro Filazzola
- Apex Resource Management Solutions Ottawa ON Canada
- Centre for Urban Environments University of Toronto Mississauga Mississauga ON Canada
| | - CJ Lortie
- Department of Biology York University Toronto ON Canada
- The National Center for Ecological Analysis and Synthesis UCSB Santa Barbara CA USA
| |
Collapse
|