101
|
Hoffmann TJ, Graff RE, Madduri RK, Rodriguez AA, Cario CL, Feng K, Jiang Y, Wang A, Klein RJ, Pierce BL, Eggener S, Tong L, Blot W, Long J, Goss LB, Darst BF, Rebbeck T, Lachance J, Andrews C, Adebiyi AO, Adusei B, Aisuodionoe-Shadrach OI, Fernandez PW, Jalloh M, Janivara R, Chen WC, Mensah JE, Agalliu I, Berndt SI, Shelley JP, Schaffer K, Machiela MJ, Freedman ND, Huang WY, Li SA, Goodman PJ, Till C, Thompson I, Lilja H, Ranatunga DK, Presti J, Van Den Eeden SK, Chanock SJ, Mosley JD, Conti DV, Haiman CA, Justice AC, Kachuri L, Witte JS. Genome-wide association study of prostate-specific antigen levels in 392,522 men identifies new loci and improves cross-ancestry prediction. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.10.27.23297676. [PMID: 37961155 PMCID: PMC10635224 DOI: 10.1101/2023.10.27.23297676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
We conducted a multi-ancestry genome-wide association study of prostate-specific antigen (PSA) levels in 296,754 men (211,342 European ancestry; 58,236 African ancestry; 23,546 Hispanic/Latino; 3,630 Asian ancestry; 96.5% of participants were from the Million Veteran Program). We identified 318 independent genome-wide significant (p≤5e-8) variants, 184 of which were novel. Most demonstrated evidence of replication in an independent cohort (n=95,768). Meta-analyzing discovery and replication (n=392,522) identified 447 variants, of which a further 111 were novel. Out-of-sample variance in PSA explained by our genome-wide polygenic risk scores ranged from 11.6%-16.6% in European ancestry, 5.5%-9.5% in African ancestry, 13.5%-18.2% in Hispanic/Latino, and 8.6%-15.3% in Asian ancestry, and decreased with increasing age. Mid-life genetically-adjusted PSA levels were more strongly associated with overall and aggressive prostate cancer than unadjusted PSA. Our study highlights how including proportionally more participants from underrepresented populations improves genetic prediction of PSA levels, offering potential to personalize prostate cancer screening.
Collapse
|
102
|
Chen X, Yin X, Shi X, Yan W, Yang Y, Liu L, Zhang T. Melon: metagenomic long-read-based taxonomic identification and quantification using marker genes. Genome Biol 2024; 25:226. [PMID: 39160564 PMCID: PMC11331721 DOI: 10.1186/s13059-024-03363-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 07/30/2024] [Indexed: 08/21/2024] Open
Abstract
Long-read sequencing holds great potential for characterizing complex microbial communities, yet taxonomic profiling tools designed specifically for long reads remain lacking. We introduce Melon, a novel marker-based taxonomic profiler that capitalizes on the unique attributes of long reads. Melon employs a two-stage classification scheme to reduce computational time and is equipped with an expectation-maximization-based post-correction module to handle ambiguous reads. Melon achieves superior performance compared to existing tools in both mock and simulated samples. Using wastewater metagenomic samples, we demonstrate the applicability of Melon by showing it provides reliable estimates of overall genome copies, and species-level taxonomic profiles.
Collapse
Affiliation(s)
- Xi Chen
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Xiaole Yin
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Xianghui Shi
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Weifu Yan
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Yu Yang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Lei Liu
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Tong Zhang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
| |
Collapse
|
103
|
Giolai M, Verweij W, Martin S, Pearson N, Nicholson P, Leggett RM, Clark MD. Measuring air metagenomic diversity in an agricultural ecosystem. Curr Biol 2024; 34:3778-3791.e4. [PMID: 39096906 DOI: 10.1016/j.cub.2024.07.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 04/26/2024] [Accepted: 07/04/2024] [Indexed: 08/05/2024]
Abstract
All species shed DNA during life or in death, providing an opportunity to monitor biodiversity via environmental DNA (eDNA). In recent years, combining eDNA, high-throughput sequencing technologies, bioinformatics, and increasingly complete sequence databases has promised a non-invasive and non-destructive environmental monitoring tool. Modern agricultural systems are often large monocultures and so are highly vulnerable to disease outbreaks. Pest and pathogen monitoring in agricultural ecosystems is key for efficient and early disease prevention, lower pesticide use, and better food security. Although the air is rich in biodiversity, it has the lowest DNA concentration of all environmental media and yet is the route for windborne spread of many damaging crop pathogens. Our work suggests that ecosystems can be monitored efficiently using airborne nucleic acid information. Here, we show that the airborne DNA of microbes can be recovered, shotgun sequenced, and taxonomically classified, including down to the species level. We show that by monitoring a field growing key crops we can identify the presence of agriculturally significant pathogens and quantify their changing abundance over a period of 1.5 months, often correlating with weather variables. We add to the evidence that aerial eDNA can be used as a source for biomonitoring in terrestrial ecosystems, specifically highlighting agriculturally relevant species and how pathogen levels correlate with weather conditions. Our ability to detect dynamically changing levels of species and strains highlights the value of airborne eDNA in agriculture, monitoring biodiversity changes, and tracking taxa of interest.
Collapse
Affiliation(s)
- Michael Giolai
- Natural History Museum, London SW7 5BD, UK; Research Centre for Ecological Change, Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki 00014, Finland
| | - Walter Verweij
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; Enza Zaden, Enkhuizen 1602 DB, the Netherlands
| | - Samuel Martin
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Neil Pearson
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Paul Nicholson
- Crop Genetics Department, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | | | | |
Collapse
|
104
|
Krutko M, Poling HM, Bryan AE, Sharma M, Singh A, Reza HA, Wikenheiser-Brokamp KA, Takebe T, Helmrath MA, Harris GM, Esfandiari L. Enhanced Piezoelectric Performance of PVDF-TrFE Nanofibers through Annealing for Tissue Engineering Applications. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.16.608345. [PMID: 39229142 PMCID: PMC11370437 DOI: 10.1101/2024.08.16.608345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
This study investigates bioelectric stimulation's role in tissue regeneration by enhancing the piezoelectric properties of tissue-engineered grafts using annealed poly(vinylidene fluoride-trifluoroethylene) (PVDF-TrFE) scaffolds. Annealing at temperatures of 80°C, 100°C, 120°C, and 140°C was assessed for its impact on material properties and physiological utility. Analytical techniques such as Differential Scanning Calorimetry (DSC), Fourier-Transform Infrared Spectroscopy (FTIR), and X-ray Diffraction (XRD) revealed increased crystallinity with higher annealing temperatures, peaking in β-phase content and crystallinity at 140°C. Scanning Electron Microscopy (SEM) showed that 140°C annealed scaffolds had enhanced lamellar structures, increased porosity, and maximum piezoelectric response. Mechanical tests indicated that 140°C annealing improved elastic modulus, tensile strength, and substrate stiffness, aligning these properties with physiological soft tissues. In vitro assessments in Schwann cells demonstrated favorable responses, with increased cell proliferation, contraction, and extracellular matrix attachment. Additionally, genes linked to extracellular matrix production, vascularization, and calcium signaling were upregulated. The foreign body response in C57BL/6 mice, evaluated through Hematoxylin and Eosin (H&E) and Picrosirius Red staining, showed no differences between scaffold groups, supporting the potential for future functional evaluation of the annealed group in tissue repair.
Collapse
|
105
|
Moreira D, Blaz J, Kim E, Eme L. A gene-rich mitochondrion with a unique ancestral protein transport system. Curr Biol 2024; 34:3812-3819.e3. [PMID: 39084221 DOI: 10.1016/j.cub.2024.07.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Revised: 05/03/2024] [Accepted: 07/02/2024] [Indexed: 08/02/2024]
Abstract
Mitochondria originated from an ancient endosymbiosis involving an alphaproteobacterium.1,2,3 Over time, these organelles reduced their gene content massively, with most genes being transferred to the host nucleus before the last eukaryotic common ancestor (LECA).4 This process has yielded varying gene compositions in modern mitogenomes, including the complete loss of this organellar genome in some extreme cases.5,6,7,8,9,10,11,12,13,14 At the other end of the spectrum, jakobids harbor the most gene-rich mitogenomes, encoding 60-66 proteins.8 Here, we introduce the mitogenome of Mantamonas sphyraenae, a protist from the deep-branching CRuMs supergroup.15,16 Remarkably, it boasts the most gene-rich mitogenome outside of jakobids, by housing 91 genes, including 62 protein-coding ones. These include rare homologs of the four subunits of the bacterial-type cytochrome c maturation system I (CcmA, CcmB, CcmC, and CcmF) alongside a unique ribosomal protein S6. During the early evolution of mitochondria, gene transfer from the proto-mitochondrial endosymbiont to the nucleus became possible thanks to systems facilitating the transport of proteins synthesized in the host cytoplasm back to the mitochondrion. In addition to the universally found eukaryotic protein import systems, jakobid mitogenomes were reported to uniquely encode the SecY transmembrane protein of the Sec general secretory pathway, whose evolutionary origin was however unclear. The Mantamonas mitogenome not only encodes SecY but also SecA, SecE, and SecG, making it the sole eukaryote known to house a complete mitochondrial Sec translocation system. Furthermore, our phylogenetic and comparative genomic analyses provide compelling evidence for the alphaproteobacterial origin of this system, establishing its presence in LECA.
Collapse
Affiliation(s)
- David Moreira
- Unité d'Ecologie Systématique et Evolution, CNRS, Université Paris-Saclay, AgroParisTech, 91190 Gif-sur-Yvette, France.
| | - Jazmin Blaz
- Unité d'Ecologie Systématique et Evolution, CNRS, Université Paris-Saclay, AgroParisTech, 91190 Gif-sur-Yvette, France
| | - Eunsoo Kim
- Division of EcoScience, Ewha Womans University, Seoul, South Korea; Division of Invertebrate Zoology, American Museum of Natural History, New York, NY, USA
| | - Laura Eme
- Unité d'Ecologie Systématique et Evolution, CNRS, Université Paris-Saclay, AgroParisTech, 91190 Gif-sur-Yvette, France.
| |
Collapse
|
106
|
Susat J, Haller-Caskie M, Bonczarowska JH, da Silva NA, Schierhold K, Rind MM, Schmölcke U, Kirleis W, Sondermann H, Rinne C, Müller J, Nebel A, Krause-Kyora B. Neolithic Yersinia pestis infections in humans and a dog. Commun Biol 2024; 7:1013. [PMID: 39155318 PMCID: PMC11330967 DOI: 10.1038/s42003-024-06676-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 08/02/2024] [Indexed: 08/20/2024] Open
Abstract
Yersinia pestis has been infecting humans since the Late Neolithic (LN). Whether those early infections were isolated zoonoses or initiators of a pandemic remains unclear. We report Y. pestis infections in two individuals (of 133) from the LN necropolis at Warburg (Germany, 5300-4900 cal BP). Our analyses show that the two genomes belong to distinct strains and reflect independent infection events. All LN genomes known today (n = 4) are basal in the phylogeny and represent separate lineages that probably originated in different animal hosts. In the LN, an opening of the landscape resulted in the introduction of new rodent species, which may have acted as Y. pestis reservoirs. Coincidentally, the number of dogs increased, possibly leading to Y. pestis infections in canines. Indeed, we detect Y. pestis in an LN dog. Collectively, our data suggest that Y. pestis frequently entered human settlements at the time without causing significant outbreaks.
Collapse
Affiliation(s)
- Julian Susat
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | | | - Joanna H Bonczarowska
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | | | | | | | - Ulrich Schmölcke
- Centre for Baltic and Scandinavian Archaeology (ZBSA), Schloss Gottorf, Schleswig, Germany
| | - Wiebke Kirleis
- Institute of Pre- and Protohistoric Archaeology, Kiel University, Kiel, Germany
| | - Holger Sondermann
- Centre for Structural Systems Biology (CSSB), Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany
- Kiel University, Kiel, Germany
| | - Christoph Rinne
- Institute of Pre- and Protohistoric Archaeology, Kiel University, Kiel, Germany
| | - Johannes Müller
- Institute of Pre- and Protohistoric Archaeology, Kiel University, Kiel, Germany
| | - Almut Nebel
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - Ben Krause-Kyora
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany.
| |
Collapse
|
107
|
Ji HJ, Pertea M. Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.13.589356. [PMID: 39185147 PMCID: PMC11343119 DOI: 10.1101/2024.04.13.589356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Recently developed long-read RNA sequencing technologies promise to provide a more accurate and comprehensive view of transcriptomes compared to short-read sequencers, primarily due to their capability to achieve full-length sequencing of transcripts. However, realizing this potential requires computational tools tailored to process long reads, which exhibit a higher error rate than short reads. Existing methods for assembling and quantifying long-read data often disagree on expressed transcripts and their abundance levels, leading researchers to lack confidence in the transcriptomes produced using this data. One approach to address the uncertainties in transcriptome assembly and quantification is by assigning the long reads to transcripts, enabling a more detailed characterization of transcript support at the read level. Here, we introduce TranSigner, a versatile tool that assigns long reads to any input transcriptome. TranSigner consists of three consecutive modules performing: read alignment to the given transcripts, computation of read-to-transcript compatibility based on alignment scores and positions, and execution of an expectation-maximization algorithm to probabilistically assign reads to transcripts and estimate transcript abundances. Using simulated data and experimental datasets from three well-studied organisms - Homo sapiens, Arabidopsis thaliana, and Mus musculus - we demonstrate that TranSigner achieves accurate read assignments, obtaining higher accuracy in transcript abundance estimation compared to existing tools.
Collapse
Affiliation(s)
- Hyun Joo Ji
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD
- Department of Computer Science, Johns Hopkins University; Baltimore, MD
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD
- Department of Computer Science, Johns Hopkins University; Baltimore, MD
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD
| |
Collapse
|
108
|
Roučová K, Vopálenský V, Mašek T, Del Llano E, Provazník J, Landry JJM, Azevedo N, Ehler E, Beneš V, Pospíšek M. Loss of ADAR1 protein induces changes in small RNA landscape in hepatocytes. RNA (NEW YORK, N.Y.) 2024; 30:1164-1183. [PMID: 38844344 PMCID: PMC11331409 DOI: 10.1261/rna.080097.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 05/17/2024] [Indexed: 08/18/2024]
Abstract
In recent years, numerous evidence has been accumulated about the extent of A-to-I editing in human RNAs and the key role ADAR1 plays in the cellular editing machinery. It has been shown that A-to-I editing occurrence and frequency are tissue-specific and essential for some tissue development, such as the liver. To study the effect of ADAR1 function in hepatocytes, we have created Huh7.5 ADAR1 KO cell lines. Upon IFN treatment, the Huh7.5 ADAR1 KO cells show rapid arrest of growth and translation, from which they do not recover. We analyzed translatome changes by using a method based on sequencing of separate polysome profile RNA fractions. We found significant changes in the transcriptome and translatome of the Huh7.5 ADAR1 KO cells. The most prominent changes include negatively affected transcription by RNA polymerase III and the deregulation of snoRNA and Y RNA levels. Furthermore, we observed that ADAR1 KO polysomes are enriched in mRNAs coding for proteins pivotal in a wide range of biological processes such as RNA localization and RNA processing, whereas the unbound fraction is enriched mainly in mRNAs coding for ribosomal proteins and translational factors. This indicates that ADAR1 plays a more relevant role in small RNA metabolism and ribosome biogenesis.
Collapse
Affiliation(s)
- Kristina Roučová
- Laboratory of RNA Biochemistry, Department of Genetics and Microbiology, Faculty of Science, Charles University, 128 00 Prague, Czech Republic
| | - Václav Vopálenský
- Laboratory of RNA Biochemistry, Department of Genetics and Microbiology, Faculty of Science, Charles University, 128 00 Prague, Czech Republic
| | - Tomáš Mašek
- Laboratory of RNA Biochemistry, Department of Genetics and Microbiology, Faculty of Science, Charles University, 128 00 Prague, Czech Republic
| | - Edgar Del Llano
- Laboratory of RNA Biochemistry, Department of Genetics and Microbiology, Faculty of Science, Charles University, 128 00 Prague, Czech Republic
- Laboratory of Biochemistry and Molecular Biology of Germ Cells, Institute of Animal Physiology and Genetics, CAS, 277 21 Liběchov, Czech Republic
| | | | | | | | - Edvard Ehler
- Department of Biology and Environmental Studies, Faculty of Education, Charles University, 116 39 Prague, Czech Republic
| | | | - Martin Pospíšek
- Laboratory of RNA Biochemistry, Department of Genetics and Microbiology, Faculty of Science, Charles University, 128 00 Prague, Czech Republic
| |
Collapse
|
109
|
Liu L, Manley JL. Modulation of diverse biological processes by CPSF, the master regulator of mRNA 3' ends. RNA (NEW YORK, N.Y.) 2024; 30:1122-1140. [PMID: 38986572 PMCID: PMC11331416 DOI: 10.1261/rna.080108.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 06/27/2024] [Indexed: 07/12/2024]
Abstract
The cleavage and polyadenylation specificity factor (CPSF) complex plays a central role in the formation of mRNA 3' ends, being responsible for the recognition of the poly(A) signal sequence, the endonucleolytic cleavage step, and recruitment of poly(A) polymerase. CPSF has been extensively studied for over three decades, and its functions and those of its individual subunits are becoming increasingly well-defined, with much current research focusing on the impact of these proteins on the normal functioning or disease/stress states of cells. In this review, we provide an overview of the general functions of CPSF and its subunits, followed by a discussion of how they exert their functions in a surprisingly diverse variety of biological processes and cellular conditions. These include transcription termination, small RNA processing, and R-loop prevention/resolution, as well as more generally cancer, differentiation/development, and infection/immunity.
Collapse
Affiliation(s)
- Lizhi Liu
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | - James L Manley
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| |
Collapse
|
110
|
Kondratov KA, Artamonov AA, Nikitin YV, Velmiskina AA, Mikhailovskii VY, Mosenko SV, Polkovnikova IA, Asinovskaya AY, Apalko SV, Sushentseva NN, Ivanov AM, Scherbak SG. Revealing differential expression patterns of piRNA in FACS blood cells of SARS-CoV-2 infected patients. BMC Med Genomics 2024; 17:212. [PMID: 39143590 PMCID: PMC11325581 DOI: 10.1186/s12920-024-01982-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Accepted: 08/05/2024] [Indexed: 08/16/2024] Open
Abstract
Non-coding RNA expression has shown to have cell type-specificity. The regulatory characteristics of these molecules are impacted by changes in their expression levels. We performed next-generation sequencing and examined small RNA-seq data obtained from 6 different types of blood cells separated by fluorescence-activated cell sorting of severe COVID-19 patients and healthy control donors. In addition to examining the behavior of piRNA in the blood cells of severe SARS-CoV-2 infected patients, our aim was to present a distinct piRNA differential expression portrait for each separate cell type. We observed that depending on the type of cell, different sorted control cells (erythrocytes, monocytes, lymphocytes, eosinophils, basophils, and neutrophils) have altering piRNA expression patterns. After analyzing the expression of piRNAs in each set of sorted cells from patients with severe COVID-19, we observed 3 significantly elevated piRNAs - piR-33,123, piR-34,765, piR-43,768 and 9 downregulated piRNAs in erythrocytes. In lymphocytes, all 19 piRNAs were upregulated. Monocytes were presented with a larger amount of statistically significant piRNA, 5 upregulated (piR-49039 piR-31623, piR-37213, piR-44721, piR-44720) and 35 downregulated. It has been previously shown that piR-31,623 has been associated with respiratory syncytial virus infection, and taking in account the major role of piRNA in transposon silencing, we presume that the differential expression patterns which we observed could be a signal of indirect antiviral activity or a specific antiviral cell state. Additionally, in lymphocytes, all 19 piRNAs were upregulated.
Collapse
Affiliation(s)
- Kirill A Kondratov
- City Hospital, No. 40 St, Petersburg, 197706, Russia.
- S. M. Kirov Military Medical Academy, St. Petersburg, 194044, Russia.
- Saint-Petersburg State University, St. Petersburg, 199034, Russia.
| | | | - Yuri V Nikitin
- S. M. Kirov Military Medical Academy, St. Petersburg, 194044, Russia
| | - Anastasiya A Velmiskina
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| | | | - Sergey V Mosenko
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| | - Irina A Polkovnikova
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| | - Anna Yu Asinovskaya
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| | - Svetlana V Apalko
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| | | | - Andrey M Ivanov
- S. M. Kirov Military Medical Academy, St. Petersburg, 194044, Russia
| | - Sergey G Scherbak
- City Hospital, No. 40 St, Petersburg, 197706, Russia
- Saint-Petersburg State University, St. Petersburg, 199034, Russia
| |
Collapse
|
111
|
Martinez K, Agirre J, Akune Y, Aoki-Kinoshita KF, Arighi C, Axelsen KB, Bolton E, Bordeleau E, Edwards NJ, Fadda E, Feizi T, Hayes C, Ives CM, Joshi HJ, Krishna Prasad K, Kossida S, Lisacek F, Liu Y, Lütteke T, Ma J, Malik A, Martin M, Mehta AY, Neelamegham S, Panneerselvam K, Ranzinger R, Ricard-Blum S, Sanou G, Shanker V, Thomas PD, Tiemeyer M, Urban J, Vita R, Vora J, Yamamoto Y, Mazumder R. Functional implications of glycans and their curation: insights from the workshop held at the 16th Annual International Biocuration Conference in Padua, Italy. Database (Oxford) 2024; 2024:baae073. [PMID: 39137905 PMCID: PMC11321244 DOI: 10.1093/database/baae073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 06/24/2024] [Accepted: 07/10/2024] [Indexed: 08/15/2024]
Abstract
Dynamic changes in protein glycosylation impact human health and disease progression. However, current resources that capture disease and phenotype information focus primarily on the macromolecules within the central dogma of molecular biology (DNA, RNA, proteins). To gain a better understanding of organisms, there is a need to capture the functional impact of glycans and glycosylation on biological processes. A workshop titled "Functional impact of glycans and their curation" was held in conjunction with the 16th Annual International Biocuration Conference to discuss ongoing worldwide activities related to glycan function curation. This workshop brought together subject matter experts, tool developers, and biocurators from over 20 projects and bioinformatics resources. Participants discussed four key topics for each of their resources: (i) how they curate glycan function-related data from publications and other sources, (ii) what type of data they would like to acquire, (iii) what data they currently have, and (iv) what standards they use. Their answers contributed input that provided a comprehensive overview of state-of-the-art glycan function curation and annotations. This report summarizes the outcome of discussions, including potential solutions and areas where curators, data wranglers, and text mining experts can collaborate to address current gaps in glycan and glycosylation annotations, leveraging each other's work to improve their respective resources and encourage impactful data sharing among resources. Database URL: https://wiki.glygen.org/Glycan_Function_Workshop_2023.
Collapse
Affiliation(s)
- Karina Martinez
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, Wentworth Way, York YO10 5DD, United Kingdom
| | - Yukie Akune
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Kiyoko F Aoki-Kinoshita
- Glycan and Life Systems Integration Center (GaLSIC), Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Cecilia Arighi
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Ave, Newark, DE 19716, United States
| | - Kristian B Axelsen
- Swiss-Prot Group, Swiss Institute of Bioinformatics (SIB), CMU, 1 rue Michel Servet, Geneva 4 1211, Switzerland
| | - Evan Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, United States
| | - Emily Bordeleau
- Michael Smith Laboratories, The University of British Columbia, 2185 East Mall, Vancouver, British Columbia V6T 1Z4, Canada
| | - Nathan J Edwards
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, 2115 Wisconsin Ave NW, Washington, DC 20007, United States
| | - Elisa Fadda
- Department of Chemistry and Hamilton Institute, Maynooth University, Kilcock Road, Maynooth, Co. Kildare W23 AH3Y, Ireland
| | - Ten Feizi
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Catherine Hayes
- Proteome Informatics Group, Swiss Institute of Bioinformatics (SIB), route de Drize 7, Geneva CH-1227, Switzerland
| | - Callum M Ives
- Department of Chemistry and Hamilton Institute, Maynooth University, Kilcock Road, Maynooth, Co. Kildare W23 AH3Y, Ireland
| | - Hiren J Joshi
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen DK-2200, Denmark
| | - Khakurel Krishna Prasad
- ELI Beamlines Facility, The Extreme Light Infrastructure ERIC, Za Radnicí 835, Dolní Břežany 25241, Czech Republic
| | - Sofia Kossida
- IMGT, The International ImMunoGeneTics Information System, National Center for Scientific Research (CNRS), Institute of Human Genetics (IGH), University of Montpellier (UM), 141 rue de la Cardonille, Montpellier 34 090, France
| | - Frederique Lisacek
- Proteome Informatics Group, Swiss Institute of Bioinformatics (SIB), route de Drize 7, Geneva CH-1227, Switzerland
| | - Yan Liu
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Thomas Lütteke
- Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Gießen, Frankfurter Str. 100, Gießen 35392, Germany
| | - Junfeng Ma
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, 3900 Reservior Road NW, Washington, DC 20007, United States
| | - Adnan Malik
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Maria Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Akul Y Mehta
- Department of Surgery, Beth Israel Deaconess Medical Center, National Center for Functional Glycomics, Harvard Medical School, 330 Brookline Avenue, Boston, MA 02215, United States
| | - Sriram Neelamegham
- Departments of Chemical & Biological Engineering, Biomedical Engineering and Medicine, University at Buffalo, State University of New York, 906 Furnas Hall, Buffalo, NY 14260, United States
| | - Kalpana Panneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - Sylvie Ricard-Blum
- Institute of Molecular and Supramolecular Chemistry and Biochemistry (ICBMS), UMR 5246, University Lyon 1, CNRS, 43 Boulevard du 11 novembre 1918, Villeurbanne cedex F-69622, France
| | - Gaoussou Sanou
- IMGT, The International ImMunoGeneTics Information System, National Center for Scientific Research (CNRS), Institute of Human Genetics (IGH), University of Montpellier (UM), 141 rue de la Cardonille, Montpellier 34 090, France
| | - Vijay Shanker
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Ave, Newark, DE 19716, United States
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California, 2001 N Soto Street, Los Angeles, CA 90032, United States
| | - Michael Tiemeyer
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - James Urban
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan 7 B, Gothenburg 41390, Sweden
| | - Randi Vita
- Immune Epitope Database and Analysis Project, La Jolla Institute for Allergy & Immunology, 9420 Athena Circle, La Jolla, CA 92037, United States
| | - Jeet Vora
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| | - Yasunori Yamamoto
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, 178-4-4 Wakashiba, Kashiwa, Chiba 277-0871, Japan
| | - Raja Mazumder
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| |
Collapse
|
112
|
Akinyemi MO, Oyedele OA, Kleyn MS, Onarinde BA, Adeleke RA, Ezekiel CN. Draft genome sequences of Weissella cibaria GM93m3, a promising probiotic strain from raw goat milk. Microbiol Resour Announc 2024; 13:e0027024. [PMID: 38958438 PMCID: PMC11320988 DOI: 10.1128/mra.00270-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 04/28/2024] [Indexed: 07/04/2024] Open
Abstract
The draft genome of a previously documented potential probiotic Weissella cibaria strain GM93m3 from raw goat milk in Nigeria is reported. The total genome size was 2,447,229 with 46 contigs and G+C content of 44.86%.
Collapse
Affiliation(s)
- Muiz O. Akinyemi
- National Centre for Food Manufacturing, University of Lincoln, Lincoln, United Kingdom
- Unit for Environmental Sciences and Management, North-West University (Potchefstroom Campus), Potchefstroom, South Africa
| | | | - Mariska S. Kleyn
- Unit for Environmental Sciences and Management, North-West University (Potchefstroom Campus), Potchefstroom, South Africa
| | - Bukola A. Onarinde
- National Centre for Food Manufacturing, University of Lincoln, Lincoln, United Kingdom
| | - Rasheed A. Adeleke
- Unit for Environmental Sciences and Management, North-West University (Potchefstroom Campus), Potchefstroom, South Africa
| | - Chibundu N. Ezekiel
- Department of Microbiology, Babcock University, Ilishan Remo, Ogun State, Nigeria
- Department of Agrobiotechnology (IFA-Tulln), Institute of Bioanalytics and Agro-Metabolomics, BOKU University, Vienna, Austria
| |
Collapse
|
113
|
Cameron ES, Sanchez S, Goldman N, Blaxter ML, Finn RD. Diversity and specificity of molecular functions in cyanobacterial symbionts. Sci Rep 2024; 14:18658. [PMID: 39134591 PMCID: PMC11319675 DOI: 10.1038/s41598-024-69215-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Accepted: 08/01/2024] [Indexed: 08/15/2024] Open
Abstract
Cyanobacteria are globally occurring photosynthetic bacteria notable for their contribution to primary production and production of toxins which have detrimental ecosystem impacts. Furthermore, cyanobacteria can form mutualistic symbiotic relationships with a diverse set of eukaryotes, including land plants, aquatic plankton and fungi. Nevertheless, not all cyanobacteria are found in symbiotic associations suggesting symbiotic cyanobacteria have evolved specializations that facilitate host-interactions. Photosynthetic capabilities, nitrogen fixation, and the production of complex biochemicals are key functions provided by host-associated cyanobacterial symbionts. To explore if additional specializations are associated with such lifestyles in cyanobacteria, we have conducted comparative phylogenomics of molecular functions and of biosynthetic gene clusters (BGCs) in 984 cyanobacterial genomes. Cyanobacteria with host-associated and symbiotic lifestyles were concentrated in the family Nostocaceae, where eight monophyletic clades correspond to specific host taxa. In agreement with previous studies, symbionts are likely to provide fixed nitrogen to their eukaryotic partners, through multiple different nitrogen fixation pathways. Additionally, our analyses identified chitin metabolising pathways in cyanobacteria associated with specific host groups, while obligate symbionts had fewer BGCs. The conservation of molecular functions and BGCs between closely related symbiotic and free-living cyanobacteria suggests the potential for additional cyanobacteria to form symbiotic relationships than is currently known.
Collapse
Affiliation(s)
- Ellen S Cameron
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Santiago Sanchez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Nick Goldman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Mark L Blaxter
- Tree of Life, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
114
|
Malusare A, Kothandaraman H, Tamboli D, Lanman NA, Aggarwal V. Understanding the natural language of DNA using encoder-decoder foundation models with byte-level precision. BIOINFORMATICS ADVANCES 2024; 4:vbae117. [PMID: 39176288 PMCID: PMC11341122 DOI: 10.1093/bioadv/vbae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/06/2024] [Accepted: 08/10/2024] [Indexed: 08/24/2024]
Abstract
Summary This article presents the Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) foundation model, analyzing DNA sequences at byte-level precision with an encoder-decoder Transformer architecture. ENBED uses a subquadratic implementation of attention to develop an efficient model capable of sequence-to-sequence transformations, generalizing previous genomic models with encoder-only or decoder-only architectures. We use Masked Language Modeling to pretrain the foundation model using reference genome sequences and apply it in the following downstream tasks: (i) identification of enhancers, promotors, and splice sites, (ii) recognition of sequences containing base call mismatches and insertion/deletion errors, an advantage over tokenization schemes involving multiple base pairs, which lose the ability to analyze with byte-level precision, (iii) identification of biological function annotations of genomic sequences, and (iv) generating mutations of the Influenza virus using the encoder-decoder architecture and validating them against real-world observations. In each of these tasks, we demonstrate significant improvement as compared to the existing state-of-the-art results. Availability and implementation The source code used to develop and fine-tune the foundation model has been released on Github (https://github.itap.purdue.edu/Clan-labs/ENBED).
Collapse
Affiliation(s)
- Aditya Malusare
- School of Industrial Engineering, Purdue University, West Lafayette, IN 47907, United States
- Institute for Cancer Research, Purdue University, West Lafayette, IN 47907, United States
| | - Harish Kothandaraman
- Institute for Cancer Research, Purdue University, West Lafayette, IN 47907, United States
| | - Dipesh Tamboli
- Elmore Family School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, United States
| | - Nadia A Lanman
- Institute for Cancer Research, Purdue University, West Lafayette, IN 47907, United States
- Department of Comparative Pathobiology, Purdue University, West Lafayette, IN 47907, United States
| | - Vaneet Aggarwal
- School of Industrial Engineering, Purdue University, West Lafayette, IN 47907, United States
- Institute for Cancer Research, Purdue University, West Lafayette, IN 47907, United States
- Elmore Family School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, United States
| |
Collapse
|
115
|
Van Uffelen A, Posadas A, Roosens NHC, Marchal K, De Keersmaecker SCJ, Vanneste K. Benchmarking bacterial taxonomic classification using nanopore metagenomics data of several mock communities. Sci Data 2024; 11:864. [PMID: 39127718 PMCID: PMC11316826 DOI: 10.1038/s41597-024-03672-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 07/22/2024] [Indexed: 08/12/2024] Open
Abstract
Taxonomic classification is crucial in identifying organisms within diverse microbial communities when using metagenomics shotgun sequencing. While second-generation Illumina sequencing still dominates, third-generation nanopore sequencing promises improved classification through longer reads. However, extensive benchmarking studies on nanopore data are lacking. We systematically evaluated performance of bacterial taxonomic classification for metagenomics nanopore sequencing data for several commonly used classifiers, using standardized reference sequence databases, on the largest collection of publicly available data for defined mock communities thus far (nine samples), representing different research domains and application scopes. Our results categorize classifiers into three categories: low precision/high recall; medium precision/medium recall, and high precision/medium recall. Most fall into the first group, although precision can be improved without excessively penalizing recall with suitable abundance filtering. No definitive 'best' classifier emerges, and classifier selection depends on application scope and practical requirements. Although few classifiers designed for long reads exist, they generally exhibit better performance. Our comprehensive benchmarking provides concrete recommendations, supported by publicly available code for reassessment and fine-tuning by other scientists.
Collapse
Affiliation(s)
- Alexander Van Uffelen
- Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium
- Department of Information Technology, Internet Technology and Data Science Lab (IDLab), Interuniversity Microelectronics Centre (IMEC), Ghent University, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Andrés Posadas
- Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium
- Department of Information Technology, Internet Technology and Data Science Lab (IDLab), Interuniversity Microelectronics Centre (IMEC), Ghent University, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Nancy H C Roosens
- Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium
| | - Kathleen Marchal
- Department of Information Technology, Internet Technology and Data Science Lab (IDLab), Interuniversity Microelectronics Centre (IMEC), Ghent University, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Department of Genetics, University of Pretoria, Pretoria, South Africa
| | | | - Kevin Vanneste
- Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium.
| |
Collapse
|
116
|
Li X, Meng B, Zhang Z, Wei L, Chang W, Wang Y, Zhang K, Li T, Lu K. qPrimerDB 2.0: an updated comprehensive gene-specific qPCR primer database for 1172 organisms. Nucleic Acids Res 2024:gkae684. [PMID: 39119895 DOI: 10.1093/nar/gkae684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/04/2024] [Accepted: 07/25/2024] [Indexed: 08/10/2024] Open
Abstract
High-quality primer design is essential for the success of all polymerase chain reaction (PCR)-based experiments. We previously developed a thermodynamics-based gene-specific quantitative PCR (qPCR) primer database for 147 organisms, which has been used extensively in gene expression studies. However, the number of organisms and the imperfection of function in the database limits its potential applications. Here, we improved the functionality of qPrimerDB to create a more comprehensive primer resource. Specifically, we (i) developed an improved primer design tool, qPrimer, building upon the previous qPrimerDB pipeline, to enhance the efficiency and simplicity of genome-scale qPCR primer design; (ii) pre-computed qPCR primer resources from 1 308 genomes of 1172 organisms and (iii) introduced a complete system for identifying, designing, checking, marking, and submitting qPCR primers. qPrimerDB 2.0 is freely available at https://qprimerdb.biodb.org. The qPrimer source code is available at https://github.com/swu1019lab/qPrimer.
Collapse
Affiliation(s)
- Xiaodong Li
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Boyu Meng
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Zhi Zhang
- State Key Laboratory of Resource Insects, Southwest University, Chongqing 400715, China
| | - Lijuan Wei
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Wei Chang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Yuhong Wang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Kai Zhang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
| | - Tian Li
- State Key Laboratory of Resource Insects, Southwest University, Chongqing 400715, China
| | - Kun Lu
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing 400715, China
- Academy of Agricultural Sciences, Southwest University, Beibei, Chongqing 400715, China
| |
Collapse
|
117
|
Cipriano A, Colantoni A, Calicchio A, Fiorentino J, Gomes D, Moqri M, Parker A, Rasouli S, Caldwell M, Briganti F, Roncarolo MG, Baldini A, Weinacht KG, Tartaglia GG, Sebastiano V. Transcriptional and epigenetic characterization of a new in vitro platform to model the formation of human pharyngeal endoderm. Genome Biol 2024; 25:211. [PMID: 39118163 PMCID: PMC11312149 DOI: 10.1186/s13059-024-03354-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 07/26/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND The Pharyngeal Endoderm (PE) is an extremely relevant developmental tissue, serving as the progenitor for the esophagus, parathyroids, thyroids, lungs, and thymus. While several studies have highlighted the importance of PE cells, a detailed transcriptional and epigenetic characterization of this important developmental stage is still missing, especially in humans, due to technical and ethical constraints pertaining to its early formation. RESULTS Here we fill this knowledge gap by developing an in vitro protocol for the derivation of PE-like cells from human Embryonic Stem Cells (hESCs) and by providing an integrated multi-omics characterization. Our PE-like cells robustly express PE markers and are transcriptionally homogenous and similar to in vivo mouse PE cells. In addition, we define their epigenetic landscape and dynamic changes in response to Retinoic Acid by combining ATAC-Seq and ChIP-Seq of histone modifications. The integration of multiple high-throughput datasets leads to the identification of new putative regulatory regions and to the inference of a Retinoic Acid-centered transcription factor network orchestrating the development of PE-like cells. CONCLUSIONS By combining hESCs differentiation with computational genomics, our work reveals the epigenetic dynamics that occur during human PE differentiation, providing a solid resource and foundation for research focused on the development of PE derivatives and the modeling of their developmental defects in genetic syndromes.
Collapse
Affiliation(s)
- Andrea Cipriano
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Alessio Colantoni
- Department of Biology and Biotechnology Charles Darwin, Sapienza University of Rome, 00185, Rome, Italy
- Center for Life Nano- & Neuro-Science, Fondazione Istituto Italiano Di Tecnologia (IIT), 00161, Rome, Italy
| | - Alessandro Calicchio
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Jonathan Fiorentino
- Center for Life Nano- & Neuro-Science, Fondazione Istituto Italiano Di Tecnologia (IIT), 00161, Rome, Italy
| | - Danielle Gomes
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Mahdi Moqri
- Biomedical Informatics Program, Department of Biomedical Data Science, Stanford University, Stanford, CA, 94305, USA
| | - Alexander Parker
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Sajede Rasouli
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Matthew Caldwell
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Francesca Briganti
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, 94305, USA
- Cardiovascular Institute and Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Maria Grazia Roncarolo
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA
- Division of Hematology, Oncology, Stem Cell Transplantation, and Regenerative Medicine, Department of Pediatrics, Stanford School of Medicine, Stanford, CA, 94305, USA
- Center for Definitive and Curative Medicine (CDCM), Stanford School of Medicine, Stanford, CA, USA
| | - Antonio Baldini
- Department of Molecular Medicine and Medical Biotech., University Federico II, 80131, Naples, Italy
| | - Katja G Weinacht
- Division of Hematology, Oncology, Stem Cell Transplantation, and Regenerative Medicine, Department of Pediatrics, Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Gian Gaetano Tartaglia
- Center for Life Nano- & Neuro-Science, Fondazione Istituto Italiano Di Tecnologia (IIT), 00161, Rome, Italy.
- Center for Human Technology, Fondazione Istituto Italiano Di Tecnologia (IIT), 16152, Genoa, Italy.
| | - Vittorio Sebastiano
- Department of Obstetrics & Gynecology, Stanford University, Stanford, CA, 94305, USA.
- Institute for Stem Cell Biology and Regenerative Medicine (ISCBRM), Stanford School of Medicine, Stanford, CA, 94305, USA.
| |
Collapse
|
118
|
Middlebrook EA, Katani R, Fair JM. OrthoPhyl-streamlining large-scale, orthology-based phylogenomic studies of bacteria at broad evolutionary scales. G3 (BETHESDA, MD.) 2024; 14:jkae119. [PMID: 38839049 PMCID: PMC11304591 DOI: 10.1093/g3journal/jkae119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 05/15/2024] [Accepted: 05/29/2024] [Indexed: 06/07/2024]
Abstract
There are a staggering number of publicly available bacterial genome sequences (at writing, 2.0 million assemblies in NCBI's GenBank alone), and the deposition rate continues to increase. This wealth of data begs for phylogenetic analyses to place these sequences within an evolutionary context. A phylogenetic placement not only aids in taxonomic classification but informs the evolution of novel phenotypes, targets of selection, and horizontal gene transfer. Building trees from multi-gene codon alignments is a laborious task that requires bioinformatic expertise, rigorous curation of orthologs, and heavy computation. Compounding the problem is the lack of tools that can streamline these processes for building trees from large-scale genomic data. Here we present OrthoPhyl, which takes bacterial genome assemblies and reconstructs trees from whole genome codon alignments. The analysis pipeline can analyze an arbitrarily large number of input genomes (>1200 tested here) by identifying a diversity-spanning subset of assemblies and using these genomes to build gene models to infer orthologs in the full dataset. To illustrate the versatility of OrthoPhyl, we show three use cases: E. coli/Shigella, Brucella/Ochrobactrum and the order Rickettsiales. We compare trees generated with OrthoPhyl to trees generated with kSNP3 and GToTree along with published trees using alternative methods. We show that OrthoPhyl trees are consistent with other methods while incorporating more data, allowing for greater numbers of input genomes, and more flexibility of analysis.
Collapse
Affiliation(s)
- Earl A Middlebrook
- Genomics and Bioanalytics Group, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | - Robab Katani
- 401 Huck Life Sciences Building, Huck Institutes of Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Jeanne M Fair
- Genomics and Bioanalytics Group, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| |
Collapse
|
119
|
Allan MF, Aruda J, Plung JS, Grote SL, des Taillades YJM, de Lajarte AA, Bathe M, Rouskin S. Discovery and Quantification of Long-Range RNA Base Pairs in Coronavirus Genomes with SEARCH-MaP and SEISMIC-RNA. RESEARCH SQUARE 2024:rs.3.rs-4814547. [PMID: 39149495 PMCID: PMC11326378 DOI: 10.21203/rs.3.rs-4814547/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
RNA molecules perform a diversity of essential functions for which their linear sequences must fold into higher-order structures. Techniques including crystallography and cryogenic electron microscopy have revealed 3D structures of ribosomal, transfer, and other well-structured RNAs; while chemical probing with sequencing facilitates secondary structure modeling of any RNAs of interest, even within cells. Ongoing efforts continue increasing the accuracy, resolution, and ability to distinguish coexisting alternative structures. However, no method can discover and quantify alternative structures with base pairs spanning arbitrarily long distances - an obstacle for studying viral, messenger, and long noncoding RNAs, which may form long-range base pairs. Here, we introduce the method of Structure Ensemble Ablation by Reverse Complement Hybridization with Mutational Profiling (SEARCH-MaP) and software for Structure Ensemble Inference by Sequencing, Mutation Identification, and Clustering of RNA (SEISMIC-RNA). We use SEARCH-MaP and SEISMIC-RNA to discover that the frameshift stimulating element of SARS coronavirus 2 base-pairs with another element 1 kilobase downstream in nearly half of RNA molecules, and that this structure competes with a pseudoknot that stimulates ribosomal frameshifting. Moreover, we identify long-range base pairs involving the frameshift stimulating element in other coronaviruses including SARS coronavirus 1 and transmissible gastroenteritis virus, and model the full genomic secondary structure of the latter. These findings suggest that long-range base pairs are common in coronaviruses and may regulate ribosomal frameshifting, which is essential for viral RNA synthesis. We anticipate that SEARCH-MaP will enable solving many RNA structure ensembles that have eluded characterization, thereby enhancing our general understanding of RNA structures and their functions. SEISMIC-RNA, software for analyzing mutational profiling data at any scale, could power future studies on RNA structure and is available on GitHub and the Python Package Index.
Collapse
Affiliation(s)
- Matthew F. Allan
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA 02139
- Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA 02139
| | - Justin Aruda
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
- Harvard Program in Biological and Biomedical Sciences, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA 02115
| | - Jesse S. Plung
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
- Harvard Program in Virology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA 02115
| | - Scott L. Grote
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
| | | | - Albéric A. de Lajarte
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
| | - Mark Bathe
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA 02139
| | - Silvi Rouskin
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA 02115
| |
Collapse
|
120
|
McEvoy SL, Meyer RS, Hasenstab-Lehman KE, Guilliams CM. The reference genome of an endangered Asteraceae, Deinandra increscens subsp. villosa, endemic to the Central Coast of California. G3 (BETHESDA, MD.) 2024; 14:jkae117. [PMID: 38845594 PMCID: PMC11304951 DOI: 10.1093/g3journal/jkae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 05/26/2024] [Indexed: 08/09/2024]
Abstract
We present a reference genome for the federally endangered Gaviota tarplant, Deinandra increscens subsp. villosa (Madiinae, Asteraceae), an annual herb endemic to the Central California coast. Generating PacBio HiFi, Oxford Nanopore Technologies, and Dovetail Omni-C data, we assembled a haploid consensus genome of 1.67 Gb as 28.7 K scaffolds with a scaffold N50 of 74.9 Mb. We annotated repeat content in 74.8% of the genome. Long terminal repeats (LTRs) covered 44.0% of the genome with Copia families predominant at 22.9% followed by Gypsy at 14.2%. Both Gypsy and Copia elements were common in ancestral peaks of LTRs, and the most abundant element was a Gypsy element containing nested Copia/Angela sequence similarity, reflecting a complex evolutionary history of repeat activity. Gene annotation produced 33,257 genes and 68,942 transcripts, of which 99% were functionally annotated. BUSCO scores for the annotated proteins were 96.0% complete of which 77.6% was single copy and 18.4% duplicates. Whole genome duplication synonymous mutation rates of Gaviota tarplant and sunflower (Helianthus annuus) shared peaks that correspond to the last Asteraceae polyploidization event and subsequent divergence from a common ancestor at ∼27 MYA. Regions of high-density tandem genes were identified, pointing to potentially important loci of environmental adaptation in this species.
Collapse
Affiliation(s)
- Susan L McEvoy
- Department of Conservation and Research, Santa Barbara Botanic Garden, Santa Barbara, CA 93105, USA
| | - Rachel S Meyer
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - C Matt Guilliams
- Department of Conservation and Research, Santa Barbara Botanic Garden, Santa Barbara, CA 93105, USA
| |
Collapse
|
121
|
Forsyth JK, Zhu J, Chavannes AS, Trevorrow ZH, Hyat M, Sievertsen SA, Ferreira-Ianone S, Conomos MP, Nuechterlein KH, Asarnow RF, Green MF, Karlsgodt KH, Perkins DO, Cannon TD, Addington JM, Cadenhead KS, Cornblatt BA, Keshavan MS, Mathalon DH, Stone WS, Tsuang MT, Walker EF, Woods SW, Narr KL, McEwen SC, Schleifer CH, Yee CM, Diehl CK, Guha A, Miller GA, Alexander-Bloch AF, Seidlitz J, Bethlehem RAI, Ophoff RA, Bearden CE. Fetal Gene Regulatory Gene Deletions are Associated with Poor Cognition and Altered Cortical Morphology in Schizophrenia and Community-Based Samples. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.08.02.24311302. [PMID: 39211869 PMCID: PMC11361264 DOI: 10.1101/2024.08.02.24311302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
Schizophrenia spectrum disorders (SSDs) are characterized by substantial clinical and genetic heterogeneity. Multiple recurrent copy number variants (CNVs) increase risk for SSDs; however, how known risk CNVs and broader genome-wide CNVs influence clinical variability is unclear. The current study examined associations between borderline intellectual functioning or childhood-onset psychosis, known risk CNVs, and burden of deletions affecting genes in 18 previously validated neurodevelopmental gene-sets in 618 SSD individuals. CNV associations were assessed for replication in 235 SSD relatives and 583 controls, and 9,930 youth from the Adolescent Brain Cognitive Development (ABCD) Study. Known SSD- and neurodevelopmental disorder (NDD)-risk CNVs were associated with borderline intellectual functioning in SSD cases (odds ratios (OR) = 7.09 and 4.57, respectively); NDD-risk deletions were nominally associated with childhood-onset psychosis (OR = 4.34). Furthermore, deletion of genes involved in regulating gene expression during fetal brain development was associated with borderline intellectual functioning across SSD cases and non-cases (OR = 2.58), with partial replication in the ABCD cohort. Exploratory analyses of cortical morphology showed associations between fetal gene regulatory gene deletions and altered gray matter volume and cortical thickness across cohorts. Results highlight contributions of known risk CNVs to phenotypic variability in SSD and the utility of a neurodevelopmental framework for identifying mechanisms that influence phenotypic variability in SSDs, as well as the broader population, with implications for personalized medicine approaches to care.
Collapse
|
122
|
Shi H, Hou G, Jiang S, Su X. PM-profiler: a high-resolution and fast tool for taxonomy annotation of amplicon-based microbiome. Microbiol Spectr 2024; 12:e0069524. [PMID: 38912828 PMCID: PMC11302061 DOI: 10.1128/spectrum.00695-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 05/12/2024] [Indexed: 06/25/2024] Open
Abstract
Amplicon sequencing stands as a cornerstone in microbiome profiling, yet concerns persist regarding its resolution and accuracy. The enhancement of reference databases and annotations marks a new era for 16S rRNA-based profiling. Capitalizing on this potential, we introduce PM-profiler, a novel tool for profiling amplicon short reads. PM-profiler is implemented by C++-based advanced algorithms, such as pre-allocated hash for reference construction, hybrid and dynamic short-read matching, big-data-guided dual-mode hierarchical taxonomy annotation strategy, and full-procedure parallel computing. This tool delivers species-level resolution and ultrafast speed for large-scale microbiomes, surpassing alignment-based approaches and the Naïve-Bayesian model. Furthermore, recognizing the global uneven distribution of microbes, we delineate optimal annotation strategies for each sampling habitat based on microbial patterns over 270,000 microbiomes. Integrated with the established workflow of Parallel-Meta Suite and the latest curated reference databases, this endeavor offers a swift and dependable solution for high-precision microbiome surveys.IMPORTANCEOur study introduces PM-profiler, a new tool that deciphers the complexity of microbial communities. With advanced algorithms, flexible annotation strategies, and well-organized big-data, PM-profiler provides a faster and more accurate way to study on microbiomes, paving the way for discoveries that could improve our understanding of microbiomes and their impact on the world.
Collapse
Affiliation(s)
- Haobo Shi
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong, China
| | - Guosen Hou
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong, China
| | - Sikai Jiang
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong, China
| | - Xiaoquan Su
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong, China
| |
Collapse
|
123
|
Hallgren MB, Clausen PTLC, Aarestrup FM. NanoMGT: Marker gene typing of low complexity mono-species metagenomic samples using noisy long reads. Biol Methods Protoc 2024; 9:bpae057. [PMID: 39262440 PMCID: PMC11387619 DOI: 10.1093/biomethods/bpae057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 07/23/2024] [Accepted: 08/01/2024] [Indexed: 09/13/2024] Open
Abstract
Rapid advancements in sequencing technologies have led to significant progress in microbial genomics, yet challenges persist in accurately identifying microbial strain diversity in metagenomic samples, especially when working with noisy long-read data from platforms like Oxford Nanopore Technologies (ONT). In this article, we introduce NanoMGT, a tool designed to enhance marker gene typing in low-complexity mono-species samples, leveraging the unique properties of long reads. NanoMGT excels in its ability to accurately identify mutations amidst high error rates, ensuring the reliable detection of multiple strain-specific marker genes. Our tool implements a novel scoring system that rewards mutations co-occurring across different reads and penalizes densely grouped, likely erroneous variants, thereby achieving a good balance between sensitivity and precision. A comparative evaluation of NanoMGT, using a simulated multi-strain sample of seven bacterial species, demonstrated superior performance relative to existing tools and the advantages of using a threshold-based filtering approach to calling minority variants in ONT's sequencing data. NanoMGT's potential as a post-binning tool in metagenomic pipelines is particularly notable, enabling researchers to more accurately determine specific alleles and understand strain diversity in microbial communities. Our findings have significant implications for clinical diagnostics, environmental microbiology, and the broader field of genomics. The findings offer a reliable and efficient approach to marker gene typing in complex metagenomic samples.
Collapse
Affiliation(s)
- Malte B Hallgren
- National Food Institute, Technical University of Denmark, Kemitorvet 204, 2800, Kgs. Lyngby, Denmark
| | - Philip T L C Clausen
- National Food Institute, Technical University of Denmark, Kemitorvet 204, 2800, Kgs. Lyngby, Denmark
| | - Frank M Aarestrup
- National Food Institute, Technical University of Denmark, Kemitorvet 204, 2800, Kgs. Lyngby, Denmark
| |
Collapse
|
124
|
Delgado S, Armijo Á, Bravo V, Orellana O, Salazar JC, Katz A. Impact of the chemical modification of tRNAs anticodon loop on the variability and evolution of codon usage in proteobacteria. Front Microbiol 2024; 15:1412318. [PMID: 39161601 PMCID: PMC11332805 DOI: 10.3389/fmicb.2024.1412318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 05/27/2024] [Indexed: 08/21/2024] Open
Abstract
Despite the highly conserved nature of the genetic code, the frequency of usage of each codon can vary significantly. The evolution of codon usage is shaped by two main evolutionary forces: mutational bias and selection pressures. These pressures can be driven by environmental factors, but also by the need for efficient translation, which depends heavily on the concentration of transfer RNAs (tRNAs) within the cell. The data presented here supports the proposal that tRNA modifications play a key role in shaping the overall preference of codon usage in proteobacteria. Interestingly, some codons, such as CGA and AGG (encoding arginine), exhibit a surprisingly low level of variation in their frequency of usage, even across genomes with differing GC content. These findings suggest that the evolution of GC content in proteobacterial genomes might be primarily driven by changes in the usage of a specific subset of codons, whose usage is itself influenced by tRNA modifications.
Collapse
Affiliation(s)
| | - Álvaro Armijo
- Facultad de Ciencias Químicas y Farmacéuticas, Universidad de Chile, Santiago, Chile
| | - Verónica Bravo
- Programa Centro de Investigacion Biomédica y Aplicada, Escuela de Medicina, Facultad de Ciencias Médicas, Universidad de Santiago de Chile, Santiago, Chile
| | - Omar Orellana
- Programa de Biología Celular y Molecular, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
| | - Juan Carlos Salazar
- Programa de Microbiología y Micología, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
| | - Assaf Katz
- Programa de Biología Celular y Molecular, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
| |
Collapse
|
125
|
Gable SM, Bushroe NA, Mendez JM, Wilson A, Pinto BJ, Gamble T, Tollis M. Differential Conservation and Loss of Chicken Repeat 1 (CR1) Retrotransposons in Squamates Reveal Lineage-Specific Genome Dynamics Across Reptiles. Genome Biol Evol 2024; 16:evae157. [PMID: 39031594 PMCID: PMC11303007 DOI: 10.1093/gbe/evae157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 07/09/2024] [Accepted: 07/11/2024] [Indexed: 07/22/2024] Open
Abstract
Transposable elements (TEs) are repetitive DNA sequences which create mutations and generate genetic diversity across the tree of life. In amniote vertebrates, TEs have been mainly studied in mammals and birds, whose genomes generally display low TE diversity. Squamates (Order Squamata; including ∼11,000 extant species of lizards and snakes) show as much variation in TE abundance and activity as they do in species and phenotypes. Despite this high TE activity, squamate genomes are remarkably uniform in size. We hypothesize that novel, lineage-specific genome dynamics have evolved over the course of squamate evolution. To understand the interplay between TEs and host genomes, we analyzed the evolutionary history of the chicken repeat 1 (CR1) retrotransposon, a TE family found in most tetrapod genomes which is the dominant TE in most reptiles. We compared 113 squamate genomes to the genomes of turtles, crocodilians, and birds and used ancestral state reconstruction to identify shifts in the rate of CR1 copy number evolution across reptiles. We analyzed the repeat landscapes of CR1 in squamate genomes and determined that shifts in the rate of CR1 copy number evolution are associated with lineage-specific variation in CR1 activity. We then used phylogenetic reconstruction of CR1 subfamilies across amniotes to reveal both recent and ancient CR1 subclades across the squamate tree of life. The patterns of CR1 evolution in squamates contrast other amniotes, suggesting key differences in how TEs interact with different host genomes and at different points across evolutionary history.
Collapse
Affiliation(s)
- Simone M Gable
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Nicholas A Bushroe
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Jasmine M Mendez
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Adam Wilson
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Brendan J Pinto
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
| | - Tony Gamble
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
- Department of Biological Sciences, Marquette University, Milwaukee, WI, USA
- Bell Museum of Natural History, University of Minnesota, St. Paul, MN, USA
| | - Marc Tollis
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| |
Collapse
|
126
|
Li A, Zhou H, Xiong S, Li J, Mallik S, Fei R, Liu Y, Zhou H, Wang X, Hei X, Wang L. PLEKv2: predicting lncRNAs and mRNAs based on intrinsic sequence features and the coding-net model. BMC Genomics 2024; 25:756. [PMID: 39095710 PMCID: PMC11295476 DOI: 10.1186/s12864-024-10662-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 07/25/2024] [Indexed: 08/04/2024] Open
Abstract
BACKGROUND Long non-coding RNAs (lncRNAs) are RNA transcripts of more than 200 nucleotides that do not encode canonical proteins. Their biological structure is similar to messenger RNAs (mRNAs). To distinguish between lncRNA and mRNA transcripts quickly and accurately, we upgraded the PLEK alignment-free tool to its next version, PLEKv2, and constructed models tailored for both animals and plants. RESULTS PLEKv2 can achieve 98.7% prediction accuracy for human datasets. Compared with classical tools and deep learning-based models, this is 8.1%, 3.7%, 16.6%, 1.4%, 4.9%, and 48.9% higher than CPC2, CNCI, Wen et al.'s CNN, LncADeep, PLEK, and NcResNet, respectively. The accuracy of PLEKv2 was > 90% for cross-species prediction. PLEKv2 is more effective and robust than CPC2, CNCI, LncADeep, PLEK, and NcResNet for primate datasets (including chimpanzees, macaques, and gorillas). Moreover, PLEKv2 is not only suitable for non-human primates that are closely related to humans, but can also predict the coding ability of RNA sequences in plants such as Arabidopsis. CONCLUSIONS The experimental results illustrate that the model constructed by PLEKv2 can distinguish lncRNAs and mRNAs better than PLEK. The PLEKv2 software is freely available at https://sourceforge.net/projects/plek2/ .
Collapse
Affiliation(s)
- Aimin Li
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China.
| | - Haotian Zhou
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Siqi Xiong
- Department of Information Engineering, College of Technology, Hubei Engineering University, Xiaogan, Hubei, 432000, China.
| | - Junhuai Li
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Saurav Mallik
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Rong Fei
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Yajun Liu
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Hongfang Zhou
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Xiaofan Wang
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Xinhong Hei
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| | - Lei Wang
- Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China
| |
Collapse
|
127
|
Kozłowska-Masłoń J, Ciomborowska-Basheer J, Kubiak MR, Makałowska I. Evolution of retrocopies in the context of HUSH silencing. Biol Direct 2024; 19:60. [PMID: 39095906 PMCID: PMC11295320 DOI: 10.1186/s13062-024-00507-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 07/29/2024] [Indexed: 08/04/2024] Open
Abstract
Retrotransposition is one of the main factors responsible for gene duplication and thus genome evolution. However, the sequences that undergo this process are not only an excellent source of biological diversity, but in certain cases also pose a threat to the integrity of the DNA. One of the mechanisms that protects against the incorporation of mobile elements is the HUSH complex, which is responsible for silencing long, intronless, transcriptionally active transposed sequences that are rich in adenine on the sense strand. In this study, broad sets of human and porcine retrocopies were analysed with respect to the above factors, taking into account evolution of these molecules. Analysis of expression pattern, genomic structure, transcript length, and nucleotide substitution frequency showed the strong relationship between the expression level and exon length as well as the protective nature of introns. The results of the studies also showed that there is no direct correlation between the expression level and adenine content. However, protein-coding retrocopies, which have a lower adenine content, have a significantly higher expression level than the adenine-rich non-coding but expressed retrocopies. Therefore, although the mechanism of HUSH silencing may be an important part of the regulation of retrocopy expression, it is one component of a more complex molecular network that remains to be elucidated.
Collapse
Affiliation(s)
- Joanna Kozłowska-Masłoń
- Institute of Human Biology and Evolution, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, Poznań, Poland
- Laboratory of Cancer Genetics, Greater Poland Cancer Centre, Garbary 15, Poznań, Poland
| | - Joanna Ciomborowska-Basheer
- Institute of Human Biology and Evolution, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, Poznań, Poland
- Laboratory of Nature Education and Conservation, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, Poznań, Poland
| | - Magdalena Regina Kubiak
- Institute of Human Biology and Evolution, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, Poznań, Poland
| | - Izabela Makałowska
- Institute of Human Biology and Evolution, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, Poznań, Poland.
| |
Collapse
|
128
|
Gorodetskiy V, Sidorova Y, Biderman B, Kupryshina N, Ryzhikova N, Sudarikov A. Gamma-delta T-cell large granular lymphocytic leukemia in the setting of rheumatologic diseases. Front Cell Dev Biol 2024; 12:1434676. [PMID: 39161592 PMCID: PMC11331004 DOI: 10.3389/fcell.2024.1434676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 07/09/2024] [Indexed: 08/21/2024] Open
Abstract
Background T-cell leukemia originating from large granular lymphocytes (T-LGL leukemia) is a rare lymphoid neoplasia characterized by clonal proliferation of large granular T lymphocytes expressing αβ or γδ T-cell receptor (TCR) on the cell membrane. γδT-LGL leukemia, accounting for approximately 17% of all T-LGL leukemia cases, is associated with autoimmune diseases. However, the features of γδT-LGL leukemia in patients with rheumatologic diseases are still insufficiently characterized. Methods In this retrospective study, 15 patients with rheumatologic disease-associated γδT-LGL leukemia were included. The patients were obtained from a single center from 2008 to 2023. Data related to clinical characteristics and rheumatologic diagnoses were collected. Immunophenotype evaluations as well as T-lymphocyte clonality (based on TCR-γ, TCR-β, and TCR-δ gene rearrangements), and signal transducer and activator of transcription (STAT) three and STAT5B mutation analyses (by next-generation sequencing) were performed on blood, bone marrow, and spleen samples. Results All but one patient had rheumatoid arthritis (RA). In 36% of patients, manifestations of γδT-LGL leukemia were present before or concurrently with clinical manifestations of RA. Splenomegaly was observed in 60% of patients and neutropenia (<1.5 × 109/L) was detected in 93% of cases. CD4-/CD8- and CD4-/CD8+ subtypes were detected in seven cases each. Mutations in STAT3 were detected in 80% of patients; however, STAT5B mutations were not detected. Evaluations of T-cell clonality and variant allele frequencies at STAT3 in the blood, bone marrow, and spleen tissue revealed an unusual variant of CD4-/CD8- γδT-LGL leukemia with predominant involvement of the spleen, involvement of the bone marrow to a less extent, and no tumor cells in peripheral blood. Conclusion The mechanism by which γδT-LGL leukemia may induce the development of RA in some patients requires further investigation. Cases of RA-associated γδT-LGL leukemia with neutropenia and splenomegaly but no detectable tumor-associated lymphocytes in peripheral blood (the so-called splenic variant of T-LGL leukemia) are difficult to diagnose and may be misdiagnosed as Felty syndrome or hepatosplenic T-cell lymphoma.
Collapse
Affiliation(s)
| | - Yulia Sidorova
- Laboratory of Molecular Hematology, National Medical Research Center for Hematology, Moscow, Russia
| | - Bella Biderman
- Laboratory of Molecular Hematology, National Medical Research Center for Hematology, Moscow, Russia
| | - Natalia Kupryshina
- Hematopoiesis Immunology Laboratory, Russian Cancer Research Center N.N. Blokhin, Moscow, Russia
| | - Natalya Ryzhikova
- Laboratory of Molecular Hematology, National Medical Research Center for Hematology, Moscow, Russia
| | - Andrey Sudarikov
- Laboratory of Molecular Hematology, National Medical Research Center for Hematology, Moscow, Russia
| |
Collapse
|
129
|
Sahrhage M, Paul NB, Beißbarth T, Haubrock M. The importance of DNA sequence for nucleosome positioning in transcriptional regulation. Life Sci Alliance 2024; 7:e202302380. [PMID: 38830772 PMCID: PMC11147951 DOI: 10.26508/lsa.202302380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 05/15/2024] [Accepted: 05/16/2024] [Indexed: 06/05/2024] Open
Abstract
Nucleosome positioning is a key factor for transcriptional regulation. Nucleosomes regulate the dynamic accessibility of chromatin and interact with the transcription machinery at every stage. Influences to steer nucleosome positioning are diverse, and the according importance of the DNA sequence in contrast to active chromatin remodeling has been the subject of long discussion. In this study, we evaluate the functional role of DNA sequence for all major elements along the process of transcription. We developed a random forest classifier based on local DNA structure that assesses the sequence-intrinsic support for nucleosome positioning. On this basis, we created a simple data resource that we applied genome-wide to the human genome. In our comprehensive analysis, we found a special role of DNA in mediating the competition of nucleosomes with cis-regulatory elements, in enabling steady transcription, for positioning of stable nucleosomes in exons, and for repelling nucleosomes during transcription termination. In contrast, we relate these findings to concurrent processes that generate strongly positioned nucleosomes in vivo that are not mediated by sequence, such as energy-dependent remodeling of chromatin.
Collapse
Affiliation(s)
- Malte Sahrhage
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Germany
| | - Niels Benjamin Paul
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Germany
- Department of Cardiology and Pneumology, University Medical Center, Göttingen, Germany
| | - Tim Beißbarth
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Germany
| | - Martin Haubrock
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Germany
| |
Collapse
|
130
|
Nammo T, Funahashi N, Udagawa H, Kozawa J, Nakano K, Shimizu Y, Okamura T, Kawaguchi M, Uebanso T, Nishimura W, Hiramoto M, Shimomura I, Yasuda K. Single-housing-induced islet epigenomic changes are related to polymorphisms in diabetic KK mice. Life Sci Alliance 2024; 7:e202302099. [PMID: 38876803 PMCID: PMC11178941 DOI: 10.26508/lsa.202302099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 05/17/2024] [Accepted: 05/20/2024] [Indexed: 06/16/2024] Open
Abstract
A lack of social relationships is increasingly recognized as a type 2 diabetes (T2D) risk. To investigate the underlying mechanism, we used male KK mice, an inbred strain with spontaneous diabetes. Given the association between living alone and T2D risk in humans, we divided the non-diabetic mice into singly housed (KK-SH) and group-housed control mice. Around the onset of diabetes in KK-SH mice, we compared H3K27ac ChIP-Seq with RNA-Seq using pancreatic islets derived from each experimental group, revealing a positive correlation between single-housing-induced changes in H3K27ac and gene expression levels. In particular, single-housing-induced H3K27ac decreases revealed a significant association with islet cell functions and GWAS loci for T2D and related diseases, with significant enrichment of binding motifs for transcription factors representative of human diabetes. Although these H3K27ac regions were preferentially localized to a polymorphic genomic background, SNVs and indels did not cause sequence disruption of enriched transcription factor motifs in most of these elements. These results suggest alternative roles of genetic variants in environment-dependent epigenomic changes and provide insights into the complex mode of disease inheritance.
Collapse
Affiliation(s)
- Takao Nammo
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Metabolic Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
- Department of Diabetes Care Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
| | - Nobuaki Funahashi
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Life Science and Technology, Tokyo Institute of Technology, Yokohama, Japan
| | - Haruhide Udagawa
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Registered Dietitians, Faculty of Health and Nutrition, Bunkyo University, Chigasaki, Japan
| | - Junji Kozawa
- Department of Metabolic Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
- Department of Diabetes Care Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
| | - Kenta Nakano
- https://ror.org/00r9w3j27 Department of Laboratory Animal Medicine, Research Institute, National Center for Global Health and Medicine (NCGM), Tokyo, Japan
| | - Yukiko Shimizu
- https://ror.org/00r9w3j27 Department of Laboratory Animal Medicine, Research Institute, National Center for Global Health and Medicine (NCGM), Tokyo, Japan
| | - Tadashi Okamura
- https://ror.org/00r9w3j27 Department of Laboratory Animal Medicine, Research Institute, National Center for Global Health and Medicine (NCGM), Tokyo, Japan
| | - Miho Kawaguchi
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
| | - Takashi Uebanso
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Preventive Environment and Nutrition, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan
| | - Wataru Nishimura
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Molecular Biology, International University of Health and Welfare School of Medicine, Chiba, Japan
- Division of Anatomy, Bio-Imaging and Neuro-cell Science, Jichi Medical University, Tochigi, Japan
| | - Masaki Hiramoto
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Biochemistry, Tokyo Medical University, Tokyo, Japan
| | - Iichiro Shimomura
- Department of Metabolic Medicine, Osaka University Graduate School of Medicine, Osaka, Japan
| | - Kazuki Yasuda
- https://ror.org/00r9w3j27 Department of Metabolic Disorder, Diabetes Research Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Diabetes, Endocrinology and Metabolism, Kyorin University School of Medicine, Tokyo, Japan
| |
Collapse
|
131
|
Vargas-Pinilla P, S Oliveira Fam B, Medina Tavares G, Lima T, Landau L, Paré P, de Cássia Aleixo Tostes R, Pissinatti A, Falótico T, Costa-Neto C, Maestri R, Bortolini MC. From molecular variations to behavioral adaptations: Unveiling adaptive epistasis in primate oxytocin system. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2024; 184:e24947. [PMID: 38783700 DOI: 10.1002/ajpa.24947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 04/12/2024] [Accepted: 04/24/2024] [Indexed: 05/25/2024]
Abstract
OBJECTIVE Our primary objective was to investigate the variability of oxytocin (OT) and the GAMEN binding motif within the LNPEP oxytocinase in primates. MATERIALS AND METHODS We sequenced the LNPEP segment encompassing the GAMEN motif in 34 Platyrrhini species, with 21 of them also sequenced for the OT gene. Our dataset was supplemented with primate sequences of LNPEP, OT, and the oxytocin receptor (OTR) sourced from public databases. Evolutionary analysis and coevolution predictions were made followed by the macroevolution analysis of relevant amino acids associated with phenotypic traits, such as mating systems, parental care, and litter size. To account for phylogenetic structure, we utilized two distinct statistical tests. Additionally, we calculated binding energies focusing on the interaction between Callithtrix jacchus VAMEN and Pro8OT. RESULTS We identified two novel motifs (AAMEN and VAMEN), challenging the current knowledge of motif conservation in placental mammals. Coevolution analysis demonstrated a correlation between GAMEN, AAMEN, and VAMEN and their corresponding OTs and OTRs. Callithrix jacchus exhibited a higher binding energy between VAMEN and Pro8OT than orthologous molecules found in humans (GAMEN and Leu8OT). DISCUSSION The coevolution of AAMEN and VAMEN with their corresponding OTs and OTRs suggests a functional relationship that could have contributed to specific reproductive and adaptive behaviors, including paternal care, social monogamy, and twin births, prominent traits in Cebidae species, such as marmosets and tamarins. Our findings underscore the coevolution of taxon-specific amino acids among the three studied molecules, shedding light on the oxytocinergic system as an adaptive epistatic repertoire in primates.
Collapse
Affiliation(s)
- Pedro Vargas-Pinilla
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
- Departamento de Farmacologia, Faculdade de Medicina, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Bibiana S Oliveira Fam
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
- Laboratório de Medicina Genômica, Centro de Pesquisa Experimental (CPE), Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil
| | - Gustavo Medina Tavares
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Thaynara Lima
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Luane Landau
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
- Department of Biological Sciences, University at Buffalo, Buffalo, New York, USA
| | - Pâmela Paré
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | | | | | - Tiago Falótico
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, São Paulo, Brazil
| | - Cláudio Costa-Neto
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Renan Maestri
- Laboratório de Ecomorfologia e Macroevolução, Departamento de Ecologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Maria Cátira Bortolini
- Laboratory of Human and Molecular Evolution, Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| |
Collapse
|
132
|
Fernandes M, Mario de Andrade E, Reis da Silva SG, Romagnoli VDS, Ortega JM, Antônio de Oliveira Mendes T. Geneapp: A web application for visualizing alternative splicing for biomedicine. Comput Biol Med 2024; 178:108789. [PMID: 38936077 DOI: 10.1016/j.compbiomed.2024.108789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 06/29/2024]
Abstract
Alternative Splicing (AS) is an essential mechanism for eukaryotes. However, the consequences of deleting a single exon can be dramatic for the organism and can lead to cancer in humans. Additionally, alternative 5' and 3' splice sites, which define the boundaries of exons, also play key roles to human disorders. Therefore, Investigating AS events is crucial for understanding the molecular basis of human diseases and developing therapeutic strategies. Workflow for AS event analysis can be sampling followed by data analysis with bioinformatics to identify the different AS events in the control and case samples, data visualization for curation, and selection of relevant targets for experimental validation. The raw output of the analysis software does not favor the inspection of events by bioinformaticians requiring custom scripts for data visualization. In this work, we propose the Geneapp application with three modules: GeneappScript, GeneappServer, and GeneappExplorer. GeneappScript is a wrapper that assists in identifying AS in samples compared in two different approaches, while GeneappServer integrates data from AS analysis already performed by the user. In GeneappExplorer, the user visualizes the previous dataset by exploring AS events in genes with functional annotation. This targeted screens that Geneapp allows to perform helps in the identification of targets for experimental validation to confirm the hypotheses under study. The Geneapp is freely available for non-commercial use at https://geneapp.net to advance research on AS for bioinformatics.
Collapse
Affiliation(s)
- Miquéias Fernandes
- Postgraduation Program in Bioinformatics, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil; Department of Biochemistry and Molecular Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil; Institute of Applied Biotechnology to Agriculture (BIOAGRO), Universidade Federal de Viçosa, Minas Gerais, Brazil.
| | - Edson Mario de Andrade
- Postgraduation Program in Bioinformatics, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil; Department of Biochemistry and Molecular Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil; Institute of Applied Biotechnology to Agriculture (BIOAGRO), Universidade Federal de Viçosa, Minas Gerais, Brazil
| | - Saymon Gazolla Reis da Silva
- Department of Biochemistry and Molecular Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil; Institute of Applied Biotechnology to Agriculture (BIOAGRO), Universidade Federal de Viçosa, Minas Gerais, Brazil
| | - Vinícius Dos Santos Romagnoli
- Department of Biochemistry and Molecular Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil; Institute of Applied Biotechnology to Agriculture (BIOAGRO), Universidade Federal de Viçosa, Minas Gerais, Brazil
| | - José Miguel Ortega
- Postgraduation Program in Bioinformatics, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Tiago Antônio de Oliveira Mendes
- Postgraduation Program in Bioinformatics, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil; Department of Biochemistry and Molecular Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil; Institute of Applied Biotechnology to Agriculture (BIOAGRO), Universidade Federal de Viçosa, Minas Gerais, Brazil.
| |
Collapse
|
133
|
Jirapongwattana N, Bunting SF, Ronning DR, Ghosal G, Karpf AR. RHNO1: at the crossroads of DNA replication stress, DNA repair, and cancer. Oncogene 2024; 43:2613-2620. [PMID: 39107463 DOI: 10.1038/s41388-024-03117-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 07/24/2024] [Accepted: 07/26/2024] [Indexed: 08/28/2024]
Abstract
The DNA replication stress (DRS) response is a crucial homeostatic mechanism for maintaining genome integrity in the face of intrinsic and extrinsic barriers to DNA replication. Importantly, DRS is often significantly increased in tumor cells, making tumors dependent on the cellular DRS response for growth and survival. Rad9-Hus1-Rad1 Interacting Nuclear Orphan 1 (RHNO1), a protein involved in the DRS response, has recently emerged as a potential therapeutic target in cancer. RHNO1 interacts with the 9-1-1 checkpoint clamp and TopBP1 to activate the ATR/Chk1 signaling pathway, the crucial mediator of the DRS response. Moreover, RHNO1 was also recently identified as a key facilitator of theta-mediated end joining (TMEJ), a DNA repair mechanism implicated in cancer progression and chemoresistance. In this literature review, we provide an overview of our current understanding of RHNO1, including its structure, function in the DRS response, and role in DNA repair, and discuss its potential as a cancer therapeutic target. Therapeutic targeting of RHNO1 holds promise for tumors with elevated DRS as well as tumors with DNA repair deficiencies, including homologous recombination DNA repair deficient (HRD) tumors. Further investigation into RHNO1 function in cancer, and development of approaches to target RHNO1, are expected to yield novel strategies for cancer treatment.
Collapse
Affiliation(s)
- Niphat Jirapongwattana
- Eppley Institute for Research in Cancer, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA
- Fred & Pamela Buffett Cancer Center, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA
| | - Samuel F Bunting
- Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854-8021, USA
| | - Donald R Ronning
- Department of Pharmaceutical Sciences, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA
| | - Gargi Ghosal
- Fred & Pamela Buffett Cancer Center, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA
- Department of Genetics, Cell Biology, and Anatomy, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA
| | - Adam R Karpf
- Eppley Institute for Research in Cancer, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA.
- Fred & Pamela Buffett Cancer Center, University of Nebraska Medical Center, Omaha, NE, 68198-6805, USA.
| |
Collapse
|
134
|
Sperfeld M, Narváez-Barragán DA, Malitsky S, Frydman V, Yuda L, Rocha J, Segev E. Algal methylated compounds shorten the lag phase of Phaeobacter inhibens bacteria. Nat Microbiol 2024; 9:2006-2021. [PMID: 38969820 PMCID: PMC11306105 DOI: 10.1038/s41564-024-01742-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 05/22/2024] [Indexed: 07/07/2024]
Abstract
The lag phase is key in resuming bacterial growth, but it remains underexplored particularly in environmental bacteria. Here we use transcriptomics and 13C-labelled metabolomics to show that the lag phase of the model marine bacterium Phaeobacter inhibens is shortened by methylated compounds produced by the microalgal partner, Emiliania huxleyi. Methylated compounds are abundantly produced and released by microalgae, and we show that their methyl groups can be collected by bacteria and assimilated through the methionine cycle. Our findings underscore the significance of methyl groups as a limiting factor during the lag phase and highlight the adjustability of this growth phase. In addition, we show that methylated compounds, typical of photosynthetic organisms, prompt diverse reductions in lag times in bacteria associated with algae and plants, potentially favouring early growth in some bacteria. These findings suggest ways to accelerate bacterial growth and underscore the significance of studying bacteria within an environmental context.
Collapse
Affiliation(s)
- Martin Sperfeld
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
- Institute of Microbiology, ETH, Zurich, Switzerland
| | | | - Sergey Malitsky
- Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
| | - Veronica Frydman
- Department of Chemical Research Support, Weizmann Institute of Science, Rehovot, Israel
| | - Lilach Yuda
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Jorge Rocha
- Agricultura en Zonas Áridas, Centro de Investigaciones Biológicas del Noroeste, La Paz, Mexico
| | - Einat Segev
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
135
|
Wright MW, Thaxton CL, Nelson T, DiStefano MT, Savatt JM, Brush MH, Cheung G, Mandell ME, Wulf B, Ward TJ, Goehringer S, O'Neill T, Weller P, Preston CG, Keseler IM, Goldstein JL, Strande NT, McGlaughon J, Azzariti DR, Cordova I, Dziadzio H, Babb L, Riehle K, Milosavljevic A, Martin CL, Rehm HL, Plon SE, Berg JS, Riggs ER, Klein TE. Generating Clinical-Grade Gene-Disease Validity Classifications Through the ClinGen Data Platforms. Annu Rev Biomed Data Sci 2024; 7:31-50. [PMID: 38663031 DOI: 10.1146/annurev-biodatasci-102423-112456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
Clinical genetic laboratories must have access to clinically validated biomedical data for precision medicine. A lack of accessibility, normalized structure, and consistency in evaluation complicates interpretation of disease causality, resulting in confusion in assessing the clinical validity of genes and genetic variants for diagnosis. A key goal of the Clinical Genome Resource (ClinGen) is to fill the knowledge gap concerning the strength of evidence supporting the role of a gene in a monogenic disease, which is achieved through a process known as Gene-Disease Validity curation. Here we review the work of ClinGen in developing a curation infrastructure that supports the standardization, harmonization, and dissemination of Gene-Disease Validity data through the creation of frameworks and the utilization of common data standards. This infrastructure is based on several applications, including the ClinGen GeneTracker, Gene Curation Interface, Data Exchange, GeneGraph, and website.
Collapse
Affiliation(s)
- Matt W Wright
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - Courtney L Thaxton
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA;
| | | | - Marina T DiStefano
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | | | - Matthew H Brush
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Gloria Cheung
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - Mark E Mandell
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - Bryan Wulf
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - T J Ward
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA;
| | | | - Terry O'Neill
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | | | - Christine G Preston
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - Ingrid M Keseler
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| | - Jennifer L Goldstein
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA;
| | | | - Jennifer McGlaughon
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA;
| | - Danielle R Azzariti
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | | | - Hannah Dziadzio
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Lawrence Babb
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | | | - Heidi L Rehm
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Sharon E Plon
- Department of Pediatrics, Division of Hematology-Oncology, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Jonathan S Berg
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA;
| | | | - Teri E Klein
- Departments of Medicine (Biomedical Informatics Research) and Genetics, Stanford University School of Medicine, Stanford, California, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, USA; ,
| |
Collapse
|
136
|
Wyler E, Lauber C, Manukyan A, Deter A, Quedenau C, Teixeira Alves LG, Wylezich C, Borodina T, Seitz S, Altmüller J, Landthaler M. Pathogen dynamics and discovery of novel viruses and enzymes by deep nucleic acid sequencing of wastewater. ENVIRONMENT INTERNATIONAL 2024; 190:108875. [PMID: 39002331 DOI: 10.1016/j.envint.2024.108875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 07/03/2024] [Accepted: 07/03/2024] [Indexed: 07/15/2024]
Abstract
Wastewater contains an extensive reservoir of genetic information, yet largely unexplored. Here, we analyzed by high-throughput sequencing total nucleic acids extracted from wastewater samples collected during a 17 month-period in Berlin, Germany. By integrating global wastewater datasets and applying a novel computational approach to accurately identify viral strains within sewage RNA-sequencing data, we demonstrated the emergence and global dissemination of a specific astrovirus strain. Astrovirus abundance and sequence variation mirrored temporal and spatial patterns of infection, potentially serving as footprints of specific timeframes and geographical locations. Additionally, we revealed more than 100,000 sequence contigs likely originating from novel viral species, exhibiting distinct profiles in total RNA and DNA datasets and including undescribed bunyaviruses and parvoviruses. Finally, we identified thousands of new CRISPR-associated protein sequences, including Transposase B (TnpB), a class of compact, RNA-guided DNA editing enzymes. Collectively, our findings underscore the potential of high-throughput sequencing of total nucleic acids derived from wastewater for a broad range of applications.
Collapse
Affiliation(s)
- Emanuel Wyler
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Chris Lauber
- Institute for Experimental Virology, TWINCORE Centre for Experimental and Clinical Infection Research, A Joint Venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany; Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
| | - Artür Manukyan
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Aylina Deter
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Claudia Quedenau
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Luiz Gustavo Teixeira Alves
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Claudia Wylezich
- Institute of Diagnostic Virology, Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, Greifswald-Insel Riems, Germany
| | - Tatiana Borodina
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
| | - Stefan Seitz
- Division of Virus-Associated Carcinogenesis (F170), German Cancer Research Center (DKFZ), Heidelberg, Germany; Department of Infectious Diseases, Molecular Virology, University of Heidelberg, Heidelberg, Germany
| | - Janine Altmüller
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany; Berlin Institute of Health at Charité, Berlin, Germany
| | - Markus Landthaler
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany; Institut für Biologie, Humboldt-Universität zu Berlin, Berlin, Germany.
| |
Collapse
|
137
|
Matsuda Y, Makino T. Comparative genomics reveals convergent signals associated with the high metabolism and longevity in birds and bats. Proc Biol Sci 2024; 291:20241068. [PMID: 39191281 DOI: 10.1098/rspb.2024.1068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 06/27/2024] [Accepted: 07/24/2024] [Indexed: 08/29/2024] Open
Abstract
Birds and bats have long lifespans relative to their body size compared with non-flying animals. However, the genomic basis associated with longer lifespan of flying species despite their higher metabolism was unclear. In this study, we hypothesized that genes involved in the regulation of metabolism and lifespan changed with the acquisition of flight and searched for genes that show specific evolutionary patterns in flying species. As a result, we identified several genes that show different evolutionary rates in bird and bat lineages. Genes in pathways involved in lifespan regulation were conserved in birds, while they evolved at an accelerated rate in bats. We also searched for genes in which convergent amino acid substitutions occurred in birds and bats and found such substitutions in genes involved in cancer, reactive oxygen species control and immunity. Our study revealed genomic changes associated with the acquisition of flight in birds and bats and suggested that multiple genes involved in the regulation of lifespan and metabolism support both high metabolism and longevity in flying species.
Collapse
Affiliation(s)
- Yuki Matsuda
- United Graduate School of Agricultural Science, Tokyo University of Agriculture and Technology, Saiwai-cho , Fuchu-shi, Tokyo 183-8509, Japan
- Graduate School of Life Sciences, Tohoku University, Aoba-ku , Sendai 980-8578, Japan
| | - Takashi Makino
- Graduate School of Life Sciences, Tohoku University, Aoba-ku , Sendai 980-8578, Japan
| |
Collapse
|
138
|
Nagelberg AL, Sihota TS, Chuang YC, Shi R, Chow JLM, English J, MacAulay C, Lam S, Lam WL, Lockwood WW. Integrative genomics identifies SHPRH as a tumor suppressor gene in lung adenocarcinoma that regulates DNA damage response. Br J Cancer 2024; 131:534-550. [PMID: 38890444 PMCID: PMC11300780 DOI: 10.1038/s41416-024-02755-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 06/03/2024] [Accepted: 06/06/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Identification of driver mutations and development of targeted therapies has considerably improved outcomes for lung cancer patients. However, significant limitations remain with the lack of identified drivers in a large subset of patients. Here, we aimed to assess the genomic landscape of lung adenocarcinomas (LUADs) from individuals without a history of tobacco use to reveal new genetic drivers of lung cancer. METHODS Integrative genomic analyses combining whole-exome sequencing, copy number, and mutational information for 83 LUAD tumors was performed and validated using external datasets to identify genetic variants with a predicted functional consequence and assess association with clinical outcomes. LUAD cell lines with alteration of identified candidates were used to functionally characterize tumor suppressive potential using a conditional expression system both in vitro and in vivo. RESULTS We identified 21 genes with evidence of positive selection, including 12 novel candidates that have yet to be characterized in LUAD. In particular, SNF2 Histone Linker PHD RING Helicase (SHPRH) was identified due to its frequency of biallelic disruption and location within the familial susceptibility locus on chromosome arm 6q. We found that low SHPRH mRNA expression is associated with poor survival outcomes in LUAD patients. Furthermore, we showed that re-expression of SHPRH in LUAD cell lines with inactivating alterations for SHPRH reduces their in vitro colony formation and tumor burden in vivo. Finally, we explored the biological pathways associated SHPRH inactivation and found an association with the tolerance of LUAD cells to DNA damage. CONCLUSIONS These data suggest that SHPRH is a tumor suppressor gene in LUAD, whereby its expression is associated with more favorable patient outcomes, reduced tumor and mutational burden, and may serve as a predictor of response to DNA damage. Thus, further exploration into the role of SHPRH in LUAD development may make it a valuable biomarker for predicting LUAD risk and prognosis.
Collapse
Affiliation(s)
- Amy L Nagelberg
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Tianna S Sihota
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Yu-Chi Chuang
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Interdisciplinary Oncology Program, University of British Columbia, Vancouver, BC, Canada
| | - Rocky Shi
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Interdisciplinary Oncology Program, University of British Columbia, Vancouver, BC, Canada
| | - Justine L M Chow
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
| | - John English
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Calum MacAulay
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Stephen Lam
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
| | - Wan L Lam
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
- Interdisciplinary Oncology Program, University of British Columbia, Vancouver, BC, Canada
| | - William W Lockwood
- Department of Integrative Oncology, BC Cancer Research Institute, Vancouver, BC, Canada.
- Department of Pathology & Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada.
- Interdisciplinary Oncology Program, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
139
|
Kudriavskii VV, Goncharov AO, Eremeev AV, Ruchko ES, Veselovsky VA, Klimina KM, Bogomazova AN, Lagarkova MA, Moshkovskii SA, Kliuchnikova AA. RNA Editing by ADAR Adenosine Deaminases in the Cell Models of CAG Repeat Expansion Diseases: Significant Effect of Differentiation from Stem Cells into Brain Organoids in the Absence of Substantial Influence of CAG Repeats on the Level of Editing. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:1474-1489. [PMID: 39245456 DOI: 10.1134/s0006297924080078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 06/24/2024] [Accepted: 07/11/2024] [Indexed: 09/10/2024]
Abstract
Expansion of CAG repeats in certain genes is a known cause of several neurodegenerative diseases, but exact mechanism behind this is not yet fully understood. It is believed that the double-stranded RNA regions formed by CAG repeats could be harmful to the cell. This study aimed to test the hypothesis that these RNA regions might potentially interfere with ADAR RNA editing enzymes, leading to the reduced A-to-I editing of RNA and activation of the interferon response. We studied induced pluripotent stem cells (iPSCs) derived from the patients with Huntington's disease or ataxia type 17, as well as midbrain organoids developed from these cells. A targeted panel for next-generation sequencing was used to assess editing in the specific RNA regions. Differentiation of iPSCs into brain organoids led to increase in the ADAR2 gene expression and decrease in the expression of protein inhibitors of RNA editing. As a result, there was increase in the editing of specific ADAR2 substrates, which allowed identification of differential substrates of ADAR isoforms. However, comparison of the pathology and control groups did not show differences in the editing levels among the iPSCs. Additionally, brain organoids with 42-46 CAG repeats did not exhibit global changes. On the other hand, brain organoids with the highest number of CAG repeats in the huntingtin gene (76) showed significant decrease in the level of RNA editing of specific transcripts, potentially involving ADAR1. Notably, editing of the long non-coding RNA PWAR5 was nearly absent in this sample. It could be stated in conclusion that in most cultures with repeat expansion, the hypothesized effect on RNA editing was not confirmed.
Collapse
Affiliation(s)
- Viacheslav V Kudriavskii
- Pirogov Russian National Research Medical University, Moscow, 117997, Russia
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow, 121205, Russia.
| | - Anton O Goncharov
- Pirogov Russian National Research Medical University, Moscow, 117997, Russia
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Artem V Eremeev
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Evgenii S Ruchko
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Vladimir A Veselovsky
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Ksenia M Klimina
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Alexandra N Bogomazova
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Maria A Lagarkova
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
| | - Sergei A Moshkovskii
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
- Max Planck Institute for Interdisciplinary Research, Göttingen, 37077, Germany.
| | - Anna A Kliuchnikova
- Pirogov Russian National Research Medical University, Moscow, 117997, Russia
- Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, 119435, Russia
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| |
Collapse
|
140
|
Othonicar MF, Garcia GS, Oliveira MT. The alternative enzymes-bearing tunicates lack multiple widely distributed genes coding for peripheral OXPHOS subunits. BIOCHIMICA ET BIOPHYSICA ACTA. BIOENERGETICS 2024; 1865:149046. [PMID: 38642871 DOI: 10.1016/j.bbabio.2024.149046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 04/01/2024] [Accepted: 04/17/2024] [Indexed: 04/22/2024]
Abstract
The respiratory chain alternative enzymes (AEs) NDX and AOX from the tunicate Ciona intestinalis (Ascidiacea) have been xenotopically expressed and characterized in human cells in culture and in the model organisms Drosophila melanogaster and mouse, with the purpose of developing bypass therapies to combat mitochondrial diseases in human patients with defective complexes I and III/IV, respectively. The fact that the genes coding for NDX and AOX have been lost from genomes of evolutionarily successful animal groups, such as vertebrates and insects, led us to investigate if the composition of the respiratory chain of Ciona and other tunicates differs significantly from that of humans and Drosophila, to accommodate the natural presence of AEs. We have failed to identify in tunicate genomes fifteen orthologous genes that code for subunits of the respiratory chain complexes; all of these putatively missing subunits are peripheral to complexes I, III and IV in mammals, and many are important for complex-complex interaction in supercomplexes (SCs), such as NDUFA11, UQCR11 and COX7A. Modeling of all respiratory chain subunit polypeptides of Ciona indicates significant structural divergence that is consistent with the lack of these fifteen clear orthologous subunits. We also provide evidence using Ciona AOX expressed in Drosophila that this AE cannot access the coenzyme Q pool reduced by complex I, but it is readily available to oxidize coenzyme Q molecules reduced by glycerophosphate oxidase, a mitochondrial inner membrane-bound dehydrogenase that is not involved in SCs. Altogether, our results suggest that Ciona AEs might have evolved in a mitochondrial inner membrane environment much different from that of mammals and insects, possibly without SCs; this correlates with the preferential functional interaction between these AEs and non-SC dehydrogenases in heterologous mammalian and insect systems. We discuss the implications of these findings for the applicability of Ciona AEs in human bypass therapies and for our understanding of the evolution of animal respiratory chain.
Collapse
Affiliation(s)
- Murilo F Othonicar
- Departamento de Biotecnologia, Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal, Universidade Estadual Paulista "Júlio de Mesquita Filho", Jaboticabal, SP, Brazil
| | - Geovana S Garcia
- Departamento de Biotecnologia, Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal, Universidade Estadual Paulista "Júlio de Mesquita Filho", Jaboticabal, SP, Brazil
| | - Marcos T Oliveira
- Departamento de Biotecnologia, Faculdade de Ciências Agrárias e Veterinárias de Jaboticabal, Universidade Estadual Paulista "Júlio de Mesquita Filho", Jaboticabal, SP, Brazil.
| |
Collapse
|
141
|
Veglia AJ, Rivera-Vicéns RE, Grupstra CGB, Howe-Kerr LI, Correa AMS. vAMPirus: A versatile amplicon processing and analysis program for studying viruses. Mol Ecol Resour 2024; 24:e13978. [PMID: 38775206 DOI: 10.1111/1755-0998.13978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 05/08/2024] [Indexed: 07/31/2024]
Abstract
Amplicon sequencing is an effective and increasingly applied method for studying viral communities in the environment. Here, we present vAMPirus, a user-friendly, comprehensive, and versatile DNA and RNA virus amplicon sequence analysis program, designed to support investigators in exploring virus amplicon sequencing data and running informed, reproducible analyses. vAMPirus intakes raw virus amplicon libraries and, by default, performs nucleotide- and amino acid-based analyses to produce results such as sequence abundance information, taxonomic classifications, phylogenies and community diversity metrics. The vAMPirus analytical framework leverages 16 different opensource tools and provides optional approaches that can increase the ratio of biological signal-to-noise and thereby reveal patterns that would have otherwise been masked. Here, we validate the vAMPirus analytical framework and illustrate its implementation as a general virus amplicon sequencing workflow by recapitulating findings from two previously published double-stranded DNA virus datasets. As a case study, we also apply the program to explore the diversity and distribution of a coral reef-associated RNA virus. vAMPirus is streamlined within Nextflow, offering straightforward scalability, standardization and communication of virus lineage-specific analyses. The vAMPirus framework is designed to be adaptable; community-driven analytical standards will continue to be incorporated as the field advances. vAMPirus supports researchers in revealing patterns of virus diversity and population dynamics in nature, while promoting study reproducibility and comparability.
Collapse
Affiliation(s)
- Alex J Veglia
- BioSciences Department, Rice University, Houston, Texas, USA
- Department of Biology, University of Puerto Rico Mayagüez, Mayagüez, Puerto Rico, USA
- EcoAzul, La Parguera, Puerto Rico, USA
| | - Ramón E Rivera-Vicéns
- EcoAzul, La Parguera, Puerto Rico, USA
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Munich, Germany
- Department of Sciences and Technology, Interamerican University of Puerto Rico at Barranquitas, Barranquitas, Puerto Rico, USA
| | - Carsten G B Grupstra
- BioSciences Department, Rice University, Houston, Texas, USA
- Department of Biology, Boston University, Boston, Massachusetts, USA
| | - Lauren I Howe-Kerr
- BioSciences Department, Rice University, Houston, Texas, USA
- Minderoo Foundation, Perth, Western Australia, Australia
| | - Adrienne M S Correa
- BioSciences Department, Rice University, Houston, Texas, USA
- Department of Environmental Science, Policy, and Management, University of California, California, USA
| |
Collapse
|
142
|
Tyagi S, Katara P. Metatranscriptomics: A Tool for Clinical Metagenomics. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2024; 28:394-407. [PMID: 39029911 DOI: 10.1089/omi.2024.0130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/21/2024]
Abstract
In the field of bioinformatics, amplicon sequencing of 16S rRNA genes has long been used to investigate community membership and taxonomic abundance in microbiome studies. As we can observe, shotgun metagenomics has become the dominant method in this field. This is largely owing to advancements in sequencing technology, which now allow for random sequencing of the entire genetic content of a microbiome. Furthermore, this method allows profiling both genes and the microbiome's membership. Although these methods have provided extensive insights into various microbiomes, they solely assess the existence of organisms or genes, without determining their active role within the microbiome. Microbiome scholarship now includes metatranscriptomics to decipher how a community of microorganisms responds to changing environmental conditions over a period of time. Metagenomic studies identify the microbes that make up a community but metatranscriptomics explores the diversity of active genes within that community, understanding their expression profile and observing how these genes respond to changes in environmental conditions. This expert review article offers a critical examination of the computational metatranscriptomics tools for studying the transcriptomes of microbial communities. First, we unpack the reasons behind the need for community transcriptomics. Second, we explore the prospects and challenges of metatranscriptomic workflows, starting with isolation and sequencing of the RNA community, then moving on to bioinformatics approaches for quantifying RNA features, and statistical techniques for detecting differential expression in a community. Finally, we discuss strengths and shortcomings in relation to other microbiome analysis approaches, pipelines, use cases and limitations, and contextualize metatranscriptomics as a tool for clinical metagenomics.
Collapse
Affiliation(s)
- Shivani Tyagi
- Computational Omics Lab, Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, India
| | - Pramod Katara
- Computational Omics Lab, Centre of Bioinformatics, IIDS, University of Allahabad, Prayagraj, India
| |
Collapse
|
143
|
Hu F, Li X, Liu K, Li Y, Xie Y, Wei C, Liu S, Song J, Wang P, Shi L, Li C, Li J, Xu L, Xue J, Zheng X, Bai M, Fang X, Jin X, Cao L, Hao P, He J, Wang J, Zhang C, Li Z. Rheumatoid arthritis patients harbour aberrant enteric bacteriophages with autoimmunity-provoking potential: a paired sibling study. Ann Rheum Dis 2024:ard-2024-225564. [PMID: 39084885 DOI: 10.1136/ard-2024-225564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 07/15/2024] [Indexed: 08/02/2024]
Abstract
OBJECTIVES Viruses have been considered as important participants in the development of rheumatoid arthritis (RA). However, the profile of enteric virome and its role in RA remains elusive. This study aimed to investigate the atlas and involvement of virome in RA pathogenesis. METHODS Faecal samples from 30 pairs of RA and healthy siblings that minimise genetic interferences were collected for metagenomic sequencing. The α and β diversity of the virome and the virome-bacteriome interaction were analysed. The differential bacteriophages were identified, and their correlations with clinical and immunological features of RA were analysed. The potential involvement of these differential bacteriophages in RA pathogenesis was further investigated by auxiliary metabolic gene annotation and molecular mimicry study. The responses of CD4+ T cells and B cells to the mimotopes derived from the differential bacteriophages were systemically studied. RESULTS The composition of the enteric bacteriophageome was distorted in RA. The differentially presented bacteriophages correlated with the immunological features of RA, including anti-CCP autoantibody and HLA-DR shared epitope. Intriguingly, the glycerolipid and purine metabolic genes were highly active in the bacteriophages from RA. Moreover, peptides of RA-enriched phages, in particular Prevotella phage and Oscillibacter phage could provoke the autoimmune responses in CD4+ T cells and plasma cells via molecular mimicry of the disease-associated autoantigen epitopes, especially those of Bip. CONCLUSIONS This study provides new insights into enteric bacteriophageome in RA development. In particular, the aberrant bacteriophages demonstrated autoimmunity-provoking potential that would promote the occurrence of the disease.
Collapse
Affiliation(s)
- Fanlei Hu
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, Beijing, China
- Department of Integration of Chinese and Western Medicine, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Xin Li
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Kai Liu
- Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, University of Chinese Academy of Sciences, shanghai, China
- Department of Clinical Laboratory, Qingpu Branch of Zhongshan Hospital, Fudan University, Shanghai, China
| | - Yanpeng Li
- Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| | - Yang Xie
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Chaonan Wei
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Shuyan Liu
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Jing Song
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Ping Wang
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Lianjie Shi
- Department of Rheumatology and Immunology, Peking University Shougang Hospital, Beijing, China
| | - Chun Li
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Jing Li
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Liling Xu
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Jimeng Xue
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Xi Zheng
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Mingxin Bai
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Xiangyu Fang
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Xu Jin
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Lulu Cao
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Pei Hao
- Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, University of Chinese Academy of Sciences, shanghai, China
| | - Jing He
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
| | - Jun Wang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Science, Beijing, China
| | - Chiyu Zhang
- Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, University of Chinese Academy of Sciences, shanghai, China
- Shanghai Public Health Clinical Center, Fudan University, Shanghai, China
| | - Zhanguo Li
- Department of Rheumatology and Immunology, Peking University People's Hospital & Beijing Key Laboratory for Rheumatism Mechanism and Immune Diagnosis (BZ0135), Beijing, China
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| |
Collapse
|
144
|
Ormond C, Ryan NM, Byerley W, Heron EA, Corvin A. Investigating copy number variants in schizophrenia pedigrees using a new consensus pipeline called PECAN. Sci Rep 2024; 14:17518. [PMID: 39080331 PMCID: PMC11289470 DOI: 10.1038/s41598-024-66021-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 06/26/2024] [Indexed: 08/02/2024] Open
Abstract
Copy number variants (CNVs) have been implicated in many human diseases, including psychiatric disorders. Whole genome sequencing offers advantages in CNV calling compared to previous array-based methods. Here we present a robust and transparent CNV calling pipeline, PECAN (PEdigree Copy number vAriaNt calling), for short-read, whole genome sequencing data, comprised of a novel combination of four calling methods and structural variant genotyping. This method is scalable and can incorporate pedigree information to retain lower-confidence CNVs that would otherwise be discarded. We have robustly benchmarked PECAN using gold-standard CNV calls for two well-established evaluation samples, NA12878 and HG002, showing that PECAN performs with high precision and recall on both datasets, outperforming another pedigree-based CNV calling pipeline. As part of this work, we provide a list of high-confidence gold standard CNVs for the NA12878 reference sample, curated from multiple studies. We applied PECAN to a collection of pedigrees multiply affected with schizophrenia and identified a rare deletion that perfectly co-segregates with schizophrenia in one of the pedigrees. The CNV overlaps the gene PITRM1, which has been implicated in a complex phenotype including ataxia, developmental delay, and schizophrenia-like episodes in affected adults.
Collapse
Affiliation(s)
- Cathal Ormond
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity Centre for Health Sciences, Trinity College Dublin, James' Street, Dublin 8, Ireland
| | - Niamh M Ryan
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity Centre for Health Sciences, Trinity College Dublin, James' Street, Dublin 8, Ireland
| | - William Byerley
- Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, CA, USA
| | - Elizabeth A Heron
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity Centre for Health Sciences, Trinity College Dublin, James' Street, Dublin 8, Ireland
| | - Aiden Corvin
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity Centre for Health Sciences, Trinity College Dublin, James' Street, Dublin 8, Ireland.
| |
Collapse
|
145
|
Ormond C, Ryan NM, Hedman AM, Cannon TD, Sullivan PF, Gill M, Hultman C, Heron EA, Johansson V, Corvin A. Whole genome sequencing study of identical twins discordant for psychosis. Transl Psychiatry 2024; 14:313. [PMID: 39080272 PMCID: PMC11289105 DOI: 10.1038/s41398-024-02982-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 06/14/2024] [Accepted: 06/18/2024] [Indexed: 08/02/2024] Open
Abstract
Monozygotic (MZ) twins are often thought to have identical genomes, but recent work has shown that early post-zygotic events can result in a spectrum of DNA variants that are different between MZ twins. Such variants may explain phenotypic discordance and contribute to disease etiology. Here we performed whole genome sequencing in 17 pairs of MZ twins discordant for a psychotic disorder (schizophrenia, schizoaffective disorder or bipolar disorder). We examined various classes of rare variants that are discordant within a twin pair. We identified four genes harboring rare, predicted deleterious missense variants that were private to an affected individual in the cohort. Variants in FOXN1 and FLOT2 would have been categorized as damaging from recent schizophrenia and bipolar exome sequencing studies. Additionally, we identified four rare genic copy number variants (CNVs) private to an affected sample, two of which overlapped genes that have shown evidence for association with schizophrenia or bipolar disorder. One such CNV was a 3q29 duplication previously implicated in autism and developmental delay. We have performed the largest MZ twin study for discordant psychotic phenotypes to date. These findings warrant further investigation using other analytical approaches.
Collapse
Affiliation(s)
- Cathal Ormond
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Niamh M Ryan
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Anna M Hedman
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Tyrone D Cannon
- Departments of Psychology and Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Patrick F Sullivan
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Departments of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
| | - Michael Gill
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Christina Hultman
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Elizabeth A Heron
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Viktoria Johansson
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Sciences, Psychiatry Unit, Umeå University, Umeå, Sweden
| | - Aiden Corvin
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland.
| |
Collapse
|
146
|
Manter DK, Reardon CL, Ashworth AJ, Ibekwe AM, Lehman RM, Maul JE, Miller DN, Creed T, Ewing PM, Park S, Ducey TF, Tyler HL, Veum KS, Weyers SL, Knaebel DB. Unveiling errors in soil microbial community sequencing: a case for reference soils and improved diagnostics for nanopore sequencing. Commun Biol 2024; 7:913. [PMID: 39069530 DOI: 10.1038/s42003-024-06594-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 07/17/2024] [Indexed: 07/30/2024] Open
Abstract
The sequencing platform and workflow strongly influence microbial community analyses through potential errors at each step. Effective diagnostics and experimental controls are needed to validate data and improve reproducibility. This cross-laboratory study evaluates sources of variability and error at three main steps of a standardized amplicon sequencing workflow (DNA extraction, polymerase chain reaction [PCR], and sequencing) using Oxford Nanopore MinION to analyze agricultural soils and a simple mock community. Variability in sequence results occurs at each step in the workflow with PCR errors and differences in library size greatly influencing diversity estimates. Common bioinformatic diagnostics and the mock community are ineffective at detecting PCR abnormalities. This work outlines several diagnostic checks and techniques to account for sequencing depth and ensure accuracy and reproducibility in soil community analyses. These diagnostics and the inclusion of a reference soil can help ensure data validity and facilitate the comparison of multiple sequencing runs within and between laboratories.
Collapse
Affiliation(s)
- Daniel K Manter
- Soil Management and Sugar Beet Research, United States Department of Agriculture Agricultural Research Service (USDA-ARS), Fort Collins, CO, USA.
| | | | - Amanda J Ashworth
- Poultry Production and Product Safety Research Unit, USDA-ARS, Fayetteville, AR, USA
| | | | - R Michael Lehman
- North Central Agricultural Research Laboratory, USDA-ARS, Brookings, SD, USA
| | - Jude E Maul
- Sustainable Agricultural Systems Laboratory, USDA-ARS, Beltsville, MD, USA
| | - Daniel N Miller
- Agroecosystem Management Research Unit, USDA-ARS, Lincoln, NE, USA
| | - Timothy Creed
- Soil Management and Sugar Beet Research, United States Department of Agriculture Agricultural Research Service (USDA-ARS), Fort Collins, CO, USA
| | | | - Stanley Park
- Water Efficiency and Salinity Research Unit, USDA-ARS, Riverside, CA, USA
| | - Thomas F Ducey
- Coastal Plains Soil, Water and Plant Research Center, USDA-ARS, Florence, SC, USA
| | - Heather L Tyler
- Crop Production Systems Research Unit, USDA-ARS, Stoneville, MS, USA
| | - Kristen S Veum
- Cropping Systems and Water Quality Research Unit, USDA-ARS, Columbia, MO, USA
| | | | | |
Collapse
|
147
|
Allan MF, Aruda J, Plung JS, Grote SL, Martin des Taillades YJ, de Lajarte AA, Bathe M, Rouskin S. Discovery and Quantification of Long-Range RNA Base Pairs in Coronavirus Genomes with SEARCH-MaP and SEISMIC-RNA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.29.591762. [PMID: 38746332 PMCID: PMC11092567 DOI: 10.1101/2024.04.29.591762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
RNA molecules perform a diversity of essential functions for which their linear sequences must fold into higher-order structures. Techniques including crystallography and cryogenic electron microscopy have revealed 3D structures of ribosomal, transfer, and other well-structured RNAs; while chemical probing with sequencing facilitates secondary structure modeling of any RNAs of interest, even within cells. Ongoing efforts continue increasing the accuracy, resolution, and ability to distinguish coexisting alternative structures. However, no method can discover and quantify alternative structures with base pairs spanning arbitrarily long distances - an obstacle for studying viral, messenger, and long noncoding RNAs, which may form long-range base pairs. Here, we introduce the method of Structure Ensemble Ablation by Reverse Complement Hybridization with Mutational Profiling (SEARCH-MaP) and software for Structure Ensemble Inference by Sequencing, Mutation Identification, and Clustering of RNA (SEISMIC-RNA). We use SEARCH-MaP and SEISMIC-RNA to discover that the frameshift stimulating element of SARS coronavirus 2 base-pairs with another element 1 kilobase downstream in nearly half of RNA molecules, and that this structure competes with a pseudoknot that stimulates ribosomal frameshifting. Moreover, we identify long-range base pairs involving the frameshift stimulating element in other coronaviruses including SARS coronavirus 1 and transmissible gastroenteritis virus, and model the full genomic secondary structure of the latter. These findings suggest that long-range base pairs are common in coronaviruses and may regulate ribosomal frameshifting, which is essential for viral RNA synthesis. We anticipate that SEARCH-MaP will enable solving many RNA structure ensembles that have eluded characterization, thereby enhancing our general understanding of RNA structures and their functions. SEISMIC-RNA, software for analyzing mutational profiling data at any scale, could power future studies on RNA structure and is available on GitHub and the Python Package Index.
Collapse
|
148
|
Renner SS, Scherz MD, Schoch CL, Gottschling M, Vences M. Improving the gold standard in NCBI GenBank and related databases: DNA sequences from type specimens and type strains. Syst Biol 2024; 73:486-494. [PMID: 37956405 DOI: 10.1093/sysbio/syad068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/21/2023] [Accepted: 11/11/2023] [Indexed: 11/15/2023] Open
Abstract
Scientific names permit humans and search engines to access knowledge about the biodiversity that surrounds us, and names linked to DNA sequences are playing an ever-greater role in search-and-match identification procedures. Here, we analyze how users and curators of the National Center for Biotechnology Information (NCBI) are flagging and curating sequences derived from nomenclatural type material, which is the only way to improve the quality of DNA-based identification in the long run. For prokaryotes, 18,281 genome assemblies from type strains have been curated by NCBI staff and improve the quality of prokaryote naming. For Fungi, type-derived sequences representing over 21,000 species are now essential for fungus naming and identification. For the remaining eukaryotes, however, the numbers of sequences identifiable as type-derived are minuscule, representing only 739 species of arthropods, 1542 vertebrates, and 125 embryophytes. An increase in the production and curation of such sequences will come from (i) sequencing of types or topotypic specimens in museum collections, (ii) the March 2023 rule changes at the International Nucleotide Sequence Database Collaboration requiring more metadata for specimens, and (iii) efforts by data submitters to facilitate curation, including informing NCBI curators about a specimen's type status. We illustrate different type-data submission journeys and provide best-practice examples from a range of organisms. Expanding the number of type-derived sequences in DNA databases, especially of eukaryotes, is crucial for capturing, documenting, and protecting biodiversity.
Collapse
Affiliation(s)
- Susanne S Renner
- Department of Biology, Washington University, Saint Louis, MO 63130, USA
| | - Mark D Scherz
- Natural History Museum of Denmark, University of Copenhagen, Universitetsparken 15, Copenhagen 2100, Denmark
| | - Conrad L Schoch
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Marc Gottschling
- Faculty of Biology, GeoBio-Center, Ludwig-Maximilians-University, Munich 80333, Germany
| | - Miguel Vences
- Division of Evolutionary Biology, Zoological Institute, University of Technology, Mendelssohnstr. 4, 38106 Braunschweig, Germany
| |
Collapse
|
149
|
Giannakis K, Richards L, Johnston IG. Ecological Predictors of Organelle Genome Evolution: Phylogenetic Correlations with Taxonomically Broad, Sparse, Unsystematized Data. Syst Biol 2024; 73:419-433. [PMID: 38459872 PMCID: PMC11282362 DOI: 10.1093/sysbio/syae009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 02/26/2024] [Accepted: 03/06/2024] [Indexed: 03/11/2024] Open
Abstract
Comparative analysis of variables across phylogenetically linked observations can reveal mechanisms and insights in evolutionary biology. As the taxonomic breadth of the sample of interest increases, challenges of data sparsity, poor phylogenetic resolution, and complicated evolutionary dynamics emerge. Here, we investigate a cross-eukaryotic question where all these problems exist: which organismal ecology features are correlated with gene retention in mitochondrial and chloroplast DNA (organelle DNA or oDNA). Through a wide palette of synthetic control studies, we first characterize the specificity and sensitivity of a collection of parametric and non-parametric phylogenetic comparative approaches to identify relationships in the face of such sparse and awkward datasets. This analysis is not directly focused on oDNA, and so provides generalizable insights into comparative approaches with challenging data. We then combine and curate ecological data coupled to oDNA genome information across eukaryotes, including a new semi-automated approach for gathering data on organismal traits from less systematized open-access resources including encyclopedia articles on species and taxa. The curation process also involved resolving several issues with existing datasets, including enforcing the clade-specificity of several ecological features and fixing incorrect annotations. Combining this unique dataset with our benchmarked comparative approaches, we confirm support for several known links between organismal ecology and organelle gene retention, identify several previously unidentified relationships constituting possible ecological contributors to oDNA genome evolution, and provide support for a recently hypothesized link between environmental demand and oDNA retention. We, with caution, discuss the implications of these findings for organelle evolution and of this pipeline for broad comparative analyses in other fields.
Collapse
Affiliation(s)
| | - Luke Richards
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
| | - Iain G Johnston
- Department of Mathematics, University of Bergen, Bergen 5007, Norway
- Computational Biology Unit, University of Bergen, Bergen 5006, Norway
| |
Collapse
|
150
|
Catalán A, Gygax D, Rodríguez-Montes L, Hinzke T, Hoff KJ, Duchen P. Two novel genomes of fireflies with different degrees of sexual dimorphism reveal insights into sex-biased gene expression and dosage compensation. Commun Biol 2024; 7:906. [PMID: 39068254 PMCID: PMC11283472 DOI: 10.1038/s42003-024-06550-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 06/27/2024] [Indexed: 07/30/2024] Open
Abstract
Sexual dimorphism arises because of divergent fitness optima between the sexes. Phenotypic divergence between sexes can range from mild to extreme. Fireflies, bioluminescent beetles, present various degrees of sexual dimorphism, with species showing very mild sexual dimorphism to species presenting female-specific neoteny, posing a unique framework to investigate the evolution of sexually dimorphic traits across species. In this work, we present novel assembled genomes of two firefly species, Lamprohiza splendidula and Luciola italica, species with different degrees of sexual dimorphism. We uncover high synteny conservation of the X-chromosome across ~ 180 Mya and find full X-chromosome dosage compensation in our two fireflies, hinting at common mechanism upregulating the single male X-chromosome. Different degrees of sex-biased expressed genes were found across two body parts showing different proportions of expression conservation between species. Interestingly, we do not find X-chromosome enrichment of sex-biased genes, but retrieve autosomal enrichment of sex-biased genes. We further uncover higher nucleotide diversity in the intronic regions of sex-biased genes, hinting at a maintenance of heterozygosity through sexual selection. We identify different levels of sex-biased gene expression divergence including a set of genes showing conserved sex-biased gene expression between species. Divergent and conserved sex-biased genes are good candidates to test their role in the maintenance of sexually dimorphic traits.
Collapse
Affiliation(s)
- Ana Catalán
- Ludwig-Maximilians-Universität Munich, Division of Evolutionary Biology, Großhaderner Straße 2, Planegg-Martinsried, Bavaria, 82152, Germany.
| | - Daniel Gygax
- Ludwig-Maximilians-Universität Munich, Division of Evolutionary Biology, Großhaderner Straße 2, Planegg-Martinsried, Bavaria, 82152, Germany
- Helmholtz Center Munich, Helmholtz Pioneer Campus, Ingolstädter Landstraße 1, Munich, Oberschleißheim, 85764, Germany
| | - Leticia Rodríguez-Montes
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, D-69120, Heidelberg, Germany
| | - Tjorven Hinzke
- Institute of Microbiology, Department of Microbial Physiology and Molecular Biology, University of Greifswald, Greifswald, Germany
- Department of Pathogen Evolution, Helmholtz Institute for One Health, Greifswald, Germany
| | - Katharina J Hoff
- University of Greifswald, Institute for Mathematics and Computer Science, Walther-Rathenau-Str. 47, 17489, Greifswald, Germany
| | - Pablo Duchen
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128, Mainz, Germany
| |
Collapse
|