1
|
Arendt-Tranholm A, Mwirigi JM, Price TJ. RNA isoform expression landscape of the human dorsal root ganglion generated from long-read sequencing. Pain 2024; 165:2468-2481. [PMID: 38809314 PMCID: PMC11511651 DOI: 10.1097/j.pain.0000000000003255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 02/14/2024] [Indexed: 05/30/2024]
Abstract
ABSTRACT Splicing is a posttranscriptional RNA processing mechanism that enhances genomic complexity by creating multiple isoforms from the same gene. We aimed to characterize the isoforms expressed in the human peripheral nervous system, with the goal of creating a resource to identify novel isoforms of functionally relevant genes associated with somatosensation and nociception. We used long-read sequencing to document isoform expression in the human dorsal root ganglia from 3 organ donors and validated in silico by confirming expression in short-read sequencing from 3 independent organ donors. Nineteen thousand five hundred forty-seven isoforms of protein-coding genes were detected and validated. We identified 763 isoforms with at least one previously undescribed splice junction. Previously unannotated isoforms of multiple pain-associated genes, including ASIC3 , MRGPRX1 , and HNRNPK , were identified. In the novel isoforms of ASIC3 , a region comprising approximately 35% of the 5'UTR was excised. By contrast, a novel splice junction was used in isoforms of MRGPRX1 to include an additional exon upstream of the start codon, consequently adding a region to the 5'UTR. Novel isoforms of HNRNPK were identified, which used previously unannotated splice sites to both excise exon 14 and include a sequence in the 3' end of exon 13. This novel insertion is predicted to introduce a tyrosine phosphorylation site potentially phosphorylated by SRC. We also independently confirm a recently reported DRG-specific splicing event in WNK1 that gives insight into how painless peripheral neuropathy occurs when this gene is mutated. Our findings give a clear overview of mRNA isoform diversity in the human dorsal root ganglia obtained using long-read sequencing.
Collapse
Affiliation(s)
- Asta Arendt-Tranholm
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| | - Juliet M. Mwirigi
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| | - Theodore J. Price
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| |
Collapse
|
2
|
Buttanri A, Kasapoğlu AG, Öner BM, Aygören AS, Muslu S, İlhan E, Yildirim E, Aydin M. Predicting the role of β-GAL genes in bean under abiotic stress and genome-wide characterization of β-GAL gene family members. PROTOPLASMA 2024:10.1007/s00709-024-01998-z. [PMID: 39441340 DOI: 10.1007/s00709-024-01998-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 10/04/2024] [Indexed: 10/25/2024]
Abstract
Β-Gals are a subgroup of the glycoside hydrolase (GH) family of enzymes, which possess the Glyco_hydro_35 (GH35) domain. Although studies have been conducted on the β-Gal gene family in numerous plant species, no such research has been conducted on beans. The purpose of this study was to determine the gene expression levels of β-Gal genes in the leaf tissue of P. vulgaris under salt and drought stress using quantitative real-time polymerase chain reaction (qRT-PCR) and to perform a comprehensive analysis of β-Gal gene family members using bioinformatics tools. In the bean genome, 25 Pvul-βGAL proteins with amino acid numbers ranging from 291 to 1119, molecular weights from 32.94 to 126.56 kDa, and isoelectric points from 5.46 to 9.08 were identified. Both segmental and tandem duplication have occurred in β-Gal genes in the bean genome, and Pvul-BGAL genes have been subject to negative selection in the evolutionary process. For a deeper comprehension of the evolutionary proximity of Pvul-BGAL genes, a phylogenetic tree and synteny map were drawn together with Arabidopsis thaliana and Glycine max β-Gal genes. The expression profiles of β-Gal genes in different tissues of the bean were determined in silico. In addition, the expression profiles of β-Gal genes in the leaves of bean plants subjected to drought and salt stress were analyzed, and the role of β-Gal genes in salt and drought stress was estimated. In this study, the role of β-Gal gene family in abiotic stress response and the characterization of β-Gal genes in beans were determined for the first time and will provide a basis for future functional genomics studies.
Collapse
Affiliation(s)
- Azize Buttanri
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey
| | - Ayşe Gül Kasapoğlu
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey.
| | - Burak Muhammed Öner
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey
| | - Ahmed Sidar Aygören
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey
| | - Selman Muslu
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey
| | - Emre İlhan
- Department of Molecular Biology and Genetics, Faculty of Science, Erzurum Technical University, Erzurum, Turkey
- Ata-Teknokent, GeneXCell Biotechnology, Ataturk University, 25240, Erzurum, Turkey
| | - Ertan Yildirim
- Department of Agricultural Biotechnology, Faculty of Agriculture, Ataturk University, Erzurum, Turkey
- Department of Garden Plants, Faculty of Agriculture, Ataturk University, Erzurum, Turkey
| | - Murat Aydin
- Department of Agricultural Biotechnology, Faculty of Agriculture, Ataturk University, Erzurum, Turkey
| |
Collapse
|
3
|
Shoer S, Reicher L, Zhao C, Pollard KS, Pilpel Y, Segal E. Pangenomes of human gut microbiota uncover links between genetic diversity and stress response. Cell Host Microbe 2024; 32:1744-1757.e2. [PMID: 39353429 DOI: 10.1016/j.chom.2024.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 07/11/2024] [Accepted: 08/23/2024] [Indexed: 10/04/2024]
Abstract
The genetic diversity of the gut microbiota has a central role in host health. Here, we created pangenomes for 728 human gut prokaryotic species, quadrupling the genes of strain-specific genomes. Each of these species has a core set of a thousand genes, differing even between closely related species, and an accessory set of genes unique to the different strains. Functional analysis shows high strain variability associates with sporulation, whereas low variability is linked with antibiotic resistance. We further map the antibiotic resistome across the human gut population and find 237 cases of extreme resistance even to last-resort antibiotics, with a predominance among Enterobacteriaceae. Lastly, the presence of specific genes in the microbiota relates to host age and sex. Our study underscores the genetic complexity of the human gut microbiota, emphasizing its significant implications for host health. The pangenomes and antibiotic resistance map constitute a valuable resource for further research.
Collapse
Affiliation(s)
- Saar Shoer
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel; Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Lee Reicher
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel; Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel; Lis Maternity and Women's Hospital, Sourasky Medical Center, Tel Aviv, Israel
| | - Chunyu Zhao
- Gladstone Institute for Data Science and Biotechnology, San Francisco, CA, USA; Chan Zuckerberg Biohub San Francisco, San Francisco, CA, USA
| | - Katherine S Pollard
- Gladstone Institute for Data Science and Biotechnology, San Francisco, CA, USA; Chan Zuckerberg Biohub San Francisco, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Yitzhak Pilpel
- Department of Molecular Genetics, The Weizmann Institute of Science, Rehovot, Israel
| | - Eran Segal
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel; Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
4
|
Khandia R, Garg R, Pandey MK, Khan AA, Dhanda SK, Malik A, Gurjar P. Determination of codon pattern and evolutionary forces acting on genes linked to inflammatory bowel disease. Int J Biol Macromol 2024; 278:134480. [PMID: 39116987 DOI: 10.1016/j.ijbiomac.2024.134480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 06/25/2024] [Accepted: 07/31/2024] [Indexed: 08/10/2024]
Abstract
Inflammatory bowel disease (IBD) is an inflammatory disorder of the gastrointestinal tract. The present study attempted to understand the codon usage preferences in genes associated with IBD progression. Compositional analysis, codon usage bias (CUB), Relative synonymous codon usage (RSCU), RNA structure, and expression analysis were performed to obtain a comprehensive picture of codon usage in IBD genes. Compositional analysis of 62 IBD-associated genes revealed that G and T are the most and least abundant nucleotides, respectively. ApG, CpA, and TpG dinucleotides were overrepresented or randomly used, while ApC, CpG, GpT, and TpA dinucleotides were either underrepresented or randomly used in genes related to IBD. The codons influencing the codon usage the most in IBD genes were CGC and AGG. A comparison of codon usage between IBD, and pancreatitis (non-IBD inflammatory disease) indicated that only codon CTG codon usage was significantly different between IBD and pancreatitis. At the same time, there were codons ATA, ACA, CGT, CAA, GTA, CCT, ATT, GCT, CGG, TTG, and CAG for whom codon usage was significantly different for IBD and housekeeping gene sets. The results suggest similar codon usage in at least two inflammatory disorders, IBD and pancreatitis. The analysis helps understand the codon biology, factors affecting gene expression of IBD-associated genes, and the evolution of these genes. The study helps reveal the molecular patterns associated with IBD.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal 462026, MP, India.
| | - Rajkumar Garg
- Department of Biosciences, Barkatullah University, Bhopal 462026, MP, India
| | - Megha Katare Pandey
- Translational Medicine Center, All India Institute of Medical Sciences, Bhopal 462020, MP, India.
| | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia.
| | - Sandeep Kumar Dhanda
- Department of Oncology, St Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Abdul Malik
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia.
| | - Pankaj Gurjar
- Centre for Global Health Research, Saveetha Medical College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India; Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia.
| |
Collapse
|
5
|
Liang J, Gondane A, Itkonen HM. CDK12-inactivation-induced MYC signaling causes dependency on the splicing kinase SRPK1. Mol Oncol 2024; 18:2510-2523. [PMID: 38775167 PMCID: PMC11459032 DOI: 10.1002/1878-0261.13666] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 03/08/2024] [Accepted: 05/08/2024] [Indexed: 10/09/2024] Open
Abstract
Inactivation of cyclin-dependent kinase 12 (CDK12) characterizes an aggressive sub-group of castration-resistant prostate cancer (CRPC). Hyper-activation of MYC transcription factor is sufficient to confer the CRPC phenotype. Here, we show that loss of CDK12 promotes MYC activity, which renders the cells dependent on the otherwise non-essential splicing regulatory kinase SRSF protein kinase 1 (SRPK1). High MYC expression is associated with increased levels of SRPK1 in patient samples, and overexpression of MYC sensitizes prostate cancer cells to SRPK1 inhibition using pharmacological and genetic strategies. We show that Endovion (SCO-101), a compound currently in clinical trials against pancreatic cancer, phenocopies the effects of the well-characterized SRPK1 inhibitor SRPIN340 on nascent transcription. This is the first study to show that Endovion is an SRPK1 inhibitor. Inhibition of SRPK1 with either of the compounds promotes transcription elongation, and transcriptionally activates the unfolded protein response. In brief, here we discover that CDK12 inactivation promotes MYC signaling in an SRPK1-dependent manner, and show that the clinical grade compound Endovion selectively targets the cells with CDK12 inactivation.
Collapse
Affiliation(s)
- Jing Liang
- Department of Biochemistry and Developmental Biology, Faculty of MedicineUniversity of HelsinkiHelsinkiFinland
| | - Aishwarya Gondane
- Department of Biochemistry and Developmental Biology, Faculty of MedicineUniversity of HelsinkiHelsinkiFinland
| | - Harri M. Itkonen
- Department of Biochemistry and Developmental Biology, Faculty of MedicineUniversity of HelsinkiHelsinkiFinland
| |
Collapse
|
6
|
Gallardo-Dodd CJ, Kutter C. The regulatory landscape of interacting RNA and protein pools in cellular homeostasis and cancer. Hum Genomics 2024; 18:109. [PMID: 39334294 PMCID: PMC11437681 DOI: 10.1186/s40246-024-00678-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 09/22/2024] [Indexed: 09/30/2024] Open
Abstract
Biological systems encompass intricate networks governed by RNA-protein interactions that play pivotal roles in cellular functions. RNA and proteins constituting 1.1% and 18% of the mammalian cell weight, respectively, orchestrate vital processes from genome organization to translation. To date, disentangling the functional fraction of the human genome has presented a major challenge, particularly for noncoding regions, yet recent discoveries have started to unveil a host of regulatory functions for noncoding RNAs (ncRNAs). While ncRNAs exist at different sizes, structures, degrees of evolutionary conservation and abundances within the cell, they partake in diverse roles either alone or in combination. However, certain ncRNA subtypes, including those that have been described or remain to be discovered, are poorly characterized given their heterogeneous nature. RNA activity is in most cases coordinated through interactions with RNA-binding proteins (RBPs). Extensive efforts are being made to accurately reconstruct RNA-RBP regulatory networks, which have provided unprecedented insight into cellular physiology and human disease. In this review, we provide a comprehensive view of RNAs and RBPs, focusing on how their interactions generate functional signals in living cells, particularly in the context of post-transcriptional regulatory processes and cancer.
Collapse
Affiliation(s)
- Carlos J Gallardo-Dodd
- Department of Microbiology, Tumor, and Cell Biology, Science for Life Laboratory, Karolinska Institute, Solna, Sweden
| | - Claudia Kutter
- Department of Microbiology, Tumor, and Cell Biology, Science for Life Laboratory, Karolinska Institute, Solna, Sweden.
| |
Collapse
|
7
|
Pasquini L, Pereira FL, Seddighi S, Zeng Y, Wei Y, Illán-Gala I, Vatsavayai SC, Friedberg A, Lee AJ, Brown JA, Spina S, Grinberg LT, Sirkis DW, Bonham LW, Yokoyama JS, Boxer AL, Kramer JH, Rosen HJ, Humphrey J, Gitler AD, Miller BL, Pollard KS, Ward ME, Seeley WW. Frontotemporal lobar degeneration targets brain regions linked to expression of recently evolved genes. Brain 2024; 147:3032-3047. [PMID: 38940350 PMCID: PMC11370792 DOI: 10.1093/brain/awae205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 05/08/2024] [Accepted: 05/20/2024] [Indexed: 06/29/2024] Open
Abstract
In frontotemporal lobar degeneration (FTLD), pathological protein aggregation in specific brain regions is associated with declines in human-specialized social-emotional and language functions. In most patients, disease protein aggregates contain either TDP-43 (FTLD-TDP) or tau (FTLD-tau). Here, we explored whether FTLD-associated regional degeneration patterns relate to regional gene expression of human accelerated regions (HARs), conserved sequences that have undergone positive selection during recent human evolution. To this end, we used structural neuroimaging from patients with FTLD and human brain regional transcriptomic data from controls to identify genes expressed in FTLD-targeted brain regions. We then integrated primate comparative genomic data to test our hypothesis that FTLD targets brain regions linked to expression levels of recently evolved genes. In addition, we asked whether genes whose expression correlates with FTLD atrophy are enriched for genes that undergo cryptic splicing when TDP-43 function is impaired. We found that FTLD-TDP and FTLD-tau subtypes target brain regions with overlapping and distinct gene expression correlates, highlighting many genes linked to neuromodulatory functions. FTLD atrophy-correlated genes were strongly enriched for HARs. Atrophy-correlated genes in FTLD-TDP showed greater overlap with TDP-43 cryptic splicing genes and genes with more numerous TDP-43 binding sites compared with atrophy-correlated genes in FTLD-tau. Cryptic splicing genes were enriched for HAR genes, and vice versa, but this effect was due to the confounding influence of gene length. Analyses performed at the individual-patient level revealed that the expression of HAR genes and cryptically spliced genes within putative regions of disease onset differed across FTLD-TDP subtypes. Overall, our findings suggest that FTLD targets brain regions that have undergone recent evolutionary specialization and provide intriguing potential leads regarding the transcriptomic basis for selective vulnerability in distinct FTLD molecular-anatomical subtypes.
Collapse
Affiliation(s)
- Lorenzo Pasquini
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Department of Neurology, Neuroscape, University of California, San Francisco, CA 94158, USA
| | - Felipe L Pereira
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Sahba Seddighi
- National Institute of Neurological Disorders and Stroke, Neurogenetics Branch, Bethesda, MD 20892, USA
| | - Yi Zeng
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Yongbin Wei
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
| | - Ignacio Illán-Gala
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Global Brain Health Institute, University of California, San Francisco, San Francisco, CA, 94158USA
- Trinity College Dublin, Dublin D02 X9W9, Ireland
- Department of Neurology, Hospital de la Santa Creu i Sant Pau, Biomedical Research Institute, Universitat Autònoma de Barcelona, Barcelona, Catalunya, 08041, Spain
| | - Sarat C Vatsavayai
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Adit Friedberg
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Global Brain Health Institute, University of California, San Francisco, San Francisco, CA, 94158USA
- Trinity College Dublin, Dublin D02 X9W9, Ireland
| | - Alex J Lee
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Jesse A Brown
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Salvatore Spina
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Lea T Grinberg
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Department of Pathology, University of California, San Francisco, CA 94158, USA
| | - Daniel W Sirkis
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Luke W Bonham
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Department of Radiology, University of California, San Francisco, CA 94158, USA
| | - Jennifer S Yokoyama
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Department of Radiology, University of California, San Francisco, CA 94158, USA
| | - Adam L Boxer
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Joel H Kramer
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Howard J Rosen
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Jack Humphrey
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Aaron D Gitler
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Bruce L Miller
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
| | - Katherine S Pollard
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Epidemiology and Biostatistics and Bakar Institute for Computational Health Sciences, University of California San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| | - Michael E Ward
- National Institute of Neurological Disorders and Stroke, Neurogenetics Branch, Bethesda, MD 20892, USA
| | - William W Seeley
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA 94158, USA
- Department of Pathology, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
8
|
Kuldell JC, Kaplan CD. RNA Polymerase II Activity Control of Gene Expression and Involvement in Disease. J Mol Biol 2024:168770. [PMID: 39214283 DOI: 10.1016/j.jmb.2024.168770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Revised: 08/26/2024] [Accepted: 08/26/2024] [Indexed: 09/04/2024]
Abstract
Gene expression is dependent on RNA Polymerase II (Pol II) activity in eukaryotes. In addition to determining the rate of RNA synthesis for all protein coding genes, Pol II serves as a platform for the recruitment of factors and regulation of co-transcriptional events, from RNA processing to chromatin modification and remodeling. The transcriptome can be shaped by changes in Pol II kinetics affecting RNA synthesis itself or because of alterations to co-transcriptional events that are responsive to or coupled with transcription. Genetic, biochemical, and structural approaches to Pol II in model organisms have revealed critical insights into how Pol II works and the types of factors that regulate it. The complexity of Pol II regulation generally increases with organismal complexity. In this review, we describe fundamental aspects of how Pol II activity can shape gene expression, discuss recent advances in how Pol II elongation is regulated on genes, and how altered Pol II function is linked to human disease and aging.
Collapse
Affiliation(s)
- James C Kuldell
- Department of Biological Sciences, 202A LSA, Fifth and Ruskin Avenues, University of Pittsburgh, Pittsburgh PA 15260, United States
| | - Craig D Kaplan
- Department of Biological Sciences, 202A LSA, Fifth and Ruskin Avenues, University of Pittsburgh, Pittsburgh PA 15260, United States.
| |
Collapse
|
9
|
Danmaigoro A, Muhammad MA, Abubakar K, Magiri RB, Bakare AG, Iji PA. Morphological and physiological features in small ruminants: an adaptation strategy for survival under changing climatic conditions. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY 2024; 68:1497-1505. [PMID: 38700715 DOI: 10.1007/s00484-024-02694-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 03/04/2024] [Accepted: 04/27/2024] [Indexed: 07/28/2024]
Abstract
Climate change due to natural human activity is a significant global phenomenon affecting the sustainability of most countries' livestock industries. Climate change factors such as ambient temperature, relative humidity, direct and indirect sun radiation, and wind have significant consequences on feed, water, pasture availability, and the re-emergence of diseases in livestock. All these variables have a considerable impact on livestock production and welfare. However, animals' ability to respond and adapt to changes in climate differs within species and breeds. Comparatively, small ruminants are more adaptive to the adverse effects of climate change than large ruminants in terms of reproduction performance, survival, production yield, and resistance to re-emerging diseases. This is mainly due to their morphological features against harsh climate effects. Tropical breeds are more adaptive to the adverse effects of climate change than small temperate ruminants. However, the difference in morphological characteristics towards adaptation to the impact of climate change will guide the development of suitable policies on the selection of breeding stock suitable for different regions in the world. The choice of breeds based on morphological features and traits is an essential strategy in mitigating and minimizing the effects of climate change on small ruminants' production and welfare. This review highlights the adaptive morphological features within and among breeds of small ruminants toward adaptation to climate change.
Collapse
Affiliation(s)
- Abubakar Danmaigoro
- Department of Veterinary Sciences, College of Agriculture, Fisheries and Forestry, Fiji National University, Koronivia Campus, P. O. Box 1544, Nausori, Suva, Fiji.
- Department of Veterinary Preclinical Sciences, Faculty of Veterinary Medicine, Universiti Kelantan Malaysia, Pengkalan Chepa 16100 Kota Bharu, Kelantan, Malaysia.
| | - Mahmud Abdullahi Muhammad
- Department of Animal Health and Production Technology, Niger State College of Agriculture, Mokwa, Niger State, Nigeria
| | - Kabeer Abubakar
- Institut de Neurosciences des Systemes, Aix-Marselille Uneversite, 27 Bd Jean Moulin, Marseille, 13005, France
| | - Royford Bundi Magiri
- Department of Veterinary Sciences, College of Agriculture, Fisheries and Forestry, Fiji National University, Koronivia Campus, P. O. Box 1544, Nausori, Suva, Fiji
| | - Archibold Garikayi Bakare
- Department of Veterinary Sciences, College of Agriculture, Fisheries and Forestry, Fiji National University, Koronivia Campus, P. O. Box 1544, Nausori, Suva, Fiji
| | - Paul Ade Iji
- Department of Veterinary Sciences, College of Agriculture, Fisheries and Forestry, Fiji National University, Koronivia Campus, P. O. Box 1544, Nausori, Suva, Fiji
| |
Collapse
|
10
|
Kováčová M, Hlaváč V, Koževnikovová R, Rauš K, Gatěk J, Souček P. Artificial Intelligence-Driven Prediction Revealed CFTR Associated with Therapy Outcome of Breast Cancer: A Feasibility Study. Oncology 2024:1-12. [PMID: 39025053 DOI: 10.1159/000540395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 07/09/2024] [Indexed: 07/20/2024]
Abstract
INTRODUCTION In silico tools capable of predicting the functional consequences of genomic differences between individuals, many of which are AI-driven, have been the most effective over the past two decades for non-synonymous single nucleotide variants (nsSNVs). When appropriately selected for the purpose of the study, a high predictive performance can be expected. In this feasibility study, we investigate the distribution of nsSNVs with an allele frequency below 5%. To classify the putative functional consequence, a tier-based filtration led by AI-driven predictors and scoring system was implemented to the overall decision-making process, resulting in a list of prioritised genes. METHODS The study has been conducted on breast cancer patients of homogeneous ethnicity. Germline rare variants have been sequenced in genes that influence pharmacokinetic parameters of anticancer drugs or molecular signalling pathways in cancer. After AI-driven functional pathogenicity classification and data mining in pharmacogenomic (PGx) databases, variants were collapsed to the gene level and ranked according to their putative deleterious role. RESULTS In breast cancer patients, seven of the twelve genes prioritised based on the predictions were found to be associated with response to oncotherapy, histological grade, and tumour subtype. Most importantly, we showed that the group of patients with at least one rare nsSNVs in cystic fibrosis transmembrane conductance regulator (CFTR) had significantly reduced disease-free (log rank, p = 0.002) and overall survival (log rank, p = 0.006). CONCLUSION AI-driven in silico analysis with PGx data mining provided an effective approach navigating for functional consequences across germline genetic background, which can be easily integrated into the overall decision-making process for future studies. The study revealed a statistically significant association with numerous clinicopathological parameters, including treatment response. Our study indicates that CFTR may be involved in the processes influencing the effectiveness of oncotherapy or in the malignant progression of the disease itself.
Collapse
Affiliation(s)
- Mária Kováčová
- Third Faculty of Medicine, Charles University, Prague, Czechia
| | - Viktor Hlaváč
- Laboratory of Pharmacogenomics, Biomedical Center, Faculty of Medicine in Pilsen, Charles University, Pilsen, Czechia
- Toxicogenomics Unit, National Institute of Public Health, Prague, Czechia
| | | | - Karel Rauš
- Institute for the Care for Mother and Child, Prague, Czechia
| | - Jiří Gatěk
- Department of Surgery, EUC Hospital and University of Tomas Bata in Zlin, Zlin, Czechia
| | - Pavel Souček
- Laboratory of Pharmacogenomics, Biomedical Center, Faculty of Medicine in Pilsen, Charles University, Pilsen, Czechia
- Toxicogenomics Unit, National Institute of Public Health, Prague, Czechia
| |
Collapse
|
11
|
Giel AS, Bigge J, Schumacher J, Maj C, Dasmeh P. Analysis of Evolutionary Conservation, Expression Level, and Genetic Association at a Genome-wide Scale Reveals Heterogeneity Across Polygenic Phenotypes. Mol Biol Evol 2024; 41:msae115. [PMID: 38865495 PMCID: PMC11247350 DOI: 10.1093/molbev/msae115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 03/22/2024] [Accepted: 05/03/2024] [Indexed: 06/14/2024] Open
Abstract
Understanding the expression level and evolutionary rate of associated genes with human polygenic diseases provides crucial insights into their disease-contributing roles. In this work, we leveraged genome-wide association studies (GWASs) to investigate the relationship between the genetic association and both the evolutionary rate (dN/dS) and expression level of human genes associated with the two polygenic diseases of schizophrenia and coronary artery disease. Our findings highlight a distinct variation in these relationships between the two diseases. Genes associated with both diseases exhibit a significantly greater variance in evolutionary rate compared to those implicated in monogenic diseases. Expanding our analyses to 4,756 complex traits in the GWAS atlas database, we unraveled distinct trait categories with a unique interplay among the evolutionary rate, expression level, and genetic association of human genes. In most polygenic traits, highly expressed genes were more associated with the polygenic phenotypes compared to lowly expressed genes. About 69% of polygenic traits displayed a negative correlation between genetic association and evolutionary rate, while approximately 30% of these traits showed a positive correlation between genetic association and evolutionary rate. Our results demonstrate the presence of a spectrum among complex traits, shaped by natural selection. Notably, at opposite ends of this spectrum, we find metabolic traits being more likely influenced by purifying selection, and immunological traits that are more likely shaped by positive selection. We further established the polygenic evolution portal (evopolygen.de) as a resource for investigating relationships and generating hypotheses in the field of human polygenic trait evolution.
Collapse
Affiliation(s)
- Ann-Sophie Giel
- Centre for Human Genetics, Marburg University, Marburg, Germany
| | - Jessica Bigge
- Centre for Human Genetics, Marburg University, Marburg, Germany
| | | | - Carlo Maj
- Centre for Human Genetics, Marburg University, Marburg, Germany
| | - Pouria Dasmeh
- Centre for Human Genetics, Marburg University, Marburg, Germany
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| |
Collapse
|
12
|
Ropert B, Gallrein C, Schumacher B. DNA repair deficiencies and neurodegeneration. DNA Repair (Amst) 2024; 138:103679. [PMID: 38640601 DOI: 10.1016/j.dnarep.2024.103679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 04/03/2024] [Accepted: 04/11/2024] [Indexed: 04/21/2024]
Abstract
Neurodegenerative diseases are the second most prevalent cause of death in industrialized countries. Alzheimer's Disease is the most widespread and also most acknowledged form of dementia today. Together with Parkinson's Disease they account for over 90 % cases of neurodegenerative disorders caused by proteopathies. Far less known are the neurodegenerative pathologies in DNA repair deficiency syndromes. Such diseases like Cockayne - or Werner Syndrome are described as progeroid syndromes - diseases that cause the premature ageing of the affected persons, and there are clear implications of such diseases in neurologic dysfunction and degeneration. In this review, we aim to draw the attention on commonalities between proteopathy-associated neurodegeneration and neurodegeneration caused by DNA repair defects and discuss how mitochondria are implicated in the development of both disorder classes. Furthermore, we highlight how nematodes are a valuable and indispensable model organism to study conserved neurodegenerative processes in a fast-forward manner.
Collapse
Affiliation(s)
- Baptiste Ropert
- Institute for Genome Stability in Aging and Disease, Medical Faculty, University and University Hospital of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany; Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), Center for Molecular Medicine Cologne (CMMC), University of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany
| | - Christian Gallrein
- Institute for Genome Stability in Aging and Disease, Medical Faculty, University and University Hospital of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany; Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), Center for Molecular Medicine Cologne (CMMC), University of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany; Leibniz Institute on Aging - Fritz Lipmann Institute (FLI), Beutenbergstraße 11, Jena 07745, Germany
| | - Björn Schumacher
- Institute for Genome Stability in Aging and Disease, Medical Faculty, University and University Hospital of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany; Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), Center for Molecular Medicine Cologne (CMMC), University of Cologne, Joseph-Stelzmann-Str. 26, Cologne 50931, Germany.
| |
Collapse
|
13
|
MacDonald N, Raven N, Diep W, Evans S, Pannipitiya S, Bramwell G, Vanbeek C, Thomas F, Russell T, Dujon AM, Telonis-Scott M, Ujvari B. The molecular evolution of cancer associated genes in mammals. Sci Rep 2024; 14:11650. [PMID: 38773187 PMCID: PMC11109183 DOI: 10.1038/s41598-024-62425-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 05/16/2024] [Indexed: 05/23/2024] Open
Abstract
Cancer is a disease that many multicellular organisms have faced for millions of years, and species have evolved various tumour suppression mechanisms to control oncogenesis. Although cancer occurs across the tree of life, cancer related mortality risks vary across mammalian orders, with Carnivorans particularly affected. Evolutionary theory predicts different selection pressures on genes associated with cancer progression and suppression, including oncogenes, tumour suppressor genes and immune genes. Therefore, we investigated the evolutionary history of cancer associated gene sequences across 384 mammalian taxa, to detect signatures of selection across categories of oncogenes (GRB2, FGL2 and CDC42), tumour suppressors (LITAF, Casp8 and BRCA2) and immune genes (IL2, CD274 and B2M). This approach allowed us to conduct a fine scale analysis of gene wide and site-specific signatures of selection across mammalian lineages under the lens of cancer susceptibility. Phylogenetic analyses revealed that for most species the evolution of cancer associated genes follows the species' evolution. The gene wide selection analyses revealed oncogenes being the most conserved, tumour suppressor and immune genes having similar amounts of episodic diversifying selection. Despite BRCA2's status as a key caretaker gene, episodic diversifying selection was detected across mammals. The site-specific selection analyses revealed that the two apoptosis associated domains of the Casp8 gene of bats (Chiroptera) are under opposing forces of selection (positive and negative respectively), highlighting the importance of site-specific selection analyses to understand the evolution of highly complex gene families. Our results highlighted the need to critically assess different types of selection pressure on cancer associated genes when investigating evolutionary adaptations to cancer across the tree of life. This study provides an extensive assessment of cancer associated genes in mammals with highly representative, and substantially large sample size for a comparative genomic analysis in the field and identifies various avenues for future research into the mechanisms of cancer resistance and susceptibility in mammals.
Collapse
Affiliation(s)
- Nick MacDonald
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Nynke Raven
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Wendy Diep
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Samantha Evans
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Senuri Pannipitiya
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Georgina Bramwell
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Caitlin Vanbeek
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Frédéric Thomas
- CREEC, UMR IRD 224-CNRS 5290, Université de Montpellier, Montpellier, France
- MIVEGEC, IRD, CNRS, Université Montpellier, Montpellier, France
| | - Tracey Russell
- Faculty of Science, School of Life and Environmental Sciences, Sydney, NSW, Australia
| | - Antoine M Dujon
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia
| | - Marina Telonis-Scott
- School of Life and Environmental Sciences, Deakin University, Burwood, Burwood, VIC, 3125, Australia
| | - Beata Ujvari
- School of Life and Environmental Sciences, Deakin University, Geelong, Waurn Ponds, Geelong, VIC, 3216, Australia.
| |
Collapse
|
14
|
Khaniya A, Rad SMAH, Halpin J, Tawinwung S, McLellan A, Suppipat K, Hirankarn N. Development of a compact bidirectional promoter-driven dual chimeric antigen receptor (CAR) construct targeting CD19 and CD20 in the Sleeping Beauty (SB) transposon system. J Immunother Cancer 2024; 12:e008555. [PMID: 38677881 PMCID: PMC11057265 DOI: 10.1136/jitc-2023-008555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/11/2024] [Indexed: 04/29/2024] Open
Abstract
BACKGROUND A bidirectional promoter-driven chimeric antigen receptor (CAR) cassette provides the simultaneous expression of two CARs, which significantly enhances dual antigen-targeted CAR T-cell therapy. METHODS We developed a second-generation CAR directing CD19 and CD20 antigens, incorporating them in a head-to-head orientation from a bidirectional promoter using a single Sleeping Beauty transposon system. The efficacy of bidirectional promoter-driven dual CD19 and CD20 CAR T cells was determined in vitro against cell lines expressing either, or both, CD19 and CD20 antigens. In vivo antitumor activity was tested in Raji lymphoma-bearing immunodeficient NOD-scid IL2Rgammanull (NSG) mice. RESULTS Of all tested promoters, the bidirectional EF-1α promoter optimally expressed transcripts from both sense (CD19-CAR) and antisense (GFP.CD20-CAR) directions. Superior cytotoxicity, cytokine production and antigen-specific activation were observed in vitro in the bidirectional EF-1α promoter-driven CD19/CD20 CAR T cells. In contrast, a unidirectional construct driven by the EF-1α promoter, but using self-cleaving peptide-linked CD19 and CD20 CARs, showed inferior expression and in vitro function. Treatment of mice bearing advanced Raji lymphomas with bidirectional EF-1α promoter-driven CD19/CD20 CAR T cells effectively controlled tumor growth and extended the survival of mice compared with group treated with single antigen targeted CAR T cells. CONCLUSION The use of bidirectional promoters in a single vector offers advantages of size and robust CAR expression with the potential to expand use in other forms of gene therapies like CAR T cells.
Collapse
MESH Headings
- Antigens, CD19/immunology
- Antigens, CD19/genetics
- Humans
- Animals
- Antigens, CD20/genetics
- Antigens, CD20/metabolism
- Antigens, CD20/immunology
- Mice
- Receptors, Chimeric Antigen/genetics
- Receptors, Chimeric Antigen/metabolism
- Receptors, Chimeric Antigen/immunology
- DNA Transposable Elements
- Promoter Regions, Genetic
- Immunotherapy, Adoptive/methods
- Mice, Inbred NOD
- Cell Line, Tumor
- Mice, SCID
- Xenograft Model Antitumor Assays
Collapse
Affiliation(s)
- Asmita Khaniya
- Medical Microbiology, Chulalongkorn University, Bangkok, Thailand
- Cellular Immunotherapy Research Unit, Chulalongkorn University, Bangkok, Thailand
| | | | - Josh Halpin
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Supannikar Tawinwung
- Cellular Immunotherapy Research Unit, Chulalongkorn University, Bangkok, Thailand
- Pharmacology and Physiology, Chulalongkorn University Faculty of Pharmaceutical Sciences, Bangkok, Thailand
| | - Alexander McLellan
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Koramit Suppipat
- Cellular Immunotherapy Research Unit, Chulalongkorn University, Bangkok, Thailand
- Department of Research Affairs, Chulalongkorn University, Bangkok, Thailand
| | - Nattiya Hirankarn
- Center of Excellence in Immunology and Immune-mediated Diseases, Chulalongkorn University, Bangkok, Thailand
- Department of Microbiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| |
Collapse
|
15
|
McCoy MJ, Fire AZ. Parallel gene size and isoform expansion of ancient neuronal genes. Curr Biol 2024; 34:1635-1645.e3. [PMID: 38460513 PMCID: PMC11043017 DOI: 10.1016/j.cub.2024.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 12/16/2023] [Accepted: 02/11/2024] [Indexed: 03/11/2024]
Abstract
How nervous systems evolved is a central question in biology. A diversity of synaptic proteins is thought to play a central role in the formation of specific synapses leading to nervous system complexity. The largest animal genes, often spanning hundreds of thousands of base pairs, are known to be enriched for expression in neurons at synapses and are frequently mutated or misregulated in neurological disorders and diseases. Although many of these genes have been studied independently in the context of nervous system evolution and disease, general principles underlying their parallel evolution remain unknown. To investigate this, we directly compared orthologous gene sizes across eukaryotes. By comparing relative gene sizes within organisms, we identified a distinct class of large genes with origins predating the diversification of animals and, in many cases, the emergence of neurons as dedicated cell types. We traced this class of ancient large genes through evolution and found orthologs of the large synaptic genes potentially driving the immense complexity of metazoan nervous systems, including in humans and cephalopods. Moreover, we found that while these genes are evolving under strong purifying selection, as demonstrated by low dN/dS ratios, they have simultaneously grown larger and gained the most isoforms in animals. This work provides a new lens through which to view this distinctive class of large and multi-isoform genes and demonstrates how intrinsic genomic properties, such as gene length, can provide flexibility in molecular evolution and allow groups of genes and their host organisms to evolve toward complexity.
Collapse
Affiliation(s)
- Matthew J McCoy
- Department of Pathology, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305, USA.
| | - Andrew Z Fire
- Department of Pathology, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305, USA; Department of Genetics, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305, USA.
| |
Collapse
|
16
|
Ahn SH, Park Y, Kim JH. Contradiction in Star-Allele Nomenclature of Pharmacogenes between Common Haplotypes and Rare Variants. Genes (Basel) 2024; 15:521. [PMID: 38674455 PMCID: PMC11050392 DOI: 10.3390/genes15040521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/18/2024] [Accepted: 04/19/2024] [Indexed: 04/28/2024] Open
Abstract
The nomenclature of star alleles has been widely used in pharmacogenomics to enhance treatment outcomes, predict drug response variability, and reduce adverse reactions. However, the discovery of numerous rare functional variants through genome sequencing introduces complexities into the star-allele system. This study aimed to assess the nature and impact of the rapid discovery of numerous rare functional variants in the traditional haplotype-based star-allele system. We developed a new method to construct haplogroups, representing a common ancestry structure, by iteratively excluding rare and functional variants of the 25 representative pharmacogenes using the 2504 genomes from the 1000 Genomes Project. In total, 192 haplogroups and 288 star alleles were identified, with an average of 7.68 ± 4.2 cross-ethnic haplogroups per gene. Most of the haplogroups (70.8%, 136/192) were highly aligned with their corresponding classical star alleles (VI = 1.86 ± 0.78), exhibiting higher genetic diversity than the star alleles. Approximately 41.3% (N = 119) of the star alleles in the 2504 genomes did not belong to any of the haplogroups, and most of them (91.3%, 105/116) were determined by a single variant according to the allele-definition table provided by CPIC. These functional single variants had low allele frequency (MAF < 1%), high evolutionary conservation, and variant deleteriousness, which suggests significant negative selection. It is suggested that the traditional haplotype-based naming system for pharmacogenetic star alleles now needs to be adjusted by balancing both traditional haplotyping and newly emerging variant-sequencing approaches to reduce naming complexity.
Collapse
Affiliation(s)
- Se Hwan Ahn
- Department of Biomedical Sciences, Seoul National University Biomedical Informatics (SNUBI), Seoul National University College of Medicine, Seoul 03080, Republic of Korea;
| | - Yoomi Park
- Seoul National University Biomedical Informatics (SNUBI), Seoul National University College of Medicine, Seoul 03080, Republic of Korea;
- Medical Research Center, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
| | - Ju Han Kim
- Department of Biomedical Sciences, Seoul National University Biomedical Informatics (SNUBI), Seoul National University College of Medicine, Seoul 03080, Republic of Korea;
- Seoul National University Biomedical Informatics (SNUBI), Seoul National University College of Medicine, Seoul 03080, Republic of Korea;
| |
Collapse
|
17
|
Safadi A, Lovell SC, Doig AJ. Essentiality, protein-protein interactions and evolutionary properties are key predictors for identifying cancer-associated genes using machine learning. Sci Rep 2024; 14:9199. [PMID: 38649399 PMCID: PMC11035574 DOI: 10.1038/s41598-023-44118-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 10/04/2023] [Indexed: 04/25/2024] Open
Abstract
The distinctive nature of cancer as a disease prompts an exploration of the special characteristics the genes implicated in cancer exhibit. The identification of cancer-associated genes and their characteristics is crucial to further our understanding of this disease and enhanced likelihood of therapeutic drug targets success. However, the rate at which cancer genes are being identified experimentally is slow. Applying predictive analysis techniques, through the building of accurate machine learning models, is potentially a useful approach in enhancing the identification rate of these genes and their characteristics. Here, we investigated gene essentiality scores and found that they tend to be higher for cancer-associated genes compared to other protein-coding human genes. We built a dataset of extended gene properties linked to essentiality and used it to train a machine-learning model; this model reached 89% accuracy and > 0.85 for the Area Under Curve (AUC). The model showed that essentiality, evolutionary-related properties, and properties arising from protein-protein interaction networks are particularly effective in predicting cancer-associated genes. We were able to use the model to identify potential candidate genes that have not been previously linked to cancer. Prioritising genes that score highly by our methods could aid scientists in their cancer genes research.
Collapse
Affiliation(s)
- Amro Safadi
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9PT, UK
| | - Simon C Lovell
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9PT, UK
| | - Andrew J Doig
- Division of Neuroscience, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, M13 9BL, UK.
| |
Collapse
|
18
|
Moroz LL. Evolutionary neurogenomics: Lengthy resolutions for complex brains. Curr Biol 2024; 34:R315-R318. [PMID: 38653198 PMCID: PMC11173357 DOI: 10.1016/j.cub.2024.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
Genomic blueprints underlying unique neuronal organization are enigmatic. A new study reveals the recruitment of ancient, larger genes for synaptic machinery, providing evolutionary constraints and flexibility, with increasing gene sizes being found in animal lineages that led to cephalopods and vertebrates.
Collapse
Affiliation(s)
- Leonid L Moroz
- Department of Neuroscience, College of Medicine and the Whitney Laboratory for Marine Biosciences, University of Florida, FL 32080, USA.
| |
Collapse
|
19
|
Bei C, Zhu J, Culviner PH, Gan M, Rubin EJ, Fortune SM, Gao Q, Liu Q. Genetically encoded transcriptional plasticity underlies stress adaptation in Mycobacterium tuberculosis. Nat Commun 2024; 15:3088. [PMID: 38600064 PMCID: PMC11006872 DOI: 10.1038/s41467-024-47410-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 03/28/2024] [Indexed: 04/12/2024] Open
Abstract
Transcriptional regulation is a critical adaptive mechanism that allows bacteria to respond to changing environments, yet the concept of transcriptional plasticity (TP) - the variability of gene expression in response to environmental changes - remains largely unexplored. In this study, we investigate the genome-wide TP profiles of Mycobacterium tuberculosis (Mtb) genes by analyzing 894 RNA sequencing samples derived from 73 different environmental conditions. Our data reveal that Mtb genes exhibit significant TP variation that correlates with gene function and gene essentiality. We also find that critical genetic features, such as gene length, GC content, and operon size independently impose constraints on TP, beyond trans-regulation. By extending our analysis to include two other Mycobacterium species -- M. smegmatis and M. abscessus -- we demonstrate a striking conservation of the TP landscape. This study provides a comprehensive understanding of the TP exhibited by mycobacteria genes, shedding light on this significant, yet understudied, genetic feature encoded in bacterial genomes.
Collapse
Affiliation(s)
- Cheng Bei
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China
| | - Junhao Zhu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Peter H Culviner
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Mingyu Gan
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, 201102, Shanghai, China
| | - Eric J Rubin
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Sarah M Fortune
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Qian Gao
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China.
- National Clinical Research Center for Infectious Diseases, Shenzhen Third People's Hospital, Shenzhen, Guangdong Province, China.
| | - Qingyun Liu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
20
|
Soheili-Nezhad S, Ibáñez-Solé O, Izeta A, Hoeijmakers JHJ, Stoeger T. Time is ticking faster for long genes in aging. Trends Genet 2024; 40:299-312. [PMID: 38519330 PMCID: PMC11003850 DOI: 10.1016/j.tig.2024.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 01/30/2024] [Accepted: 01/30/2024] [Indexed: 03/24/2024]
Abstract
Recent studies of aging organisms have identified a systematic phenomenon, characterized by a negative correlation between gene length and their expression in various cell types, species, and diseases. We term this phenomenon gene-length-dependent transcription decline (GLTD) and suggest that it may represent a bottleneck in the transcription machinery and thereby significantly contribute to aging as an etiological factor. We review potential links between GLTD and key aging processes such as DNA damage and explore their potential in identifying disease modification targets. Notably, in Alzheimer's disease, GLTD spotlights extremely long synaptic genes at chromosomal fragile sites (CFSs) and their vulnerability to postmitotic DNA damage. We suggest that GLTD is an integral element of biological aging.
Collapse
Affiliation(s)
- Sourena Soheili-Nezhad
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands; Department of Cognitive Neuroscience, Donders Institute for Brain, Cognition, and Behaviour, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Olga Ibáñez-Solé
- Stem Cells & Aging Group, Biogipuzkoa Health Research Institute, Donostia-San Sebastián, Spain; Institute for Genome Stability in Aging and Disease, Medical Faculty, University and University Hospital of Cologne, Joseph-Stelzmann-Strasse 26, 50931 Cologne, Germany; Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), Center for Molecular Medicine Cologne (CMMC), University of Cologne, Joseph-Stelzmann-Strasse 26, 50931 Cologne, Germany
| | - Ander Izeta
- Stem Cells & Aging Group, Biogipuzkoa Health Research Institute, Donostia-San Sebastián, Spain; Tecnun-University of Navarra, 20018 Donostia-San Sebastian, Spain.
| | - Jan H J Hoeijmakers
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands; University of Cologne, Faculty of Medicine, Cluster of Excellence for Aging Research, Institute for Genome Stability in Ageing and Disease, Cologne, Germany; Princess Maxima Center for Pediatric Oncology, Oncode Institute, Utrecht, The Netherlands.
| | - Thomas Stoeger
- Feinberg School of Medicine, Division of Pulmonary and Critical Care Medicine, Northwestern University, Chicago, IL, USA; Potocsnak Longevity Institute, Northwestern University, Chicago, IL, USA; Simpson Querrey Lung Institute for Translational Science, Chicago, IL, USA.
| |
Collapse
|
21
|
Richardson R, Tejedor Navarro H, Amaral LAN, Stoeger T. Meta-Research: Understudied genes are lost in a leaky pipeline between genome-wide assays and reporting of results. eLife 2024; 12:RP93429. [PMID: 38546716 PMCID: PMC10977968 DOI: 10.7554/elife.93429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/01/2024] Open
Abstract
Present-day publications on human genes primarily feature genes that already appeared in many publications prior to completion of the Human Genome Project in 2003. These patterns persist despite the subsequent adoption of high-throughput technologies, which routinely identify novel genes associated with biological processes and disease. Although several hypotheses for bias in the selection of genes as research targets have been proposed, their explanatory powers have not yet been compared. Our analysis suggests that understudied genes are systematically abandoned in favor of better-studied genes between the completion of -omics experiments and the reporting of results. Understudied genes remain abandoned by studies that cite these -omics experiments. Conversely, we find that publications on understudied genes may even accrue a greater number of citations. Among 45 biological and experimental factors previously proposed to affect which genes are being studied, we find that 33 are significantly associated with the choice of hit genes presented in titles and abstracts of -omics studies. To promote the investigation of understudied genes, we condense our insights into a tool, find my understudied genes (FMUG), that allows scientists to engage with potential bias during the selection of hits. We demonstrate the utility of FMUG through the identification of genes that remain understudied in vertebrate aging. FMUG is developed in Flutter and is available for download at fmug.amaral.northwestern.edu as a MacOS/Windows app.
Collapse
Affiliation(s)
- Reese Richardson
- Interdisciplinary Biological Sciences, Northwestern UniversityEvanstonUnited States
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
| | - Heliodoro Tejedor Navarro
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- Northwestern Institute on Complex Systems, Northwestern UniversityEvanstonUnited States
| | - Luis A Nunes Amaral
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- Northwestern Institute on Complex Systems, Northwestern UniversityEvanstonUnited States
- Department of Molecular Biosciences, Northwestern UniversityEvanstonUnited States
- Department of Physics and Astronomy, Northwestern UniversityEvanstonUnited States
| | - Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- The Potocsnak Longevity Institute, Northwestern UniversityChicagoUnited States
- Simpson Querrey Lung Institute for Translational Science, Northwestern UniversityChicagoUnited States
| |
Collapse
|
22
|
Lopes-Marques M, Mort M, Carneiro J, Azevedo A, Amaro AP, Cooper DN, Azevedo L. Meta-analysis of 46,000 germline de novo mutations linked to human inherited disease. Hum Genomics 2024; 18:20. [PMID: 38395944 PMCID: PMC10885371 DOI: 10.1186/s40246-024-00587-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 02/15/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND De novo mutations (DNMs) are variants that occur anew in the offspring of noncarrier parents. They are not inherited from either parent but rather result from endogenous mutational processes involving errors of DNA repair/replication. These spontaneous errors play a significant role in the causation of genetic disorders, and their importance in the context of molecular diagnostic medicine has become steadily more apparent as more DNMs have been reported in the literature. In this study, we examined 46,489 disease-associated DNMs annotated by the Human Gene Mutation Database (HGMD) to ascertain their distribution across gene and disease categories. RESULTS Most disease-associated DNMs reported to date are found to be associated with developmental and psychiatric disorders, a reflection of the focus of sequencing efforts over the last decade. Of the 13,277 human genes in which DNMs have so far been found, the top-10 genes with the highest proportions of DNM relative to gene size were H3-3 A, DDX3X, CSNK2B, PURA, ZC4H2, STXBP1, SCN1A, SATB2, H3-3B and TUBA1A. The distribution of CADD and REVEL scores for both disease-associated DNMs and those mutations not reported to be de novo revealed a trend towards higher deleteriousness for DNMs, consistent with the likely lower selection pressure impacting them. This contrasts with the non-DNMs, which are presumed to have been subject to continuous negative selection over multiple generations. CONCLUSION This meta-analysis provides important information on the occurrence and distribution of disease-associated DNMs in association with heritable disease and should make a significant contribution to our understanding of this major type of mutation.
Collapse
Affiliation(s)
- Mónica Lopes-Marques
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - João Carneiro
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - António Azevedo
- CHUdSA-Centro Hospitalar Universitário de Santo António, Porto, Portugal
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal
| | - Andreia P Amaro
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - Luísa Azevedo
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal.
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal.
| |
Collapse
|
23
|
Rasheed A, Al-Huqail AA, Ali B, Alghanem SMS, Shah AA, Azeem F, Rizwan M, Al-Qthanin RN, Soudy FA. Molecular characterization of genes involved in tolerance of cadmium in Triticum aestivum (L.) under Cd stress. JOURNAL OF HAZARDOUS MATERIALS 2024; 464:132955. [PMID: 37976857 DOI: 10.1016/j.jhazmat.2023.132955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 10/21/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023]
Abstract
The NRAMPs (natural resistance-associated macrophage proteins) are major transporters for the absorption and transport of metals like Pb, Zn, Mn, Fe, and Cd in plants. While NRAMP gene family members have been extensively studied as metal transporters in model and other plants, little information has been reported on their role in Triticum aestivum, particularly in response to Cd stress. Current study reported 13 NRAMP candidates in the genome of T. aestivum. Phylogenetic analysis divided these into three clades. Motif and gene structure study showed that members in the same clades shared the same location and pattern, which further supported the phylogenetic analysis. The analysis of cis-acting elements in promoter sequences of NRAMP genes in wheat identified stress-responsive transcription factor binding sites. Multiple sequence alignment identified the conservation of important residues. Based on RNA-seq and qRT-PCR analysis, Cd stress-responsive variations of TaNRAMP gene expression were reported. This study provides comprehensive data to understand the TaNRAMP gene family, its features, and its expression, which will be a helpful framework for functional research.
Collapse
Affiliation(s)
- Asima Rasheed
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad 38000, Pakistan
| | - Arwa Abdulkreem Al-Huqail
- Department of Biology, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Baber Ali
- Department of Plant Sciences, Quaid-i-Azam University, Islamabad 45320, Pakistan
| | | | - Asad Ali Shah
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad 38000, Pakistan
| | - Farrukh Azeem
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad 38000, Pakistan.
| | - Muhammad Rizwan
- Department of Environmental Sciences, Government College University Faisalabad, Faisalabad 38000, Pakistan.
| | - Rahmah N Al-Qthanin
- Department of Biology, College of Science, King Khalid University, Abha, 61413, Saudi Arabia; Prince Sultan Bin Abdelaziz for Environmental Research and Natural Resources Sustainability Center, King Khalid University, Abha 61421, Saudi Arabia
| | - Fathia A Soudy
- Genetics and Genetic Engineering Department, Faculty of Agriculture, Benha University, Moshtohor 13736, Egypt
| |
Collapse
|
24
|
Xiao Y, Chen YM, Zou Z, Ye C, Dou X, Wu J, Liu C, Liu S, Yan H, Wang P, Zeng TB, Liu Q, Fei J, Tang W, He C. Profiling of RNA-binding protein binding sites by in situ reverse transcription-based sequencing. Nat Methods 2024; 21:247-258. [PMID: 38200227 PMCID: PMC10864177 DOI: 10.1038/s41592-023-02146-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Accepted: 12/07/2023] [Indexed: 01/12/2024]
Abstract
RNA-binding proteins (RBPs) regulate diverse cellular processes by dynamically interacting with RNA targets. However, effective methods to capture both stable and transient interactions between RBPs and their RNA targets are still lacking, especially when the interaction is dynamic or samples are limited. Here we present an assay of reverse transcription-based RBP binding site sequencing (ARTR-seq), which relies on in situ reverse transcription of RBP-bound RNAs guided by antibodies to identify RBP binding sites. ARTR-seq avoids ultraviolet crosslinking and immunoprecipitation, allowing for efficient and specific identification of RBP binding sites from as few as 20 cells or a tissue section. Taking advantage of rapid formaldehyde fixation, ARTR-seq enables capturing the dynamic RNA binding by RBPs over a short period of time, as demonstrated by the profiling of dynamic RNA binding of G3BP1 during stress granule assembly on a timescale as short as 10 minutes.
Collapse
Affiliation(s)
- Yu Xiao
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Yan-Ming Chen
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Zhongyu Zou
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Chang Ye
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Xiaoyang Dou
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Jinjun Wu
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
| | - Chang Liu
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Shun Liu
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Hao Yan
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
| | - Pingluan Wang
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Tie-Bo Zeng
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Qinzhe Liu
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, Chicago, IL, USA
| | - Jingyi Fei
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
| | - Weixin Tang
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
| | - Chuan He
- Department of Chemistry, The University of Chicago, Chicago, IL, USA.
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA.
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA.
- Howard Hughes Medical Institute, Chicago, IL, USA.
| |
Collapse
|
25
|
Szatmári EZ, Csordás A, Kerepesi C. Unique Patterns in Amino Acid Sequences of Aging-Related Proteins. Adv Biol (Weinh) 2024; 8:e2300436. [PMID: 37880927 DOI: 10.1002/adbi.202300436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/14/2023] [Indexed: 10/27/2023]
Abstract
Aging has strong genetic components and the list of genes that may regulate the aging process is collected in the GenAge database. There may be characteristic patterns in the amino acid sequences of aging-related proteins that distinguish them from other proteins and this information will lead to a better understanding of the aging process. To test this hypothesis, human protein sequences are extracted from the UniProt database and the relative frequency of every amino acid residue in aging-related proteins and the remaining proteins is calculated. The main observation is that the mean relative frequency of aspartic acid (D) is consistently higher, while the mean relative frequencies of tryptophan (W) and leucine (L) are consistently lower in aging-related proteins compared to the non-aging-related proteins for the human and four examined model organisms. It is also observed that the mean relative frequency of aspartic acid is higher, while the mean relative frequency of tryptophan is lower in pro-longevity proteins compared to anti-longevity proteins in model organisms. Finally, it is found that aging-related proteins tend to be longer than non-aging-related proteins. It is hoped that this analysis initiates further computational and experimental research to explore the underlying mechanisms of these findings.
Collapse
Affiliation(s)
- Eszter Zita Szatmári
- Institute for Computer Science and Control (SZTAKI), Hungarian Research Network (HUN-REN), Budapest, 1111, Hungary
- Department of Applied Analysis and Computational Mathematics, Eötvös Loránd University (ELTE), Pázmány Péter sétány 1/C, Budapest, 1117, Hungary
| | | | - Csaba Kerepesi
- Institute for Computer Science and Control (SZTAKI), Hungarian Research Network (HUN-REN), Budapest, 1111, Hungary
| |
Collapse
|
26
|
Liu Y, Xu W, Yang P, Liu X. Revealing Molecular Patterns of Alzheimer's Disease Risk Gene Expression Signatures in COVID-19 Brains. J Alzheimers Dis 2024; 101:31-48. [PMID: 39058446 DOI: 10.3233/jad-240609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]
Abstract
Background Various virus infections are known to predispose to Alzheimer's disease (AD), and a linkage between COVID-19 and AD has been established. COVID-19 infection modulates the gene expression of the genes implicated in progression of AD. Objective Determination of molecular patterns and codon usage and context analysis for the genes that are modulated during COVID-19 infection and are implicated in AD was the target of the study. Methods Our study employed a comprehensive array of research methods, including relative synonymous codon usage, Codon adaptation index analysis, Neutrality and parity analysis, Rare codon analyses, and codon context analysis. This meticulous approach was crucial in determining the molecular patterns present in genes up or downregulated during COVID-19 infection. Results G/C ending codons were preferred in upregulated genes while not in downregulated genes, and in both gene sets, longer genes have high expressivity. Similarly, T over A nucleotide was preferred, and selection was the major evolutionary force in shaping codon usage in both gene sets. Apart from stops codons, codons CGU - Arg, AUA - Ile, UUA - Leu, UCG - Ser, GUA - Val, and CGA - Arg in upregulated genes, while CUA - Leu, UCG - Ser, and UUA - Leu in downregulated genes were present below the 0.5%. Glutamine-initiated codon pairs have high residual values in upregulated genes. Identical codon pairs GAG-GAG and GUG-GUG were preferred in both gene sets. Conclusions The shared and unique molecular features in the up- and downregulated gene sets provide insights into the complex interplay between COVID-19 infection and AD. Further studies are required to elucidate the relationship of these molecular patterns with AD pathology.
Collapse
Affiliation(s)
- Yan Liu
- Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Third Hospital of Shanxi Medical University, Tongji Shanxi Hospital, Taiyuan, China
| | - Weiyue Xu
- Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Third Hospital of Shanxi Medical University, Tongji Shanxi Hospital, Taiyuan, China
| | - Pan Yang
- TEDA institute of Biological Science and Biotechnology, Nankai University, TEDA, Tianjin, China
| | - Xingshun Liu
- Department of Biochemistry and Molecular Biology, Shanxi Medical University, Taiyuan, China
| |
Collapse
|
27
|
Nakatsu K, Jijiwa M, Khadka V, Nasu M, Deng Y. sRNAfrag: a pipeline and suite of tools to analyze fragmentation in small RNA sequencing data. Brief Bioinform 2023; 25:bbad515. [PMID: 38243693 PMCID: PMC10796253 DOI: 10.1093/bib/bbad515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 11/25/2023] [Accepted: 12/13/2023] [Indexed: 01/21/2024] Open
Abstract
Fragments derived from small RNAs such as small nucleolar RNAs are biologically relevant but remain poorly understood. To address this gap, we developed sRNAfrag, a modular and interoperable tool designed to standardize the quantification and analysis of small RNA fragmentation across various biotypes. The tool outputs a set of tables forming a relational database, allowing for an in-depth exploration of biologically complex events such as multi-mapping and RNA fragment stability across different cell types. In a benchmark test, sRNAfrag was able to identify established loci of mature microRNAs solely based on sequencing data. Furthermore, the 5' seed sequence could be rediscovered by utilizing a visualization approach primarily applied in multi-sequence-alignments. Utilizing the relational database outputs, we detected 1411 snoRNA fragment conservation events between two out of four eukaryotic species, providing an opportunity to explore motifs through evolutionary time and conserved fragmentation patterns. Additionally, the tool's interoperability with other bioinformatics tools like ViennaRNA amplifies its utility for customized analyses. We also introduce a novel loci-level variance-score which provides insights into the noise around peaks and demonstrates biological relevance by distinctly separating breast cancer and neuroblastoma cell lines after dimension reduction when applied to small nucleolar RNAs. Overall, sRNAfrag serves as a versatile foundation for advancing our understanding of small RNA fragments and offers a functional foundation to further small RNA research. Availability: https://github.com/kenminsoo/sRNAfrag.
Collapse
Affiliation(s)
- Ken Nakatsu
- Emory College of Arts and Sciences, Emory University, 201 Dowman Dr, 30322, Georgia, United States of America
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, 96813, Hawaii, United States of America
| | - Mayumi Jijiwa
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, 96813, Hawaii, United States of America
| | - Vedbar Khadka
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, 96813, Hawaii, United States of America
| | - Masaki Nasu
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, 96813, Hawaii, United States of America
| | - Youping Deng
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, 96813, Hawaii, United States of America
| |
Collapse
|
28
|
Han SW, Jewell S, Thomas-Tikhonenko A, Barash Y. Contrasting and Combining Transcriptome Complexity Captured by Short and Long RNA Sequencing Reads. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.21.568046. [PMID: 38045232 PMCID: PMC10690182 DOI: 10.1101/2023.11.21.568046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Mapping transcriptomic variations using either short or long reads RNA sequencing is a staple of genomic research. Long reads are able to capture entire isoforms and overcome repetitive regions, while short reads still provides improved coverage and error rates. Yet how to quantitatively compare the technologies, can we combine those, and what may be the benefit of such a combined view remain open questions. We tackle these questions by first creating a pipeline to assess matched long and short reads data using a variety of transcriptome statistics. We find that across datasets, algorithms and technologies, matched short reads data detects roughly 50% more splice junctions, with 10-30% of the splice junctions included at 20% or more are missed by long reads. In contrast, long reads detect many more intron retention events, pointing to the benefit of combining the technologies. We introduce MAJIQ-L, an extension of the MAJIQ software to enable a unified view of transcriptome variations from both technologies and demonstrate its benefits. Our software can be used to assess any future long reads technology or algorithm, and combine it with short reads data for improved transcriptome analysis.
Collapse
Affiliation(s)
- Seong Woo Han
- Department of Computer and Information Sciences, School of Engineering, University of Pennsylvania
| | - San Jewell
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
| | - Andrei Thomas-Tikhonenko
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania
- Division of Cancer Pathobiology, Children's Hospital of Philadelphia
| | - Yoseph Barash
- Department of Computer and Information Sciences, School of Engineering, University of Pennsylvania
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
| |
Collapse
|
29
|
Arendt-Tranholm A, Mwirigi JM, Price TJ. RNA isoform expression landscape of the human dorsal root ganglion (DRG) generated from long read sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.28.564535. [PMID: 37961262 PMCID: PMC10634934 DOI: 10.1101/2023.10.28.564535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Splicing is a post-transcriptional RNA processing mechanism that enhances genomic complexity by creating multiple isoforms from the same gene. Diversity in splicing in the mammalian nervous system is associated with neuronal development, synaptic function and plasticity, and is also associated with diseases of the nervous system ranging from neurodegeneration to chronic pain. We aimed to characterize the isoforms expressed in the human peripheral nervous system, with the goal of creating a resource to identify novel isoforms of functionally relevant genes associated with somatosensation and nociception. We used long read sequencing (LRS) to document isoform expression in the human dorsal root ganglia (hDRG) from 3 organ donors. Isoforms were validated in silico by confirming expression in hDRG short read sequencing (SRS) data from 3 independent organ donors. 19,547 isoforms of protein-coding genes were detected using LRS and validated with SRS and strict expression cutoffs. We identified 763 isoforms with at least one previously undescribed splice-junction. Previously unannotated isoforms of multiple pain-associated genes, including ASIC3, MRGPRX1 and HNRNPK were identified. In the novel isoforms of ASIC3, a region comprising ~35% of the 5'UTR was excised. In contrast, a novel splice-junction was utilized in isoforms of MRGPRX1 to include an additional exon upstream of the start-codon, consequently adding a region to the 5'UTR. Novel isoforms of HNRNPK were identified which utilized previously unannotated splice-sites to both excise exon 14 and include a sequence in the 5' end of exon 13. The insertion and deletion in the coding region was predicted to excise a serine-phosphorylation site favored by cdc2, and replace it with a tyrosine-phosphorylation site potentially phosphorylated by SRC. We also independently confirm a recently reported DRG-specific splicing event in WNK1 that gives insight into how painless peripheral neuropathy occurs when this gene is mutated. Our findings give a clear overview of mRNA isoform diversity in the hDRG obtained using LRS. Using this work as a foundation, an important next step will be to use LRS on hDRG tissues recovered from people with a history of chronic pain. This should enable identification of new drug targets and a better understanding of chronic pain that may involve aberrant splicing events.
Collapse
Affiliation(s)
- Asta Arendt-Tranholm
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| | - Juliet M. Mwirigi
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| | - Theodore J. Price
- School of Behavioral and Brain Sciences, Department of Neuroscience and Center for Advanced Pain Studies, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, Texas, 75080
| |
Collapse
|
30
|
Pasquini L, Pereira FL, Seddighi S, Zeng Y, Wei Y, Illán-Gala I, Vatsavayai SC, Friedberg A, Lee AJ, Brown JA, Spina S, Grinberg LT, Sirkis DW, Bonham LW, Yokoyama JS, Boxer AL, Kramer JH, Rosen HJ, Humphrey J, Gitler AD, Miller BL, Pollard KS, Ward ME, Seeley WW. FTLD targets brain regions expressing recently evolved genes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.27.23297687. [PMID: 37961381 PMCID: PMC10635220 DOI: 10.1101/2023.10.27.23297687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
In frontotemporal lobar degeneration (FTLD), pathological protein aggregation is associated with a decline in human-specialized social-emotional and language functions. Most disease protein aggregates contain either TDP-43 (FTLD-TDP) or tau (FTLD-tau). Here, we explored whether FTLD targets brain regions that express genes containing human accelerated regions (HARs), conserved sequences that have undergone positive selection during recent human evolution. To this end, we used structural neuroimaging from patients with FTLD and normative human regional transcriptomic data to identify genes expressed in FTLD-targeted brain regions. We then integrated primate comparative genomic data to test our hypothesis that FTLD targets brain regions expressing recently evolved genes. In addition, we asked whether genes expressed in FTLD-targeted brain regions are enriched for genes that undergo cryptic splicing when TDP-43 function is impaired. We found that FTLD-TDP and FTLD-tau subtypes target brain regions that express overlapping and distinct genes, including many linked to neuromodulatory functions. Genes whose normative brain regional expression pattern correlated with FTLD cortical atrophy were strongly associated with HARs. Atrophy-correlated genes in FTLD-TDP showed greater overlap with TDP-43 cryptic splicing genes compared with atrophy-correlated genes in FTLD-tau. Cryptic splicing genes were enriched for HAR genes, and vice versa, but this effect was due to the confounding influence of gene length. Analyses performed at the individual-patient level revealed that the expression of HAR genes and cryptically spliced genes within putative regions of disease onset differed across FTLD-TDP subtypes. Overall, our findings suggest that FTLD targets brain regions that have undergone recent evolutionary specialization and provide intriguing potential leads regarding the transcriptomic basis for selective vulnerability in distinct FTLD molecular-anatomical subtypes.
Collapse
Affiliation(s)
- Lorenzo Pasquini
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Department of Neurology, Neuroscape, University of California, San Francisco, CA, USA
| | - Felipe L Pereira
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Sahba Seddighi
- National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
| | - Yi Zeng
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Yongbin Wei
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China
| | - Ignacio Illán-Gala
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Global Brain Health Institute, University of California, San Francisco, San Francisco, CA, USA and Trinity College Dublin, Dublin, Ireland
- Department of Neurology, Hospital de la Santa Creu i Sant Pau, Biomedical Research Institute, Universitat Autònoma de Barcelona, Barcelona, Catalunya, Spain
| | - Sarat C Vatsavayai
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Adit Friedberg
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Global Brain Health Institute, University of California, San Francisco, San Francisco, CA, USA and Trinity College Dublin, Dublin, Ireland
| | - Alex J Lee
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Jesse A Brown
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Salvatore Spina
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Lea T Grinberg
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Department of Pathology, University of California, San Francisco, CA, USA
| | - Daniel W Sirkis
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Luke W Bonham
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Department of Radiology, University of California, San Francisco, CA, USA
| | - Jennifer S Yokoyama
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Department of Radiology, University of California, San Francisco, CA, USA
| | - Adam L Boxer
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Joel H Kramer
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Howard J Rosen
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Jack Humphrey
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Aaron D Gitler
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Bruce L Miller
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
| | - Katherine S Pollard
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Department of Epidemiology & Biostatistics and Bakar Institute for Computational Health Sciences, University of California San Francisco, San Francisco, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Michael E Ward
- National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
| | - William W Seeley
- Department of Neurology, Memory and Aging Center, University of California, San Francisco, CA, USA
- Department of Pathology, University of California, San Francisco, CA, USA
| |
Collapse
|
31
|
Balasooriya GI, Wee TL, Spector DL. A sub-set of guanine- and cytosine-rich genes are actively transcribed at the nuclear Lamin B1 region. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.28.564411. [PMID: 37961255 PMCID: PMC10634887 DOI: 10.1101/2023.10.28.564411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Chromatin organization in the mammalian cell nucleus plays a vital role in the regulation of gene expression. The lamina-associated domain at the inner nuclear membrane has been proposed to harbor heterochromatin, while the nuclear interior has been shown to contain most of the euchromatin. Here, we show that a sub-set of actively transcribing genes, marked by RNA Pol II pSer2, are associated with Lamin B1 at the inner nuclear envelop in mESCs and the number of genes proportionally increases upon in vitro differentiation of mESC to olfactory precursor cells. These nuclear periphery-associated actively transcribing genes primarily represent housekeeping genes, and their gene bodies are significantly enriched with guanine and cytosine compared to genes actively transcribed at the nuclear interior. We found the promoters of these genes to also be significantly enriched with guanine and to be predominantly regulated by zinc finger protein transcription factors. We provide evidence supporting the emerging notion that the Lamin B1 region is not solely transcriptionally silent.
Collapse
|
32
|
Liu Z, Huang YF. Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.29.555384. [PMID: 37693607 PMCID: PMC10491176 DOI: 10.1101/2023.08.29.555384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficiency at the gene level, limiting their power to pinpoint deleterious deletions associated with genetic disorders. Here we introduce DosaCNV, a deep multiple-instance learning framework that, for the first time, models deletion pathogenicity jointly with gene haploinsufficiency. By integrating over 30 gene-level features potentially predictive of haploinsufficiency, DosaCNV shows unmatched performance in prioritizing pathogenic deletions associated with a broad spectrum of genetic disorders. Furthermore, DosaCNV outperforms existing methods in predicting gene haploinsufficiency even though it is not trained on known haploinsufficient genes. Finally, DosaCNV leverages a state-of-the-art technique to quantify the contributions of individual gene-level features to haploinsufficiency, allowing for human-understandable explanations of model predictions. Altogether, DosaCNV is a powerful computational tool for both fundamental and translational research.
Collapse
Affiliation(s)
- Zhihan Liu
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
- Molecular, Cellular, and Integrative Biosciences Program, Pennsylvania State University, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
33
|
Cowell IG, Casement JW, Austin CA. To Break or Not to Break: The Role of TOP2B in Transcription. Int J Mol Sci 2023; 24:14806. [PMID: 37834253 PMCID: PMC10573011 DOI: 10.3390/ijms241914806] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 09/28/2023] [Accepted: 09/29/2023] [Indexed: 10/15/2023] Open
Abstract
Transcription and its regulation pose challenges related to DNA torsion and supercoiling of the DNA template. RNA polymerase tracking the helical groove of the DNA introduces positive helical torsion and supercoiling upstream and negative torsion and supercoiling behind its direction of travel. This can inhibit transcriptional elongation and other processes essential to transcription. In addition, chromatin remodeling associated with gene activation can generate or be hindered by excess DNA torsional stress in gene regulatory regions. These topological challenges are solved by DNA topoisomerases via a strand-passage reaction which involves transiently breaking and re-joining of one (type I topoisomerases) or both (type II topoisomerases) strands of the phosphodiester backbone. This review will focus on one of the two mammalian type II DNA topoisomerase enzymes, DNA topoisomerase II beta (TOP2B), that have been implicated in correct execution of developmental transcriptional programs and in signal-induced transcription, including transcriptional activation by nuclear hormone ligands. Surprisingly, several lines of evidence indicate that TOP2B-mediated protein-free DNA double-strand breaks are involved in signal-induced transcription. We discuss the possible significance and origins of these DSBs along with a network of protein interaction data supporting a variety of roles for TOP2B in transcriptional regulation.
Collapse
Affiliation(s)
- Ian G. Cowell
- Biosciences Institute, The Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - John W. Casement
- Bioinformatics Support Unit, The Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Caroline A. Austin
- Biosciences Institute, The Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| |
Collapse
|
34
|
Bei C, Zhu J, Culviner PH, Rubin EJ, Fortune SM, Gao Q, Liu Q. Genetically encoded transcriptional plasticity underlies stress adaptation in Mycobacterium tuberculosis. RESEARCH SQUARE 2023:rs.3.rs-3303807. [PMID: 37790329 PMCID: PMC10543248 DOI: 10.21203/rs.3.rs-3303807/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Transcriptional regulation is a critical adaptive mechanism that allows bacteria to respond to changing environments, yet the concept of transcriptional plasticity (TP) remains largely unexplored. In this study, we investigate the genome-wide TP profiles of Mycobacterium tuberculosis (Mtb) genes by analyzing 894 RNA sequencing samples derived from 73 different environmental conditions. Our data reveal that Mtb genes exhibit significant TP variation that correlates with gene function and gene essentiality. We also found that critical genetic features, such as gene length, GC content, and operon size independently impose constraints on TP, beyond trans-regulation. By extending our analysis to include two other Mycobacterium species -- M. smegmatis and M. abscessus -- we demonstrate a striking conservation of the TP landscape. This study provides a comprehensive understanding of the TP exhibited by mycobacteria genes, shedding light on this significant, yet understudied, genetic feature encoded in bacterial genomes.
Collapse
Affiliation(s)
- Cheng Bei
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China
| | - Junhao Zhu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Peter H Culviner
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Eric J. Rubin
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Sarah M Fortune
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Qian Gao
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China
- National Clinical Research Center for Infectious Diseases, Shenzhen Third People’s Hospital, Shenzhen, Guangdong Province, China
| | - Qingyun Liu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
35
|
Jain A, Begum T, Ahmad S. Analysis and Prediction of Pathogen Nucleic Acid Specificity for Toll-like Receptors in Vertebrates. J Mol Biol 2023; 435:168208. [PMID: 37479078 DOI: 10.1016/j.jmb.2023.168208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/20/2023] [Accepted: 07/13/2023] [Indexed: 07/23/2023]
Abstract
Identification of key sequence, expression and function related features of nucleic acid-sensing host proteins is of fundamental importance to understand the dynamics of pathogen-specific host responses. To meet this objective, we considered toll-like receptors (TLRs), a representative class of membrane-bound sensor proteins, from 17 vertebrate species covering mammals, birds, reptiles, amphibians, and fishes in this comparative study. We identified the molecular signatures of host TLRs that are responsible for sensing pathogen nucleic acids or other pathogen-associated molecular patterns (PAMPs), and potentially play important roles in host defence mechanism. Interestingly, our findings reveal that such host-specific features are directly related to the strand (single or double) specificity of nucleic acid from pathogens. However, during host-pathogen interactions, such features were unable to explain the pathogenic PAMP (i.e., DNA, RNA or other) selectivity, suggesting a more complex mechanism. Using these features, we developed a number of machine learning models, of which Random Forest achieved a high performance (94.57% accuracy) to predict strand specificity of TLRs from protein-derived features. We applied the trained model to propose strand specificity of some previously uncharacterized distinct fish-specific novel TLRs (TLR18, TLR23, TLR24, TLR25, TLR27).
Collapse
Affiliation(s)
- Anuja Jain
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India. https://twitter.com/@Anuja334
| | - Tina Begum
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| | - Shandar Ahmad
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| |
Collapse
|
36
|
Nakatsu K, Jijiwa M, Khadka V, Nasu M, Huo M, Deng Y. sRNAfrag: A pipeline and suite of tools to analyze fragmentation in small RNA sequencing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.19.553943. [PMID: 37662282 PMCID: PMC10473647 DOI: 10.1101/2023.08.19.553943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Fragments derived from small RNAs such as small nucleolar RNAs hold biological relevance. However, they remain poorly understood, calling for more comprehensive methods for analysis. We developed sRNAfrag, a standardized workflow and set of scripts to quantify and analyze sRNA fragmentation of any biotype. In a benchmark, it is able to detect loci of mature microRNAs fragmented from precursors and, utilizing multi-mapping events, the conserved 5' seed sequence of miRNAs which we believe may extraoplate to other small RNA fragments. The tool detected 1411 snoRNA fragment conservation events between 2/4 eukaryotic species, providing the opportunity to explore motifs and fragmentation patterns not only within species, but between. Availability: https://github.com/kenminsoo/sRNAfrag.
Collapse
Affiliation(s)
- Ken Nakatsu
- Emory College of Arts and Sciences, Emory University, 201 Dowman Dr, Atlanta, 30322, Georgia, United States of America
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
| | - Mayumi Jijiwa
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
| | - Vedbar Khadka
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
| | - Masaki Nasu
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
| | - Matthew Huo
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
- Krieger School of Arts and Sciences, Johns Hopkins University, 3400 N Charles St, Baltimore, 21218, Maryland, United States of America
| | - Youping Deng
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, 651 Ilalo St, Honolulu, 96813, Hawaii, United States of America
| |
Collapse
|
37
|
Bei C, Zhu J, Culviner PH, Rubin EJ, Fortune SM, Gao Q, Liu Q. Genetically encoded transcriptional plasticity underlies stress adaptation in Mycobacterium tuberculosis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.20.553992. [PMID: 37645742 PMCID: PMC10462119 DOI: 10.1101/2023.08.20.553992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Transcriptional regulation is a critical adaptive mechanism that allows bacteria to respond to changing environments, yet the concept of transcriptional plasticity (TP) remains largely unexplored. In this study, we investigate the genome-wide TP profiles of Mycobacterium tuberculosis (Mtb) genes by analyzing 894 RNA sequencing samples derived from 73 different environmental conditions. Our data reveal that Mtb genes exhibit significant TP variation that correlates with gene function and gene essentiality. We also found that critical genetic features, such as gene length, GC content, and operon size independently impose constraints on TP, beyond trans-regulation. By extending our analysis to include two other Mycobacterium species -- M. smegmatis and M. abscessus -- we demonstrate a striking conservation of the TP landscape. This study provides a comprehensive understanding of the TP exhibited by mycobacteria genes, shedding light on this significant, yet understudied, genetic feature encoded in bacterial genomes.
Collapse
Affiliation(s)
- Cheng Bei
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China
| | - Junhao Zhu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Peter H Culviner
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Eric J. Rubin
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Sarah M Fortune
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Qian Gao
- Key Laboratory of Medical Molecular Virology (MOE/NHC/CAMS), School of Basic Medical Science, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China
- National Clinical Research Center for Infectious Diseases, Shenzhen Third People’s Hospital, Shenzhen, Guangdong Province, China
| | - Qingyun Liu
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
38
|
Farhat S, Modica MV, Puillandre N. Whole Genome Duplication and Gene Evolution in the Hyperdiverse Venomous Gastropods. Mol Biol Evol 2023; 40:msad171. [PMID: 37494290 PMCID: PMC10401626 DOI: 10.1093/molbev/msad171] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 06/20/2023] [Accepted: 07/17/2023] [Indexed: 07/28/2023] Open
Abstract
The diversity of venomous organisms and the toxins they produce have been increasingly investigated, but taxonomic bias remains important. Neogastropods, a group of marine predators representing almost 22% of the known gastropod diversity, evolved a wide range of feeding strategies, including the production of toxins to subdue their preys. However, whether the diversity of these compounds is at the origin of the hyperdiversification of the group and how genome evolution may correlate with both the compounds and species diversities remain understudied. Among the available gastropods genomes, only eight, with uneven quality assemblies, belong to neogastropods. Here, we generated chromosome-level assemblies of two species belonging to the Tonnoidea and Muricoidea superfamilies (Monoplex corrugatus and Stramonita haemastoma). The two obtained high-quality genomes had 3 and 2.2 Gb, respectively, and 92-89% of the total assembly conformed 35 pseudochromosomes in each species. Through the analysis of syntenic blocks, Hox gene cluster duplication, and synonymous substitutions distribution pattern, we inferred the occurrence of a whole genome duplication event in both genomes. As these species are known to release venom, toxins were annotated in both genomes, but few of them were found in homologous chromosomes. A comparison of the expression of ohnolog genes (using transcriptomes from osphradium and salivary glands in S. haemastoma), where both copies were differentially expressed, showed that most of them had similar expression profiles. The high quality of these genomes makes them valuable reference in their respective taxa, facilitating the identification of genome-level processes at the origin of their evolutionary success.
Collapse
Affiliation(s)
- Sarah Farhat
- Institut Systématique Evolution Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - Maria Vittoria Modica
- Department of Biology and Evolution of Marine Organisms (BEOM), Stazione Zoologica Anton Dohrn, Roma, Italy
| | - Nicolas Puillandre
- Institut Systématique Evolution Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| |
Collapse
|
39
|
Jeon H, Lim KS, Nguyen Y, Nettleton D. Adjusting for gene-specific covariates to improve RNA-seq analysis. Bioinformatics 2023; 39:btad498. [PMID: 37589589 PMCID: PMC10460482 DOI: 10.1093/bioinformatics/btad498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 06/29/2023] [Accepted: 08/16/2023] [Indexed: 08/18/2023] Open
Abstract
SUMMARY This article suggests a novel positive false discovery rate (pFDR) controlling method for testing gene-specific hypotheses using a gene-specific covariate variable, such as gene length. We suppose the null probability depends on the covariate variable. In this context, we propose a rejection rule that accounts for heterogeneity among tests by using two distinct types of null probabilities. We establish a pFDR estimator for a given rejection rule by following Storey's q-value framework. A condition on a type 1 error posterior probability is provided that equivalently characterizes our rejection rule. We also present a suitable procedure for selecting a tuning parameter through cross-validation that maximizes the expected number of hypotheses declared significant. A simulation study demonstrates that our method is comparable to or better than existing methods across realistic scenarios. In data analysis, we find support for our method's premise that the null probability varies with a gene-specific covariate variable. AVAILABILITY AND IMPLEMENTATION The source code repository is publicly available at https://github.com/hsjeon1217/conditional_method.
Collapse
Affiliation(s)
- Hyeongseon Jeon
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, United States
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, United States
| | - Kyu-Sang Lim
- Department of Animal Resources Science, Kongju National University, Yesan-gun, Chungnam 32439, Republic of Korea
| | - Yet Nguyen
- Department of Mathematics and Statistics, Old Dominion University, Norfolk, VA 23529, United States
| | - Dan Nettleton
- Department of Statistics, Iowa State University, Ames, IA 50011, Unites States
| |
Collapse
|
40
|
Carbonell-Sala S, Lagarde J, Nishiyori H, Palumbo E, Arnan C, Takahashi H, Carninci P, Uszczynska-Ratajczak B, Guigó R. CapTrap-Seq: A platform-agnostic and quantitative approach for high-fidelity full-length RNA transcript sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.16.543444. [PMID: 37398314 PMCID: PMC10312720 DOI: 10.1101/2023.06.16.543444] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Long-read RNA sequencing is essential to produce accurate and exhaustive annotation of eukaryotic genomes. Despite advancements in throughput and accuracy, achieving reliable end-to-end identification of RNA transcripts remains a challenge for long-read sequencing methods. To address this limitation, we developed CapTrap-seq, a cDNA library preparation method, which combines the Cap-trapping strategy with oligo(dT) priming to detect 5'capped, full-length transcripts, together with the data processing pipeline LyRic. We benchmarked CapTrap-seq and other popular RNA-seq library preparation protocols in a number of human tissues using both ONT and PacBio sequencing. To assess the accuracy of the transcript models produced, we introduced a capping strategy for synthetic RNA spike-in sequences that mimics the natural 5'cap formation in RNA spike-in molecules. We found that the vast majority (up to 90%) of transcript models that LyRic derives from CapTrap-seq reads are full-length. This makes it possible to produce highly accurate annotations with minimal human intervention.
Collapse
|
41
|
Chai S, Wakefield L, Norgard M, Li B, Enicks D, Marks DL, Grompe M. Strong ubiquitous micro-promoters for recombinant adeno-associated viral vectors. Mol Ther Methods Clin Dev 2023; 29:504-512. [PMID: 37287749 PMCID: PMC10241652 DOI: 10.1016/j.omtm.2023.05.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 05/12/2023] [Indexed: 06/09/2023]
Abstract
Significant progress has been made in developing recombinant adeno-associated virus (rAAV) for clinical gene therapy. While rAAV is a versatile gene delivery platform, its packaging limit of 4.7 kb limits the diseases it can target. Here, we report two unusually small promoters that enable the expression of larger transgenes than standard promoters. These micro-promoters are only 84 (MP-84) and 135 bp (MP-135) in size but have activity in most cells and tissues comparable to the CAG promoter, the strongest ubiquitous promoter to date. MP-84- and MP-135-based rAAV constructs displayed robust activity in cultured cells from the three different germ-layer lineages. In addition, reporter gene expression was documented in human primary hepatocytes and pancreatic islets and in multiple mouse tissues in vivo, including brain and skeletal muscle. MP-84 and MP-135 will enable the therapeutic expression of transgenes currently too large for rAAV vectors.
Collapse
Affiliation(s)
- Sunghee Chai
- Papé Family Pediatric Research Institute, Oregon Stem Cell Center, Portland, OR, USA
| | - Leslie Wakefield
- Papé Family Pediatric Research Institute, Oregon Stem Cell Center, Portland, OR, USA
| | - Mason Norgard
- Department of Pediatrics, Oregon Health & Science University, Portland, OR, USA
| | - Bin Li
- Papé Family Pediatric Research Institute, Oregon Stem Cell Center, Portland, OR, USA
| | - David Enicks
- Papé Family Pediatric Research Institute, Oregon Stem Cell Center, Portland, OR, USA
| | - Daniel L. Marks
- Department of Pediatrics, Oregon Health & Science University, Portland, OR, USA
| | - Markus Grompe
- Papé Family Pediatric Research Institute, Oregon Stem Cell Center, Portland, OR, USA
| |
Collapse
|
42
|
Nevers Y, Glover NM, Dessimoz C, Lecompte O. Protein length distribution is remarkably uniform across the tree of life. Genome Biol 2023; 24:135. [PMID: 37291671 PMCID: PMC10251718 DOI: 10.1186/s13059-023-02973-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 05/16/2023] [Indexed: 06/10/2023] Open
Abstract
BACKGROUND In every living species, the function of a protein depends on its organization of structural domains, and the length of a protein is a direct reflection of this. Because every species evolved under different evolutionary pressures, the protein length distribution, much like other genomic features, is expected to vary across species but has so far been scarcely studied. RESULTS Here we evaluate this diversity by comparing protein length distribution across 2326 species (1688 bacteria, 153 archaea, and 485 eukaryotes). We find that proteins tend to be on average slightly longer in eukaryotes than in bacteria or archaea, but that the variation of length distribution across species is low, especially compared to the variation of other genomic features (genome size, number of proteins, gene length, GC content, isoelectric points of proteins). Moreover, most cases of atypical protein length distribution appear to be due to artifactual gene annotation, suggesting the actual variation of protein length distribution across species is even smaller. CONCLUSIONS These results open the way for developing a genome annotation quality metric based on protein length distribution to complement conventional quality measures. Overall, our findings show that protein length distribution between living species is more uniform than previously thought. Furthermore, we also provide evidence for a universal selection on protein length, yet its mechanism and fitness effect remain intriguing open questions.
Collapse
Affiliation(s)
- Yannis Nevers
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland.
| | - Natasha M Glover
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Christophe Dessimoz
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland
- Department of Computer Science, University College London, London, UK
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Odile Lecompte
- Department of Computer Science, Centre de Recherche en Biomédecine de Strasbourg, ICube, UMR 7357, University of Strasbourg, CNRS, Strasbourg, France
| |
Collapse
|
43
|
Truong DD, Lamhamedi-Cherradi SE, Porter RW, Krishnan S, Swaminathan J, Gibson A, Lazar AJ, Livingston JA, Gopalakrishnan V, Gordon N, Daw NC, Navin NE, Gorlick R, Ludwig JA. Dissociation protocols used for sarcoma tissues bias the transcriptome observed in single-cell and single-nucleus RNA sequencing. BMC Cancer 2023; 23:488. [PMID: 37254069 PMCID: PMC10230784 DOI: 10.1186/s12885-023-10977-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 05/17/2023] [Indexed: 06/01/2023] Open
Abstract
BACKGROUND Single-cell RNA-seq has emerged as an innovative technology used to study complex tissues and characterize cell types, states, and lineages at a single-cell level. Classification of bulk tumors by their individual cellular constituents has also created new opportunities to generate single-cell atlases for many organs, cancers, and developmental models. Despite the tremendous promise of this technology, recent evidence studying epithelial tissues and diverse carcinomas suggests the methods used for tissue processing, cell disaggregation, and preservation can significantly bias gene expression and alter the observed cell types. To determine whether sarcomas - tumors of mesenchymal origin - are subject to the same technical artifacts, we profiled patient-derived tumor explants (PDXs) propagated from three aggressive subtypes: osteosarcoma (OS), Ewing sarcoma (ES), desmoplastic small round cell tumor (DSRCT). Given the rarity of these sarcoma subtypes, we explored whether single-nuclei RNA-seq from more widely available archival frozen specimens could accurately be identified by gene expression signatures linked to tissue phenotype or pathognomonic fusion proteins. RESULTS We systematically assessed dissociation methods across different sarcoma subtypes. We compared gene expression from single-cell and single-nucleus RNA-sequencing of 125,831 whole-cells and nuclei from ES, DSRCT, and OS PDXs. We detected warm dissociation artifacts in single-cell samples and gene length bias in single-nucleus samples. Classic sarcoma gene signatures were observed regardless of the dissociation method. In addition, we showed that dissociation method biases could be computationally corrected. CONCLUSIONS We highlighted transcriptional biases, including warm dissociation and gene-length biases, introduced by the dissociation method for various sarcoma subtypes. This work is the first to characterize how the dissociation methods used for sc/snRNA-seq may affect the interpretation of the molecular features in sarcoma PDXs.
Collapse
Affiliation(s)
- Danh D Truong
- Sarcoma Medical Oncology Department, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | | | - Robert W Porter
- Sarcoma Medical Oncology Department, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Sandhya Krishnan
- Sarcoma Medical Oncology Department, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | | | - Amber Gibson
- Division of Pediatrics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Alexander J Lazar
- Division of Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - J Andrew Livingston
- Sarcoma Medical Oncology Department, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Vidya Gopalakrishnan
- Division of Pediatrics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Nancy Gordon
- Division of Pediatrics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Najat C Daw
- Division of Pediatrics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Nicholas E Navin
- Department of Genetics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Richard Gorlick
- Division of Pediatrics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Joseph A Ludwig
- Sarcoma Medical Oncology Department, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
| |
Collapse
|
44
|
Sims AA, Gurkar AU. DNA damage-induced stalling of transcription drives aging through gene expression imbalance. DNA Repair (Amst) 2023; 125:103483. [PMID: 36921370 DOI: 10.1016/j.dnarep.2023.103483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 03/07/2023] [Indexed: 03/12/2023]
Abstract
Age-related changes in gene expression have long been examined to understand the biology of aging. The hallmarks of aging are biological processes known to be associated with aging, but whether there is a unifying driver of these attributes, is not well understood. With the advent of technology over the last few years, it is quite clear that aging leads to global decline in transcription. In this Perspective, we highlight a new study in Nature Genetics that aimed to determine why global transcription rate reduces with age and how this phenomenon is the driver that interconnects multiple hallmarks of aging. This study recognizes that age-related accumulation of DNA damage, particularly transcription-blocking lesions, stalls RNA polymerase. This phenomenon affects longer genes leading to a gradual loss of transcription and skewing the transcriptome. In order to design a successful aging intervention, future work will be needed to test how some promising therapies in pre-clinical trials target affect transcriptional rate.
Collapse
Affiliation(s)
- Austin A Sims
- Aging Institute of UPMC and the University of Pittsburgh School of Medicine, 100 Technology Dr, Pittsburgh, PA 15219, USA
| | - Aditi U Gurkar
- Aging Institute of UPMC and the University of Pittsburgh School of Medicine, 100 Technology Dr, Pittsburgh, PA 15219, USA.
| |
Collapse
|
45
|
Ibañez-Solé O, Barrio I, Izeta A. Age or lifestyle-induced accumulation of genotoxicity is associated with a length-dependent decrease in gene expression. iScience 2023; 26:106368. [PMID: 37013186 PMCID: PMC10066539 DOI: 10.1016/j.isci.2023.106368] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 01/26/2023] [Accepted: 03/05/2023] [Indexed: 03/12/2023] Open
Abstract
DNA damage has long been advocated as a molecular driver of aging. DNA damage occurs in a stochastic manner, and is therefore more likely to accumulate in longer genes. The length-dependent accumulation of transcription-blocking damage, unlike that of somatic mutations, should be reflected in gene expression datasets of aging. We analyzed gene expression as a function of gene length in several single-cell RNA sequencing datasets of mouse and human aging. We found a pervasive age-associated length-dependent underexpression of genes across species, tissues, and cell types. Furthermore, we observed length-dependent underexpression associated with UV-radiation and smoke exposure, and in progeroid diseases, Cockayne syndrome, and trichothiodystrophy. Finally, we studied published gene sets showing global age-related changes. Genes underexpressed with aging were significantly longer than overexpressed genes. These data highlight a previously undetected hallmark of aging and show that accumulation of genotoxicity in long genes could lead to reduced RNA polymerase II processivity.
Collapse
Affiliation(s)
- Olga Ibañez-Solé
- Tissue Engineering Group; Biodonostia Health Research Institute, 20014 Donostia-San Sebastián, Spain
| | - Irantzu Barrio
- Department of Mathematics, University of the basque Country UPV/EHU, 48940 Leioa, Spain
- Basque Center for Applied Mathematics, BCAM, 48009 Bilbao, Spain
| | - Ander Izeta
- Tissue Engineering Group; Biodonostia Health Research Institute, 20014 Donostia-San Sebastián, Spain
- Tecnun-University of Navarra, 20018 Donostia-San Sebastián, Spain
| |
Collapse
|
46
|
Mimoso CA, Adelman K. U1 snRNP increases RNA Pol II elongation rate to enable synthesis of long genes. Mol Cell 2023; 83:1264-1279.e10. [PMID: 36965480 PMCID: PMC10135401 DOI: 10.1016/j.molcel.2023.03.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/06/2023] [Accepted: 02/28/2023] [Indexed: 03/27/2023]
Abstract
The expansion of introns within mammalian genomes poses a challenge for the production of full-length messenger RNAs (mRNAs), with increasing evidence that these long AT-rich sequences present obstacles to transcription. Here, we investigate RNA polymerase II (RNAPII) elongation at high resolution in mammalian cells and demonstrate that RNAPII transcribes faster across introns. Moreover, we find that this acceleration requires the association of U1 snRNP (U1) with the elongation complex at 5' splice sites. The role of U1 to stimulate elongation rate through introns reduces the frequency of both premature termination and transcriptional arrest, thereby dramatically increasing RNA production. We further show that changes in RNAPII elongation rate due to AT content and U1 binding explain previous reports of pausing or termination at splice junctions and the edge of CpG islands. We propose that U1-mediated acceleration of elongation has evolved to mitigate the risks that long AT-rich introns pose to transcript completion.
Collapse
Affiliation(s)
- Claudia A Mimoso
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Karen Adelman
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Ludwig Center at Harvard, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
47
|
Pasternak AO, Berkhout B. HIV persistence: silence or resistance? Curr Opin Virol 2023; 59:101301. [PMID: 36805974 DOI: 10.1016/j.coviro.2023.101301] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 12/05/2022] [Accepted: 12/19/2022] [Indexed: 02/19/2023]
Abstract
Despite decades of suppressive antiretroviral therapy, human immunodeficiency virus (HIV) reservoirs in infected individuals persist and fuel viral rebound once therapy is interrupted. The persistence of viral reservoirs is the main obstacle to achieving HIV eradication or a long-term remission. The last decade has seen a profound change in our understanding of the mechanisms behind HIV persistence, which appears to be much more complex than originally assumed. In addition to the persistence of transcriptionally silent proviruses in a stable latent reservoir that is invisible to the immune system, HIV is increasingly recognized to persist by resistance to the immune clearance, which appears to play a surprisingly prominent role in shaping the reservoir. In this review, we discuss some emerging insights into the mechanisms of HIV persistence, as well as their implications for the development of strategies towards an HIV cure.
Collapse
Affiliation(s)
- Alexander O Pasternak
- Laboratory of Experimental Virology, Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands.
| | - Ben Berkhout
- Laboratory of Experimental Virology, Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
| |
Collapse
|
48
|
Stoeger T. The Road Less Traveled: Uncovering the Convergence Toward Specific Pleiotropic Phenotypes in Aging. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.28.534472. [PMID: 37034589 PMCID: PMC10081180 DOI: 10.1101/2023.03.28.534472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Aging is a complex process influenced by a wide range of environmental and molecular factors. Despite this complexity, individuals tend to age in highly similar ways, leading to the question of what drives this convergence. Recent research, including my own discoveries, suggests that the length of transcript molecules plays a crucial role in age-dependent changes to the transcriptome. Drawing inspiration from the road trip analogy of cellular transcription, I propose that a non-linear scaling law drives convergence towards specific pleiotropic phenotypes in biological aging. This scaling law is based on the notion that molecular changes observed during aging may reflect unspecific damage to cellular physiology. By validating this hypothesis, I can improve our understanding of biological aging and identify new candidate compounds for anti-aging interventions, as well as re-identify one known intervention. This work has actionable implications for improving human health and extending lifespans.
Collapse
Affiliation(s)
- Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| |
Collapse
|
49
|
Wagner EJ, Tong L, Adelman K. Integrator is a global promoter-proximal termination complex. Mol Cell 2023; 83:416-427. [PMID: 36634676 PMCID: PMC10866050 DOI: 10.1016/j.molcel.2022.11.012] [Citation(s) in RCA: 41] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/08/2022] [Accepted: 11/14/2022] [Indexed: 01/13/2023]
Abstract
Integrator is a metazoan-specific protein complex capable of inducing termination at all RNAPII-transcribed loci. Integrator recognizes paused, promoter-proximal RNAPII and drives premature termination using dual enzymatic activities: an endonuclease that cleaves nascent RNA and a protein phosphatase that removes stimulatory phosphorylation associated with RNAPII pause release and productive elongation. Recent breakthroughs in structural biology have revealed the overall architecture of Integrator and provided insights into how multiple Integrator modules are coordinated to elicit termination effectively. Furthermore, functional genomics and biochemical studies have unraveled how Integrator-mediated termination impacts protein-coding and noncoding loci. Here, we review the current knowledge about the assembly and activity of Integrator and describe the role of Integrator in gene regulation, highlighting the importance of this complex for human health.
Collapse
Affiliation(s)
- Eric J Wagner
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY 14642, USA.
| | - Liang Tong
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA.
| | - Karen Adelman
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
50
|
Gyenis A, Chang J, Demmers JJPG, Bruens ST, Barnhoorn S, Brandt RMC, Baar MP, Raseta M, Derks KWJ, Hoeijmakers JHJ, Pothof J. Genome-wide RNA polymerase stalling shapes the transcriptome during aging. Nat Genet 2023; 55:268-279. [PMID: 36658433 PMCID: PMC9925383 DOI: 10.1038/s41588-022-01279-6] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 12/07/2022] [Indexed: 01/21/2023]
Abstract
Gene expression profiling has identified numerous processes altered in aging, but how these changes arise is largely unknown. Here we combined nascent RNA sequencing and RNA polymerase II chromatin immunoprecipitation followed by sequencing to elucidate the underlying mechanisms triggering gene expression changes in wild-type aged mice. We found that in 2-year-old liver, 40% of elongating RNA polymerases are stalled, lowering productive transcription and skewing transcriptional output in a gene-length-dependent fashion. We demonstrate that this transcriptional stress is caused by endogenous DNA damage and explains the majority of gene expression changes in aging in most mainly postmitotic organs, specifically affecting aging hallmark pathways such as nutrient sensing, autophagy, proteostasis, energy metabolism, immune function and cellular stress resilience. Age-related transcriptional stress is evolutionary conserved from nematodes to humans. Thus, accumulation of stochastic endogenous DNA damage during aging deteriorates basal transcription, which establishes the age-related transcriptome and causes dysfunction of key aging hallmark pathways, disclosing how DNA damage functionally underlies major aspects of normal aging.
Collapse
Affiliation(s)
- Akos Gyenis
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
- University of Cologne, Faculty of Medicine, Cluster of Excellence for Aging Research, Institute for Genome Stability in Ageing and Disease, Cologne, Germany
| | - Jiang Chang
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Joris J P G Demmers
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Serena T Bruens
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Sander Barnhoorn
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Renata M C Brandt
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Marjolein P Baar
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
- Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Marko Raseta
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Kasper W J Derks
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
- Department of Clinical Genetics and School for Oncology & Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Jan H J Hoeijmakers
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands
- University of Cologne, Faculty of Medicine, Cluster of Excellence for Aging Research, Institute for Genome Stability in Ageing and Disease, Cologne, Germany
- Princess Maxima Center for Pediatric Oncology, Oncode Institute, Utrecht, The Netherlands
| | - Joris Pothof
- Department of Molecular Genetics, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, The Netherlands.
| |
Collapse
|