101
|
Woolley SA, Salavati M, Clark EL. Recent advances in the genomic resources for sheep. Mamm Genome 2023; 34:545-558. [PMID: 37752302 PMCID: PMC10627984 DOI: 10.1007/s00335-023-10018-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 08/30/2023] [Indexed: 09/28/2023]
Abstract
Sheep (Ovis aries) provide a vital source of protein and fibre to human populations. In coming decades, as the pressures associated with rapidly changing climates increase, breeding sheep sustainably as well as producing enough protein to feed a growing human population will pose a considerable challenge for sheep production across the globe. High quality reference genomes and other genomic resources can help to meet these challenges by: (1) informing breeding programmes by adding a priori information about the genome, (2) providing tools such as pangenomes for characterising and conserving global genetic diversity, and (3) improving our understanding of fundamental biology using the power of genomic information to link cell, tissue and whole animal scale knowledge. In this review we describe recent advances in the genomic resources available for sheep, discuss how these might help to meet future challenges for sheep production, and provide some insight into what the future might hold.
Collapse
Affiliation(s)
- Shernae A Woolley
- The Roslin Institute, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK
| | - Mazdak Salavati
- The Roslin Institute, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK
- Scotland's Rural College, Parkgate, Barony Campus, Dumfries, DG1 3NE, UK
| | - Emily L Clark
- The Roslin Institute, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, UK.
| |
Collapse
|
102
|
Crowley LM. The genome sequence of a digger wasp, Ectemnius lituratus (Panzer,1805). Wellcome Open Res 2023; 8:552. [PMID: 39188773 PMCID: PMC11345589 DOI: 10.12688/wellcomeopenres.20337.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/02/2023] [Indexed: 08/28/2024] Open
Abstract
We present a genome assembly from an individual female Ectemnius lituratus (a digger wasp; Arthropoda; Insecta; Hymenoptera; Crabronidae). The genome sequence is 235.1 megabases in span. Most of the assembly is scaffolded into 13 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 29.67 kilobases in length. Gene annotation of this assembly on Ensembl identified 9,724 protein coding genes.
Collapse
Affiliation(s)
- Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
103
|
Warren WC, Rice ES, Maggs X, Roback E, Keene A, Martin F, Ogeh D, Haggerty L, Carroll RA, McGaugh S, Rohner N. Astyanax mexicanus surface and cavefish chromosome-scale assemblies for trait variation discovery. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.16.567450. [PMID: 38014157 PMCID: PMC10680795 DOI: 10.1101/2023.11.16.567450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
The ability of organisms to adapt to sudden extreme environmental changes produces some of the most drastic examples of rapid phenotypic evolution. The Mexican Tetra, Astyanax mexicanus, is abundant in the surface waters of northeastern Mexico, but repeated colonizations of cave environments have resulted in the independent evolution of troglomorphic phenotypes in several populations. Here, we present three chromosome-scale assemblies of this species, for one surface and two cave populations, enabling the first whole-genome comparisons between independently evolved cave populations to evaluate the genetic basis for the evolution of adaptation to the cave environment. Our assemblies represent the highest quality of sequence completeness with predicted protein-coding and non-coding gene metrics far surpassing prior resources and, to our knowledge, all long-read assembled teleost genomes, including zebrafish. Whole genome synteny alignments show highly conserved gene order among cave forms in contrast to a higher number of chromosomal rearrangements when compared to other phylogenetically close or distant teleost species. By phylogenetically assessing gene orthology across distant branches of amniotes, we discover gene orthogroups unique to A. mexicanus. When compared to a representative surface fish genome, we find a rich amount of structural sequence diversity, defined here as the number and size of insertions and deletions as well as expanding and contracting repeats across cave forms. These new more complete genomic resources ensure higher trait resolution for comparative, functional, developmental, and genetic studies of drastic trait differences within a species.
Collapse
Affiliation(s)
- Wesley C. Warren
- Department of Animal Sciences, Department of Surgery, University of Missouri, Bond Life Sciences Center, Columbia, MO
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO
| | - Edward S. Rice
- Department of Animal Sciences, Department of Surgery, University of Missouri, Bond Life Sciences Center, Columbia, MO
| | - X Maggs
- Department of Animal Sciences, Department of Surgery, University of Missouri, Bond Life Sciences Center, Columbia, MO
| | - Emma Roback
- Department of Ecology, Evolution, and Behavior, University of Minnesota, Saint Paul, MN
| | - Alex Keene
- Department of Biology, Texas AM University, College Station, TX
| | - Fergal Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Denye Ogeh
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rachel A. Carroll
- Department of Animal Sciences, Department of Surgery, University of Missouri, Bond Life Sciences Center, Columbia, MO
| | - Suzanne McGaugh
- Department of Ecology, Evolution, and Behavior, University of Minnesota, Saint Paul, MN
| | - Nicolas Rohner
- Stowers Institute for Medical Research, Kansas City, MO
- Department of Molecular and Integrative Physiology, KU Medical Center, Kansas City, KS
| |
Collapse
|
104
|
Lees DC. The genome sequence of the Chestnut, Conistra vaccinii (Linnaeus, 1761). Wellcome Open Res 2023; 8:532. [PMID: 38634068 PMCID: PMC11021882 DOI: 10.12688/wellcomeopenres.20346.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/02/2023] [Indexed: 04/19/2024] Open
Abstract
We present a genome assembly from an individual male Conistra vaccinii (the Chestnut; Arthropoda; None; Lepidoptera; Noctuidae). The genome sequence is 720.8 megabases in span. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.44 kilobases in length. Gene annotation of this assembly on Ensembl identified 13,109 protein coding genes.
Collapse
|
105
|
Boyes D, Blaxter ML. The genome sequence of the Beautiful Hook-tip, Laspeyria flexula (Denis & Schiffermüller, 1775). Wellcome Open Res 2023; 8:529. [PMID: 39309423 PMCID: PMC11413856 DOI: 10.12688/wellcomeopenres.20351.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/02/2023] [Indexed: 09/25/2024] Open
Abstract
We present a genome assembly from an individual male Laspeyria flexula (the Beautiful Hook-tip; Arthropoda; Insecta; Lepidoptera; Erebidae). The genome sequence is 450.9 megabases in span. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.58 kilobases in length. Gene annotation of this assembly on Ensembl identified 13,281 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | - Mark L Blaxter
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | | | | | | | | | | | |
Collapse
|
106
|
Boyes D. The genome sequence of the Six-belted Clearwing, Bembecia ichneumoniformis (Denis & Schiffermüller, 1775). Wellcome Open Res 2023; 8:515. [PMID: 38813550 PMCID: PMC11134158 DOI: 10.12688/wellcomeopenres.20279.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/18/2023] [Indexed: 05/31/2024] Open
Abstract
We present a genome assembly from an individual male Bembecia ichneumoniformis (the Six-belted Clearwing; Arthropoda; Insecta; Lepidoptera; Sesiidae). The genome sequence is 511.4 megabases in span. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.32 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,114 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | |
Collapse
|
107
|
Ballouz S, Kawaguchi RK, Pena MT, Fischer S, Crow M, French L, Knight FM, Adams LB, Gillis J. The transcriptional legacy of developmental stochasticity. Nat Commun 2023; 14:7226. [PMID: 37940702 PMCID: PMC10632366 DOI: 10.1038/s41467-023-43024-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 10/30/2023] [Indexed: 11/10/2023] Open
Abstract
Genetic and environmental variation are key contributors during organism development, but the influence of minor perturbations or noise is difficult to assess. This study focuses on the stochastic variation in allele-specific expression that persists through cell divisions in the nine-banded armadillo (Dasypus novemcinctus). We investigated the blood transcriptome of five wild monozygotic quadruplets over time to explore the influence of developmental stochasticity on gene expression. We identify an enduring signal of autosomal allelic variability that distinguishes individuals within a quadruplet despite their genetic similarity. This stochastic allelic variation, akin to X-inactivation but broader, provides insight into non-genetic influences on phenotype. The presence of stochastically canalized allelic signatures represents a novel axis for characterizing organismal variability, complementing traditional approaches based on genetic and environmental factors. We also developed a model to explain the inconsistent penetrance associated with these stochastically canalized allelic expressions. By elucidating mechanisms underlying the persistence of allele-specific expression, we enhance understanding of development's role in shaping organismal diversity.
Collapse
Affiliation(s)
- Sara Ballouz
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- School of Computer Science and Engineering, Faculty of Engineering, University of New South Wales Sydney, Sydney, NSW, Australia
| | - Risa Karakida Kawaguchi
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- Center for iPS Cell Research and Application, Kyoto University, Kyoto, Japan
| | - Maria T Pena
- US Department of Health and Human Services, Health Resources and Services Administration, Healthcare System Bureau, National Hansen's Disease Program, Baton Rouge, LA, 70803, USA
| | - Stephan Fischer
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, Paris, F-75015, France
| | - Megan Crow
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- Genentech, Inc., South San Francisco, CA, USA
| | - Leon French
- Physiology Department and Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | | | - Linda B Adams
- US Department of Health and Human Services, Health Resources and Services Administration, Healthcare System Bureau, National Hansen's Disease Program, Baton Rouge, LA, 70803, USA
| | - Jesse Gillis
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
- Physiology Department and Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
108
|
Sivell O, Levey B, Barclay MVL. The genome sequence of a darkling beetle, Lagria hirta (Linnaeus, 1758). Wellcome Open Res 2023; 8:501. [PMID: 38434741 PMCID: PMC10905005 DOI: 10.12688/wellcomeopenres.20232.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/13/2023] [Indexed: 03/05/2024] Open
Abstract
We present a genome assembly from an individual male Lagria hirta (darkling beetle; Arthropoda; Insecta; Coleoptera; Tenebrionidae). The genome sequence is 336.8 megabases in span. Most of the assembly is scaffolded into 12 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled and is 16.12 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,850protein coding genes.
Collapse
Affiliation(s)
| | - Brian Levey
- Natural History Museum, London, England, UK
- National Museum of Wales, Cardiff, Wales, UK
| | | | | | | | | | | | | | | |
Collapse
|
109
|
Crowley LM. The genome sequence of the Saxon Wasp, Dolichovespula saxonica (Fabricius, 1793). Wellcome Open Res 2023; 8:498. [PMID: 38371738 PMCID: PMC10873546 DOI: 10.12688/wellcomeopenres.20180.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2023] [Indexed: 02/20/2024] Open
Abstract
We present a genome assembly from an individual male Dolichovespula saxonica (the Saxon Wasp; Arthropoda; Insecta; Hymenoptera; Vespidae). The genome sequence is 221.8 megabases in span. Most of the assembly is scaffolded into 26 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 18.97 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,856 protein coding genes.
Collapse
Affiliation(s)
- Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | | | | | | | | | | | | |
Collapse
|
110
|
Crowley LM. The genome sequence of Fabricius' Nomad Bee, Nomada fabriciana (Linne, 1767). Wellcome Open Res 2023; 8:497. [PMID: 38558924 PMCID: PMC10980859 DOI: 10.12688/wellcomeopenres.20184.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2023] [Indexed: 04/04/2024] Open
Abstract
We present a genome assembly from an individual female Nomada fabriciana (Fabricius' Nomad Bee; Arthropoda; Insecta; Hymenoptera; Apidae). The genome sequence is 233.6 megabases in span. Most of the assembly is scaffolded into 12 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 19.4 kilobases in length. Gene annotation of this assembly on Ensembl identified 9,700 protein coding genes.
Collapse
Affiliation(s)
- Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | | | | | | | | | | | | |
Collapse
|
111
|
Crowley LM. The genome sequence of a digger wasp, Ectemnius continuus (Fabricius, 1804). Wellcome Open Res 2023; 8:469. [PMID: 38533438 PMCID: PMC10964002 DOI: 10.12688/wellcomeopenres.20138.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/06/2023] [Indexed: 03/28/2024] Open
Abstract
We present a genome assembly from an individual female Ectemnius continuus (digger wasp; Arthropoda; Insecta; Hymenoptera; Crabronidae). The genome sequence is 260.3 megabases in span. Most of the assembly is scaffolded into 14 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 27.05 kilobases in length. Gene annotation of this assembly on Ensembl identified 9,835 protein coding genes.
Collapse
Affiliation(s)
- Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | | | | | | | | | | | | |
Collapse
|
112
|
Falk S, Green A. The genome sequence of the small wasp-sawfly, Tenthredo distinguenda (R. Stein, 1885). Wellcome Open Res 2023; 8:459. [PMID: 38779059 PMCID: PMC11109533 DOI: 10.12688/wellcomeopenres.19523.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/31/2023] [Indexed: 05/25/2024] Open
Abstract
We present a genome assembly from an individual male Tenthredo distinguenda (the small wasp-sawfly; Arthropoda; Insecta; Hymenoptera; Tenthredinidae). The genome sequence is 229.4 megabases in span. Most of the assembly is scaffolded into 9 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 31.6 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,332 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
113
|
Boyes D, Boyes C. The genome sequence of the Ingrailed Clay, Diarsia mendica (Fabricius, 1775). Wellcome Open Res 2023; 8:448. [PMID: 38333733 PMCID: PMC10850848 DOI: 10.12688/wellcomeopenres.20106.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/21/2023] [Indexed: 02/10/2024] Open
Abstract
We present a genome assembly from an individual male Diarsia mendica (the Ingrailed Clay; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence is 727.9 megabases in span. Most of the assembly is scaffolded into 32 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.37 kilobases in length. Gene annotation of this assembly on Ensembl identified 14,077 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | - Clare Boyes
- Independent researcher, Welshpool, Wales, UK
| | | | | | | | | | | | | |
Collapse
|
114
|
Widjaya MA, Liu CH, Lee SD, Cheng WC. Transcriptomics Meta-Analysis Reveals Phagosome and Innate Immune System Dysfunction as Potential Mechanisms in the Cortex of Alzheimer's Disease Mouse Strains. J Mol Neurosci 2023; 73:773-786. [PMID: 37733230 DOI: 10.1007/s12031-023-02152-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 08/30/2023] [Indexed: 09/22/2023]
Abstract
Immune-related pathways can affect the immune system directly, such as the chemokine signaling pathway, or indirectly, such as the phagosome pathway. Alzheimer's disease (AD) is reportedly associated with several immune-related pathways. However, exploring its underlying mechanism is challenging in animal studies because AD mouse strains differentially express immune-related pathway characteristics. To overcome this problem, we performed a meta-analysis to identify significant and consistent immune-related AD pathways that are expressed in different AD mouse strains. Next-generation RNA sequencing (RNA-seq) and microarray datasets for the cortex of AD mice from different strains such as APP/PSEN1, APP/PS2, 3xTg, TREM, and 5xFAD were collected from the NCBI GEO database. Each dataset's quality control and normalization were already processed from each original study source using various methods depending on the high-throughput analysis platform (FastQC, median of ratios, RMA, between array normalization). Datasets were analyzed using DESeq2 for RNA-seq and GEO2R for microarray to identify differentially expressed (DE) genes. Significantly DE genes were meta-analyzed using Stouffer's method, with significant genes further analyzed for functional enrichment. Ten datasets representing 20 conditions were obtained from the NCBI GEO database, comprising 116 control and 120 AD samples. The DE analysis identified 284 significant DE genes. The meta-analysis identified three significantly enriched immune-related AD pathways: phagosome, the complement and coagulation cascade, and chemokine signaling. Phagosomes-related genes correlated with complement and immune system. Meanwhile, phagosomes and chemokine signaling genes overlapped with B cells receptors pathway genes indicating potential correlation between phagosome, chemokines, and adaptive immune system as well. The transcriptomic meta-analysis showed that AD is associated with immune-related pathways in the brain's cortex through the phagosome, complement and coagulation cascade, and chemokine signaling pathways. Interestingly, phagosome and chemokine signaling pathways had potential correlation with B cells receptors pathway.
Collapse
Affiliation(s)
- Michael Anekson Widjaya
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, 40402, Taiwan
| | - Chia-Hsin Liu
- Cancer Biology and Precision Therapeutics Center, China Medical University and Academia Sinica China Medical University, Taichung, 40403, Taiwan
| | - Shin-Da Lee
- Department of Physical Therapy, PhD program in Healthcare Science, China Medical University, Taichung, 406040, Taiwan.
| | - Wei-Chung Cheng
- Cancer Biology and Precision Therapeutics Center, China Medical University and Academia Sinica China Medical University, Taichung, 40403, Taiwan.
- Ph.D. Program for Cancer Biology and Drug Discovery, China Medical University and Academia Sinica, Taichung, Taiwan.
| |
Collapse
|
115
|
de Jong TV, Pan Y, Rastas P, Munro D, Tutaj M, Akil H, Benner C, Chen D, Chitre AS, Chow W, Colonna V, Dalgard CL, Demos WM, Doris PA, Garrison E, Geurts AM, Gunturkun HM, Guryev V, Hourlier T, Howe K, Huang J, Kalbfleisch T, Kim P, Li L, Mahaffey S, Martin FJ, Mohammadi P, Ozel AB, Polesskaya O, Pravenec M, Prins P, Sebat J, Smith JR, Solberg Woods LC, Tabakoff B, Tracey A, Uliano-Silva M, Villani F, Wang H, Sharp BM, Telese F, Jiang Z, Saba L, Wang X, Murphy TD, Palmer AA, Kwitek AE, Dwinell MR, Williams RW, Li JZ, Chen H. A revamped rat reference genome improves the discovery of genetic diversity in laboratory rats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.13.536694. [PMID: 37214860 PMCID: PMC10197727 DOI: 10.1101/2023.04.13.536694] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The seventh iteration of the reference genome assembly for Rattus norvegicus-mRatBN7.2-corrects numerous misplaced segments and reduces base-level errors by approximately 9-fold and increases contiguity by 290-fold compared to its predecessor. Gene annotations are now more complete, significantly improving the mapping precision of genomic, transcriptomic, and proteomics data sets. We jointly analyzed 163 short-read whole genome sequencing datasets representing 120 laboratory rat strains and substrains using mRatBN7.2. We defined ~20.0 million sequence variations, of which 18.7 thousand are predicted to potentially impact the function of 6,677 genes. We also generated a new rat genetic map from 1,893 heterogeneous stock rats and annotated transcription start sites and alternative polyadenylation sites. The mRatBN7.2 assembly, along with the extensive analysis of genomic variations among rat strains, enhances our understanding of the rat genome, providing researchers with an expanded resource for studies involving rats.
Collapse
Affiliation(s)
- Tristan V de Jong
- Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Yanchao Pan
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Pasi Rastas
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Daniel Munro
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
- Department of Integrative Structural and Computational Biology, Scripps Research, San Diego, CA, USA
| | - Monika Tutaj
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Huda Akil
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA
| | - Chris Benner
- Department of Medicine, University of California San Diego, San Diego, CA, USA
| | - Denghui Chen
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Apurva S Chitre
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - William Chow
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Clifton L Dalgard
- Department of Anatomy, Physiology & Genetics; The American Genome Center, Uniformed Services University of the Health Sciences, Washington DC, USA
| | - Wendy M Demos
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Peter A Doris
- The Brown Foundation Institute of Molecular Medicine, Center For Human Genetics, University of Texas Health Science Center, Houston, TX, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Aron M Geurts
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Hakan M Gunturkun
- Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Victor Guryev
- Genome Structure and Ageing, University of Groningen, UMC Groningen, The Netherlands
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus in Hinxton, Cambridgeshire, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | - Jun Huang
- Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Ted Kalbfleisch
- Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Louisville, KY, USA
| | - Panjun Kim
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Ling Li
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Spencer Mahaffey
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus in Hinxton, Cambridgeshire, UK
| | - Pejman Mohammadi
- Center for Immunity and Immunotherapies, Seattle Children’s Research Institute, Seattle, WA, USA
- Department of Pediatrics, University of Washington School of Medicine, Seattle, WA, USA
| | - Ayse Bilge Ozel
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Oksana Polesskaya
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Michal Pravenec
- Institute of Physiology, Czech Academy of Sciences, Prague, Czechia
| | - Pjotr Prins
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jonathan Sebat
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Jennifer R Smith
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Leah C Solberg Woods
- Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Boris Tabakoff
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Alan Tracey
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | | | - Flavia Villani
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Hongyang Wang
- Department of Animal Sciences, Washington State University, Pullman, WA, USA
| | - Burt M Sharp
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Francesca Telese
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Zhihua Jiang
- Department of Animal Sciences, Washington State University, Pullman, WA, USA
| | - Laura Saba
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Xusheng Wang
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Abraham A Palmer
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Anne E Kwitek
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Melinda R Dwinell
- Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Robert W Williams
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jun Z Li
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Hao Chen
- Department of Pharmacology, Addiction Science, and Toxicology, University of Tennessee Health Science Center, Memphis, TN, USA
| |
Collapse
|
116
|
Iskusnykh IY, Fattakhov N, Li Y, Bihannic L, Kirchner MK, Steshina EY, Northcott PA, Chizhikov VV. Lmx1a is a master regulator of the cortical hem. eLife 2023; 12:e84095. [PMID: 37725078 PMCID: PMC10508884 DOI: 10.7554/elife.84095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 09/05/2023] [Indexed: 09/21/2023] Open
Abstract
Development of the nervous system depends on signaling centers - specialized cellular populations that produce secreted molecules to regulate neurogenesis in the neighboring neuroepithelium. In some cases, signaling center cells also differentiate to produce key types of neurons. The formation of a signaling center involves its induction, the maintenance of expression of its secreted molecules, and cell differentiation and migration events. How these distinct processes are coordinated during signaling center development remains unknown. By performing studies in mice, we show that Lmx1a acts as a master regulator to orchestrate the formation and function of the cortical hem (CH), a critical signaling center that controls hippocampus development. Lmx1a co-regulates CH induction, its Wnt signaling, and the differentiation and migration of CH-derived Cajal-Retzius neurons. Combining RNAseq, genetic, and rescue experiments, we identified major downstream genes that mediate distinct Lmx1a-dependent processes. Our work revealed that signaling centers in the mammalian brain employ master regulatory genes and established a framework for analyzing signaling center development.
Collapse
Affiliation(s)
- Igor Y Iskusnykh
- Department of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphisUnited States
| | - Nikolai Fattakhov
- Department of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphisUnited States
| | - Yiran Li
- Department of Developmental Neurobiology, St. Jude Children's Research HospitalMemphisUnited States
| | - Laure Bihannic
- Department of Developmental Neurobiology, St. Jude Children's Research HospitalMemphisUnited States
| | - Matthew K Kirchner
- Department of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphisUnited States
| | - Ekaterina Y Steshina
- Department of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphisUnited States
| | - Paul A Northcott
- Department of Developmental Neurobiology, St. Jude Children's Research HospitalMemphisUnited States
| | - Victor V Chizhikov
- Department of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphisUnited States
| |
Collapse
|
117
|
Boyes D. The genome sequence of the Diamondback Moth, Plutella xylostella (Linnaeus, 1758). Wellcome Open Res 2023; 8:404. [PMID: 38779055 PMCID: PMC11109710 DOI: 10.12688/wellcomeopenres.20006.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2023] [Indexed: 05/25/2024] Open
Abstract
We present a genome assembly from an individual male Plutella xylostella (the Diamondback Moth; Arthropoda; Insecta; Lepidoptera; Plutellidae). The genome sequence is 323.3 megabases in span. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 35.12 kilobases in length. Gene annotation of this assembly on Ensembl identified 17,190 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | |
Collapse
|
118
|
Liu B, Li Y, Xu Y, Xue W, Jin Z. Jian Yun Qing Hua Decoction inhibits malignant behaviors of gastric carcinoma cells via COL12A1 mediated ferroptosis signal pathway. Chin Med 2023; 18:118. [PMID: 37700383 PMCID: PMC10496189 DOI: 10.1186/s13020-023-00799-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 07/12/2023] [Indexed: 09/14/2023] Open
Abstract
BACKGROUND Jian Yun Qing Hua Decoction (JYQHD), a traditional Chinese medicine decoction, which has been applied in the treatment of gastric cancer (GC). We attempt to confirm the anti-gastric cancer effect of JYQHD and explore the mechanism of JYQHD. METHODS Acute toxicity test was used to understand the toxicity of JYQHD. We studied the expression and prognostic outcome of COL12A1 within GC tissues through the network databases. Using several web-based databases, we analyzed the major components and targets of JYQHD, as well as known therapeutic targets in gastric cancer. The Venn diagram was utilized to obtain the overlapped genes. Lentiviral vector, shRNAs and plasmids, were used to transfect GC cells. Cell counting kit-8 (CCK8), sphere formation, malondialdehyde (MDA), glutathione (GSH), reactive oxygen species (ROS), Fe2+, transmission electron microscopy (TEM), quantitative Real-Time Polymerase Chain Reaction (qRT-PCR), Western-Blot (WB), and immunohistochemical (IHC) assays were employed to investigate the role and mechanism of COL12A1 and JYQHD in GC. RESULTS The results showed that JYQHD was non-toxic and safe. JYQHD inhibited growth and sphere formation ability through inducing the ferroptosis of GC cells, and suppressed the GC cells induced subcutaneous xenograft tumor growth. COL12A1 was highly expressed in gastric cancer tissues, indicating poor prognosis. COL12A1 specifically enhanced GC cell progression and stemness via suppressing ferroptosis. JYQHD down-regulated COL12A1 in order to suppress the stemness of GC cells via inducing ferroptosis. CONCLUSION COL12A1 inhibited ferroptosis and enhanced stemness in GC cells. JYQHD inhibited the development of GC cells by inhibiting cancer cell stemness via the ferroptosis pathway mediated by COL12A1.
Collapse
Affiliation(s)
- Baoxinzi Liu
- Department of Medical Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China
| | - Yu Li
- Department of Medical Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China
| | - Yuanyuan Xu
- Department of Medical Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China
| | - Weiwei Xue
- Department of Medical Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China.
| | - Zhichao Jin
- Department of Medical Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China.
| |
Collapse
|
119
|
Boyes D, Mulley JF. The genome sequence of the Six-striped Rustic, Xestia sexstrigata (Haworth, 1809). Wellcome Open Res 2023; 8:399. [PMID: 38779048 PMCID: PMC11109539 DOI: 10.12688/wellcomeopenres.19988.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2023] [Indexed: 05/25/2024] Open
Abstract
We present a genome assembly from an individual female Xestia sexstrigata (the Six-striped Rustic; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence is 638.3 megabases in span. Most of the assembly is scaffolded into 32 chromosomal pseudomolecules, including the W and Z sex chromosomes. The mitochondrial genome has also been assembled and is 15.36 kilobases in length. Gene annotation of this assembly on Ensembl identified 15,104 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
120
|
Crowley LM. The genome sequence of the Orange-tailed Mining Bee, Andrena haemorrhoa (Fabricius, 1781). Wellcome Open Res 2023; 8:396. [PMID: 38813552 PMCID: PMC11134145 DOI: 10.12688/wellcomeopenres.19982.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2023] [Indexed: 05/31/2024] Open
Abstract
We present a genome assembly from an individual female Andrena haemorrhoa (the Orange-tailed Mining Bee; Arthropoda; Insecta; Hymenoptera; Andrenidae). The genome sequence is 330.7 megabases in span. Most of the assembly is scaffolded into 7 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 16.46 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,908 protein coding genes.
Collapse
|
121
|
Falk S, Monks J. The genome sequence of the painted nomad bee, Nomada fucata (Panzer, 1798). Wellcome Open Res 2023; 8:395. [PMID: 38385126 PMCID: PMC10879756 DOI: 10.12688/wellcomeopenres.19927.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/18/2023] [Indexed: 02/23/2024] Open
Abstract
We present a genome assembly from an individual female Nomada fucata (the painted nomad bee; Arthropoda; Insecta; Hymenoptera; Apidae). The genome sequence is 264.1 megabases in span. Most of the assembly is scaffolded into 8 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 22.06 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,173 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
122
|
Williams AS, Wilk EJ, Fisher JL, Lasseigne BN. Evaluating cancer cell line and patient-derived xenograft recapitulation of tumor and non-diseased tissue gene expression profiles in silico. Cancer Rep (Hoboken) 2023; 6:e1874. [PMID: 37533331 PMCID: PMC10480419 DOI: 10.1002/cnr2.1874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 06/15/2023] [Accepted: 07/17/2023] [Indexed: 08/04/2023] Open
Abstract
BACKGROUND Preclinical models like cancer cell lines and patient-derived xenografts (PDXs) are vital for studying disease mechanisms and evaluating treatment options. It is essential that they accurately recapitulate the disease state of interest to generate results that will translate in the clinic. Prior studies have demonstrated that preclinical models do not recapitulate all biological aspects of human tissues, particularly with respect to the tissue of origin gene expression signatures. Therefore, it is critical to assess how well preclinical model gene expression profiles correlate with human cancer tissues to inform preclinical model selection and data analysis decisions. AIMS Here we evaluated how well preclinical models recapitulate human cancer and non-diseased tissue gene expression patterns in silico with respect to the full gene expression profile as well as subsetting by the most variable genes, genes significantly correlated with tumor purity, and tissue-specific genes. METHODS By using publicly available gene expression profiles across multiple sources, we evaluated cancer cell line and patient-derived xenograft recapitulation of tumor and non-diseased tissue gene expression profiles in silico. RESULTS We found that using the full gene set improves correlations between preclinical model and tissue global gene expression profiles, confirmed that glioblastoma (GBM) PDX global gene expression correlation to GBM tumor global gene expression outperforms GBM cell line to GBM tumor global gene expression correlations, and demonstrated that preclinical models in our study often failed to reproduce tissue-specific expression. While including additional genes for global gene expression comparison between cell lines and tissues decreases the overall correlation, it improves the relative rank between a cell line and its tissue of origin compared to other tissues. Our findings underscore the importance of using the full gene expression set measured when comparing preclinical models and tissues and confirm that tissue-specific patterns are better preserved in GBM PDX models than in GBM cell lines. CONCLUSION Future studies can build on these findings to determine the specific pathways and gene sets recapitulated by particular preclinical models to facilitate model selection for a given study design or goal.
Collapse
Affiliation(s)
- Avery S. Williams
- The Department of Cell, Developmental and Integrative Biology, Heersink School of MedicineThe University of Alabama at BirminghamBirminghamAlabamaUSA
| | - Elizabeth J. Wilk
- The Department of Cell, Developmental and Integrative Biology, Heersink School of MedicineThe University of Alabama at BirminghamBirminghamAlabamaUSA
| | - Jennifer L. Fisher
- The Department of Cell, Developmental and Integrative Biology, Heersink School of MedicineThe University of Alabama at BirminghamBirminghamAlabamaUSA
| | - Brittany N. Lasseigne
- The Department of Cell, Developmental and Integrative Biology, Heersink School of MedicineThe University of Alabama at BirminghamBirminghamAlabamaUSA
| |
Collapse
|
123
|
Lawniczak MK, Crowley LM, McAlister E. The genome sequence of the dark-edged bee fly, Bombylius major (Linnaeus, 1758). Wellcome Open Res 2023; 8:379. [PMID: 38533437 PMCID: PMC10964003 DOI: 10.12688/wellcomeopenres.19804.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/27/2023] [Indexed: 03/28/2024] Open
Abstract
We present a genome assembly from an individual male Bombylius major (the dark-edged bee fly; Arthropoda; Insecta; Diptera; Bombyliidae). The genome sequence is 304.3 megabases in span. The whole assembly is scaffolded into 7 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled and is 17.8 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,852 protein coding genes.
Collapse
Affiliation(s)
| | - Liam M. Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | | | - University of Oxford and Wytham Woods Genome Acquisition Lab
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- Department of Biology, University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
| | - Darwin Tree of Life Barcoding collective
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- Department of Biology, University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
| | - Wellcome Sanger Institute Tree of Life programme
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- Department of Biology, University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
| | - Wellcome Sanger Institute Scientific Operations: DNA Pipelines collective
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- Department of Biology, University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
| | - Tree of Life Core Informatics collective
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- Department of Biology, University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
| | | |
Collapse
|
124
|
Falk S, Mulley JF. The genome sequence of the short-fringed mining bee, Andrena dorsata (Kirby, 1802). Wellcome Open Res 2023; 8:373. [PMID: 39114811 PMCID: PMC11305455 DOI: 10.12688/wellcomeopenres.19756.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2023] [Indexed: 08/10/2024] Open
Abstract
We present a genome assembly from an individual female Andrena dorsata (the short-fringed mining bee; Arthropoda; Insecta; Hymenoptera; Andrenidae). The genome sequence is 277.3 megabases in span. Most of the assembly is scaffolded into 3 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 16.11 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,916 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
125
|
Boyes D, Eljounaidi K. The genome sequence of the Beautiful Golden Y, Autographa pulchrina (Haworth, 1809). Wellcome Open Res 2023; 8:375. [PMID: 39830091 PMCID: PMC11739698 DOI: 10.12688/wellcomeopenres.19840.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/02/2023] [Indexed: 01/22/2025] Open
Abstract
We present a genome assembly from an individual female Autographa pulchrina (the Beautiful Golden Y; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence is 426.2 megabases in span. Most of the assembly is scaffolded into 32 chromosomal pseudomolecules, including the Z and W sex chromosomes. The mitochondrial genome has also been assembled and is 15.25 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,916 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
126
|
Short S, Green Etxabe A, Robinson A, Spurgeon D, Kille P. The genome sequence of the red compost earthworm, Lumbricus rubellus (Hoffmeister, 1843). Wellcome Open Res 2023; 8:354. [PMID: 38618197 PMCID: PMC11015115 DOI: 10.12688/wellcomeopenres.19834.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/02/2023] [Indexed: 04/16/2024] Open
Abstract
We present a genome assembly from an individual Lumbricus rubellus (the red compost earthworm; Annelida; Clitellata; Haplotaxida; Lumbricidae). The genome sequence is 787.5 megabases in span. Most of the assembly is scaffolded into 18 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 15.81 kilobases in length. Gene annotation of this assembly on Ensembl identified 33,426 protein coding genes.
Collapse
Affiliation(s)
- Stephen Short
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | - Alex Robinson
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | - David Spurgeon
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | |
Collapse
|
127
|
Crowley LM, Ashworth M, Wawman DC. The genome sequence of the Thick-legged Hoverfly, Syritta pipiens (Linnaeus, 1758). Wellcome Open Res 2023; 8:349. [PMID: 39114817 PMCID: PMC11303951 DOI: 10.12688/wellcomeopenres.19848.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/08/2023] [Indexed: 08/10/2024] Open
Abstract
We present a genome assembly from an individual female Syritta pipiens (the Thick-legged Hoverfly; Arthropoda; Insecta; Diptera; Syrphidae). The genome sequence is 318.5 megabases in span. Most of the assembly is scaffolded into 5 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 15.76 kilobases in length. Gene annotation of this assembly on Ensembl identified 18,405 protein coding genes.
Collapse
|
128
|
Hayward A, Lohse K, Vila R, Laetsch DR, Hedlund JS. The genome sequence of the Brown Argus, Aricia agestis (Denis & Schiffermüller, 1775). Wellcome Open Res 2023; 8:336. [PMID: 38404635 PMCID: PMC10884594 DOI: 10.12688/wellcomeopenres.19784.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/24/2023] [Indexed: 02/27/2024] Open
Abstract
We present genome assemblies from two male Aricia agestis specimens (the Brown Argus; Arthropoda; Insecta; Lepidoptera; Lycaenidae). The genome sequences are 435.3 and 437.4 megabases in span. Each assembly is scaffolded into 23 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genomes were assembled and are 15.47 and 15.45 kilobases in length. Gene annotation of these assemblies on Ensembl identified 12,688 and 12,654 protein coding genes.
Collapse
Affiliation(s)
- Alex Hayward
- College of Life and Environmental Sciences, University of Exeter, Exeter, England, UK
| | - Konrad Lohse
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh, Scotland, UK
| | - Roger Vila
- Institut de Biologia Evolutiva, CSIC - Universitat Pompeu Fabra, Barcelona, Spain
| | - Dominik R. Laetsch
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh, Scotland, UK
| | | | - Wellcome Sanger Institute Tree of Life programme
- College of Life and Environmental Sciences, University of Exeter, Exeter, England, UK
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh, Scotland, UK
- Institut de Biologia Evolutiva, CSIC - Universitat Pompeu Fabra, Barcelona, Spain
- Department of Evolutionary Ecology, Lund University, Lund, Sweden
| | - Wellcome Sanger Institute Scientific Operations: DNA Pipelines collective
- College of Life and Environmental Sciences, University of Exeter, Exeter, England, UK
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh, Scotland, UK
- Institut de Biologia Evolutiva, CSIC - Universitat Pompeu Fabra, Barcelona, Spain
- Department of Evolutionary Ecology, Lund University, Lund, Sweden
| | - Tree of Life Core Informatics collective
- College of Life and Environmental Sciences, University of Exeter, Exeter, England, UK
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh, Scotland, UK
- Institut de Biologia Evolutiva, CSIC - Universitat Pompeu Fabra, Barcelona, Spain
- Department of Evolutionary Ecology, Lund University, Lund, Sweden
| | | |
Collapse
|
129
|
Jung YH, Wang HLV, Ali S, Corces VG, Kremsky I. Characterization of a strain-specific CD-1 reference genome reveals potential inter- and intra-strain functional variability. BMC Genomics 2023; 24:437. [PMID: 37537522 PMCID: PMC10401811 DOI: 10.1186/s12864-023-09523-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 07/19/2023] [Indexed: 08/05/2023] Open
Abstract
BACKGROUND CD-1 is an outbred mouse stock that is frequently used in toxicology, pharmacology, and fundamental biomedical research. Although inbred strains are typically better suited for such studies due to minimal genetic variability, outbred stocks confer practical advantages over inbred strains, such as improved breeding performance and low cost. Knowledge of the full genetic variability of CD-1 would make it more useful in toxicology, pharmacology, and fundamental biomedical research. RESULTS We performed deep genomic DNA sequencing of CD-1 mice and used the data to identify genome-wide SNPs, indels, and germline transposable elements relative to the mm10 reference genome. We used multiple genome-wide sequencing data types and previously published CD-1 SNPs to validate our called variants. We used the called variants to construct a strain-specific CD-1 reference genome, which we show can improve mappability and reduce experimental biases from genome-wide sequencing data derived from CD-1 mice. Based on previously published ChIP-seq and ATAC-seq data, we find evidence that genetic variation between CD-1 mice can lead to alterations in transcription factor binding. We also identified a number of variants in the coding region of genes which could have effects on translation of genes. CONCLUSIONS We have identified millions of previously unidentified CD-1 variants with the potential to confound studies involving CD-1. We used the identified variants to construct a CD-1-specific reference genome, which can improve accuracy and reduce bias when aligning genomics data derived from CD-1 mice.
Collapse
Affiliation(s)
- Yoon Hee Jung
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Hsiao-Lin V Wang
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Samir Ali
- Department of Basic Sciences, Loma Linda University School of Medicine, Loma Linda, CA, 92350, USA
| | - Victor G Corces
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Isaac Kremsky
- Department of Basic Sciences, Loma Linda University School of Medicine, Loma Linda, CA, 92350, USA.
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA.
| |
Collapse
|
130
|
Mitchell R, Sivell O. The genome sequence of the Locust Fly, Stomorhina lunata (Fabricius, 1805). Wellcome Open Res 2023; 8:330. [PMID: 38779051 PMCID: PMC11109694 DOI: 10.12688/wellcomeopenres.19532.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/01/2023] [Indexed: 05/25/2024] Open
Abstract
We present a genome assembly from an individual female Stomorhina lunata (the Locust Fly; Arthropoda; Insecta; Diptera; Rhiniidae). The genome sequence is 728.1 megabases in span. Most of the assembly is scaffolded into 6 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 16.49 kilobases in length. Gene annotation of this assembly on Ensembl identified 18,358 protein coding genes.
Collapse
|
131
|
Chen Y, Sim A, Wan YK, Yeo K, Lee JJX, Ling MH, Love MI, Göke J. Context-aware transcript quantification from long-read RNA-seq data with Bambu. Nat Methods 2023; 20:1187-1195. [PMID: 37308696 PMCID: PMC10448944 DOI: 10.1038/s41592-023-01908-w] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 05/08/2023] [Indexed: 06/14/2023]
Abstract
Most approaches to transcript quantification rely on fixed reference annotations; however, the transcriptome is dynamic and depending on the context, such static annotations contain inactive isoforms for some genes, whereas they are incomplete for others. Here we present Bambu, a method that performs machine-learning-based transcript discovery to enable quantification specific to the context of interest using long-read RNA-sequencing. To identify novel transcripts, Bambu estimates the novel discovery rate, which replaces arbitrary per-sample thresholds with a single, interpretable, precision-calibrated parameter. Bambu retains the full-length and unique read counts, enabling accurate quantification in presence of inactive isoforms. Compared to existing methods for transcript discovery, Bambu achieves greater precision without sacrificing sensitivity. We show that context-aware annotations improve quantification for both novel and known transcripts. We apply Bambu to quantify isoforms from repetitive HERVH-LTR7 retrotransposons in human embryonic stem cells, demonstrating the ability for context-specific transcript expression analysis.
Collapse
Affiliation(s)
- Ying Chen
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Andre Sim
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Yuk Kei Wan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Republic of Singapore
| | - Keith Yeo
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Joseph Jing Xian Lee
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Min Hao Ling
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Michael I Love
- Department of Biostatistics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA
- Department of Genetics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA
| | - Jonathan Göke
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore.
- Department of Statistics and Data Science, National University of Singapore, Singapore, Republic of Singapore.
| |
Collapse
|
132
|
Shuaib M, Adroub S, Mourier T, Mfarrej S, Zhang H, Esau L, Alsomali A, Alofi FS, Ahmad AN, Shamsan A, Khogeer A, Hashem AM, Almontashiri NAM, Hala S, Pain A. Impact of the SARS-CoV-2 nucleocapsid 203K/204R mutations on the inflammatory immune response in COVID-19 severity. Genome Med 2023; 15:54. [PMID: 37475040 PMCID: PMC10360309 DOI: 10.1186/s13073-023-01208-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 07/04/2023] [Indexed: 07/22/2023] Open
Abstract
BACKGROUND The excessive inflammatory responses provoked by SARS-CoV-2 infection are critical factors affecting the severity and mortality of COVID-19. Previous work found that two adjacent co-occurring mutations R203K and G204R (KR) on the nucleocapsid (N) protein correlate with increased disease severity in COVID-19 patients. However, links with the host immune response remain unclear. METHODS Here, we grouped nasopharyngeal swab samples of COVID-19 patients into two cohorts based on the presence and absence of SARS-CoV-2 nucleocapsid KR mutations. We performed nasopharyngeal transcriptome analysis of age, gender, and ethnicity-matched COVID-19 patients infected with either SARS-CoV-2 with KR mutations in the N protein (KR patients n = 39) or with the wild-type N protein (RG patients n = 39) and compared to healthy controls (n = 34). The impact of KR mutation on immune response was further characterized experimentally by transcriptomic and proteomic profiling of virus-like-particle (VLP) incubated cells. RESULTS We observed markedly elevated expression of proinflammatory cytokines, chemokines, and interferon-stimulated (ISGs) genes in the KR patients compared to RG patients. Using nasopharyngeal transcriptome data, we found significantly higher levels of neutrophils and neutrophil-to-lymphocyte (NLR) ratio in KR patients than in the RG patients. Furthermore, transcriptomic and proteomic profiling of VLP incubated cells confirmed a similar hyper-inflammatory response mediated by the KR variant. CONCLUSIONS Our data demonstrate an unforeseen connection between nucleocapsid KR mutations and augmented inflammatory immune response in severe COVID-19 patients. These findings provide insights into how mutations in SARS-CoV-2 modulate host immune output and pathogenesis and may contribute to more efficient therapeutics and vaccine development.
Collapse
Affiliation(s)
- Muhammad Shuaib
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| | - Sabir Adroub
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Tobias Mourier
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Sara Mfarrej
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Huoming Zhang
- Bioscience Core Laboratory, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Luke Esau
- Bioscience Core Laboratory, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Afrah Alsomali
- Infectious Diseases Department, King Abdullah Medical Complex, Jeddah, MOH, Saudi Arabia
| | - Fadwa S Alofi
- Infectious Diseases Department, King Fahad Hospital, Madinah, MOH, Saudi Arabia
| | - Adeel Nazir Ahmad
- KAUST Health - Fakeeh Care, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Abbas Shamsan
- Dr. Suliman Al-Habib Medical Group, Riyadh, Saudi Arabia
| | - Asim Khogeer
- Plan and Research Department, General Directorate of Health Affairs Makkah Region, Makkah, MOH, Saudi Arabia
| | - Anwar M Hashem
- Vaccines and Immunotherapy Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Clinical Microbiology and Immunology, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Naif A M Almontashiri
- College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
- Center for Genetics and Inherited Diseases, Taibah University, Almadinah Almunwarah, Saudi Arabia
| | - Sharif Hala
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Infectious Disease Research Department, King Abdullah International Medical Research Centre, Ministry of National Guard Health Affairs, Jeddah, Saudi Arabia
- King Saud Bin Abdulaziz University for Health Sciences, Ministry of National Guard Health Affairs, Jeddah, Saudi Arabia
| | - Arnab Pain
- Pathogen Genomics Laboratory, Bioscience Program, Biological and Environmental Science and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- International Institute for Zoonosis Control, Global Institution for Collaborative Research and Education (GI-CoRE), Hokkaido University, Sapporo, 001-0020, Japan.
| |
Collapse
|
133
|
Campbell LI, Nwezeobi J, van Brunschot SL, Kaweesi T, Seal SE, Swamy RAR, Namuddu A, Maslen GL, Mugerwa H, Armean IM, Haggerty L, Martin FJ, Malka O, Santos-Garcia D, Juravel K, Morin S, Stephens ME, Muhindira PV, Kersey PJ, Maruthi MN, Omongo CA, Navas-Castillo J, Fiallo-Olivé E, Mohammed IU, Wang HL, Onyeka J, Alicai T, Colvin J. Comparative evolutionary analyses of eight whitefly Bemisia tabaci sensu lato genomes: cryptic species, agricultural pests and plant-virus vectors. BMC Genomics 2023; 24:408. [PMID: 37468834 DOI: 10.1186/s12864-023-09474-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 06/21/2023] [Indexed: 07/21/2023] Open
Abstract
BACKGROUND The group of > 40 cryptic whitefly species called Bemisia tabaci sensu lato are amongst the world's worst agricultural pests and plant-virus vectors. Outbreaks of B. tabaci s.l. and the associated plant-virus diseases continue to contribute to global food insecurity and social instability, particularly in sub-Saharan Africa and Asia. Published B. tabaci s.l. genomes have limited use for studying African cassava B. tabaci SSA1 species, due to the high genetic divergences between them. Genomic annotations presented here were performed using the 'Ensembl gene annotation system', to ensure that comparative analyses and conclusions reflect biological differences, as opposed to arising from different methodologies underpinning transcript model identification. RESULTS We present here six new B. tabaci s.l. genomes from Africa and Asia, and two re-annotated previously published genomes, to provide evolutionary insights into these globally distributed pests. Genome sizes ranged between 616-658 Mb and exhibited some of the highest coverage of transposable elements reported within Arthropoda. Many fewer total protein coding genes (PCG) were recovered compared to the previously published B. tabaci s.l. genomes and structural annotations generated via the uniform methodology strongly supported a repertoire of between 12.8-13.2 × 103 PCG. An integrative systematics approach incorporating phylogenomic analysis of nuclear and mitochondrial markers supported a monophyletic Aleyrodidae and the basal positioning of B. tabaci Uganda-1 to the sub-Saharan group of species. Reciprocal cross-mating data and the co-cladogenesis pattern of the primary obligate endosymbiont 'Candidatus Portiera aleyrodidarum' from 11 Bemisia genomes further supported the phylogenetic reconstruction to show that African cassava B. tabaci populations consist of just three biological species. We include comparative analyses of gene families related to detoxification, sugar metabolism, vector competency and evaluate the presence and function of horizontally transferred genes, essential for understanding the evolution and unique biology of constituent B. tabaci. s.l species. CONCLUSIONS These genomic resources have provided new and critical insights into the genetics underlying B. tabaci s.l. biology. They also provide a rich foundation for post-genomic research, including the selection of candidate gene-targets for innovative whitefly and virus-control strategies.
Collapse
Affiliation(s)
- Lahcen I Campbell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Joachim Nwezeobi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK.
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, UK.
| | - Sharon L van Brunschot
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- CSIRO Health and Biosecurity, Dutton Park, QLD, Australia
- School of Biological Sciences, The University of Queensland, Brisbane, QLD, Australia
| | - Tadeo Kaweesi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Rwebitaba Zonal Agricultural Research and Development Institute, Fort Portal, Uganda
| | - Susan E Seal
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | - Rekha A R Swamy
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | - Annet Namuddu
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- National Crops Resources Research Institute, Kampala, Uganda
| | - Gareth L Maslen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Imperial College London, South Kensington, London, UK
| | - Habibu Mugerwa
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Department of Entomology, University of Georgia, Griffin, GA, USA
| | - Irina M Armean
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Osnat Malka
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Diego Santos-Garcia
- CNRS, Laboratory of Biometry and Evolutionary Biology UMR 5558, University of Lyon, Villeurbanne, France
- Center for Biology and Management of Populations, INRAe UMR1062, Montferrier-sur-Lez, France
| | - Ksenia Juravel
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Shai Morin
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | | | - Paul Visendi Muhindira
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD, Australia
| | - Paul J Kersey
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Royal Botanic Gardens, Kew, London, UK
| | - M N Maruthi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | | | - Jesús Navas-Castillo
- Instituto de Hortofruticultura Subtropical Y Mediterránea "La Mayora" (IHSM-UMA-CSIC), Consejo Superior de Investigaciones Científicas, Málaga, Algarrobo-Costa, Spain
| | - Elvira Fiallo-Olivé
- Instituto de Hortofruticultura Subtropical Y Mediterránea "La Mayora" (IHSM-UMA-CSIC), Consejo Superior de Investigaciones Científicas, Málaga, Algarrobo-Costa, Spain
| | | | - Hua-Ling Wang
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- College of Forestry, Hebei Agricultural University, Baoding, Hebei, China
| | - Joseph Onyeka
- National Root Crops Research Institute (NRCRI), Umudike, Nigeria
| | - Titus Alicai
- National Crops Resources Research Institute, Kampala, Uganda
| | - John Colvin
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| |
Collapse
|
134
|
Falk S, Smith MN. The genome sequence of a tachinid fly, Tachina lurida (Fabricius, 1781). Wellcome Open Res 2023; 8:288. [PMID: 37635753 PMCID: PMC10448141 DOI: 10.12688/wellcomeopenres.19635.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/27/2023] [Indexed: 08/29/2023] Open
Abstract
We present a genome assembly from an individual female Tachina lurida (a tachinid fly; Arthropoda; Insecta; Diptera; Tachinidae). The genome sequence is 899.2 megabases in span. Most of the assembly is scaffolded into 6 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 17.3 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,127 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | - Matthew N. Smith
- Independent researcher: co-organiser of the UK Tachinid Recording Scheme, Winnersh, England, UK
| | | | | | | | | | | | | |
Collapse
|
135
|
Magallanes-Alba ME, Baricalla A, Rego N, Brun A, Karasov WH, Caviedes-Vidal E. Rapid genome functional annotation pipeline anchored to the house sparrow ( Passer domesticus, Linnaeus 1758) genome reannotation. Biol Methods Protoc 2023; 8:bpad013. [PMID: 37521110 PMCID: PMC10373626 DOI: 10.1093/biomethods/bpad013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 06/26/2023] [Accepted: 07/05/2023] [Indexed: 08/01/2023] Open
Abstract
The house sparrow (Passer domesticus) is a valuable avian model for studying evolutionary genetics, development, neurobiology, physiology, behavior, and ecology, both in laboratory and field-based settings. The current annotation of the P. domesticus genome available at the Ensembl Rapid Release site is primarily focused on gene set building and lacks functional information. In this study, we present the first comprehensive functional reannotation of the P. domesticus genome using intestinal Illumina RNA sequencing (RNA-Seq) libraries. Our revised annotation provides an expanded view of the genome, encompassing 38592 transcripts compared to the current 23574 transcripts in Ensembl. We also predicted 14717 protein-coding genes, achieving 96.4% completeness for Passeriformes lineage BUSCOs. A substantial improvement in this reannotation is the accurate delineation of untranslated region (UTR) sequences. We identified 82.7% and 93.8% of the transcripts containing 5'- and 3'-UTRs, respectively. These UTR annotations are crucial for understanding post-transcriptional regulatory processes. Our findings underscore the advantages of incorporating additional specific RNA-Seq data into genome annotation, particularly when leveraging fast and efficient data processing capabilities. This functional reannotation enhances our understanding of the P. domesticus genome, providing valuable resources for future investigations in various research fields.
Collapse
Affiliation(s)
- Melisa Eliana Magallanes-Alba
- Instituto Multidisciplinario de Investigaciones Biológicas (IMIBIO-SL), Consejo Nacional de Investigaciones Científicas y Técnicas, San Luis, San Luis 5700, Argentina
- Department of Forest and Wildlife Ecology, University of Wisconsin–Madison, Madison, WI 53706, USA
| | - Agustín Baricalla
- Centro de Investigaciones y Transferencia del Noroeste de la Provincia de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas, Pergamino, Buenos Aires 2700, Argentina
| | - Natalia Rego
- Bioinformatics Unit, Institut Pasteur de Montevideo, Montevideo, Montevideo 11200, Uruguay
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Montevideo 11400, Uruguay
| | - Antonio Brun
- Instituto Multidisciplinario de Investigaciones Biológicas (IMIBIO-SL), Consejo Nacional de Investigaciones Científicas y Técnicas, San Luis, San Luis 5700, Argentina
- Departamento Kinesiología y Fisiatría. Facultad de Ciencias de la Salud, Universidad Nacional de San Luis, San Luis , San Luis 5700, Argentina
- Departamento de Biología. Facultad de Química, Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, San Luis 5700, Argentina
| | - William H Karasov
- Department of Forest and Wildlife Ecology, University of Wisconsin–Madison, Madison, WI 53706, USA
| | - Enrique Caviedes-Vidal
- Correspondence address: Instituto Multidisciplinario de Investigaciones Biológicas de San Luis. Almte. Brown 907, D5700 ANW, San Luis. Argentina.
| |
Collapse
|
136
|
Boyes D, Lewis OT. The genome sequence of the Dark Spectacle, Abrostola triplasia (Linnaeus, 1758). Wellcome Open Res 2023; 8:278. [PMID: 37766851 PMCID: PMC10521094 DOI: 10.12688/wellcomeopenres.19624.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/22/2023] [Indexed: 09/29/2023] Open
Abstract
We present a genome assembly from an individual male Abrostola triplasia (the Dark Spectacle; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence is 362.7 megabases in span. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.34 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,532 protein coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
137
|
Chang G, Jones S, Leelakumari S, Ashkani J, Culibrk L, O'Neill K, Tse K, Cheng D, Chuah E, McDonald H, Kirk H, Pandoh P, Pari S, Angelini V, Kyle C, Bertorelle G, Zhao Y, Mungall A, Moore R, Vilaça S, Jones S. The genome sequence of the Loggerhead sea turtle, Caretta caretta Linnaeus 1758. F1000Res 2023; 12:336. [PMID: 37455852 PMCID: PMC10338980 DOI: 10.12688/f1000research.131283.2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/21/2023] [Indexed: 07/18/2023] Open
Abstract
We present a genome assembly of Caretta caretta (the Loggerhead sea turtle; Chordata, Testudines, Cheloniidae), generated from genomic data from two unrelated females. The genome sequence is 2.13 gigabases in size. The assembly has a busco completion score of 96.1% and N50 of 130.95 Mb. The majority of the assembly is scaffolded into 28 chromosomal representations with a remaining 2% of the assembly being excluded from these.
Collapse
Affiliation(s)
- Glenn Chang
- Genome Science and Technology Graduate Program, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Samantha Jones
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Sreeja Leelakumari
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Jahanshah Ashkani
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Luka Culibrk
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Kieran O'Neill
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Kane Tse
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Dean Cheng
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Eric Chuah
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Helen McDonald
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Heather Kirk
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Pawan Pandoh
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Sauro Pari
- Fondazione Cetacea Onlus, Riccione, RN, 47838, Italy
| | | | - Christopher Kyle
- Forensic Science Department, Trent University, Peterborough, Ontario, K9L 0G2, Canada
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, Ontario, K9L 0G2, Canada
| | - Giorgio Bertorelle
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, FE, 44121, Italy
| | - Yongjun Zhao
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Andrew Mungall
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Richard Moore
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
| | - Sibelle Vilaça
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, Ontario, K9L 0G2, Canada
| | - Steven Jones
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia, V5Z 4S6, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
| |
Collapse
|
138
|
Falk S, Sivell O. The genome sequence of the narrow-cheeked clusterfly, Pollenia angustigena (Wainwright, 1940). Wellcome Open Res 2023; 8:270. [PMID: 39346929 PMCID: PMC11437293 DOI: 10.12688/wellcomeopenres.19576.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/14/2023] [Indexed: 10/01/2024] Open
Abstract
We present a genome assembly from an individual female Pollenia angustigena (the narrow-cheeked clusterfly; Arthropoda; Insecta; Diptera; Polleniidae). The genome sequence is 1370.5 megabases in span. Most of the assembly is scaffolded into 6 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 21.01 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,930 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
139
|
Crowley LM, Mulley JF. The genome sequence of the Tawny Mining Bee, Andrena fulva (Müller, 1766). Wellcome Open Res 2023; 8:258. [PMID: 39417201 PMCID: PMC11480741 DOI: 10.12688/wellcomeopenres.19510.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/01/2023] [Indexed: 10/19/2024] Open
Abstract
We present a genome assembly from an individual female Andrena fulva (the Tawny Mining Bee; Arthropoda; Insecta; Hymenoptera; Andrenidae). The genome sequence is 461.7 megabases in span. Most of the assembly is scaffolded into 7 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 14.7 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,011 protein coding genes.
Collapse
Affiliation(s)
| | - John F. Mulley
- School of Natural Sciences, Bangor University, Bangor, Wales, UK
| | | | | | | | | | | | | |
Collapse
|
140
|
Rivera M, Zhang H, Pham J, Isquith J, Zhou QJ, Sasik R, Mark A, Ma W, Holm F, Fisch KM, Kuo DJ, Jamieson C, Jiang Q. Malignant A-to-I RNA editing by ADAR1 drives T-cell acute lymphoblastic leukemia relapse via attenuating dsRNA sensing. RESEARCH SQUARE 2023:rs.3.rs-2444524. [PMID: 37398458 PMCID: PMC10312963 DOI: 10.21203/rs.3.rs-2444524/v2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Leukemia initiating cells (LICs) are regarded as the origin of leukemia relapse and therapeutic resistance. Identifying direct stemness determinants that fuel LIC self-renewal is critical for developing targeted approaches to eliminate LICs and prevent relapse. Here, we show that the RNA editing enzyme ADAR1 is a crucial stemness factor that promotes LIC self-renewal by attenuating aberrant double-stranded RNA (dsRNA) sensing. Elevated adenosine-to-inosine (A-to-I) editing is a common attribute of relapsed T-ALL regardless of molecular subtypes. Consequently, knockdown of ADAR1 severely inhibits LIC self-renewal capacity and prolongs survival in T-ALL PDX models. Mechanistically, ADAR1 directs hyper-editing of immunogenic dsRNA and retains unedited nuclear dsRNA to avoid detection by the innate immune sensor MDA5. Moreover, we uncovered that the cell intrinsic level of MDA5 dictates the dependency on ADAR1-MDA5 axis in T-ALL. Collectively, our results show that ADAR1 functions as a self-renewal factor that limits the sensing of endogenous dsRNA. Thus, targeting ADAR1 presents a safe and effective therapeutic strategy for eliminating T-ALL LICs.
Collapse
Affiliation(s)
- Maria Rivera
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
- Moores Cancer Center, La Jolla, CA 92037, USA
| | - Haoran Zhang
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
- Moores Cancer Center, La Jolla, CA 92037, USA
| | - Jessica Pham
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Jane Isquith
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Qingchen Jenny Zhou
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
- Moores Cancer Center, La Jolla, CA 92037, USA
| | - Roman Sasik
- Center for Computational Biology & Bioinformatics (CCBB), University of California, San Diego, La Jolla, 92093-0681
| | - Adam Mark
- Center for Computational Biology & Bioinformatics (CCBB), University of California, San Diego, La Jolla, 92093-0681
| | - Wenxue Ma
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Frida Holm
- Department of Women’s and Children’s Health, Division of Pediatric Oncology and Surgery, Karolinska Institutet, Sweden
| | - Kathleen M Fisch
- Center for Computational Biology & Bioinformatics (CCBB), University of California, San Diego, La Jolla, 92093-0681
- Department of Obstetrics, Gynecology & Reproductive Sciences, University of California, San Diego, La Jolla, CA
| | - Dennis John Kuo
- Moores Cancer Center, La Jolla, CA 92037, USA
- Division of Pediatric Hematology-Oncology, Rady Children’s Hospital San Diego, University of California, San Diego, CA
| | - Catriona Jamieson
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
- Moores Cancer Center, La Jolla, CA 92037, USA
| | - Qingfei Jiang
- Division of Regenerative Medicine, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
- Moores Cancer Center, La Jolla, CA 92037, USA
| |
Collapse
|
141
|
Williams AS, Wilk EJ, Fisher JL, Lasseigne BN. Evaluating cancer cell line and patient-derived xenograft recapitulation of tumor and non-diseased tissue gene expression profiles in silico. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536431. [PMID: 37090499 PMCID: PMC10120639 DOI: 10.1101/2023.04.11.536431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Preclinical models like cancer cell lines and patient-derived xenografts (PDXs) are vital for studying disease mechanisms and evaluating treatment options. It is essential that they accurately recapitulate the disease state of interest to generate results that will translate in the clinic. Prior studies have demonstrated that preclinical models do not recapitulate all biological aspects of human tissues, particularly with respect to the tissue of origin gene expression signatures. Therefore, it is critical to assess how well preclinical model gene expression profiles correlate with human cancer tissues to inform preclinical model selection and data analysis decisions. Here we evaluated how well preclinical models recapitulate human cancer and non-diseased tissue gene expression patterns in silico with respect to the full gene expression profile as well as subsetting by the most variable genes, genes significantly correlated with tumor purity, and tissue-specific genes by using publicly available gene expression profiles across multiple sources. We found that using the full gene set improves correlations between preclinical model and tissue global gene expression profiles, confirmed that GBM PDX global gene expression correlation to GBM tumor global gene expression outperforms GBM cell line to GBM tumor global gene expression correlations, and demonstrated that preclinical models in our study often failed to reproduce tissue-specific expression. While including additional genes for global gene expression comparison between cell lines and tissues decreases the overall correlation, it improves the relative rank between a cell line and its tissue of origin compared to other tissues. Our findings underscore the importance of using the full gene expression set measured when comparing preclinical models and tissues and confirm that tissue-specific patterns are better preserved in GBM PDX models than in GBM cell lines. Future studies can build on these findings to determine the specific pathways and gene sets recapitulated by particular preclinical models to facilitate model selection for a given study design or goal.
Collapse
Affiliation(s)
- Avery S. Williams
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Elizabeth J. Wilk
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Jennifer L. Fisher
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Brittany N. Lasseigne
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| |
Collapse
|
142
|
Sivell O, Sivell D. The genome sequence of a carabid beetle, Nebria salina (Fairmaire & Laboulbène, 1854). Wellcome Open Res 2023; 8:247. [PMID: 38196772 PMCID: PMC10774627 DOI: 10.12688/wellcomeopenres.19372.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/11/2023] [Indexed: 01/11/2024] Open
Abstract
We present a genome assembly from an individual female Nebria salina (a carabid beetle; Arthropoda; Insecta; Coleoptera; Carabidae). The genome sequence is 256.7 megabases in span. Most of the assembly is scaffolded into 21 chromosomal pseudomolecules, including the assembled X sex chromosome. The mitochondrial genome has also been assembled and is 24.7 kilobases in length. Gene annotation of this assembly on Ensembl identified 10,671 protein coding genes.
Collapse
|
143
|
Porquet F, Weidong L, Jehasse K, Gazon H, Kondili M, Blacher S, Massotte L, Di Valentin E, Furling D, Gillet NA, Klein AF, Seutin V, Willems L. Specific DMPK-promoter targeting by CRISPRi reverses myotonic dystrophy type 1-associated defects in patient muscle cells. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 32:857-871. [PMID: 37273786 PMCID: PMC10238591 DOI: 10.1016/j.omtn.2023.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 05/10/2023] [Indexed: 06/06/2023]
Abstract
Myotonic dystrophy type 1 (DM1) is a neuromuscular disease that originates from an expansion of CTG microsatellites in the 3' untranslated region of the DMPK gene, thus leading to the expression of transcripts containing expanded CUG repeats (CUGexp). The pathophysiology is explained by a toxic RNA gain of function where CUGexp RNAs form nuclear aggregates that sequester and alter the function of MBNL splicing factors, triggering splicing misregulation linked to the DM1 symptoms. There is currently no cure for DM1, and most therapeutic strategies aim at eliminating CUGexp-DMPK transcripts. Here, we investigate a DMPK-promoter silencing strategy using CRISPR interference as a new alternative approach. Different sgRNAs targeting the DMPK promoter are evaluated in DM1 patient muscle cells. The most effective guides allowed us to reduce the level of DMPK transcripts and CUGexp-RNA aggregates up to 80%. The CUGexp-DMPK repression corrects the overall transcriptome, including spliceopathy, and reverses a physiological parameter in DM1 muscle cells. Its action is specific and restricted to the DMPK gene, as confirmed by genome-wide expression analysis. Altogether, our findings highlight DMPK-promoter silencing by CRISPRi as a promising therapeutic approach for DM1.
Collapse
Affiliation(s)
- Florent Porquet
- Laboratory of Molecular and Cellular Epigenetics, GIGA-Cancer, ULiège, 4000 Liège, Belgium
- Laboratory of Neurophysiology, GIGA-Neurosciences, ULiège, 4000 Liège, Belgium
- Sorbonne Université, INSERM, Institut de Myologie, Centre de Recherche en Myologie, 75013 Paris, France
| | - Lin Weidong
- Laboratory of Molecular and Cellular Epigenetics, GIGA-Cancer, ULiège, 4000 Liège, Belgium
| | - Kévin Jehasse
- Laboratory of Neurophysiology, GIGA-Neurosciences, ULiège, 4000 Liège, Belgium
| | - Hélène Gazon
- Laboratory of Molecular and Cellular Epigenetics, GIGA-Cancer, ULiège, 4000 Liège, Belgium
| | - Maria Kondili
- Sorbonne Université, INSERM, Institut de Myologie, Centre de Recherche en Myologie, 75013 Paris, France
| | - Silvia Blacher
- Laboratory of Biology of Tumor and Development, GIGA-Cancer, ULiège, 4000 Liège, Belgium
| | - Laurent Massotte
- Laboratory of Neurophysiology, GIGA-Neurosciences, ULiège, 4000 Liège, Belgium
| | | | - Denis Furling
- Sorbonne Université, INSERM, Institut de Myologie, Centre de Recherche en Myologie, 75013 Paris, France
| | - Nicolas Albert Gillet
- Namur Research Institute for Life Sciences (NARILIS), Integrated Veterinary Research Unit (URVI), University of Namur, 5000 Namur, Belgium
| | - Arnaud François Klein
- Sorbonne Université, INSERM, Institut de Myologie, Centre de Recherche en Myologie, 75013 Paris, France
| | - Vincent Seutin
- Laboratory of Neurophysiology, GIGA-Neurosciences, ULiège, 4000 Liège, Belgium
| | - Luc Willems
- Laboratory of Molecular and Cellular Epigenetics, GIGA-Cancer, ULiège, 4000 Liège, Belgium
| |
Collapse
|
144
|
Bista I, Wood JMD, Desvignes T, McCarthy SA, Matschiner M, Ning Z, Tracey A, Torrance J, Sims Y, Chow W, Smith M, Oliver K, Haggerty L, Salzburger W, Postlethwait JH, Howe K, Clark MS, William Detrich H, Christina Cheng CH, Miska EA, Durbin R. Genomics of cold adaptations in the Antarctic notothenioid fish radiation. Nat Commun 2023; 14:3412. [PMID: 37296119 PMCID: PMC10256766 DOI: 10.1038/s41467-023-38567-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/05/2023] [Indexed: 06/12/2023] Open
Abstract
Numerous novel adaptations characterise the radiation of notothenioids, the dominant fish group in the freezing seas of the Southern Ocean. To improve understanding of the evolution of this iconic fish group, here we generate and analyse new genome assemblies for 24 species covering all major subgroups of the radiation, including five long-read assemblies. We present a new estimate for the onset of the radiation at 10.7 million years ago, based on a time-calibrated phylogeny derived from genome-wide sequence data. We identify a two-fold variation in genome size, driven by expansion of multiple transposable element families, and use the long-read data to reconstruct two evolutionarily important, highly repetitive gene family loci. First, we present the most complete reconstruction to date of the antifreeze glycoprotein gene family, whose emergence enabled survival in sub-zero temperatures, showing the expansion of the antifreeze gene locus from the ancestral to the derived state. Second, we trace the loss of haemoglobin genes in icefishes, the only vertebrates lacking functional haemoglobins, through complete reconstruction of the two haemoglobin gene clusters across notothenioid families. Both the haemoglobin and antifreeze genomic loci are characterised by multiple transposon expansions that may have driven the evolutionary history of these genes.
Collapse
Affiliation(s)
- Iliana Bista
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK.
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge, CB2 1QN, UK.
- Naturalis Biodiversity Center, Leiden, 2333 CR, the Netherlands.
| | - Jonathan M D Wood
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Thomas Desvignes
- University of Oregon, Institute of Neuroscience, 1254 University of Oregon, 13th Avenue, Eugene, OR, 97403, USA
| | - Shane A McCarthy
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK
| | - Michael Matschiner
- University of Oslo, Natural History Museum, University of Oslo, Sars' gate 1, 0562, Oslo, Norway
- University of Zurich, Department of Palaeontology and Museum, University of Zurich, Karl-Schmid-Strasse 4, 8006, Zurich, Switzerland
| | - Zemin Ning
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Alan Tracey
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - James Torrance
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Ying Sims
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - William Chow
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Michelle Smith
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Karen Oliver
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Walter Salzburger
- University of Basel, Zoological Institute, Department of Environmental Sciences, Vesalgasse 1, 4051, Basel, Switzerland
| | - John H Postlethwait
- University of Oregon, Institute of Neuroscience, 1254 University of Oregon, 13th Avenue, Eugene, OR, 97403, USA
| | - Kerstin Howe
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Melody S Clark
- British Antarctic Survey, High Cross, Madingley Road, Cambridge, CB3 0ET, UK
| | - H William Detrich
- Northeastern University, Department of Marine and Environmental Sciences, Marine Science Centre, 430 Nahant Rd., Nahant, MA, 01908, USA
| | - C-H Christina Cheng
- Department of Evolution, Ecology, and Behaviour, University of Illinois, Urbana-Champaign, IL, 61801, USA
| | - Eric A Miska
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge, CB2 1QN, UK
| | - Richard Durbin
- Wellcome Sanger Institute, Tree of Life, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK.
| |
Collapse
|
145
|
Song Y, Burns GW, Joshi NR, Arora R, Kim JJ, Fazleabas AT. Spheroids as a model for endometriotic lesions. JCI Insight 2023; 8:e160815. [PMID: 37104033 PMCID: PMC10393231 DOI: 10.1172/jci.insight.160815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 04/20/2023] [Indexed: 04/28/2023] Open
Abstract
The development and progression of endometriotic lesions are poorly understood, but immune cell dysfunction and inflammation are closely associated with the pathophysiology of endometriosis. There is a need for 3D in vitro models to permit the study of interactions between cell types and the microenvironment. To address this, we developed endometriotic spheroids (ES) to explore the role of epithelial-stromal interactions and model peritoneal invasion associated with lesion development. Using a nonadherent microwell culture system, spheroids were generated with immortalized endometriotic epithelial cells (12Z) combined with endometriotic stromal (iEc-ESC) or uterine stromal (iHUF) cell lines. Transcriptomic analysis found 4,522 differentially expressed genes in ES compared with spheroids containing uterine stromal cells. The top increased gene sets were inflammation-related pathways, and an overlap with baboon endometriotic lesions was highly significant. Finally, to mimic invasion of endometrial tissue into the peritoneum, a model was developed with human peritoneal mesothelial cells in an extracellular matrix. Invasion was increased in the presence of estradiol or pro-inflammatory macrophages and suppressed by a progestin. Taken together, our results strongly support the concept that ES are an appropriate model for dissecting mechanisms that contribute to endometriotic lesion development.
Collapse
Affiliation(s)
- Yong Song
- Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University, Grand Rapids, Michigan, USA
| | - Gregory W. Burns
- Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University, Grand Rapids, Michigan, USA
| | - Niraj R. Joshi
- Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University, Grand Rapids, Michigan, USA
| | - Ripla Arora
- Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University, Grand Rapids, Michigan, USA
| | - J. Julie Kim
- Department of Obstetrics and Gynecology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Asgerally T. Fazleabas
- Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University, Grand Rapids, Michigan, USA
| |
Collapse
|
146
|
Falk S, Mulley JF. The genome sequence of the variegated flesh fly, Sarcophaga variegata (Scopoli, 1763). Wellcome Open Res 2023; 8:234. [PMID: 38510269 PMCID: PMC10951566 DOI: 10.12688/wellcomeopenres.19483.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 03/22/2024] Open
Abstract
We present a genome assembly from an individual male Sarcophaga variegata (the variegated flesh fly; Arthropoda; Insecta; Diptera; Sarcophagidae). The genome sequence is 718.5 megabases in span. Most of the assembly is scaffolded into 7 chromosomal pseudomolecules including the X and Y sex chromosomes. The mitochondrial genome has also been assembled and is 18.7 kilobases in length. Gene annotation of this assembly on Ensembl identified 16,660 protein coding genes.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | | | | | | | | | | | | | | |
Collapse
|
147
|
Crowley LM, Hammond J. The genome sequence of the Lichen Button, Acleris literana (Linnaeus, 1758). Wellcome Open Res 2023; 8:232. [PMID: 38249958 PMCID: PMC10799227 DOI: 10.12688/wellcomeopenres.19481.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 01/23/2024] Open
Abstract
We present a genome assembly from an individual male Acleris literana (the Lichen Button; Arthropoda; Insecta; Lepidoptera; Tortricidae). The genome sequence is 674.9 megabases in span. Most of the assembly is scaffolded into 30 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 16.4 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,577 protein coding genes.
Collapse
|
148
|
Aryankalayil MJ, Bylicky MA, Martello S, Chopra S, Sproull M, May JM, Shankardass A, MacMillan L, Vanpouille-Box C, Eke I, Scott KMK, Dalo J, Coleman CN. Microarray analysis of hub genes, non-coding RNAs and pathways in lung after whole body irradiation in a mouse model. Int J Radiat Biol 2023; 99:1702-1715. [PMID: 37212632 PMCID: PMC10615684 DOI: 10.1080/09553002.2023.2214205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 05/05/2023] [Indexed: 05/23/2023]
Abstract
PURPOSE Previous research has highlighted the impact of radiation damage, with cancer patients developing acute disorders including radiation induced pneumonitis or chronic disorders including pulmonary fibrosis months after radiation therapy ends. We sought to discover biomarkers that predict these injuries and develop treatments that mitigate this damage and improve quality of life. MATERIALS AND METHODS Six- to eight-week-old female C57BL/6 mice received 1, 2, 4, 8, 12 Gy or sham whole body irradiation. Animals were euthanized 48 h post exposure and lungs removed, snap frozen and underwent RNA isolation. Microarray analysis was performed to determine dysregulation of messenger RNA (mRNA), microRNA (miRNA), and long non-coding RNA (lncRNA) after radiation injury. RESULTS We observed sustained dysregulation of specific RNA markers including: mRNAs, lncRNAs, and miRNAs across all doses. We also identified significantly upregulated genes that can indicate high dose exposure, including Cpt1c, Pdk4, Gdf15, and Eda2r, which are markers of senescence and fibrosis. Only three miRNAs were significantly dysregulated across all radiation doses: miRNA-142-3p and miRNA-142-5p were downregulated and miRNA-34a-5p was upregulated. IPA analysis predicted inhibition of several molecular pathways with increasing doses of radiation, including: T cell development, Quantity of leukocytes, Quantity of lymphocytes, and Cell viability. CONCLUSIONS These RNA biomarkers might be highly relevant in the development of treatments and in predicting normal tissue injury in patients undergoing radiation treatment. We are conducting further experiments in our laboratory, which includes a human lung-on-a-chip model, to develop a decision tree model using RNA biomarkers.
Collapse
Affiliation(s)
- Molykutty J Aryankalayil
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Michelle A Bylicky
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shannon Martello
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sunita Chopra
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mary Sproull
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jared M May
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Aman Shankardass
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Iris Eke
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA, USA
| | - Kevin M K Scott
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Juan Dalo
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - C Norman Coleman
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Radiation Research Program, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| |
Collapse
|
149
|
Holmes A. The genome sequence of the surf clam, Spisula solida (Linnaeus, 1758). Wellcome Open Res 2023; 8:227. [PMID: 38764973 PMCID: PMC11101916 DOI: 10.12688/wellcomeopenres.19486.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 05/21/2024] Open
Abstract
We present a genome assembly from an individual Spisula solida (the surf clam; Mollusca; Bivalvia; Venerida; Mactridae). The genome sequence is 932.1 megabases in span. Most of the assembly is scaffolded into 19 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 19.3 kilobases in length. Gene annotation of this assembly on Ensembl identified 13,833 protein coding genes.
Collapse
|
150
|
Zhu W, Liao X. LCAT: an isoform-sensitive error correction for transcriptome sequencing long reads. Front Genet 2023; 14:1166975. [PMID: 37292144 PMCID: PMC10245045 DOI: 10.3389/fgene.2023.1166975] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 05/04/2023] [Indexed: 06/10/2023] Open
Abstract
As the carrier of genetic information, RNA carries the information from genes to proteins. Transcriptome sequencing technology is an important way to obtain transcriptome sequences, and it is also the basis for transcriptome research. With the development of third-generation sequencing, long reads can cover full-length transcripts and reflect the composition of different isoforms. However, the high error rate of third-generation sequencing affects the accuracy of long reads and downstream analysis. The current error correction methods seldom consider the existence of different isoforms in RNA, which makes the diversity of isoforms a serious loss. Here, we introduce LCAT (long-read error correction algorithm for transcriptome sequencing data), a wrapper algorithm of MECAT, to reduce the loss of isoform diversity while keeping MECAT's error correction performance. The experimental results show that LCAT can not only improve the quality of transcriptome sequencing long reads but also retain the diversity of isoforms.
Collapse
Affiliation(s)
- Wufei Zhu
- Department of Endocrinology, Yichang Central People’s Hospital, The First College of Clinical Medical Science, China Three Gorges University, Yichang, China
| | - Xingyu Liao
- Computer, Electrical and Mathematical Sciences, and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| |
Collapse
|