1
|
Galli M, Chen Z, Ghandour T, Chaudhry A, Gregory J, Feng F, Li M, Schleif N, Zhang X, Dong Y, Song G, Walley JW, Chuck G, Whipple C, Kaeppler HF, Huang SSC, Gallavotti A. Transcription factor binding divergence drives transcriptional and phenotypic variation in maize. NATURE PLANTS 2025:10.1038/s41477-025-02007-8. [PMID: 40506505 DOI: 10.1038/s41477-025-02007-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Accepted: 04/22/2025] [Indexed: 06/18/2025]
Abstract
Regulatory elements are essential components of plant genomes that have shaped the domestication and improvement of modern crops. However, their identity, function and diversity remain poorly characterized, limiting our ability to harness their full power for agricultural advances using induced or natural variation. Here we mapped transcription factor (TF) binding for 200 TFs from 30 families in two distinct maize inbred lines historically used in maize breeding. TF binding comparison revealed widespread differences between inbreds, driven largely by structural variation, that correlated with gene expression changes and explained complex quantitative trait loci such as Vgt1, an important determinant of flowering time, and DICE, an herbivore resistance enhancer. CRISPR-Cas9 editing of TF binding regions validated the function and structure of regulatory regions at various loci controlling plant architecture and biotic resistance. Our maize TF binding catalogue identifies functional regulatory regions and enables collective and comparative analysis, highlighting its value for agricultural improvement.
Collapse
Affiliation(s)
- Mary Galli
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | - Zongliang Chen
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | - Tara Ghandour
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Amina Chaudhry
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | - Jason Gregory
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | - Fan Feng
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | - Miaomiao Li
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | | | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Yinxin Dong
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Gaoyuan Song
- Department of Plant Pathology, Entomology and Microbiology, Iowa State University, Ames, IA, USA
| | - Justin W Walley
- Department of Plant Pathology, Entomology and Microbiology, Iowa State University, Ames, IA, USA
| | - George Chuck
- University of California, Berkeley, Berkeley, CA, USA
| | - Clinton Whipple
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Heidi F Kaeppler
- Department of Agronomy, University of Wisconsin, Madison, WI, USA
- Wisconsin Crop Innovation Center, University of Wisconsin, Middleton, WI, USA
| | - Shao-Shan Carol Huang
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA.
| | - Andrea Gallavotti
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA.
- Department of Plant Biology, Rutgers University, New Brunswick, NJ, USA.
| |
Collapse
|
2
|
Lucotte EA, Jay P, Rougemont Q, Boyer L, Cornille A, Snirc A, Labat A, Chahine E, Duhamel M, Namias A, Gendelman J, Ma WJ, Hayes RK, Baruri S, Ham JP, Perlin MH, Hood ME, Rodríguez de la Vega RC, Giraud T. Repeated loss of function at HD mating-type genes and of recombination in anther-smut fungi. Nat Commun 2025; 16:4962. [PMID: 40436846 PMCID: PMC12119880 DOI: 10.1038/s41467-025-60222-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Accepted: 05/14/2025] [Indexed: 06/01/2025] Open
Abstract
Basidiomycete fungi typically have two mating-type loci controlling mating compatibility, HD and PR, residing on different chromosomes. Loss-of-function in mating compatibility has been reported at the PR genes in a few heterothallic basidiomycetes, but not for the HD genes. In Microbotryum anther-smut fungi, there have been repeated linkage events between the HD and PR loci through chromosome fusions, leading to non-recombining regions. Here, we found that two sister Microbotryum species parasitizing Dianthus plants, M. superbum and M. shykoffianum, as well as the distantly related M. scorzonerae, have their HD and PR loci on different chromosomes, but with the PR chromosome fused with a part of the ancestral HD chromosome. In addition, recombination suppression has extended stepwise, generating evolutionary strata. In all three species, the HD genes lost their function in mating compatibility, natural diploid strains being often homozygous at the HD locus. Strains could be homozygous for a disrupted HD2 gene, that was hardly expressed during mating. Mating tests confirmed that a single genetic factor controlled mating compatibility and that haploid strains with identical HD alleles could mate and produce hyphae. This study shows that a unifactorial mating-type determinism can evolve, repeatedly, from a bifactorial system, by different mechanisms.
Collapse
Affiliation(s)
- Elise A Lucotte
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France.
| | - Paul Jay
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
- Center for GeoGenetics, University of Copenhagen, Copenhagen, Denmark
| | - Quentin Rougemont
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Loreleï Boyer
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Amandine Cornille
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, Gif-sur-Yvette, France
| | - Alodie Snirc
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Amandine Labat
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Elizabeth Chahine
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Marine Duhamel
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | - Alice Namias
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France
| | | | - Wen-Juan Ma
- Department of Biology, Amherst College, Amherst, MA, USA
- Department of Biology, Research group of Ecology, Evolution and Genetics, Vrije Universiteit Brussel, Brussels, Belgium
| | - Roxanne K Hayes
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY, USA
| | - Shikhi Baruri
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY, USA
| | - Joseph P Ham
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY, USA
| | - Michael H Perlin
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY, USA
| | - Michael E Hood
- Department of Biology, Amherst College, Amherst, MA, USA
| | | | - Tatiana Giraud
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Société Evolution, Gif sur Yvette, France.
| |
Collapse
|
3
|
Broad GR, Januszczak I, Fletcher C, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Heath Bumblebee, Bombus jonellus (Kirby, 1802). Wellcome Open Res 2025; 10:269. [PMID: 40520151 PMCID: PMC12166357 DOI: 10.12688/wellcomeopenres.24256.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/13/2025] [Indexed: 06/18/2025] Open
Abstract
We present a genome assembly from a female specimen of Bombus jonellus (Heath Bumblebee; Arthropoda; Insecta; Hymenoptera; Apidae). The genome sequence has a total length of 357.90 megabases. Most of the assembly (78.06%) is scaffolded into 18 chromosomal pseudomolecules. The mitochondrial genome has also been assembled, with a length of 24.83 kilobases.
Collapse
|
4
|
Maldonado M, Pita L, Hentschel U, Erpenbeck D, Oatley G, Sinclair E, Aunin E, Gettle N, Santos C, Paulini M, Niu H, McKenna V, O’Brien R, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory Team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics Team, EBI Aquatic Symbiosis Genomics Data Portal Team, Aquatic Symbiosis Genomics Project Leadership. The chromosomal genome sequence of the sponge Crambe crambe (Schmidt, 1862) and its associated microbial metagenome sequences. Wellcome Open Res 2025; 10:275. [PMID: 40520149 PMCID: PMC12166358 DOI: 10.12688/wellcomeopenres.24154.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/25/2025] [Indexed: 06/18/2025] Open
Abstract
We present a genome assembly from an individual Crambe crambe (Porifera; Demospongiae; Poecilosclerida; Crambeidae). The host genome sequence is 143.20 megabases in span. Most of the assembly is scaffolded into 18 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 19.53 kilobases in length. Several symbiotic prokaryotic genomes were assembled as MAGs, including two relevant sponge symbionts, the Candidatus Beroebacter blanensis/ AqS2 clade (Tethybacterales, Gammaproteobacteria) of LMA sponges, and the widely distributed archaeal Nitrosopumilus sp. clade.
Collapse
Affiliation(s)
- Manuel Maldonado
- Center for Advanced Studies of Blanes (CEAB-CSIC), Girona, Spain
| | - Lucia Pita
- Institute of Marine Sciences – CSIC, Barcelona, Spain
- Integrated Marine Ecology group, Institute of Marine Research IIM-CSIC, Vigo, Spain
| | - Ute Hentschel
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
| | - Dirk Erpenbeck
- Ludwig-Maximilians University of Munich, Munich, Germany
| | - Graeme Oatley
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | | - Eerik Aunin
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Noah Gettle
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Camilla Santos
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Michael Paulini
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Haoyu Niu
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | | - Rebecca O’Brien
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory Team
- Center for Advanced Studies of Blanes (CEAB-CSIC), Girona, Spain
- Institute of Marine Sciences – CSIC, Barcelona, Spain
- Integrated Marine Ecology group, Institute of Marine Research IIM-CSIC, Vigo, Spain
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
- Ludwig-Maximilians University of Munich, Munich, Germany
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Center for Advanced Studies of Blanes (CEAB-CSIC), Girona, Spain
- Institute of Marine Sciences – CSIC, Barcelona, Spain
- Integrated Marine Ecology group, Institute of Marine Research IIM-CSIC, Vigo, Spain
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
- Ludwig-Maximilians University of Munich, Munich, Germany
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Core Informatics Team
- Center for Advanced Studies of Blanes (CEAB-CSIC), Girona, Spain
- Institute of Marine Sciences – CSIC, Barcelona, Spain
- Integrated Marine Ecology group, Institute of Marine Research IIM-CSIC, Vigo, Spain
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
- Ludwig-Maximilians University of Munich, Munich, Germany
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - EBI Aquatic Symbiosis Genomics Data Portal Team
- Center for Advanced Studies of Blanes (CEAB-CSIC), Girona, Spain
- Institute of Marine Sciences – CSIC, Barcelona, Spain
- Integrated Marine Ecology group, Institute of Marine Research IIM-CSIC, Vigo, Spain
- GEOMAR Helmholtz Centre for Ocean Research Kiel, Kiel, Germany
- Ludwig-Maximilians University of Munich, Munich, Germany
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | |
Collapse
|
5
|
Merkulov P, Latypova A, Tiurin K, Serganova M, Kirov I. DNA Methylation and Alternative Splicing Safeguard Genome and Transcriptome After a Retrotransposition Burst in Arabidopsis thaliana. Int J Mol Sci 2025; 26:4816. [PMID: 40429956 PMCID: PMC12112155 DOI: 10.3390/ijms26104816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2025] [Revised: 05/10/2025] [Accepted: 05/14/2025] [Indexed: 05/29/2025] Open
Abstract
Transposable elements (TEs) are major drivers of plant genome plasticity, but the immediate molecular consequences of new TE insertions remain poorly understood. In this study, we generated a wild-type Arabidopsis thaliana population with novel insertions of ONSEN retrotransposon to investigate early epigenomic and transcriptomic changes using whole-genome and cDNA nanopore sequencing. We found that novel ONSEN insertions were distributed non-randomly, with a strong preference for genic regions, particularly in chromatin enriched for H2A.Z, H3K27me3, and H3K4me2. Most full-length ONSEN insertions within genes were rapidly recognized and spliced out as new introns (intronization), thereby mitigating potential deleterious effects on transcript isoforms. In some cases, ONSEN insertions provided alternative transcription start or termination sites, generating novel transcript isoforms. Genome-wide methylation analysis revealed that new ONSEN copies were efficiently and precisely targeted by DNA methylation. Independently on the location of the original ONSEN element, the euchromatic and heterochromatic insertions display distinct methylation signatures, reflecting the action of different epigenetic pathways. In conclusion, our results demonstrate that DNA methylation and alternative splicing are effective control mechanisms safeguarding the plant genome and transcriptome integrity after retrotransposition burst.
Collapse
Affiliation(s)
- Pavel Merkulov
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, 127550 Moscow, Russia; (P.M.); (A.L.); (K.T.); (M.S.)
- Moscow Center for Advanced Studies, Kulakova Str. 20, 123592 Moscow, Russia
| | - Anastasiia Latypova
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, 127550 Moscow, Russia; (P.M.); (A.L.); (K.T.); (M.S.)
- Moscow Center for Advanced Studies, Kulakova Str. 20, 123592 Moscow, Russia
| | - Kirill Tiurin
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, 127550 Moscow, Russia; (P.M.); (A.L.); (K.T.); (M.S.)
- Moscow Center for Advanced Studies, Kulakova Str. 20, 123592 Moscow, Russia
| | - Melania Serganova
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, 127550 Moscow, Russia; (P.M.); (A.L.); (K.T.); (M.S.)
- Moscow Center for Advanced Studies, Kulakova Str. 20, 123592 Moscow, Russia
| | - Ilya Kirov
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, 127550 Moscow, Russia; (P.M.); (A.L.); (K.T.); (M.S.)
- Moscow Center for Advanced Studies, Kulakova Str. 20, 123592 Moscow, Russia
| |
Collapse
|
6
|
Chapdelaine-Trépanier V, Shenoy S, Masud W, Minju-Op A, Bérubé MA, Schönherr S, Forer L, Fradet-Turcotte A, Taliun D, Cuella-Martin R. CRISPR-BEasy: a free web-based service for designing sgRNA tiling libraries for CRISPR-dependent base editing screens. Nucleic Acids Res 2025:gkaf382. [PMID: 40377102 DOI: 10.1093/nar/gkaf382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2025] [Revised: 04/15/2025] [Accepted: 04/24/2025] [Indexed: 05/18/2025] Open
Abstract
CRISPR-dependent base editing (BE) enables the modeling and correction of genetic mutations at single-base resolution. Base editing screens, where point mutations are queried en masse, are powerful tools to systematically draw genotype-phenotype associations and characterise the function of genes and other genomic elements. However, the lack of user-friendly web-based tools for designing base editing screens can hinder broad technology adoption. Here, we introduce CRISPR-BEasy (https://crispr-beasy.cerc-genomic-medicine.ca), a free, automated web-based server that streamlines the creation of single guide (sg)RNA tiling libraries for base editing screens. Researchers can provide their genes or genomic features of interest, their base editors of choice, and target sequences to act as positive and negative controls. The server designs and annotates sgRNA libraries by integrating custom code with publicly available tools such as crisprVerse and Ensembl's Variant Effect Predictor. CRISPR-BEasy provides downloadable results, including sgRNA on/off-target scores, predicted mutational outcomes per base editor, and intuitive interactive visualizations for data quality assessment. CRISPR-BEasy also provides a separate tool that assembles sgRNA libraries into oligonucleotides for cloning following the detailed protocol documented in the searchable web server manual. Together, CRISPR-BEasy ensures the seamless design of cloning-ready sgRNA libraries, seeking to democratise access to base editing screening technologies.
Collapse
Affiliation(s)
- Vincent Chapdelaine-Trépanier
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| | - Shamika Shenoy
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| | - Wardah Masud
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| | - Amisha Minju-Op
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| | - Marie-Anne Bérubé
- Department of Molecular Biology, Medical Biochemistry and Pathology, Faculty of Medicine, Université Laval, Québec City, QC,G1V 0A6, Canada
- Oncology Division, Centre Hospitalier Universitaire (CHU) de Québec-Université Laval Research Centre, Québec City, QC,G1R 2J6, Canada
- Université Laval Cancer Research Center, Université Laval, Québec City, QC,G1R 3S3, Canada
| | - Sebastian Schönherr
- Institute of Genetic Epidemiology, Department of Genetics, Medical University of Innsbruck, Innsbruck,6020, Austria
| | - Lukas Forer
- Institute of Genetic Epidemiology, Department of Genetics, Medical University of Innsbruck, Innsbruck,6020, Austria
| | - Amélie Fradet-Turcotte
- Department of Molecular Biology, Medical Biochemistry and Pathology, Faculty of Medicine, Université Laval, Québec City, QC,G1V 0A6, Canada
- Oncology Division, Centre Hospitalier Universitaire (CHU) de Québec-Université Laval Research Centre, Québec City, QC,G1R 2J6, Canada
- Université Laval Cancer Research Center, Université Laval, Québec City, QC,G1R 3S3, Canada
| | - Daniel Taliun
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| | - Raquel Cuella-Martin
- Department of Human Genetics, McGill University, Montreal, QC,H3A 0G1, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, QC,H3A 0G1, Canada
| |
Collapse
|
7
|
Feiner N, Uller T, de la Cruz F, Garcia-Roa R, Meier J, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective. The genome sequence of the Ibiza wall lizard, Podarcis pityusensis (Boscá, 1883). Wellcome Open Res 2025; 10:235. [PMID: 40520148 PMCID: PMC12163369 DOI: 10.12688/wellcomeopenres.24143.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/16/2025] [Indexed: 06/18/2025] Open
Abstract
Eukaryota; Opisthokonta; Metazoa; Eumetazoa; Bilateria; Deuterostomia; Chordata; Craniata; Vertebrata; Gnathostomata; Teleostomi; Euteleostomi; Sarcopterygii; Dipnotetrapodomorpha; Tetrapoda; Amniota; Sauropsida; Sauria; Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Laterata; Lacertibaenia; Lacertidae; Lacertinae; Podarcis; Podarcis pityusensis (Boscá, 1883) (NCBI:txid74359).
Collapse
Affiliation(s)
- Nathalie Feiner
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Lund University, Lund, Sweden
| | - Tobias Uller
- Department of Biology, Lund University, Lund, Sweden
| | - Ferran de la Cruz
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
- Departamento de Biologia, Universidade do Porto Faculdade de Ciencias, Porto, Portugal
| | | | - Joana Meier
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Lund University, Lund, Sweden
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
- Departamento de Biologia, Universidade do Porto Faculdade de Ciencias, Porto, Portugal
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Lund University, Lund, Sweden
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
- Departamento de Biologia, Universidade do Porto Faculdade de Ciencias, Porto, Portugal
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Wellcome Sanger Institute Tree of Life Core Informatics team
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Lund University, Lund, Sweden
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
- Departamento de Biologia, Universidade do Porto Faculdade de Ciencias, Porto, Portugal
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | - Tree of Life Core Informatics collective
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Department of Biology, Lund University, Lund, Sweden
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, Porto, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
- Departamento de Biologia, Universidade do Porto Faculdade de Ciencias, Porto, Portugal
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
8
|
Kim IV, Navarrete C, Grau-Bové X, Iglesias M, Elek A, Zolotarov G, Bykov NS, Montgomery SA, Ksiezopolska E, Cañas-Armenteros D, Soto-Angel JJ, Leys SP, Burkhardt P, Suga H, de Mendoza A, Marti-Renom MA, Sebé-Pedrós A. Chromatin loops are an ancestral hallmark of the animal regulatory genome. Nature 2025:10.1038/s41586-025-08960-w. [PMID: 40335694 DOI: 10.1038/s41586-025-08960-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 03/31/2025] [Indexed: 05/09/2025]
Abstract
In bilaterian animals, gene regulation is shaped by a combination of linear and spatial regulatory information. Regulatory elements along the genome are integrated into gene regulatory landscapes through chromatin compartmentalization1,2, insulation of neighbouring genomic regions3,4 and chromatin looping that brings together distal cis-regulatory sequences5. However, the evolution of these regulatory features is unknown because the three-dimensional genome architecture of most animal lineages remains unexplored6,7. To trace the evolutionary origins of animal genome regulation, here we characterized the physical organization of the genome in non-bilaterian animals (sponges, ctenophores, placozoans and cnidarians)8,9 and their closest unicellular relatives (ichthyosporeans, filastereans and choanoflagellates)10 by combining high-resolution chromosome conformation capture11,12 with epigenomic marks and gene expression data. Our comparative analysis showed that chromatin looping is a conserved feature of genome architecture in ctenophores, placozoans and cnidarians. These sequence-determined distal contacts involve both promoter-enhancer and promoter-promoter interactions. By contrast, chromatin loops are absent in the unicellular relatives of animals. Our findings indicate that spatial genome regulation emerged early in animal evolution. This evolutionary innovation introduced regulatory complexity, ultimately facilitating the diversification of animal developmental programmes and cell type repertoires.
Collapse
Affiliation(s)
- Iana V Kim
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
- Centre Nacional d'Anàlisis Genòmic (CNAG), Barcelona, Spain.
| | - Cristina Navarrete
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Xavier Grau-Bové
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Marta Iglesias
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Anamaria Elek
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Grygoriy Zolotarov
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | | | - Sean A Montgomery
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Ewa Ksiezopolska
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Didac Cañas-Armenteros
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | | | - Sally P Leys
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada
| | | | - Hiroshi Suga
- Department of Life and Environmental Sciences, Faculty of Bioresource Sciences, Prefectural University of Hiroshima, Shobara, Japan
| | - Alex de Mendoza
- School of Biological and Behavioral Sciences, Queen Mary University of London, London, UK
| | - Marc A Marti-Renom
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Centre Nacional d'Anàlisis Genòmic (CNAG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- ICREA, Barcelona, Spain
| | - Arnau Sebé-Pedrós
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), Barcelona, Spain.
- ICREA, Barcelona, Spain.
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
| |
Collapse
|
9
|
Zhao K, Xue H, Li G, Chitikineni A, Fan Y, Cao Z, Dong X, Lu H, Zhao K, Zhang L, Qiu D, Ren R, Gong F, Li Z, Ma X, Wan S, Varshney RK, Wei C, Yin D. Pangenome analysis reveals structural variation associated with seed size and weight traits in peanut. Nat Genet 2025; 57:1250-1261. [PMID: 40295880 PMCID: PMC12081311 DOI: 10.1038/s41588-025-02170-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 03/17/2025] [Indexed: 04/30/2025]
Abstract
Peanut (Arachis hypogaea L.) is an important oilseed and food legume crop, with seed size and weight being critical traits for domestication and breeding. However, genomic rearrangements like structural variations (SVs) underlying seed size and weight remain unclear. Here we present a comprehensive pangenome analysis utilizing eight high-quality genomes (two diploid wild, two tetraploid wild and four tetraploid cultivated peanuts) and resequencing data of 269 accessions with diverse seed sizes. We identified 22,222 core or soft-core, 22,232 distributed and 5,643 private gene families. The frequency of SVs in subgenome A is higher than in subgenome B. We identified 1,335 domestication-related SVs and 190 SVs associated with seed size or weight. Notably, a 275-bp deletion in gene AhARF2-2 results in loss of interaction with AhIAA13 and TOPLESS, reducing the inhibitory effect on AhGRF5 and promoting seed expansion. This high-quality pangenome serves as a fundamental resource for the genetic enhancement of peanuts and other legume crops.
Collapse
Affiliation(s)
- Kunkun Zhao
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Hongzhang Xue
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Guowei Li
- Institute of Crop Germplasm Resources, Shandong Academy of Agricultural Sciences, Ji'nan, China
| | - Annapurna Chitikineni
- WA State Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Western Australia, Australia
| | - Yi Fan
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Zenghui Cao
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Xiaorui Dong
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Huimin Lu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Kai Zhao
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Lin Zhang
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Ding Qiu
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Rui Ren
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Fangping Gong
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Zhongfeng Li
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Xingli Ma
- College of Agronomy, Henan Agricultural University, Zhengzhou, China
| | - Shubo Wan
- Institute of Crop Germplasm Resources, Shandong Academy of Agricultural Sciences, Ji'nan, China
| | - Rajeev K Varshney
- WA State Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Western Australia, Australia.
| | - Chaochun Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
| | - Dongmei Yin
- College of Agronomy, Henan Agricultural University, Zhengzhou, China.
| |
Collapse
|
10
|
Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Vestal Cuckoo Bee, Bombus vestalis (Geoffroy, 1785). Wellcome Open Res 2025; 10:218. [PMID: 40416837 PMCID: PMC12102655 DOI: 10.12688/wellcomeopenres.24004.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/25/2025] [Indexed: 05/27/2025] Open
Abstract
We present a genome assembly from a haploid male specimen of Bombus vestalis (Vestal Cuckoo Bee; Arthropoda; Insecta; Hymenoptera; Apidae). The genome sequence has a total length of 280.01 megabases. Most of the assembly (91.96%) is scaffolded into 25 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 24.37 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,600 protein-coding genes.
Collapse
|
11
|
Liesner D, Cossard GG, Zheng M, Godfroy O, Barrera-Redondo J, Haas FB, Coelho SM. Developmental pathways underlying sexual differentiation in the U/V sex chromosome system of giant kelp. Dev Cell 2025; 60:1142-1152.e6. [PMID: 39793585 DOI: 10.1016/j.devcel.2024.12.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 06/04/2024] [Accepted: 12/11/2024] [Indexed: 01/13/2025]
Abstract
In many multicellular organisms, sexual development is not determined by XX/XY or ZW/ZZ systems but by U/V sex chromosomes. In U/V systems, sex determination occurs in the haploid phase, with U chromosomes in females and V chromosomes in males. Here, we explore several male, female, and partially sex-reversed male lines of giant kelp to decipher how U/V sex chromosomes and autosomes initiate male versus female development. We identify a key set of genes on the sex chromosomes involved in triggering sexual development and characterize autosomal effector genes underlying sexual differentiation. We show that male, but not female, development involves large-scale transcriptome reorganization with pervasive enrichment in regulatory genes, faster evolutionary rates, and high species-specificity of male-biased genes. Our observations imply that a female-like phenotype is the "ground state", which is complemented by the presence of a U-chromosome but overridden by a dominant male developmental program triggered by the V-chromosome.
Collapse
Affiliation(s)
- Daniel Liesner
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany.
| | - Guillaume G Cossard
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Min Zheng
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Olivier Godfroy
- Station Biologique de Roscoff CNRS, UMR8227 Laboratory of Integrative Biology of Marine Models, Sorbonne Université, Station Biologique de Roscoff, Roscoff, France
| | - Josué Barrera-Redondo
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Fabian B Haas
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Susana M Coelho
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany.
| |
Collapse
|
12
|
Li Z, Pollet N. Impact of a horizontally transferred Helitron family on genome evolution in Xenopus laevis. Mob DNA 2025; 16:19. [PMID: 40241130 PMCID: PMC12001565 DOI: 10.1186/s13100-025-00356-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2024] [Accepted: 03/26/2025] [Indexed: 04/18/2025] Open
Abstract
BACKGROUND Within eukaryotes, most horizontal transfer of genetic material involves mobile DNA sequences and such events are called horizontal transposable element transfer (HTT). Although thousands of HTT examples have been reported, the transfer mechanisms and their impacts on host genomes remain elusive. RESULTS In this work, we carefully annotated three Helitron families within several Xenopus frog genomes. One of the Helitron family, Heli1Xen1, is recurrently involved in capturing and shuffling Xenopus laevis genes required in early embryonic development. Remarkably, we found that Heli1Xen1 is seemingly expressed in X. laevis and has produced multiple genomic polymorphisms within the X. laevis population. To identify the origin of Heli1Xen1, we searched its consensus sequence against available genome assemblies. We found highly similar copies in the genomes of another 13 vertebrate species from divergent vertebrate lineages, including reptiles, ray-finned fishes and amphibians. Further phylogenetic analysis provides evidence showing that Heli1Xen1 invaded these lineages via HTT quite recently, around 0.58-10.74 million years ago. CONCLUSIONS The frequently Heli1Xen1-involved HTT events among reptiles, fishes and amphibians could provide insights into possible vectors for transfer, such as shared viruses across lineages. Furthermore, we propose that the Heli1Xen1 sequence could be an ideal candidate for studying the mechanism and genomic impact of Helitron transposition.
Collapse
Affiliation(s)
- Zhen Li
- UMR Évolution, Génomes, Comportement et Écologie, Université Paris-Saclay, CNRS, IRD, Gif-sur-Yvette, 91198, France
| | - Nicolas Pollet
- UMR Évolution, Génomes, Comportement et Écologie, Université Paris-Saclay, CNRS, IRD, Gif-sur-Yvette, 91198, France.
| |
Collapse
|
13
|
Davison NJ, Morin PA, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the harbour porpoise, Phocoena phocoena (Linnaeus, 1758). Wellcome Open Res 2025; 10:181. [PMID: 40256618 PMCID: PMC12006785 DOI: 10.12688/wellcomeopenres.24011.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/25/2025] [Indexed: 04/22/2025] Open
Abstract
We present a genome assembly from a female specimen of Phocoena phocoena (harbour porpoise; Chordata; Mammalia; Artiodactyla; Phocoenidae). The genome sequence has a total length of 2,512.71 megabases. Most of the assembly (94.41%) is scaffolded into 22 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled, with a length of 16.38 kilobases.
Collapse
Affiliation(s)
- Nicholas J. Davison
- Scottish Marine Animal Stranding Scheme, University of Glasgow Institute of Biodiversity Animal Health and Comparative Medicine, Glasgow, Scotland, UK
| | - Phillip A. Morin
- NOAA Fisheries Southwest Fisheries Science Center, La Jolla, California, USA
| | | | | | | | - Tree of Life Core Informatics collective
- Scottish Marine Animal Stranding Scheme, University of Glasgow Institute of Biodiversity Animal Health and Comparative Medicine, Glasgow, Scotland, UK
- NOAA Fisheries Southwest Fisheries Science Center, La Jolla, California, USA
| | | |
Collapse
|
14
|
Rammou DL, Youlatos D, Triantafyllidis A, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective. The genome sequence of the European ground squirrel, Spermophilus citellus (Linnaeus, 1766). Wellcome Open Res 2025; 10:184. [PMID: 40433625 PMCID: PMC12107237 DOI: 10.12688/wellcomeopenres.23974.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/20/2025] [Indexed: 05/29/2025] Open
Abstract
We present a genome assembly from a female Spermophilus citellus (European ground squirrel; Chordata; Mammalia; Rodentia; Sciuridae). The genome sequence has a total length of 3,090.03 megabases. Most of the assembly (95.47%) is scaffolded into 20 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled, with a length of 16.45 kilobases.
Collapse
Affiliation(s)
- Dimitra-Lida Rammou
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Dionisios Youlatos
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- International Center for Biodiversity and Primate Conservation, Dali University, Dali, Yunnan, China
| | - Alexandros Triantafyllidis
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- Genomics and Epigenomics Translational Research, Center for Interdisciplinary Research and Innovation, Balkan Center, Thessaloniki, Greece
| | | | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- International Center for Biodiversity and Primate Conservation, Dali University, Dali, Yunnan, China
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- Genomics and Epigenomics Translational Research, Center for Interdisciplinary Research and Innovation, Balkan Center, Thessaloniki, Greece
| | - Wellcome Sanger Institute Tree of Life Core Informatics team
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- International Center for Biodiversity and Primate Conservation, Dali University, Dali, Yunnan, China
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- Genomics and Epigenomics Translational Research, Center for Interdisciplinary Research and Innovation, Balkan Center, Thessaloniki, Greece
| | - Tree of Life Core Informatics collective
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- International Center for Biodiversity and Primate Conservation, Dali University, Dali, Yunnan, China
- Department of Genetics, Development & Molecular Biology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
- Genomics and Epigenomics Translational Research, Center for Interdisciplinary Research and Innovation, Balkan Center, Thessaloniki, Greece
| |
Collapse
|
15
|
Davison NJ, Morin PA, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of long-finned pilot whale, Globicephala melas (Traill, 1809). Wellcome Open Res 2025; 10:180. [PMID: 40256617 PMCID: PMC12009481 DOI: 10.12688/wellcomeopenres.23919.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2025] [Indexed: 04/22/2025] Open
Abstract
We present a genome assembly from a male specimen of Globicephala melas (long-finned pilot whale; Chordata; Mammalia; Artiodactyla; Delphinidae). The genome sequence has a total length of 2,651.28 megabases. Most of the assembly (89.15%) is scaffolded into 23 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled, with a length of 16.39 kilobases. Gene annotation of this assembly on Ensembl identified 17,911 protein-coding genes.
Collapse
Affiliation(s)
| | - Phillip A. Morin
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, California, USA
| | | | | | | | | | | |
Collapse
|
16
|
Crowley LM, Sivell O, Mitchell R, Sivell D, University of Oxford and Wytham Woods Genome Acquisition Lab, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of a snail-killing fly, Dichetophora obliterata (Fabricius, 1805). Wellcome Open Res 2025; 10:176. [PMID: 40433623 PMCID: PMC12107236 DOI: 10.12688/wellcomeopenres.23993.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/21/2025] [Indexed: 05/29/2025] Open
Abstract
We present a genome assembly from a female specimen of Dichetophora obliterata (snail-killing fly; Arthropoda; Insecta; Diptera; Sciomyzidae). The genome sequence has a total length of 1,312.79 megabases. Most of the assembly (99.78%) is scaffolded into 6 chromosomal pseudomolecules. The mitochondrial genome has also been assembled, with a length of 21.36 kilobases. Gene annotation of this assembly on Ensembl identified 15,139 protein-coding genes.
Collapse
Affiliation(s)
| | | | - Ryan Mitchell
- Independent researcher, Sligo, County Sligo, Ireland
| | | | | | | | | | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- University of Oxford, Oxford, England, UK
- Natural History Museum, London, England, UK
- Independent researcher, Sligo, County Sligo, Ireland
| | | | | | | | | |
Collapse
|
17
|
Davison NJ, Morin PA, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the short-beaked common dolphin, Delphinus delphis Linnaeus, 1758. Wellcome Open Res 2025; 10:178. [PMID: 40290354 PMCID: PMC12022549 DOI: 10.12688/wellcomeopenres.23918.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2025] [Indexed: 04/30/2025] Open
Abstract
We present a genome assembly from a male specimen of Delphinus delphis (short-beaked common dolphin; Chordata; Mammalia; Artiodactyla; Delphinidae). The genome sequence has a total length of 2,663.52 megabases. Most of the assembly (88.76%) is scaffolded into 23 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled, with a length of 16.39 kilobases. Gene annotation of this assembly at Ensembl identified 17,797 protein-coding genes.
Collapse
Affiliation(s)
| | - Phillip A. Morin
- Southwest Fisheries Science Center, National Marine Fisheries Service, NOAA, La Jolla, California, USA
| | | | | | | | | | | |
Collapse
|
18
|
Albornos L, Iriondo P, Rodríguez-Marcos S, Farelo P, Sobrino-Mengual G, Muñoz-Centeno LM, Martín I, Dopico B. A Comprehensive Analysis of Short Specific Tissue (SST) Proteins, a New Group of Proteins from PF10950 That May Give Rise to Cyclopeptide Alkaloids. PLANTS (BASEL, SWITZERLAND) 2025; 14:1117. [PMID: 40219186 PMCID: PMC11991032 DOI: 10.3390/plants14071117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2025] [Revised: 03/24/2025] [Accepted: 03/26/2025] [Indexed: 04/14/2025]
Abstract
Proteins of the PF10950 family feature the DUF2775 domain of unknown function. The most studied are specific tissue (ST) proteins with tandem repeats, which are putative precursors of cyclopeptide alkaloids. Here, we study uncharacterised short ST (SST) proteins with the DUFF2775 domain by analysing 194 sequences from 120 species of 39 taxonomic families in silico. SST proteins have a signal peptide and their size and several other characteristics depend on their individual taxonomic family. Sequence analyses revealed that SST proteins contain two well-conserved regions, one resembling the ST repeat, which could constitute the core of cyclopeptide alkaloids. We studied the unique SST1 gene of Arabidopsis thaliana, which is adjacent to and co-expressed with a gene encoding a protein with a BURP domain, associated with cyclopeptide production. The empirical analysis indicated that the SST1 promoter is mainly activated in the roots, where most of the transcripts accumulate, and that the SST1 protein accumulates in the root vascular cambium. At the cellular level, SST fused to GFP appears in vesicles that co-localise with the endoplasmic reticulum and the vacuole. Thus, SSTs are a new type of PF10950 protein found in core eudicots with two conserved regions that could be involved in root biology.
Collapse
Affiliation(s)
- Lucía Albornos
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
- Institute for Agribiotechnology Research (CIALE), University of Salamanca, Campus Villamayor-Parque Científico, 37185 Villamayor, Spain
| | - Paula Iriondo
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
- Institute for Agribiotechnology Research (CIALE), University of Salamanca, Campus Villamayor-Parque Científico, 37185 Villamayor, Spain
| | - Silvia Rodríguez-Marcos
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
| | - Patricia Farelo
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
| | - Guillermo Sobrino-Mengual
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
| | - Luz María Muñoz-Centeno
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
| | - Ignacio Martín
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
- Institute for Agribiotechnology Research (CIALE), University of Salamanca, Campus Villamayor-Parque Científico, 37185 Villamayor, Spain
| | - Berta Dopico
- Department of Botany and Plant Physiology, University of Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain; (P.I.); (S.R.-M.); (P.F.); (G.S.-M.); (L.M.M.-C.); (I.M.)
- Institute for Agribiotechnology Research (CIALE), University of Salamanca, Campus Villamayor-Parque Científico, 37185 Villamayor, Spain
| |
Collapse
|
19
|
Kogay R, Wolf YI, Koonin EV. Horizontal Transfer of Bacterial Operons into Eukaryote Genomes. Genome Biol Evol 2025; 17:evaf055. [PMID: 40111106 PMCID: PMC11965790 DOI: 10.1093/gbe/evaf055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 03/12/2025] [Accepted: 03/16/2025] [Indexed: 03/22/2025] Open
Abstract
In prokaryotes, functionally linked genes are typically clustered into operons, which are transcribed into a single mRNA, providing for the coregulation of the production of the respective proteins, whereas eukaryotes generally lack operons. We explored the possibility that some prokaryotic operons persist in eukaryotic genomes after horizontal gene transfer (HGT) from bacteria. Extensive comparative analysis of prokaryote and eukaryote genomes revealed 33 gene pairs originating from bacterial operons, mostly encoding enzymes of the same metabolic pathways, and represented in distinct clades of fungi or amoebozoa. This amount of HGT is about an order of magnitude less than that observed for the respective individual genes. These operon fragments appear to be relatively recent acquisitions as indicated by their narrow phylogenetic spread and low intron density. In 20 of the 33 horizontally acquired operonic gene pairs, the genes are fused in the respective group of eukaryotes so that the encoded proteins become domains of a multifunctional protein ensuring coregulation and correct stoichiometry. We hypothesize that bacterial operons acquired via HGT initially persist in eukaryotic genomes under a neutral evolution regime and subsequently are either disrupted by genome rearrangement or undergo gene fusion which is then maintained by selection.
Collapse
Affiliation(s)
- Roman Kogay
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Yuri I Wolf
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V Koonin
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
20
|
Hostetler T, Chen TY, Chang WC. Bioinformatic, structural, and biochemical analysis leads to the discovery of novel isonitrilases and decodes their substrate selectivity. RSC Chem Biol 2025; 6:583-589. [PMID: 39944535 PMCID: PMC11811631 DOI: 10.1039/d4cb00304g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Accepted: 01/28/2025] [Indexed: 04/04/2025] Open
Abstract
Bacterial species, such as Mycobacterium tuberculosis, utilize isonitrile-containing peptides (INPs) for trace metal trafficking, e.g., copper or zinc. Despite their importance, very few INP structures have been characterized to date. Reported INPs consist of a peptide backbone and β-isonitrile amide moieties. While the peptide backbone can be annotated using an adenylation domain predictor of non-ribosomal peptide synthetase (NRPS), determining the alkyl chain of β-isonitrile amide moieties remains challenging via conventional analytical techniques. In this study, we focus on non-heme iron and 2-oxoglutarate (Fe/2OG) dependent isonitrilases that exhibit inherent selectivity toward the alkyl chain length of the substrate, thus enabling the structural elucidation of INPs. Based on two known isonitrilase structures, we identified eight residue positions that control substrate selectivity. Using a custom Python program that we developed, BioSynthNexus, over 350 Fe/2OG isonitrilase genes were identified. One of these enzymes was engineered through mutations at eight selected positions, effectively modifying its substrate preference to favor either a shorter or a longer alkyl chain. Furthermore, by examining several annotated isonitrilases at eight selected positions, substrate preferences of several isonitrilases were predicted and validated through biochemical assays. Together, these findings allow for effective identification of isonitrilases and INPs, and establish a predictive framework for determining the preferred alkyl chain of β-isonitrile amide moieties.
Collapse
Affiliation(s)
- Tyler Hostetler
- Department of Chemistry, North Carolina State University Raleigh NC 27695 USA
| | - Tzu-Yu Chen
- Department of Chemistry, North Carolina State University Raleigh NC 27695 USA
| | - Wei-Chen Chang
- Department of Chemistry, North Carolina State University Raleigh NC 27695 USA
| |
Collapse
|
21
|
Yano R, Li F, Hiraga S, Takeshima R, Kobayashi M, Toda K, Umehara Y, Kajiya-Kanegae H, Iwata H, Kaga A, Ishimoto M. The genomic landscape of gene-level structural variations in Japanese and global soybean Glycine max cultivars. Nat Genet 2025; 57:973-985. [PMID: 40033060 PMCID: PMC11985339 DOI: 10.1038/s41588-025-02113-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 01/30/2025] [Indexed: 03/05/2025]
Abstract
Japanese soybeans are traditionally bred to produce soy foods such as tofu, miso and boiled soybeans. Here, to investigate their distinctive genomic features, including genomic structural variations (SVs), we constructed 11 nanopore-based genome references for Japanese and other soybean lines. Our assembly-based comparative method, designated 'Asm2sv', identified gene-level SVs comprehensively, enabling pangenome analysis of 462 worldwide cultivars and varieties. Based on these, we identified selective sweeps between Japanese and US soybeans, one of which was the pod-shattering resistance gene PDH1. Genome-wide association studies further identified several quantitative trait loci that accounted for large-seed phenotypes of Japanese soybean lines, some of which were also close to regions of the selective sweeps, including PDH1. Notably, specific combinations of alleles, including SVs, were found to increase the seed size of some Japanese landraces. In addition to the differences in cultivation environments, distinct food processing usages might result in changes in Japanese soybean genomes.
Collapse
Affiliation(s)
- Ryoichi Yano
- Research Center for Advanced Analysis, National Agriculture and Food Research Organization (NARO), Tsukuba, Japan.
| | - Feng Li
- Institute of Crop Science, NARO, Tsukuba, Japan
| | | | | | | | - Kyoko Toda
- Research Center of Genetic Resources, NARO, Tsukuba, Japan
| | - Yosuke Umehara
- Institute of Agrobiological Sciences, NARO, Tsukuba, Japan
| | - Hiromi Kajiya-Kanegae
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Hiroyoshi Iwata
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Akito Kaga
- Institute of Crop Science, NARO, Tsukuba, Japan
| | | |
Collapse
|
22
|
Saxena AS, Baer CF. High rate of mutation and efficient removal by selection of structural variants from natural populations of Caenorhabditis elegans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.22.644739. [PMID: 40196536 PMCID: PMC11974759 DOI: 10.1101/2025.03.22.644739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
The importance of genomic structural variants (SVs) is well-appreciated, but much less is known about their mutational properties than of single nucleotide variants (SNVs) and short indels. The reason is simple: the longer the mutation, the less likely it will be covered by a single sequencing read, thus the harder it is to map unambiguously to a unique genomic location. Here we report SV mutation rate estimates from six mutation accumulation (MA) lines from two strains of C. elegans (N2 and PB306) using long-read (PacBio) sequencing. The inferred SV mutation rate ~1/10 the SNV rate and ~1/4 the short indel rate. We identified 40 mutations, and removed 52 false positives (FP) by manual inspection of each SV call. Excluding one atypical line (5 mutations, 35 FPs), the signal (inferred mutant) to noise (FP) ratio is approximately 2:1. False negative rates were determined by simulating variants in the reference genome, and observing 'recall'. Recall rate ranges from >90% for short indels and declines as SV length increases. Small deletions have nearly the same recall rate as small insertions (~100bp), but deletions have higher recall rates than insertions as size increases. The reported SV mutation rate is likely an underestimate. A quarter of identified SV mutations occur in SV hotspots that harbor pre-existing low complexity repeat variation. By comparison of the spectrum of spontaneous SVs to wild isolates, we infer that natural selection is not only efficient at removing SVs in exons, but also removes roughly half of SVs in intergenic regions.
Collapse
Affiliation(s)
- Ayush Shekhar Saxena
- Department of Biology, University of Florida, Gainesville, FL, USA
- Present address – Regeneron Pharmaceuticals Inc., Tarrytown, NY, USA
| | - Charles F. Baer
- Department of Biology, University of Florida, Gainesville, FL, USA
- University of Florida Genetics Institute, Gainesville, FL, USA
| |
Collapse
|
23
|
Delić T, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Tree of Life Core Informatics collective, Wellcome Sanger Institute Tree of Life Core Informatics team. The genome sequence of a cave beetle, Leptodirus hochenwartii F.J.Schmidt, 1832. Wellcome Open Res 2025; 10:159. [PMID: 40225906 PMCID: PMC11992515 DOI: 10.12688/wellcomeopenres.23959.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2025] [Indexed: 04/15/2025] Open
Abstract
We present a genome assembly from a male specimen of Leptodirus hochenwartii (cave beetle; Arthropoda; Insecta; Coleoptera; Leiodidae). The genome sequence has a total length of 492.36 megabases. Most of the assembly (98.03%) is scaffolded into 14 chromosomal pseudomolecules, including the X and Y sex chromosomes. The mitochondrial genome has also been assembled and is 22.01 kilobases in length.
Collapse
Affiliation(s)
- Teo Delić
- SubBio Laboratory, Department of Biology, University of Ljubljana, Ljubljana, Slovenia
| | | | | | | | | |
Collapse
|
24
|
Hutchinson F, Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Four-spotted Footman moth, Lithosia quadra (Linnaeus, 1758). Wellcome Open Res 2025; 10:146. [PMID: 40302899 PMCID: PMC12038344 DOI: 10.12688/wellcomeopenres.23788.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2025] [Indexed: 05/02/2025] Open
Abstract
We present a genome assembly from a male Lithosia quadra (Four-spotted Footman; Arthropoda; Insecta; Lepidoptera; Erebidae). The genome sequence has a total length of 456.27 megabases. Most of the assembly (99.91%) is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.38 kilobases in length.
Collapse
|
25
|
Broad GR, Lees DC, Boyes D, Natural History Museum Genome Acquisition Lab, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Straw-barred Pearl moth, Pyrausta despicata Scopoli, 1763. Wellcome Open Res 2025; 10:151. [PMID: 40443799 PMCID: PMC12120437 DOI: 10.12688/wellcomeopenres.23902.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2025] [Indexed: 06/02/2025] Open
Abstract
We present a genome assembly from a male specimen of Pyrausta despicata (Straw-barred Pearl; Arthropoda; Insecta; Lepidoptera; Crambidae). The genome sequence has a total length of 481.83 megabases. Most of the assembly (99.61%) is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled, with a length of 15.29 kilobases.
Collapse
Affiliation(s)
| | | | - Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Sivell O, Taylor SC, Barclay MVL, Skipp S, Geiser MF, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of a beetle, Pycnomerus fuliginosus Erichson, 1842. Wellcome Open Res 2025; 10:144. [PMID: 40256616 PMCID: PMC12006784 DOI: 10.12688/wellcomeopenres.23770.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2025] [Indexed: 04/22/2025] Open
Abstract
We present a genome assembly from a female Pycnomerus fuliginosus (beetle; Arthropoda; Insecta; Coleoptera; Zopheridae). The genome sequence has a total length of 359.22 megabases. Most of the assembly (95.81%) is scaffolded into 11 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 17.21 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,547 protein-coding genes.
Collapse
|
27
|
O’Brien MF, Lopez Colom R, Natural History Museum Genome Acquisition Lab, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Common Pochard, Aythya ferina (Linnaeus, 1758). Wellcome Open Res 2025; 10:152. [PMID: 40256619 PMCID: PMC12009480 DOI: 10.12688/wellcomeopenres.23904.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2025] [Indexed: 04/22/2025] Open
Abstract
We present a genome assembly from a female specimen of Aythya ferina (Common Pochard; Chordata; Aves; Anseriformes; Anatidae). The assembly contains two haplotypes with total lengths of 1,252.30 megabases and 1,103.59 megabases. Most of haplotype 1 (92.13%) is scaffolded into 41 chromosomal pseudomolecules, including the W and Z sex chromosomes. Haplotype 2 was assembled to scaffold level. The mitochondrial genome has also been assembled, with a length of 16.6 kilobases.
Collapse
|
28
|
Hutchinson F, Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Maple Pug moth, Eupithecia inturbata (Hübner, 1817). Wellcome Open Res 2025; 10:141. [PMID: 40225904 PMCID: PMC11992513 DOI: 10.12688/wellcomeopenres.23796.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2025] [Indexed: 04/15/2025] Open
Abstract
We present a genome assembly from a female Eupithecia inturbata (Maple Pug; Arthropoda; Insecta; Lepidoptera; Geometridae). The genome sequence has a total length of 427.76 megabases. Most of the assembly (99.94%) is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled, with a length of 15.33 kilobases. Gene annotation of this assembly on Ensembl identified 12,386 protein-coding genes.
Collapse
|
29
|
Davis J, Griffiths A. The genome sequence of the True Lover's Knot moth, Lycophotia porphyrea (Denis & Schiffermüller), 1775. Wellcome Open Res 2025; 9:469. [PMID: 40115046 PMCID: PMC11923537 DOI: 10.12688/wellcomeopenres.22897.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2025] [Indexed: 03/22/2025] Open
Abstract
We present a genome assembly from an individual male Lycophotia porphyrea (the True Lover's Knot; Arthropoda; Insecta; Lepidoptera; Noctuidae). The genome sequence has a total length of 542.40 megabases. Most of the assembly is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.39 kilobases in length. Gene annotation of this assembly at Ensembl identified 17,907 protein-coding genes.
Collapse
Affiliation(s)
| | - Andy Griffiths
- Wellcome Sanger Institute, Hinxton, England, UK
- Royal Botanic Garden Edinburgh, Edinburgh, Scotland, UK
| |
Collapse
|
30
|
Barclay MVL, Turner T, Telfer MG, Geiser MF, Natural History Museum Genome Acquisition Lab, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of a bark-dwelling beetle, Silvanus unidentatus (Olivier, 1790). Wellcome Open Res 2025; 10:136. [PMID: 40206407 PMCID: PMC11979581 DOI: 10.12688/wellcomeopenres.23771.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2025] [Indexed: 04/11/2025] Open
Abstract
We present a genome assembly from a specimen of Silvanus unidentatus (bark-dwelling beetle; Arthropoda; Insecta; Coleoptera; Silvanidae). The genome sequence has a total length of 187.15 megabases. Most of the assembly (84.24%) is scaffolded into 7 chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 17.15 kilobases in length. Gene annotation of this assembly on Ensembl identified 11,364 protein-coding genes.
Collapse
Affiliation(s)
| | | | - Mark G. Telfer
- Independent researcher, Ventnor, Isle of Wight, England, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Broad GR, Crowley LM, Januszczak I, Fletcher C, Barnes I, McCulloch J, De Ketelaere A, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of an ichneumonid wasp, Polytribax perspicillator (Gravenhorst, 1807). Wellcome Open Res 2025; 10:140. [PMID: 40271159 PMCID: PMC12015431 DOI: 10.12688/wellcomeopenres.23769.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2025] [Indexed: 04/25/2025] Open
Abstract
We present a genome assembly from a male Polytribax perspicillator (ichneumonid wasp; Arthropoda; Insecta; Hymenoptera; Ichneumonidae). The genome sequence has a total length of 314.21 megabases. Most of the assembly (99.94%) is scaffolded into 8 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 25.68 kilobases in length.
Collapse
Affiliation(s)
| | | | | | | | - Ian Barnes
- Natural History Museum, London, England, UK
| | - James McCulloch
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | | | - Natural History Museum Genome Acquisition Lab
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | - Darwin Tree of Life Barcoding collective
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | - Wellcome Sanger Institute Scientific Operations: Sequencing Operations
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | - Wellcome Sanger Institute Tree of Life Core Informatics team
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | - Tree of Life Core Informatics collective
- Natural History Museum, London, England, UK
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
- Independent researcher, Beernem, Belgium
| | | |
Collapse
|
32
|
Engel SR, Aleksander S, Nash RS, Wong ED, Weng S, Miyasato SR, Sherlock G, Cherry JM. Saccharomyces Genome Database: advances in genome annotation, expanded biochemical pathways, and other key enhancements. Genetics 2025; 229:iyae185. [PMID: 39530598 PMCID: PMC11912841 DOI: 10.1093/genetics/iyae185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 10/29/2024] [Accepted: 11/02/2024] [Indexed: 11/16/2024] Open
Abstract
Budding yeast (Saccharomyces cerevisiae) is the most extensively characterized eukaryotic model organism and has long been used to gain insight into the fundamentals of genetics, cellular biology, and the functions of specific genes and proteins. The Saccharomyces Genome Database (SGD) is a scientific resource that provides information about the genome and biology of S. cerevisiae. For more than 30 years, SGD has maintained the genetic nomenclature, chromosome maps, and functional annotation for budding yeast along with search and analysis tools to explore these data. Here, we describe recent updates at SGD, including the 2 most recent reference genome annotation updates, expanded biochemical pathway representation, changes to SGD search and data files, and other enhancements to the SGD website and user interface. These activities are part of our continuing effort to promote insights gained from yeast to enable the discovery of functional relationships between sequence and gene products in fungi and higher eukaryotes.
Collapse
Affiliation(s)
- Stacia R Engel
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Suzi Aleksander
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Robert S Nash
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Edith D Wong
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Shuai Weng
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Stuart R Miyasato
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - J Michael Cherry
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| |
Collapse
|
33
|
Hamilton JP, Brose J, Buell CR. SpudDB: a database for accessing potato genomic data. Genetics 2025; 229:iyae205. [PMID: 39657689 PMCID: PMC11912843 DOI: 10.1093/genetics/iyae205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 11/08/2024] [Accepted: 11/14/2024] [Indexed: 12/12/2024] Open
Abstract
Potato is a key food crop with a complex, polyploid genome. Advancements in sequencing technologies coupled with improvements in genome assembly algorithms have enabled generation of phased, chromosome-scale genome assemblies for cultivated tetraploid potato. The SpudDB database houses potato genome sequence and annotation, with the doubled monoploid DM 1-3 516 R44 (hereafter DM) genome serving as the reference genome and haplotype. Diverse annotation data types for DM genes are provided through a suite of Gene Report Pages including gene expression profiles across 438 potato samples. To further annotate potato genes based on expression, 65 gene co-expression modules were constructed that permit the identification of tightly co-regulated genes within DM across development and responses to wounding, abiotic stress, and biotic stress. Genome browser views of DM and 28 other potato genomes are provided along with a download page for genome sequence and annotation. To link syntenic genes within and between haplotypes, syntelogs were identified across 25 cultivated potato genomes. Through access to potato genome sequences and associated annotations, SpudDB can enable potato biologists, geneticists, and breeders to continue to improve this key food crop.
Collapse
Affiliation(s)
- John P Hamilton
- Department of Crop & Soil Sciences, University of Georgia, Athens, GA 30602, USA
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA 30602, USA
| | - Julia Brose
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA 30602, USA
| | - C Robin Buell
- Department of Crop & Soil Sciences, University of Georgia, Athens, GA 30602, USA
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA 30602, USA
- Institute of Plant Breeding, Genetics & Genomics, University of Georgia, Athens, GA 30602, USA
- The Plant Center, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
34
|
Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of a chironomid fly, Chironomus tentans Fabricius, 1805. Wellcome Open Res 2025; 10:133. [PMID: 40302898 PMCID: PMC12038343 DOI: 10.12688/wellcomeopenres.23907.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2025] [Indexed: 05/02/2025] Open
Abstract
We present a genome assembly from a specimen of Chironomus tentans (chironomid fly; Arthropoda; Insecta; Diptera; Chironomidae). The genome sequence has a total length of 185.47 megabases. Most of the assembly (99.7%) is scaffolded into 4 chromosomal pseudomolecules. The mitochondrial genome has also been assembled, with a length of 16.09 kilobases. Gene annotation of this assembly on Ensembl identified 12,575 protein-coding genes.
Collapse
|
35
|
Fletcher C, Alexander D, Reed B, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the European smelt, Osmerus eperlanus (Linnaeus, 1758). Wellcome Open Res 2025; 10:118. [PMID: 40183113 PMCID: PMC11966098 DOI: 10.12688/wellcomeopenres.23789.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2025] [Indexed: 04/05/2025] Open
Abstract
We present a genome assembly from a specimen of Osmerus eperlanus (European smelt; Chordata; Actinopteri; Osmeriformes; Osmeridae). The genome sequence has a total length of 508.70 megabases. Most of the assembly (95.79%) is scaffolded into 28 chromosomal pseudomolecules. The mitochondrial genome has also been assembled, with a length of 16.61 kilobases.
Collapse
|
36
|
O’Brien MF, Lopez Colom R, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the common crane, Grus grus (Linnaeus, 1758). Wellcome Open Res 2025; 10:119. [PMID: 40162188 PMCID: PMC11953615 DOI: 10.12688/wellcomeopenres.23797.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2025] [Indexed: 04/02/2025] Open
Abstract
We present a genome assembly from a male specimen of Grus grus (common crane; Chordata; Aves; Gruiformes; Gruidae). The assembly contains two haplotypes with total lengths of 1,352.26 megabases and 1,291.08 megabases. Most of haplotype 1 (91.85%) is scaffolded into 40 chromosomal pseudomolecules, including the Z sex chromosome. Haplotype 2 was assembled to scaffold level. The mitochondrial genome has also been assembled and is 18.9 kilobases in length.
Collapse
|
37
|
Zhao S, Qiao D, Zhang R, Jing T, An Y. CsMPDB 1.0: An interactive web application for visualizing and exploring the microRNAs and phasiRNAs of tea plant (Camellia sinensis var. sinensis 'Shuchazao'). Int J Biol Macromol 2025; 293:139209. [PMID: 39746417 DOI: 10.1016/j.ijbiomac.2024.139209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 12/20/2024] [Accepted: 12/24/2024] [Indexed: 01/04/2025]
Abstract
The microRNAs and phasiRNAs of plant are small non-coding RNAs with important functions through regulating gene expression at the post-transcriptional level. However, identifying miRNAs, phasiRNAs and their target genes from numerous sequencing raw data requires multiple software and command-line operations, which are time-consuming and labor-intensive for non-model plants. Therefore, we present CsMPDB (miRNAs and phasiRNAs database of Camellia sinensis), an interactive web application with multiple analysis modules developed to visualize and explore miRNA and phasiRNA in tea plants based on 259 sRNA-seq samples and 24 degradome-seq samples in NCBI. The source code for the CsMPDB was written in R/shiny. It is compatible, extendable, and portable to be easily set up on different operating systems, and can be accessed at http://myshiny.cpolar.io/CsMPDB. This application plays an important role in accelerating the functional study of sRNAs in the transcriptional regulation of tea plants, and has important reference value for the development of sRNA databases of other species.
Collapse
Affiliation(s)
- Shiqi Zhao
- School of Fishery, Zhejiang Ocean University, Zhoushan 316022, China
| | - Dahe Qiao
- Tea Research Institute, Guizhou Academy of Agricultural Sciences, Guiyang 550025, Guizhou, China
| | - Runqi Zhang
- School of Fishery, Zhejiang Ocean University, Zhoushan 316022, China
| | - Tingting Jing
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China.
| | - Yanlin An
- Department of Food Science and Engineering, Moutai Institute, Renhuai 564507, China.
| |
Collapse
|
38
|
Righi E, Guigó R. The BioGenome Portal: a web-based platform for biodiversity genomics data management. NAR Genom Bioinform 2025; 7:lqaf020. [PMID: 40124712 PMCID: PMC11928930 DOI: 10.1093/nargab/lqaf020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 02/10/2025] [Accepted: 02/24/2025] [Indexed: 03/25/2025] Open
Abstract
Biodiversity genomics projects are underway with the aim of sequencing the genomes of all eukaryotic species on Earth. Here we describe the BioGenome Portal, a web-based application to facilitate organization and access to the data produced by biodiversity genomics projects. The portal integrates user-generated data with data deposited in public repositories. The portal generates sequence status reports that can be eventually ingested by designated metadata tracking systems, facilitating the coordination task of these systems. The portal is open-source and fully customizable. It can be deployed at any site with minimum effort, contributing to the democratization of biodiversity genomics projects. We illustrate the features of the BioGenome Portal through a number of specific instances. One such instance is being used as the reference portal for the Catalan Initiative for the Earth Biogenome Project, a regional project aiming to sequencing the genomes of the species of the Catalan linguistic area.
Collapse
Affiliation(s)
- Emilio Righi
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, C/ Dr. Aiguader 88, Barcelona 08003 Catalonia, Spain
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, C/ Dr. Aiguader 88, Barcelona 08003 Catalonia, Spain
- Departament of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), C/ Dr. Aiguader 80, Barcelona 08003, Catalonia, Spain
| |
Collapse
|
39
|
Ray A, Yang C, Stelloh C, Tutaj M, Liu P, Liu Y, Qiu Q, Auer PL, Lin CW, Widlansky ME, Geurts AM, Cowley AW, Liang M, Kwitek AE, Greene AS, Rao S. Chromatin State Maps of Blood Pressure-Relevant Renal Segments Reveal Potential Regulatory Role for SNPs. Hypertension 2025; 82:476-488. [PMID: 39723540 DOI: 10.1161/hypertensionaha.124.23873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Accepted: 12/09/2024] [Indexed: 12/28/2024]
Abstract
BACKGROUND Hypertension or elevated blood pressure (BP) is a worldwide clinical challenge and the leading primary risk factor for kidney dysfunctions, heart failure, and cerebrovascular disease. The kidney is a central regulator of BP by maintaining sodium-water balance. Multiple genome-wide association studies revealed that BP is a heritable quantitative trait, modulated by several genetic, epigenetic, and environmental factors. The SNPs identified in genome-wide association studies predominantly (>95%) reside within noncoding genomic regions, making it difficult to understand how they regulate BP. Given the central role of the kidney in regulating BP, we hypothesized that chromatin-accessible regions in renal tissue would be enriched for BP-associated single nucleotide polymorphisms. METHODS We manually dissected 2 important kidney segments that maintain the sodium-water balance: proximal tubules and medullary thick ascending limbs from the human and rat kidneys. To delineate their chromatin and transcriptomic profiles, we performed the assay for transposase-accessible chromatin and RNA sequencing, respectively. RESULTS The chromatin accessibility maps revealed the shared and unique cis-regulatory elements that modulate the chromatin accessibility in proximal tubule and medullary thick ascending limbs of humans and rats. We developed a visualization tool to compare the cross-species epigenomic maps to identify potential regulatory targets for hypertension pathogenesis. We also identified a significant enrichment of BP-associated single nucleotide polymorphisms (1064 for human proximal tubule and 1172 for human medullary thick ascending limbs) within accessible chromatin regions of both segments, including rs1173771 and rs1421811 at the NPR3 locus and rs1800470 at the TGFb1 locus. CONCLUSIONS Collectively, this study lays a foundation for interrogating how intergenic single nucleotide polymorphisms may regulate polygenic traits such as BP.
Collapse
Affiliation(s)
- Atrayee Ray
- Versiti Blood Research Institute, Milwaukee, WI (A.R., C.S., S.R.)
| | - Chun Yang
- Department of Physiology (C.Y., M.T., A.M.G., A.W.C., A.E.K.), Medical College of Wisconsin, Milwaukee
| | - Cary Stelloh
- Versiti Blood Research Institute, Milwaukee, WI (A.R., C.S., S.R.)
| | - Monika Tutaj
- Department of Physiology (C.Y., M.T., A.M.G., A.W.C., A.E.K.), Medical College of Wisconsin, Milwaukee
| | - Pengyuan Liu
- Department of Physiology, University of Arizona, Tucson (P.L., Y.L., Q.Q., M.L.)
| | - Yong Liu
- Department of Physiology, University of Arizona, Tucson (P.L., Y.L., Q.Q., M.L.)
| | - Qiongzi Qiu
- Department of Physiology, University of Arizona, Tucson (P.L., Y.L., Q.Q., M.L.)
| | - Paul L Auer
- The Institute for Health and Equity (P.L.A.), Medical College of Wisconsin, Milwaukee
| | - Chien-Wei Lin
- Division of Biostatistics, Data Science Institute (C.-W.L.), Medical College of Wisconsin, Milwaukee
| | | | - Aron M Geurts
- Department of Physiology (C.Y., M.T., A.M.G., A.W.C., A.E.K.), Medical College of Wisconsin, Milwaukee
| | - Allen W Cowley
- Department of Physiology (C.Y., M.T., A.M.G., A.W.C., A.E.K.), Medical College of Wisconsin, Milwaukee
| | - Mingyu Liang
- Department of Physiology, University of Arizona, Tucson (P.L., Y.L., Q.Q., M.L.)
| | - Anne E Kwitek
- Department of Physiology (C.Y., M.T., A.M.G., A.W.C., A.E.K.), Medical College of Wisconsin, Milwaukee
| | | | - Sridhar Rao
- Versiti Blood Research Institute, Milwaukee, WI (A.R., C.S., S.R.)
- Department of Pediatrics, Section of Hematology/Oncology/Transplantation (S.R.), Medical College of Wisconsin, Milwaukee
- Department of Cell Biology, Neurobiology, and Anatomy (S.R.), Medical College of Wisconsin, Milwaukee
| |
Collapse
|
40
|
Margalit S, Tulpová Z, Detinis Zur T, Michaeli Y, Deek J, Nifker G, Haldar R, Gnatek Y, Omer D, Dekel B, Baris Feldman H, Grunwald A, Ebenstein Y. Long-read structural and epigenetic profiling of a kidney tumor-matched sample with nanopore sequencing and optical genome mapping. NAR Genom Bioinform 2025; 7:lqae190. [PMID: 39781516 PMCID: PMC11704781 DOI: 10.1093/nargab/lqae190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 12/12/2024] [Accepted: 12/31/2024] [Indexed: 01/12/2025] Open
Abstract
Carcinogenesis often involves significant alterations in the cancer genome, marked by large structural variants (SVs) and copy number variations (CNVs) that are difficult to capture with short-read sequencing. Traditionally, cytogenetic techniques are applied to detect such aberrations, but they are limited in resolution and do not cover features smaller than several hundred kilobases. Optical genome mapping (OGM) and nanopore sequencing [Oxford Nanopore Technologies (ONT)] bridge this resolution gap and offer enhanced performance for cytogenetic applications. Additionally, both methods can capture epigenetic information as they profile native, individual DNA molecules. We compared the effectiveness of the two methods in characterizing the structural, copy number and epigenetic landscape of a clear cell renal cell carcinoma tumor. Both methods provided comparable results for basic karyotyping and CNVs, but differed in their ability to detect SVs of different sizes and types. ONT outperformed OGM in detecting small SVs, while OGM excelled in detecting larger SVs, including translocations. Differences were also observed among various ONT SV callers. Additionally, both methods provided insights into the tumor's methylome and hydroxymethylome. While ONT was superior in methylation calling, hydroxymethylation reports can be further optimized. Our findings underscore the importance of carefully selecting the most appropriate platform based on specific research questions.
Collapse
Affiliation(s)
- Sapir Margalit
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Zuzana Tulpová
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
- Institute of Experimental Botany of the Czech Academy of Sciences, Olomouc, Czech Republic
| | - Tahir Detinis Zur
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Yael Michaeli
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Jasline Deek
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Gil Nifker
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Rita Haldar
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Yehudit Gnatek
- Pediatric Stem Cell Research Institute, Edmond and Lily Safra Children’s Hospital, Sheba Medical Center, 52621 Ramat Gan, Israel
| | - Dorit Omer
- Pediatric Stem Cell Research Institute, Edmond and Lily Safra Children’s Hospital, Sheba Medical Center, 52621 Ramat Gan, Israel
| | - Benjamin Dekel
- Pediatric Stem Cell Research Institute, Edmond and Lily Safra Children’s Hospital, Sheba Medical Center, 52621 Ramat Gan, Israel
- Pediatric Nephrology Unit, The Edmond and Lily Safra Children’s Hospital, Sheba Medical Center, 52621 Ramat Gan, Israel
- School of Medicine, Faculty of Medical and Health Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Hagit Baris Feldman
- School of Medicine, Faculty of Medical and Health Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- The Genetics Institute and Genomics Center, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Assaf Grunwald
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Yuval Ebenstein
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
- Department of Biomedical Engineering, Tel Aviv University, 6997801 Tel Aviv, Israel
| |
Collapse
|
41
|
Falk S, Crowley LM, McCulloch J, Akinmusola RY. The genome sequence of a long-legged fly, Scellus notatus (Fabricius, 1781). Wellcome Open Res 2025; 10:117. [PMID: 40104438 PMCID: PMC11914874 DOI: 10.12688/wellcomeopenres.23764.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2025] [Indexed: 03/20/2025] Open
Abstract
We present a genome assembly from a female Scellus notatus (long-legged fly; Arthropoda; Insecta; Diptera; Dolichopodidae). The genome sequence has a total length of 446.83 megabases. Most of the assembly (99.72%) is scaffolded into 5 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 17.7 kilobases in length.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, England, UK
| | - Liam M Crowley
- Department of Biology, University of Oxford, Oxford, England, UK
| | - James McCulloch
- Department of Biology, University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | | |
Collapse
|
42
|
McCulloch J, Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of a mirid bug, Stenodema calcarata (Fallen, 1807). Wellcome Open Res 2025; 10:115. [PMID: 40162190 PMCID: PMC11950723 DOI: 10.12688/wellcomeopenres.23763.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2025] [Indexed: 04/02/2025] Open
Abstract
We present a genome assembly from a specimen of Stenodema calcarata (mirid bug; Arthropoda; Insecta; Hemiptera; Miridae). The genome sequence has a total length of 575.35 megabases. Most of the assembly (98.42%) is scaffolded into 18 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 19.31 kilobases in length.
Collapse
Affiliation(s)
- James McCulloch
- University of Oxford, Oxford, England, UK
- Wellcome Sanger Institute, Hinxton, England, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Broad GR, Holt S, Sivess L, Boyes D, Natural History Museum Genome Acquisition Lab, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Scorched Wing moth, Plagodis dolabraria (Linnaeus, 1767). Wellcome Open Res 2025; 10:111. [PMID: 40125538 PMCID: PMC11926530 DOI: 10.12688/wellcomeopenres.23761.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2025] [Indexed: 03/25/2025] Open
Abstract
We present a genome assembly from a male specimen of Plagodis dolabraria (Scorched Wing; Arthropoda; Insecta; Lepidoptera; Geometridae). The genome sequence has a total length of 939.07 megabases. Most of the assembly (99.59%) is scaffolded into 31 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 16.97 kilobases in length.
Collapse
Affiliation(s)
| | - Steph Holt
- Natural History Museum, London, England, UK
| | | | - Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Hesson JC, Haba Y, McBride CS, Sheerin E, Mathers TC, Paulini M, Pointon DLB, Torrance JW, Sadasivan Baby C, Wood JMD, McCarthy SA, Lawniczak MKN, Makunin A. A chromosomal reference genome sequence for the northern house mosquito, Culex pipiens form pipiens, Linnaeus, 1758. Wellcome Open Res 2025; 10:107. [PMID: 40115327 PMCID: PMC11923538 DOI: 10.12688/wellcomeopenres.23767.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2025] [Indexed: 03/23/2025] Open
Abstract
We present a genome assembly from an individual female Culex pipiens sensu stricto (the northern house mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Sweden. The genome sequence is 533 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules. The complete mitochondrial genome was also assembled and is 15.6 kilobases in length.
Collapse
Affiliation(s)
- Jenny C Hesson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Biologisk Myggkontroll, Nedre Dalälvens Utvecklings AB, Gysinge, Sweden
| | - Yuki Haba
- Princeton University Department of Ecology and Evolutionary Biology, Princeton, New Jersey, USA
- Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, USA
| | - Carolyn S McBride
- Princeton University Department of Ecology and Evolutionary Biology, Princeton, New Jersey, USA
| | - Edel Sheerin
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | | - Michael Paulini
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| | | | | | | | | | - Shane A McCarthy
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
- University of Cambridge Department of Genetics, Cambridge, England, UK
| | | | - Alex Makunin
- Tree of Life, Wellcome Sanger Institute, Hinxton, England, UK
| |
Collapse
|
45
|
Fletcher C, Lees DC, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Dotted Footman moth, Pelosia muscerda (Hufnagel, 1767). Wellcome Open Res 2025; 10:106. [PMID: 40078957 PMCID: PMC11897693 DOI: 10.12688/wellcomeopenres.23766.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2025] [Indexed: 03/14/2025] Open
Abstract
We present a genome assembly from a male Pelosia muscerda (Dotted Footman; Arthropoda; Insecta; Lepidoptera; Erebidae). The genome sequence has a total length of 500.49 megabases. Most of the assembly (99.98%) is scaffolded into 30 chromosomal pseudomolecules, including the Z sex chromosome. The mitochondrial genome has also been assembled and is 15.42 kilobases in length.
Collapse
|
46
|
Willink B, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective. The genome sequence of the Tropical Bluetail Damselfly, Ischnura senegalensis (Rambur, 1842). Wellcome Open Res 2025; 10:104. [PMID: 40290353 PMCID: PMC12033981 DOI: 10.12688/wellcomeopenres.23747.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2025] [Indexed: 04/30/2025] Open
Abstract
We present a genome assembly from a specimen of Ischnura senegalensis (Tropical Bluetail; Arthropoda; Insecta; Odonata; Coenagrionidae). The assembly contains two haplotypes with total lengths of 1,599.82 megabases and 1,602.78 megabases. Most of haplotype 1 (96.41%) is scaffolded into 14 chromosomal pseudomolecules, including the X sex chromosome, which haplotype 2 is a scaffold-level assembly. The mitochondrial genome has also been assembled and is 18.11 kilobases in length.
Collapse
Affiliation(s)
- Beatriz Willink
- Department of Biological Science, National University of Singapore, Singapore 117558, Singapore
- Department of Entomology, Cornell University, Ithaca, New York, USA
| | | | | | | | - Tree of Life Core Informatics collective
- Department of Biological Science, National University of Singapore, Singapore 117558, Singapore
- Department of Entomology, Cornell University, Ithaca, New York, USA
| |
Collapse
|
47
|
Sivell D, Telnov D, Geiser MF, Barclay MVL, Natural History Museum Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the click beetle, Ampedus sanguinolentus sanguinolentus (Schrank, 1776). Wellcome Open Res 2025; 10:96. [PMID: 40134828 PMCID: PMC11933788 DOI: 10.12688/wellcomeopenres.23712.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/05/2025] [Indexed: 03/27/2025] Open
Abstract
We present a genome assembly from a female specimen of Ampedus sanguinolentus sanguinolentus (click beetle; Arthropoda; Insecta; Coleoptera; Elateridae). The assembly contains two haplotypes with total lengths of 1,574.76 megabases and 1,572.87 megabases. Most of haplotype 1 (97.13%) is scaffolded into 10 chromosomal pseudomolecules, while haplotype 2 is a scaffold-level assembly. The mitochondrial genome has also been assembled and is 15.99 kilobases in length.
Collapse
Affiliation(s)
| | - Dmitry Telnov
- Natural History Museum, London, England, UK
- Daugavpils University, Daugavpils, Latvia
- Institute of Biology, University of Latvia, Rīga, Latvia
| | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Adkins P, Bishop J, Harley J, Holland PWH. The genome sequence of the amphioxus, Branchiostoma lanceolatum (Pallas, 1774). Wellcome Open Res 2025; 10:95. [PMID: 40093594 PMCID: PMC11907188 DOI: 10.12688/wellcomeopenres.23671.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/31/2025] [Indexed: 03/19/2025] Open
Abstract
We present a genome assembly from a specimen of Branchiostoma lanceolatum (Amphioxus; Chordata; Leptocardii; Amphioxiformes; Branchiostomatidae). The assembly contains two haplotypes with total lengths of 468.40 megabases and 465.81 megabases, respectively. Most of haplotype 1 (99.34%) is scaffolded into 19 chromosomal pseudomolecules. Haplotype 2 is a scaffold level assembly. The mitochondrial genome has also been assembled and is 15.14 kilobases in length.
Collapse
Affiliation(s)
- Patrick Adkins
- The Marine Biological Association, Plymouth, England, UK
| | - John Bishop
- The Marine Biological Association, Plymouth, England, UK
| | - Joanna Harley
- The Marine Biological Association, Plymouth, England, UK
| | | |
Collapse
|
49
|
Boyes D, Hutchinson F, Crowley LM, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Large Birch Bell moth, Large Birch Roller, Epinotia brunnichana (Linnaeus, 1767). Wellcome Open Res 2025; 10:101. [PMID: 40171521 PMCID: PMC11959263 DOI: 10.12688/wellcomeopenres.23749.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2025] [Indexed: 04/03/2025] Open
Abstract
We present a genome assembly from a female Epinotia brunnichana (Large Birch Bell, Large Birch Roller; Arthropoda; Insecta; Lepidoptera; Tortricidae). The genome sequence has a total length of 943.10 megabases. Most of the assembly (99.68%) is scaffolded into 29 chromosomal pseudomolecules, including the W and Z sex chromosomes. The mitochondrial genome has also been assembled and is 15.7 kilobases in length. Gene annotation of this assembly on Ensembl identified 12,003 protein-coding genes.
Collapse
Affiliation(s)
- Douglas Boyes
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
| | | | | | | | | | - Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team
- UK Centre for Ecology & Hydrology, Wallingford, England, UK
- University of Essex, Penryn, England, UK
- University of Oxford, Oxford, England, UK
| | | | | | | | | |
Collapse
|
50
|
Falk S, Monks J, University of Oxford and Wytham Woods Genome Acquisition Lab, Darwin Tree of Life Barcoding collective, Wellcome Sanger Institute Tree of Life Management, Samples and Laboratory team, Wellcome Sanger Institute Scientific Operations: Sequencing Operations, Wellcome Sanger Institute Tree of Life Core Informatics team, Tree of Life Core Informatics collective, Darwin Tree of Life Consortium. The genome sequence of the Coppice Mining Bee, Andrena helvola (Linnaeus, 1758). Wellcome Open Res 2025; 10:102. [PMID: 40078958 PMCID: PMC11897692 DOI: 10.12688/wellcomeopenres.23746.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2025] [Indexed: 03/14/2025] Open
Abstract
We present a genome assembly from a female Andrena helvola (Coppice Mining Bee; Arthropoda; Insecta; Hymenoptera; Andrenidae). The genome sequence has a total length of 442.47 megabases. Most of the assembly (91.93%) is scaffolded into 7 chromosomal pseudomolecules. The mitochondrial genome has also been assembled and is 16.61 kilobases in length.
Collapse
Affiliation(s)
- Steven Falk
- Independent researcher, Kenilworth, Warwickshire, England, UK
| | | | | | | | | | | | | | | | | |
Collapse
|