1
|
Kalendar R, Kairov U. Genome-Wide Tool for Sensitive de novo Identification and Visualisation of Interspersed and Tandem Repeats. Bioinform Biol Insights 2024; 18:11779322241306391. [PMID: 39703748 PMCID: PMC11656428 DOI: 10.1177/11779322241306391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Accepted: 11/25/2024] [Indexed: 12/21/2024] Open
Abstract
Genomic repeats are functionally ubiquitous structural units found in all genomes. Studying these repeats of different origins is essential for understanding the evolution and adaptation of a given organism. These repeating patterns have manifold signatures and structures with varying degrees of homology, making their identification challenging. To address this challenge, we developed a new algorithm and software that can rapidly and accurately detect any repeated sequences de novo with varying degrees of homology in genomic sequences in interspersed or clustered repeats. Numerous forms of repeated sequences and complex patterns can be identified, even for complex sequence variants and implicit or mixed types of repeat blocks. Direct and inverted-repeat elements, perfect and imperfect microsatellite repeats, and any short or long tandem repeat belonging to a wide range of higher-order repeat structures of telomeres or large satellite sequences can be detected. By combining precision and versatility, our tool contributes significantly to elucidating the intricate landscape of genomic repeats.
Collapse
Affiliation(s)
- Ruslan Kalendar
- Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan
| | - Ulykbek Kairov
- Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan
| |
Collapse
|
2
|
Parisot N, Ribeiro Lopes M, Peignier S, Baa-Puyoulet P, Charles H, Calevro F, Callaerts P. Annotation of transcription factors, chromatin-associated factors, and basal transcription machinery in the pea aphid, Acyrthosiphon pisum, and development of the ATFdb database, a resource for studies of transcriptional regulation. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2024; 177:104217. [PMID: 39579797 DOI: 10.1016/j.ibmb.2024.104217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 10/15/2024] [Accepted: 11/19/2024] [Indexed: 11/25/2024]
Abstract
The pea aphid, Acyrthosiphon pisum, is an emerging model system in functional and comparative genomics, in part due to the availability of new genomic approaches and the different sequencing and annotation efforts that the community has dedicated to this important crop pest insect. The pea aphid is also used as a model to study fascinating biological traits of aphids, such as their extensive polyphenisms, their bacteriocyte-confined nutritional symbiosis, or their adaptation to the highly unbalanced diet represented by phloem sap. To get insights into the molecular basis of all these processes, it is important to have an appropriate annotation of transcription factors (TFs), which would enable the reconstruction/inference of gene regulatory networks in aphids. Using the latest version of the A. pisum genome assembly and annotation, which represents the first chromosome-level pea aphid genome, we annotated the complete repertoire of A. pisum TFs and complemented this information by annotating genes encoding chromatin-associated and basal transcription machinery proteins. These annotations were done combining information from the model Drosophila melanogaster, for which we also provide a revisited list of these proteins, and de novo prediction. The comparison between the two model systems allowed the identification of major losses or expansions in each genome, while a deeper analysis was made of ZNF TFs (with certain families expanded in the pea aphid), and the Hox gene cluster (showing reorganization in gene position in the pea aphid compared to D. melanogaster). All annotations are available to the community through the Aphid Transcription Factors database (ATFdb), consolidating the various annotations we generated. ATFdb serves as a valuable resource for gene regulation studies in aphids.
Collapse
Affiliation(s)
- Nicolas Parisot
- INSA Lyon, INRAE, BF2I, UMR0203, F-69621, Villeurbanne, France.
| | | | - Sergio Peignier
- INSA Lyon, INRAE, BF2I, UMR0203, F-69621, Villeurbanne, France
| | | | - Hubert Charles
- INSA Lyon, INRAE, BF2I, UMR0203, F-69621, Villeurbanne, France
| | | | - Patrick Callaerts
- KU Leuven, University of Leuven, Department of Human Genetics, Laboratory of Behavioral and Developmental Genetics, B-3000, Leuven, Belgium.
| |
Collapse
|
3
|
Wattad H, Molcho J, Manor R, Weil S, Aflalo ED, Chalifa-Caspi V, Sagi A. Roadmap and Considerations for Genome Editing in a Non-Model Organism: Genetic Variations and Off-Target Profiling. Int J Mol Sci 2024; 25:12530. [PMID: 39684244 DOI: 10.3390/ijms252312530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Revised: 11/14/2024] [Accepted: 11/18/2024] [Indexed: 12/18/2024] Open
Abstract
The CRISPR/Cas genome editing approach in non-model organisms poses challenges that remain to be resolved. Here, we demonstrated a generalized roadmap for a de novo genome annotation approach applied to the non-model organism Macrobrachium rosenbergii. We also addressed the typical genome editing challenges arising from genetic variations, such as a high frequency of single nucleotide polymorphisms, differences in sex chromosomes, and repetitive sequences that can lead to off-target events. For the genome editing of M. rosenbergii, our laboratory recently adapted the CRISPR/Cas genome editing approach to embryos and the embryonic primary cell culture. In this continuation study, an annotation pipeline was trained to predict the gene models by leveraging the available genomic, transcriptomic, and proteomic data, and enabling accurate gene prediction and guide design for knock-outs. A next-generation sequencing analysis demonstrated a high frequency of genetic variations in genes on both autosomal and sex chromosomes, which have been shown to affect the accuracy of editing analyses. To enable future applications based on the CRISPR/Cas tool in non-model organisms, we also verified the reliability of editing efficiency and tracked off-target frequencies. Despite the lack of comprehensive information on non-model organisms, this study provides an example of the feasibility of selecting and editing specific genes with a high degree of certainty.
Collapse
Affiliation(s)
- Hanin Wattad
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
| | - Jonathan Molcho
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
| | - Rivka Manor
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
| | - Simy Weil
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
| | - Eliahu D Aflalo
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
- Department of Life Sciences, Achva Academic College, Arugot 7980400, Israel
| | - Vered Chalifa-Caspi
- Bioinformatics Core Facility, Ilse Katz Institute for Nanoscale Science & Technology, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| | - Amir Sagi
- Department of Life Sciences, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, P.O. Box 653, Beer-Sheva 8410501, Israel
| |
Collapse
|
4
|
Yuan H, Liu XJ, Liu XZ, Zhao LN, Mao SL, Huang Y. The evolutionary dynamics of genome sizes and repetitive elements in Ensifera (Insecta: Orthoptera). BMC Genomics 2024; 25:1041. [PMID: 39501135 PMCID: PMC11539627 DOI: 10.1186/s12864-024-10949-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 10/24/2024] [Indexed: 11/08/2024] Open
Abstract
BACKGROUND In evolutionary biology, identifying and quantifying inter-lineage genome size variation and elucidating the underlying causes of that variation have long been goals. Repetitive elements (REs) have been proposed and confirmed as being among the most important contributors to genome size variation. However, the evolutionary implications of genome size variation and RE dynamics are not well understood. RESULTS A total of 35 Ensifera insects were collected from different areas in China, including nine species of crickets and 26 species of katydids. The genome sizes of seven species were then determined using flow cytometry. The RepeatExplorer2 pipeline was employed to retrieve the repeated sequences for each species, based on low-coverage (0.1 X) high-throughput Illumina unassembled short reads. The genome sizes of the 35 Ensifera insects exhibited a considerable degree of variation, ranging from 1.00 to 18.34 pg. This variation was more than 18-fold. Similarly, the RE abundances exhibited considerable variation, ranging from 13.66 to 61.16%. In addition, the Tettigonioidea had larger genomes and contained significantly more REs than did the Grylloidea genomes. Analysis of the correlation between RE abundance and the genome size of 35 Ensifera insects revealed that the abundance of REs, transposable elements (TEs), long terminal repeats (LTRs), and long interspersed nuclear elements (LINEs) are significantly correlated with genome size. Notably, there is an inflection point in this correlation, where species with increasingly large genomes (e.g., > 5-10 pg) have repeats that contribute less to genome expansion than expected. Furthermore, this study revealed contrasting evolutionary directions between the Tettigonioidea and Grylloidea clades in terms of the expansion of REs. Tettigonioidea species exhibit a gradual increase in ancestral genome size and RE abundance as they diverge, while Grylloidea species experience sustained genome contraction. CONCLUSIONS This study reveals extensive variation in genome size and RE abundance in Ensifera insects, with distinct evolutionary patterns across two major groups, Tettigonioidea and Grylloidea. This provides valuable insights into the variation in genome size and RE abundance in Ensifera insects, offering a comprehensive understanding of their evolutionary history.
Collapse
Affiliation(s)
- Hao Yuan
- School of Basic Medical Sciences, Xi'an Medical University, Xi'an, China
| | - Xiao-Jing Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Xuan-Zeng Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Li-Na Zhao
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Shao-Li Mao
- Xi'an Botanical Garden of Shaanxi Province, Institute of Botany of Shaanxi Province, Shaanxi Engineering Research Centre for Conservation and Utilization of Botanical Resources, Xi'an, China.
| | - Yuan Huang
- College of Life Sciences, Shaanxi Normal University, Xi'an, China.
| |
Collapse
|
5
|
Martelossi J, Iannello M, Ghiselli F, Luchetti A. Widespread HCD-tRNA derived SINEs in bivalves rely on multiple LINE partners and accumulate in genic regions. Mob DNA 2024; 15:22. [PMID: 39415259 PMCID: PMC11481361 DOI: 10.1186/s13100-024-00332-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 10/03/2024] [Indexed: 10/18/2024] Open
Abstract
BACKGROUND Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retrotransposons widespread across eukaryotes. They exist both as lineage-specific, fast-evolving elements and as ubiquitous superfamilies characterized by highly conserved domains (HCD). Several of these superfamilies have been described in bivalves, however their overall distribution and impact on host genome evolution are still unknown due to the extreme scarcity of transposon libraries for the clade. In this study, we examined more than 40 bivalve genomes to uncover the distribution of HCD-tRNA-related SINEs, discover novel SINE-LINE partnerships, and understand their possible role in shaping bivalve genome evolution. RESULTS We found that bivalve HCD SINEs have an ancient origin, and they can rely on at least four different LINE clades. According to a "mosaic" evolutionary scenario, multiple LINE partner can promote the amplification of the same HCD SINE superfamilies while homologues LINE-derived tails are present between different superfamilies. Multiple SINEs were found to be highly similar between phylogenetically related species but separated by extremely long evolutionary timescales, up to ~ 400 million years. Studying their genomic distribution in a subset of five species, we observed different patterns of SINE enrichment in various genomic compartments as well as differences in the tendency of SINEs to form tandem-like and palindromic structures also within intronic sequences. Despite these differences, we observed that SINEs, especially older ones, tend to accumulate preferentially within genes, or in their close proximity, consistently with a model of survival bias for less harmful, short non-coding transposons in euchromatic genomic regions. CONCLUSION Here we conducted a wide characterization of tRNA-related SINEs in bivalves revealing their taxonomic distribution and LINE partnerships across the clade. Moreover, through the study of their genomic distribution in five species, we highlighted commonalities and differences with other previously studied eukaryotes, thus extending our understanding of SINE evolution across the tree of life.
Collapse
Affiliation(s)
- Jacopo Martelossi
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy.
| | - Mariangela Iannello
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Fabrizio Ghiselli
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy.
| | - Andrea Luchetti
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
6
|
Weng YM, Lopez-Cacacho I, Foquet B, Martinez JI, Plotkin D, Sourakov A, Frandsen PB, Kawahara AY. A near chromosome-level genome assembly of a ghost moth (Lepidoptera, Hepialidae). Sci Data 2024; 11:1139. [PMID: 39414832 PMCID: PMC11484951 DOI: 10.1038/s41597-024-03783-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 08/14/2024] [Indexed: 10/18/2024] Open
Abstract
Ghost moths are an unusual family of primitive moths (Lepidoptera: Hepialidae) known for their large body size and crepuscular adult activity. These moths represent an ancient lineage, frequently have soil dwelling larvae, and are adapted to high elevations, deserts, and other extreme environments. Despite being rather speciose with more than 700 species, there is a dearth of genomic resources for the family. Here, we present the first high quality, publicly available hepialid genome, generated from an Andean species of ghost moth, Druceiella hillmani. Our genome assembly has a length of 2,586 Mbp with contig N50 of 28.1 Mb and N50 of 29, and BUSCO completeness of 97.1%, making it one of the largest genomes in the order Lepidoptera. Our assembly is a vital resource for future research on ghost moth genomics.
Collapse
Affiliation(s)
- Yi-Ming Weng
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Isabel Lopez-Cacacho
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Bert Foquet
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Jose I Martinez
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - David Plotkin
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Andrei Sourakov
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
| | - Paul B Frandsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah, USA
| | - Akito Y Kawahara
- McGuire Center for Lepidoptera & Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
7
|
Cicconardi F, Morris BJ, Martelossi J, Ray DA, Montgomery SH. Novel Sex-Specific Genes and Diverse Interspecific Expression in the Antennal Transcriptomes of Ithomiine Butterflies. Genome Biol Evol 2024; 16:evae218. [PMID: 39373182 PMCID: PMC11500719 DOI: 10.1093/gbe/evae218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 09/23/2024] [Accepted: 09/24/2024] [Indexed: 10/08/2024] Open
Abstract
The olfactory sense is crucial for organisms, facilitating environmental recognition and interindividual communication. Ithomiini butterflies exemplify this importance not only because they rely strongly on olfactory cues for both inter- and intra-sexual behaviors, but also because they show convergent evolution of specialized structures within the antennal lobe, called macroglomerular complexes (MGCs). These structures, widely absent in butterflies, are present in moths where they enable heightened sensitivity to, and integration of, information from various types of pheromones. In this study, we investigate chemosensory evolution across six Ithomiini species and identify possible links between expression profiles and neuroanatomical. To enable this, we sequenced four new high-quality genome assemblies and six sex-specific antennal transcriptomes for three of these species with different MGC morphologies. With extensive genomic analyses, we found that the expression of antennal transcriptomes across species exhibit profound divergence, and identified highly expressed ORs, which we hypothesize may be associated to MGCs, as highly expressed ORs are absent in Methona, an Ithomiini lineage which also lacks MGCs. More broadly, we show how antennal sexual dimorphism is prevalent in both chemosensory genes and non-chemosensory genes, with possible relevance for behavior. As an example, we show how lipid-related genes exhibit consistent sexual dimorphism, potentially linked to lipid transport or host selection. In this study, we investigate the antennal chemosensory adaptations, suggesting a link between genetic diversity, ecological specialization, and sensory perception with the convergent evolution of MCGs. Insights into chemosensory gene evolution, expression patterns, and potential functional implications enhance our knowledge of sensory adaptations and sexual dimorphisms in butterflies, laying the foundation for future investigations into the genetic drivers of insect behavior, adaptation, and speciation.
Collapse
Affiliation(s)
- Francesco Cicconardi
- School of Biological Sciences, Bristol University, 24 Tyndall Ave, Bristol BS8 1TQ, UK
| | - Billy J Morris
- Department of Zoology, University of Cambridge, Downing Street, Cambridge CB2 3EJ, UK
| | - Jacopo Martelossi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, 40126 Bologna, Italy
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | - Stephen H Montgomery
- School of Biological Sciences, Bristol University, 24 Tyndall Ave, Bristol BS8 1TQ, UK
| |
Collapse
|
8
|
Ye X, Yang Y, Zhao X, Fang Q, Ye G. The state of parasitoid wasp genomics. Trends Parasitol 2024; 40:914-929. [PMID: 39227194 DOI: 10.1016/j.pt.2024.08.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 08/12/2024] [Accepted: 08/12/2024] [Indexed: 09/05/2024]
Abstract
Parasitoid wasps represent a group of parasitic insects with high species diversity that have played a pivotal role in biological control and evolutionary studies. Over the past 20 years, developments in genomics have greatly enhanced our understanding of the biology of these species. Technological leaps in sequencing have facilitated the improvement of genome quality and quantity, leading to the availability of hundreds of parasitoid wasp genomes. Here, we summarize recent progress in parasitoid wasp genomics, focusing on the evolution of genome size (GS) and the genomic basis of several key traits. We also discuss the contributions of genomics in studying venom evolution and endogenization of viruses. Finally, we advocate for increased sequencing and functional research to better understand parasitoid biology and enhance biological control.
Collapse
Affiliation(s)
- Xinhai Ye
- College of Advanced Agriculture Sciences, Zhejiang A&F University, Hangzhou, China.
| | - Yi Yang
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Xianxin Zhao
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Qi Fang
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Gongyin Ye
- State Key Laboratory of Rice Biology and Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China.
| |
Collapse
|
9
|
Betancourt AJ, Wei KHC, Huang Y, Lee YCG. Causes and Consequences of Varying Transposable Element Activity: An Evolutionary Perspective. Annu Rev Genomics Hum Genet 2024; 25:1-25. [PMID: 38603565 DOI: 10.1146/annurev-genom-120822-105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Transposable elements (TEs) are genomic parasites found in nearly all eukaryotes, including humans. This evolutionary success of TEs is due to their replicative activity, involving insertion into new genomic locations. TE activity varies at multiple levels, from between taxa to within individuals. The rapidly accumulating evidence of the influence of TE activity on human health, as well as the rapid growth of new tools to study it, motivated an evaluation of what we know about TE activity thus far. Here, we discuss why TE activity varies, and the consequences of this variation, from an evolutionary perspective. By studying TE activity in nonhuman organisms in the context of evolutionary theories, we can shed light on the factors that affect TE activity. While the consequences of TE activity are usually deleterious, some have lasting evolutionary impacts by conferring benefits on the host or affecting other evolutionary processes.
Collapse
Affiliation(s)
- Andrea J Betancourt
- Institute of Infection, Veterinary, and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Kevin H-C Wei
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| | - Yuh Chwen G Lee
- Center for Complex Biological Systems, University of California, Irvine, California, USA;
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| |
Collapse
|
10
|
Majid M, Khan H, Liu X, Shaheer M, Huang Y. Evolutionary Dynamics of Satellite DNA Repeats across the Tettigoniidae Family: Insights from Genomic Analysis. Biomolecules 2024; 14:915. [PMID: 39199303 PMCID: PMC11352069 DOI: 10.3390/biom14080915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 07/23/2024] [Accepted: 07/26/2024] [Indexed: 09/01/2024] Open
Abstract
Satellite DNA repeats are repetitive DNA sequences found in eukaryotic genomes, typically consisting of short DNA motifs repeated in tandem arrays. Despite the vast body of literature on satellite DNA repeats in other taxa, investigations specifically targeting Tettigoniidae remain conspicuously absent. Our study aims to fill a critical gap in our understanding of satellitome evolutionary processes shaping Tettigoniidae genomes. Repeatome analysis revealed that the Meconema thalassinum genome comprises 92%, and Phryganogryllacris superangulata had the lowest value of 34%, with an average of 67% in other Tettigoniidae species. The analysis reveals significant variation in the number of satellite DNA repeats across species of the Tettigoniidae family, with M. thalassinum exhibiting the highest count, 246, reported in insects to date and the lowest count, 10, in Pholidoptera griseoptera. Ruspolia dubia and Ruspolia yunnana, which are congeneric species, showcase distinct counts of 104 and 84 families, respectively. Satellite DNA repeats in R. dubia exhibit the highest abundance, constituting 17.2% of the total genome, while the lowest abundance was reported in P. griseoptera, at 5.65%. The genome size correlates weakly with the satellite DNA family count (rs = 0.42, p = 0.29), but a strong correlation exists between satellite abundance and family number (rs = 0.73, p = 0.03). Moreover, the analysis of satellite DNA gain and loss patterns provides insights into the amplification and homogenization of satellite DNA families within the genome, with species-specific repeats exhibiting a positive trend toward amplification. The chromosomal distribution in M. thalassinum displayed that the highest accumulation was observed on Chr12, Chr01, and Chr04, constituting 17.79%, 17.4%, and 17.22% of the total chromosome size, respectively. The chromosome-specific propagation of satellite DNA families was evident, with MthSat01 solely on chromosome 1 and MthSat170 on chromosome 2, sharing 1.64% and 2.33%. The observed conservation and variations in satellite DNA number and abundances, along with distinct patterns of gain and loss, indicate the influence of potentially diverse evolutionary processes shaping the genomic landscape of these insects, which requires further investigation. Furthermore, the differential accumulation of satellite DNA on specific chromosomes implies that potential chromosome-specific functions or structural features influence the retention and proliferation of satellite sequences.
Collapse
Affiliation(s)
- Muhammad Majid
- College of Life Sciences, Shaanxi Normal University, Xi’an 710119, China; (M.M.)
| | - Hashim Khan
- College of Life Sciences, Shaanxi Normal University, Xi’an 710119, China; (M.M.)
| | - Xuanzeng Liu
- College of Life Sciences, Shaanxi Normal University, Xi’an 710119, China; (M.M.)
| | - Muhammad Shaheer
- Department of Entomology, MNS Agriculture University, Multan 66000, Pakistan
| | - Yuan Huang
- College of Life Sciences, Shaanxi Normal University, Xi’an 710119, China; (M.M.)
| |
Collapse
|
11
|
Schweizer RM, Meidt CG, Benavides LR, Wilson JS, Griswold TL, Sim SB, Geib SM, Branstetter MG. Reference genome for the Mojave poppy bee (Perdita meconis), a specialist pollinator of conservation concern. J Hered 2024; 115:470-479. [PMID: 38088446 PMCID: PMC11235129 DOI: 10.1093/jhered/esad076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 12/04/2023] [Accepted: 12/07/2023] [Indexed: 07/11/2024] Open
Abstract
The Mojave poppy bee, Perdita meconis Griswold (Hymenoptera: Anthophila: Andrenidae), is a species of conservation concern that is restricted to the eastern Mojave Desert of North America. It is a specialist pollinator of two poppy genera, Arctomecon and Argemone (Papaveraceae), and is being considered for listing under the US Endangered Species Act along with one of its pollinator hosts, the Las Vegas bearpoppy (Arctomecon californica). Here, we present a near chromosome-level genome of the Mojave poppy bee to provide a genomic resource that will aid conservation efforts and future research. We isolated DNA from a single, small (<7 mm), male specimen collected using non-ideal preservation methods and then performed whole-genome sequencing using PacBio HiFi technology. After quality and contaminant filtering, the final draft genome assembly is 327 Mb, with an N50 length of 17.5 Mb. Annotated repetitive elements compose 37.3% of the genome, although a large proportion (24.87%) of those are unclassified repeats. Additionally, we annotated 18,245 protein-coding genes and 19,433 transcripts. This genome represents one of only a few genomes from the large bee family Andrenidae and one of only a few genomes for pollinator specialists. We highlight both the potential of this genome as a resource for future research, and how high-quality genomes generated from small, non-ideal (in terms of preservation) specimens could facilitate biodiversity genomics.
Collapse
Affiliation(s)
- Rena M Schweizer
- U.S. Department of Agriculture, Agricultural Research Service (USDA-ARS), Pollinating Insects Research Unit, Utah State University, Logan, UT, United States
- Division of Biological Sciences, University of Montana, Missoula, MT, United States
| | - Colleen G Meidt
- U.S. Department of Agriculture, Agricultural Research Service (USDA-ARS), Pollinating Insects Research Unit, Utah State University, Logan, UT, United States
- Department of Biology, Utah State University, Logan, UT, United States
| | - Ligia R Benavides
- U.S. Department of Agriculture, Agricultural Research Service (USDA-ARS), Pollinating Insects Research Unit, Utah State University, Logan, UT, United States
| | - Joseph S Wilson
- Department of Biology, Utah State University-Tooele, Tooele, UT, United States
| | - Terry L Griswold
- U.S. Department of Agriculture, Agricultural Research Service (USDA-ARS), Pollinating Insects Research Unit, Utah State University, Logan, UT, United States
| | - Sheina B Sim
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Pacific Basin Agricultural Research Center, Tropical Pest Genetics and Molecular Biology Research Unit, Hilo, HI, United States
| | - Scott M Geib
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Pacific Basin Agricultural Research Center, Tropical Pest Genetics and Molecular Biology Research Unit, Hilo, HI, United States
| | - Michael G Branstetter
- U.S. Department of Agriculture, Agricultural Research Service (USDA-ARS), Pollinating Insects Research Unit, Utah State University, Logan, UT, United States
| |
Collapse
|
12
|
Wang Q, Zhang J, Liu C, Ru C, Qian Q, Yang M, Yan S, Liu W, Wang G. Identification of antennal alternative splicing by combining genome and full-length transcriptome analysis in Bactrocera dorsalis. Front Physiol 2024; 15:1384426. [PMID: 38952867 PMCID: PMC11215311 DOI: 10.3389/fphys.2024.1384426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 05/29/2024] [Indexed: 07/03/2024] Open
Abstract
Alternative splicing is an essential post-transcriptional regulatory mechanism that diversifies gene function by generating multiple protein isoforms from a single gene and act as a crucial role in insect environmental adaptation. Olfaction, a key sense for insect adaptation, relies heavily on the antennae, which are the primary olfactory organs expressing most of the olfactory genes. Despite the extensive annotation of olfactory genes within insect antennal tissues facilitated by high-throughput sequencing technology advancements, systematic analyses of alternative splicing are still relatively less. In this study, we focused on the oriental fruit fly (Bactrocera dorsalis), a significant pest of fruit crops. We performed a detailed analysis of alternative splicing in its antennae by utilizing the full-length transcriptome of its antennal tissue and the insect's genome. The results revealed 8600 non-redundant full-length transcripts identified in the oriental fruit fly antennal full-length transcriptome, spanning 4,145 gene loci. Over 40% of these loci exhibited multiple isoforms. Among these, 161 genes showed sex-biased isoform switching, involving seven different types of alternative splicing. Notably, events involving alternative transcription start sites (ATSS) and alternative transcription termination sites (ATTS) were the most common. Of all the genes undergoing ATSS and ATTS alternative splicing between male and female, 32 genes were alternatively spliced in protein coding regions, potentially affecting protein function. These genes were categorized based on the length of the sex-biased isoforms, with the highest difference in isoform fraction (dIF) associated with the ATSS type, including genes such as BdorABCA13, BdorCAT2, and BdorTSN3. Additionally, transcription factor binding sites for doublesex were identified upstream of both BdorABCA13 and BdorCAT2. Besides being expressed in the antennal tissues, BdorABCA13 and BdorCAT2 are also expressed in the mouthparts, legs, and genitalia of both female and male adults, suggesting their functional diversity. This study reveals alternative splicing events in the antennae of Bactrophora dorsalis from two aspects: odorant receptor genes and other types of genes expressed in the antennae. This study not only provides a research foundation for understanding the regulation of gene function by alternative splicing in the oriental fruit fly but also offers new insights for utilizing olfaction-based behavioral manipulation techniques to manage this pest.
Collapse
Affiliation(s)
- Qi Wang
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Jie Zhang
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Chenhao Liu
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Chuanjian Ru
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Qian Qian
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Minghuan Yang
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Shanchun Yan
- Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, Northeast Forestry University, Harbin, China
| | - Wei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Guirong Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| |
Collapse
|
13
|
Gable SM, Bushroe N, Mendez J, Wilson A, Pinto B, Gamble T, Tollis M. Differential Conservation and Loss of CR1 Retrotransposons in Squamates Reveals Lineage-Specific Genome Dynamics across Reptiles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.09.579686. [PMID: 38405926 PMCID: PMC10888918 DOI: 10.1101/2024.02.09.579686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Transposable elements (TEs) are repetitive DNA sequences which create mutations and generate genetic diversity across the tree of life. In amniotic vertebrates, TEs have been mainly studied in mammals and birds, whose genomes generally display low TE diversity. Squamates (Order Squamata; ~11,000 extant species of lizards and snakes) show as much variation in TE abundance and activity as they do in species and phenotypes. Despite this high TE activity, squamate genomes are remarkably uniform in size. We hypothesize that novel, lineage-specific dynamics have evolved over the course of squamate evolution to constrain genome size across the order. Thus, squamates may represent a prime model for investigations into TE diversity and evolution. To understand the interplay between TEs and host genomes, we analyzed the evolutionary history of the CR1 retrotransposon, a TE family found in most tetrapod genomes. We compared 113 squamate genomes to the genomes of turtles, crocodilians, and birds, and used ancestral state reconstruction to identify shifts in the rate of CR1 copy number evolution across reptiles. We analyzed the repeat landscapes of CR1 in squamate genomes and determined that shifts in the rate of CR1 copy number evolution are associated with lineage-specific variation in CR1 activity. We then used phylogenetic reconstruction of CR1 subfamilies across amniotes to reveal both recent and ancient CR1 subclades across the squamate tree of life. The patterns of CR1 evolution in squamates contrast other amniotes, suggesting key differences in how TEs interact with different host genomes and at different points across evolutionary history.
Collapse
Affiliation(s)
- Simone M. Gable
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Nicholas Bushroe
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Jasmine Mendez
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Adam Wilson
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Brendan Pinto
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
| | - Tony Gamble
- Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA
- Department of Biological Sciences, Marquette University, Milwaukee, WI, USA
- Bell Museum of Natural History, University of Minnesota, St. Paul, MN, USA
| | - Marc Tollis
- School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| |
Collapse
|
14
|
Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024; 2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Genome sequencing quality, in terms of both read length and accuracy, is constantly improving. By combining long-read sequencing technologies with various scaffolding techniques, chromosome-level genome assemblies are now achievable at an affordable price for non-model organisms. Insects represent an exciting taxon for studying the genomic underpinnings of evolutionary innovations, due to ancient origins, immense species-richness, and broad phenotypic diversity. Here we summarize some of the most important methods for carrying out a comparative genomics study on insects. We describe available tools and offer concrete tips on all stages of such an endeavor from DNA extraction through genome sequencing, annotation, and several evolutionary analyses. Along the way we describe important insect-specific aspects, such as DNA extraction difficulties or gene families that are particularly difficult to annotate, and offer solutions. We describe results from several examples of comparative genomics analyses on insects to illustrate the fascinating questions that can now be addressed in this new age of genomics research.
Collapse
Affiliation(s)
- Barbara Feldmeyer
- Senckenberg Biodiversity and Climate Research Centre (SBiK-F), Molecular Ecology, Frankfurt, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Bertrand Fouks
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Jacqueline Heckenhauer
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany
- Department of Terrestrial Zoology, Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Ann Kathrin Huylmans
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University, Mainz, Germany
| | - Alun R C Jones
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Eckart Stolle
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn, Germany
| | - Mark C Harrison
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| |
Collapse
|