1
|
Ballesio F, Pepe G, Ausiello G, Novelletto A, Helmer-Citterich M, Gherardini PF. Human lncRNAs harbor conserved modules embedded in different sequence contexts. Noncoding RNA Res 2024; 9:1257-1270. [PMID: 39040814 PMCID: PMC11261117 DOI: 10.1016/j.ncrna.2024.06.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 06/11/2024] [Accepted: 06/19/2024] [Indexed: 07/24/2024] Open
Abstract
We analyzed the structure of human long non-coding RNA (lncRNAs) genes to investigate whether the non-coding transcriptome is organized in modular domains, as is the case for protein-coding genes. To this aim, we compared all known human lncRNA exons and identified 340 pairs of exons with high sequence and/or secondary structure similarity but embedded in a dissimilar sequence context. We grouped these pairs in 106 clusters based on their reciprocal similarities. These shared modules are highly conserved between humans and the four great ape species, display evidence of purifying selection and likely arose as a result of recent segmental duplications. Our analysis contributes to the understanding of the mechanisms driving the evolution of the non-coding genome and suggests additional strategies towards deciphering the functional complexity of this class of molecules.
Collapse
Affiliation(s)
- Francesco Ballesio
- PhD Program in Cellular and Molecular Biology, Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - Gerardo Pepe
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - Gabriele Ausiello
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - Andrea Novelletto
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | | | | |
Collapse
|
2
|
Karageorgiou C, Gokcumen O, Dennis MY. Deciphering the role of structural variation in human evolution: a functional perspective. Curr Opin Genet Dev 2024; 88:102240. [PMID: 39121701 DOI: 10.1016/j.gde.2024.102240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 06/27/2024] [Accepted: 07/23/2024] [Indexed: 08/12/2024]
Abstract
Advances in sequencing technologies have enabled the comparison of high-quality genomes of diverse primate species, revealing vast amounts of divergence due to structural variation. Given their large size, structural variants (SVs) can simultaneously alter the function and regulation of multiple genes. Studies estimate that collectively more than 3.5% of the genome is divergent in humans versus other great apes, impacting thousands of genes. Functional genomics and gene-editing tools in various model systems recently emerged as an exciting frontier - investigating the wide-ranging impacts of SVs on molecular, cellular, and systems-level phenotypes. This review examines existing research and identifies future directions to broaden our understanding of the functional roles of SVs on phenotypic innovations and diversity impacting uniquely human features, ranging from cognition to metabolic adaptations.
Collapse
Affiliation(s)
- Charikleia Karageorgiou
- Department of Biological Sciences, University at Buffalo, 109 Cooke Hall, Buffalo, NY 14260, USA. https://twitter.com/@evobioclio
| | - Omer Gokcumen
- Department of Biological Sciences, University at Buffalo, 109 Cooke Hall, Buffalo, NY 14260, USA
| | - Megan Y Dennis
- Department of Biochemistry & Molecular Medicine, Genome Center, and MIND Institute, University of California, Davis, CA 95616, USA.
| |
Collapse
|
3
|
Johansson PA, Palmer JM, McGrath L, Warrier S, Hamilton HR, Beckman T, D'Mellow MG, Brooks KM, Glasson W, Hayward NK, Pritchard AL. Germline Variants in Patients Affected by Both Uveal and Cutaneous Melanoma. Pigment Cell Melanoma Res 2024. [PMID: 39315505 DOI: 10.1111/pcmr.13199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Revised: 08/15/2024] [Accepted: 09/03/2024] [Indexed: 09/25/2024]
Abstract
Uveal melanoma (UM) and nonacral cutaneous melanoma (CM) are distinct entities with varied genetic landscapes despite both arising from melanocytes. There are, however, similarities in that they most frequently affect people of European ancestry, and high penetrance germline variants in BAP1, POT1 and CDKN2A have been shown to predispose to both UM and CM. This study aims to further explore germline variants in patients affected by both UM and CM, shedding light on the underlying genetic mechanism causing these diseases. Using exome sequencing we analysed germline DNA samples from a cohort of 83 Australian patients diagnosed with both UM and CM. Eight (10%) patients were identified that carried pathogenic mutations in known melanoma predisposition genes POT1, MITF, OCA2, SLC45A2 and TYR. Three (4%) patients carried pathogenic variants in genes previously linked with other cancer syndromes (ATR, BRIP1 and MSH6) and another three cases carried monoallelic pathogenic variants in recessive cancer genes (xeroderma pigmentosum and Fanconi anaemia), indicating that reduced penetrance of phenotype in these individuals may contribute to the development of both UM and CM. These findings highlight the need for further studies characterising the role of these genes in melanoma susceptibility.
Collapse
Affiliation(s)
- Peter A Johansson
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
- University of Queensland, Brisbane, Queensland, Australia
| | - Jane M Palmer
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Lindsay McGrath
- Queensland Ocular Oncology Service, The Terrace Eye Centre, Brisbane, Queensland, Australia
| | - Sunil Warrier
- Queensland Ocular Oncology Service, The Terrace Eye Centre, Brisbane, Queensland, Australia
| | - Hayley R Hamilton
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Timothy Beckman
- Queensland Ocular Oncology Service, The Terrace Eye Centre, Brisbane, Queensland, Australia
| | - Matthew G D'Mellow
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Kelly M Brooks
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
- University of Queensland, Brisbane, Queensland, Australia
| | - William Glasson
- Queensland Ocular Oncology Service, The Terrace Eye Centre, Brisbane, Queensland, Australia
| | - Nicholas K Hayward
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Antonia L Pritchard
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
- Department of Genetics and Immunology, Division of Biomedical Science, University of the Highlands and Islands, Inverness, Scotland, UK
| |
Collapse
|
4
|
Chundru VK, Zhang Z, Walter K, Lindsay SJ, Danecek P, Eberhardt RY, Gardner EJ, Malawsky DS, Wigdor EM, Torene R, Retterer K, Wright CF, Ólafsdóttir H, Guillen Sacoto MJ, Ayaz A, Akbeyaz IH, Türkdoğan D, Al Balushi AI, Bertoli-Avella A, Bauer P, Szenker-Ravi E, Reversade B, McWalter K, Sheridan E, Firth HV, Hurles ME, Samocha KE, Ustach VD, Martin HC. Federated analysis of autosomal recessive coding variants in 29,745 developmental disorder patients from diverse populations. Nat Genet 2024:10.1038/s41588-024-01910-8. [PMID: 39313616 DOI: 10.1038/s41588-024-01910-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 08/14/2024] [Indexed: 09/25/2024]
Abstract
Autosomal recessive coding variants are well-known causes of rare disorders. We quantified the contribution of these variants to developmental disorders in a large, ancestrally diverse cohort comprising 29,745 trios, of whom 20.4% had genetically inferred non-European ancestries. The estimated fraction of patients attributable to exome-wide autosomal recessive coding variants ranged from ~2-19% across genetically inferred ancestry groups and was significantly correlated with average autozygosity. Established autosomal recessive developmental disorder-associated (ARDD) genes explained 84.0% of the total autosomal recessive coding burden, and 34.4% of the burden in these established genes was explained by variants not already reported as pathogenic in ClinVar. Statistical analyses identified two novel ARDD genes: KBTBD2 and ZDHHC16. This study expands our understanding of the genetic architecture of developmental disorders across diverse genetically inferred ancestry groups and suggests that improving strategies for interpreting missense variants in known ARDD genes may help diagnose more patients than discovering the remaining genes.
Collapse
Affiliation(s)
- V Kartik Chundru
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Department of Clinical and Biomedical Sciences, University of Exeter Medical School, Royal Devon and Exeter Hospital, Exeter, UK
| | - Zhancheng Zhang
- GeneDx, Gaithersburg, MD, USA
- Deka Biosciences, Germantown, MD, USA
| | - Klaudia Walter
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Sarah J Lindsay
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Petr Danecek
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Eugene J Gardner
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- MRC Epidemiology Unit, Cambridge, UK
| | | | - Emilie M Wigdor
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Institute of Developmental and Regenerative Medicine, Department of Paediatrics, University of Oxford, Oxford, UK
| | - Rebecca Torene
- GeneDx, Gaithersburg, MD, USA
- Geisinger, Danville, PA, USA
| | - Kyle Retterer
- GeneDx, Gaithersburg, MD, USA
- Geisinger, Danville, PA, USA
| | - Caroline F Wright
- Department of Clinical and Biomedical Sciences, University of Exeter Medical School, Royal Devon and Exeter Hospital, Exeter, UK
| | | | | | - Akif Ayaz
- Istanbul Medipol University, Medical School, Department of Medical Genetics, Istanbul, Turkey
| | - Ismail Hakki Akbeyaz
- Marmara University Medical Faculty, Pendik Training and Research Hospital, Department of Pediatric Neurology, Istanbul, Turkey
| | - Dilşad Türkdoğan
- Marmara University Medical Faculty, Pendik Training and Research Hospital, Department of Pediatric Neurology, Istanbul, Turkey
| | | | | | - Peter Bauer
- Medical Genetics, CENTOGENE GmbH, Rostock, Germany
- Clinic of Internal Medicine, Department of Hematology, Oncology, and Palliative Medicine, University Medicine Rostock, Rostock, Germany
| | | | - Bruno Reversade
- Laboratory of Human Genetics & Therapeutics, BESE, KAUST, Thuwal, Saudi Arabia
| | | | - Eamonn Sheridan
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK
- Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, UK
| | - Helen V Firth
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Cambridge University Hospitals Foundation Trust, Addenbrooke's Hospital, Cambridge, UK
| | | | - Kaitlin E Samocha
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | | | - Hilary C Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| |
Collapse
|
5
|
Bravo JI, Zhang L, Benayoun BA. Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-copy numbers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.10.612283. [PMID: 39314493 PMCID: PMC11419044 DOI: 10.1101/2024.09.10.612283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Long INterspersed Element-1 (LINE-1; L1) and Alu are two families of transposable elements (TEs) occupying ~17% and ~11% of the human genome, respectively. Though only a small fraction of L1 copies is able to produce the machinery to mobilize autonomously, Alu elements and degenerate L1 copies can hijack their functional machinery and mobilize in trans. The expression and subsequent copy number expansion of L1 and Alu can exert pathological effects on their hosts, promoting genome instability, inflammation, and cell cycle alterations. These features have made L1 and Alu promising focus subjects in studies of aging and aging diseases where they can become active. However, the mechanisms regulating variation in their expression and copy number remain incompletely characterized. Moreover, the relevance of known mechanisms to diverse human populations remains unclear, as mechanisms are often characterized in isogenic cell culture models. To address these gaps, we leveraged genomic data from the 1000 Genomes Project to carry out a trans-ethnic GWAS of L1 and Alu insertion global singletons. These singletons are rare insertions observed only once in a population, potentially reflecting recently acquired L1 and Alu integrants or structural variants, and which we used as proxies for L1/Alu-associated copy number variation. Our computational approach identified single nucleotide variants in genomic regions containing genes with potential and known TE regulatory properties, and it enriched for single nucleotide variants in regions containing known regulators of L1 expression. Moreover, we identified many reference TE copies and polymorphic structural variants that were associated with L1/Alu singletons, suggesting their potential contribution to TE copy number variation through transposition-dependent or transposition-independent mechanisms. Finally, a transcriptional analysis of lymphoblastoid cells highlighted potential cell cycle alterations in a subset of samples harboring L1/Alu singletons. Collectively, our results (i) suggest that known TE regulatory mechanisms may also play regulatory roles in diverse human populations, (ii) expand the list of genic and repetitive genomic loci implicated in TE copy number variation, and (iii) reinforce the links between TEs and disease.
Collapse
Affiliation(s)
- Juan I. Bravo
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
| | - Lucia Zhang
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Quantitative and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, California, USA
| | - Bérénice A. Benayoun
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
- Biochemistry and Molecular Medicine Department, USC Keck School of Medicine, Los Angeles, CA 90089, USA
- USC Norris Comprehensive Cancer Center, Epigenetics and Gene Regulation, Los Angeles, CA 90089, USA
- USC Stem Cell Initiative, Los Angeles, CA 90089, USA
| |
Collapse
|
6
|
Uribe-Salazar JM, Kaya G, Weyenberg K, Radke B, Hino K, Soto DC, Shiu JL, Zhang W, Ingamells C, Haghani NK, Xu E, Rosas J, Simó S, Miesfeld J, Glaser T, Baraban SC, Jao LE, Dennis MY. Zebrafish models of human-duplicated gene SRGAP2 reveal novel functions in microglia and visual system development. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.11.612570. [PMID: 39314374 PMCID: PMC11418993 DOI: 10.1101/2024.09.11.612570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Recent expansion of duplicated genes unique in the Homo lineage likely contributed to brain evolution and other human-specific traits. One hallmark example is the expansion of the human SRGAP2 family, resulting in a human-specific paralog SRGAP2C . Introduction of SRGAP2C in mouse models is associated with altering cortical neuronal migration, axon guidance, synaptogenesis, and sensory-task performance. Truncated, human-specific SRGAP2C heterodimerizes with the full-length ancestral gene product SRGAP2A and antagonizes its functions. However, the significance of SRGAP2 duplication beyond neocortex development has not been elucidated due to the embryonic lethality of complete Srgap2 knockout in mice. Using zebrafish, we showed that srgap2 knockout results in viable offspring that phenocopy "humanized" SRGAP2C larvae. Specifically, human SRGAP2C protein interacts with zebrafish Srgap2, demonstrating similar Srgap2 functional antagonism observed in mice. Shared traits between knockout and humanized zebrafish larvae include altered morphometric features (i.e., reduced body length and inter-eye distance) and differential expression of synapse-, axogenesis-, vision-related genes. Through single-cell transcriptome analysis, we further observed a skewed balance of excitatory and inhibitory neurons that likely contributes to increased susceptibility to seizures displayed by Srgap2 mutant larvae, a phenotype resembling SRGAP2 loss-of-function in a child with early infantile epileptic encephalopathy. Single-cell data also pointed to strong microglia expression of srgap2 with mutants exhibiting altered membrane dynamics and likely delayed maturation of microglial cells. srgap2 -expressing microglia cells were also detected in the developing eye together with altered expression of genes related to axogenesis and synaptogenesis in mutant retinal cells. Consistent with the perturbed gene expression in the retina, we found that SRGAP2 mutant larvae exhibited increased sensitivity to broad and fine visual cues. Finally, comparing the transcriptomes of relevant cell types between human (+ SRGAP2C ) and non-human primates (- SRGAP2C ) revealed significant overlaps of gene alterations with mutant cells in our zebrafish models; this suggests that SRGAP2C plays similar roles altering microglia and the visual system in modern humans. Together, our functional characterization of zebrafish Srgap2 and human SRGAP2C in zebrafish uncovered novel gene functions and highlights the strength of cross-species analysis in understanding the development of human-specific features.
Collapse
|
7
|
Engelbrecht E, Rodriguez OL, Watson CT. Addressing Technical Pitfalls in Pursuit of Molecular Factors That Mediate Immunoglobulin Gene Regulation. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2024; 213:651-662. [PMID: 39007649 PMCID: PMC11333172 DOI: 10.4049/jimmunol.2400131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 06/13/2024] [Indexed: 07/16/2024]
Abstract
The expressed Ab repertoire is a critical determinant of immune-related phenotypes. Ab-encoding transcripts are distinct from other expressed genes because they are transcribed from somatically rearranged gene segments. Human Abs are composed of two identical H and L chain polypeptides derived from genes in IGH locus and one of two L chain loci. The combinatorial diversity that results from Ab gene rearrangement and the pairing of different H and L chains contributes to the immense diversity of the baseline Ab repertoire. During rearrangement, Ab gene selection is mediated by factors that influence chromatin architecture, promoter/enhancer activity, and V(D)J recombination. Interindividual variation in the composition of the Ab repertoire associates with germline variation in IGH, implicating polymorphism in Ab gene regulation. Determining how IGH variants directly mediate gene regulation will require integration of these variants with other functional genomic datasets. In this study, we argue that standard approaches using short reads have limited utility for characterizing regulatory regions in IGH at haplotype resolution. Using simulated and chromatin immunoprecipitation sequencing reads, we define features of IGH that limit use of short reads and a single reference genome, namely 1) the highly duplicated nature of the DNA sequence in IGH and 2) structural polymorphisms that are frequent in the population. We demonstrate that personalized diploid references enhance performance of short-read data for characterizing mappable portions of the locus, while also showing that long-read profiling tools will ultimately be needed to fully resolve functional impacts of IGH germline variation on expressed Ab repertoires.
Collapse
Affiliation(s)
- Eric Engelbrecht
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| |
Collapse
|
8
|
Loh CA, Shields DA, Schwing A, Evrony GD. High-fidelity, large-scale targeted profiling of microsatellites. Genome Res 2024; 34:1008-1026. [PMID: 39013593 PMCID: PMC11368184 DOI: 10.1101/gr.278785.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 07/11/2024] [Indexed: 07/18/2024]
Abstract
Microsatellites are highly mutable sequences that can serve as markers for relationships among individuals or cells within a population. The accuracy and resolution of reconstructing these relationships depends on the fidelity of microsatellite profiling and the number of microsatellites profiled. However, current methods for targeted profiling of microsatellites incur significant "stutter" artifacts that interfere with accurate genotyping, and sequencing costs preclude whole-genome microsatellite profiling of a large number of samples. We developed a novel method for accurate and cost-effective targeted profiling of a panel of more than 150,000 microsatellites per sample, along with a computational tool for designing large-scale microsatellite panels. Our method addresses the greatest challenge for microsatellite profiling-"stutter" artifacts-with a low-temperature hybridization capture that significantly reduces these artifacts. We also developed a computational tool for accurate genotyping of the resulting microsatellite sequencing data that uses an ensemble approach integrating three microsatellite genotyping tools, which we optimize by analysis of de novo microsatellite mutations in human trios. Altogether, our suite of experimental and computational tools enables high-fidelity, large-scale profiling of microsatellites, which may find utility in diverse applications such as lineage tracing, population genetics, ecology, and forensics.
Collapse
Affiliation(s)
- Caitlin A Loh
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York 10016, USA
- Department of Pediatrics, Department of Neuroscience & Physiology, Institute for Systems Genetics, Perlmutter Cancer Center, and Neuroscience Institute, New York University Grossman School of Medicine, New York, New York 10016, USA
| | - Danielle A Shields
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York 10016, USA
- Department of Pediatrics, Department of Neuroscience & Physiology, Institute for Systems Genetics, Perlmutter Cancer Center, and Neuroscience Institute, New York University Grossman School of Medicine, New York, New York 10016, USA
| | - Adam Schwing
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York 10016, USA
- Department of Pediatrics, Department of Neuroscience & Physiology, Institute for Systems Genetics, Perlmutter Cancer Center, and Neuroscience Institute, New York University Grossman School of Medicine, New York, New York 10016, USA
| | - Gilad D Evrony
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York 10016, USA;
- Department of Pediatrics, Department of Neuroscience & Physiology, Institute for Systems Genetics, Perlmutter Cancer Center, and Neuroscience Institute, New York University Grossman School of Medicine, New York, New York 10016, USA
| |
Collapse
|
9
|
Rossetto IH, Ludington AJ, Simões BF, Van Cao N, Sanders KL. Dynamic Expansions and Retinal Expression of Spectrally Distinct Short-Wavelength Opsin Genes in Sea Snakes. Genome Biol Evol 2024; 16:evae150. [PMID: 38985750 PMCID: PMC11316226 DOI: 10.1093/gbe/evae150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 07/02/2024] [Accepted: 07/04/2024] [Indexed: 07/12/2024] Open
Abstract
The photopigment-encoding visual opsin genes that mediate color perception show great variation in copy number and adaptive function across vertebrates. An open question is how this variation has been shaped by the interaction of lineage-specific structural genomic architecture and ecological selection pressures. We contribute to this issue by investigating the expansion dynamics and expression of the duplicated Short-Wavelength-Sensitive-1 opsin (SWS1) in sea snakes (Elapidae). We generated one new genome, 45 resequencing datasets, 10 retinal transcriptomes, and 81 SWS1 exon sequences for sea snakes, and analyzed these alongside 16 existing genomes for sea snakes and their terrestrial relatives. Our analyses revealed multiple independent transitions in SWS1 copy number in the marine Hydrophis clade, with at least three lineages having multiple intact SWS1 genes: the previously studied Hydrophis cyanocinctus and at least two close relatives of this species; Hydrophis atriceps and Hydrophis fasciatus; and an individual Hydrophis curtus. In each lineage, gene copy divergence at a key spectral tuning site resulted in distinct UV and Violet/Blue-sensitive SWS1 subtypes. Both spectral variants were simultaneously expressed in the retinae of H. cyanocinctus and H. atriceps, providing the first evidence that these SWS1 expansions confer novel phenotypes. Finally, chromosome annotation for nine species revealed shared structural features in proximity to SWS1 regardless of copy number. If these features are associated with SWS1 duplication, expanded opsin complements could be more common in snakes than is currently recognized. Alternatively, selection pressures specific to aquatic environments could favor improved chromatic distinction in just some lineages.
Collapse
Affiliation(s)
- Isaac H Rossetto
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia 5005, Australia
| | - Alastair J Ludington
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia 5005, Australia
| | - Bruno F Simões
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia 5005, Australia
- School of Biological and Marine Sciences, University of Plymouth, Plymouth PL4 8AA, UK
| | - Nguyen Van Cao
- Department of Aquaculture Biotechnology, Vietnamese Academy of Science and Technology, Institute of Oceanography, Nha Trang, Khánh Hòa, Vietnam
| | - Kate L Sanders
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia 5005, Australia
| |
Collapse
|
10
|
Yoo D, Rhie A, Hebbar P, Antonacci F, Logsdon GA, Solar SJ, Antipov D, Pickett BD, Safonova Y, Montinaro F, Luo Y, Malukiewicz J, Storer JM, Lin J, Sequeira AN, Mangan RJ, Hickey G, Anez GM, Balachandran P, Bankevich A, Beck CR, Biddanda A, Borchers M, Bouffard GG, Brannan E, Brooks SY, Carbone L, Carrel L, Chan AP, Crawford J, Diekhans M, Engelbrecht E, Feschotte C, Formenti G, Garcia GH, de Gennaro L, Gilbert D, Green RE, Guarracino A, Gupta I, Haddad D, Han J, Harris RS, Hartley GA, Harvey WT, Hiller M, Hoekzema K, Houck ML, Jeong H, Kamali K, Kellis M, Kille B, Lee C, Lee Y, Lees W, Lewis AP, Li Q, Loftus M, Loh YHE, Loucks H, Ma J, Mao Y, Martinez JFI, Masterson P, McCoy RC, McGrath B, McKinney S, Meyer BS, Miga KH, Mohanty SK, Munson KM, Pal K, Pennell M, Pevzner PA, Porubsky D, Potapova T, Ringeling FR, Rocha JL, Ryder OA, Sacco S, Saha S, Sasaki T, Schatz MC, Schork NJ, Shanks C, Smeds L, Son DR, Steiner C, Sweeten AP, Tassia MG, Thibaud-Nissen F, Torres-González E, Trivedi M, Wei W, Wertz J, Yang M, Zhang P, Zhang S, Zhang Y, Zhang Z, Zhao SA, Zhu Y, Jarvis ED, Gerton JL, Rivas-González I, Paten B, Szpiech ZA, Huber CD, Lenz TL, Konkel MK, Yi SV, Canzar S, Watson CT, Sudmant PH, Molloy E, Garrison E, Lowe CB, Ventura M, O’Neill RJ, Koren S, Makova KD, Phillippy AM, Eichler EE. Complete sequencing of ape genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.31.605654. [PMID: 39131277 PMCID: PMC11312596 DOI: 10.1101/2024.07.31.605654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
We present haplotype-resolved reference genomes and comparative analyses of six ape species, namely: chimpanzee, bonobo, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. We achieve chromosome-level contiguity with unparalleled sequence accuracy (<1 error in 500,000 base pairs), completely sequencing 215 gapless chromosomes telomere-to-telomere. We resolve challenging regions, such as the major histocompatibility complex and immunoglobulin loci, providing more in-depth evolutionary insights. Comparative analyses, including human, allow us to investigate the evolution and diversity of regions previously uncharacterized or incompletely studied without bias from mapping to the human reference. This includes newly minted gene families within lineage-specific segmental duplications, centromeric DNA, acrocentric chromosomes, and subterminal heterochromatin. This resource should serve as a definitive baseline for all future evolutionary studies of humans and our closest living ape relatives.
Collapse
Affiliation(s)
- DongAhn Yoo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Arang Rhie
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Prajna Hebbar
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Francesca Antonacci
- Department of Biosciences, Biotechnology and Environment, University of Bari, Bari, 70124, Italy
| | - Glennis A. Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Department of Genetics, Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19103, USA
| | - Steven J. Solar
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Dmitry Antipov
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Brandon D. Pickett
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Yana Safonova
- Computer Science and Engineering Department, Huck Institutes of Life Sciences, Pennsylvania State University, State College, PA 16801, USA
| | - Francesco Montinaro
- Department of Biosciences, Biotechnology and Environment, University of Bari, Bari, 70124, Italy
- Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Yanting Luo
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA
| | - Joanna Malukiewicz
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, 20146 Hamburg, Germany
| | - Jessica M. Storer
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| | - Jiadong Lin
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Abigail N. Sequeira
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Riley J. Mangan
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Genetics Training Program, Harvard Medical School, Boston, MA 02115, USA
| | - Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | | | | | - Anton Bankevich
- Computer Science and Engineering Department, Huck Institutes of Life Sciences, Pennsylvania State University, State College, PA 16801, USA
| | - Christine R. Beck
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA
| | - Arjun Biddanda
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Matthew Borchers
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Gerard G. Bouffard
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Emry Brannan
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Shelise Y. Brooks
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lucia Carbone
- Department of Medicine, KCVI, Oregon Health Sciences University, Portland, OR, USA
- Division of Genetics, Oregon National Primate Research Center, Beaverton, OR, USA
| | - Laura Carrel
- PSU Medical School, Penn State University School of Medicine, Hershey, PA, USA
| | - Agnes P. Chan
- The Translational Genomics Research Institute, a part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Juyun Crawford
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Eric Engelbrecht
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY 10021, USA
| | - Gage H. Garcia
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Luciana de Gennaro
- Department of Biosciences, Biotechnology and Environment, University of Bari, Bari, 70124, Italy
| | - David Gilbert
- San Diego Biomedical Research Institute, San Diego, CA, USA
| | | | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Ishaan Gupta
- Department of Computer Science and Engineering, University of California San Diego, CA, USA
| | - Diana Haddad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Junmin Han
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Robert S. Harris
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Gabrielle A. Hartley
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| | - William T. Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Michael Hiller
- LOEWE Centre for Translational Biodiversity Genomics, Senckenberg Research Institute, Goethe University, Frankfurt, Germany
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marlys L. Houck
- San Diego Zoo Wildlife Alliance, Escondido, CA, 92027-7000, USA
| | - Hyeonsoo Jeong
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Kaivan Kamali
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bryce Kille
- Department of Computer Science, Rice University, Houston, TX 77005, USA
| | - Chul Lee
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Youngho Lee
- Laboratory of bioinformatics and population genetics, Interdisciplinary program in bioinformatics, Seoul National University, Republic of Korea
| | - William Lees
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, USA
- Bioengineering Program, Faculty of Engineering, Bar-Ilan University, Ramat Gan, Israel
| | - Alexandra P. Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Qiuhui Li
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Mark Loftus
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Yong Hwee Eddie Loh
- Neuroscience Research Institute, University of California, Santa Barbara, CA, USA
| | - Hailey Loucks
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, PA, USA
| | - Yafei Mao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
- Center for Genomic Research, International Institutes of Medicine, Fourth Affiliated Hospital, Zhejiang University, Yiwu, Zhejiang, China
- Shanghai Jiao Tong University Chongqing Research Institute, Chongqing, China
| | - Juan F. I. Martinez
- Computer Science and Engineering Department, Huck Institutes of Life Sciences, Pennsylvania State University, State College, PA 16801, USA
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Rajiv C. McCoy
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Barbara McGrath
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Sean McKinney
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Britta S. Meyer
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, 20146 Hamburg, Germany
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Saswat K. Mohanty
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Katherine M. Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Karol Pal
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California San Diego, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA
| | - Francisca R. Ringeling
- Faculty of Informatics and Data Science, University of Regensburg, 93053 Regensburg, Germany
| | - Joana L. Rocha
- Department of Integrative Biology, University of California, Berkeley, Berkeley, USA
| | - Oliver A. Ryder
- San Diego Zoo Wildlife Alliance, Escondido, CA, 92027-7000, USA
| | - Samuel Sacco
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Swati Saha
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, USA
| | - Takayo Sasaki
- San Diego Biomedical Research Institute, San Diego, CA, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Nicholas J. Schork
- The Translational Genomics Research Institute, a part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Cole Shanks
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Linnéa Smeds
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Dongmin R. Son
- Department of Ecology, Evolution and Marine Biology, Neuroscience Research Institute, University of California, Santa Barbara, CA, USA
| | - Cynthia Steiner
- San Diego Zoo Wildlife Alliance, Escondido, CA, 92027-7000, USA
| | - Alexander P. Sweeten
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Michael G. Tassia
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | - Mihir Trivedi
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Wenjie Wei
- School of Life Sciences, Westlake University, Hangzhou 310024, China
- National Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, 430070, Wuhan, China
| | - Julie Wertz
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Muyu Yang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, PA, USA
| | - Panpan Zhang
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Shilong Zhang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Yang Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, PA, USA
| | - Zhenmiao Zhang
- Department of Computer Science and Engineering, University of California San Diego, CA, USA
| | - Sarah A. Zhao
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Yixin Zhu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Erich D. Jarvis
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | | | - Iker Rivas-González
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95060, USA
| | - Zachary A. Szpiech
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Christian D. Huber
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Tobias L. Lenz
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, 20146 Hamburg, Germany
| | - Miriam K. Konkel
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Soojin V. Yi
- Department of Ecology, Evolution and Marine Biology, Department of Molecular, Cellular and Developmental Biology, Neuroscience Research Institute, University of California, Santa Barbara, CA, USA
| | - Stefan Canzar
- Faculty of Informatics and Data Science, University of Regensburg, 93053 Regensburg, Germany
| | - Corey T. Watson
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, USA
| | - Peter H. Sudmant
- Department of Integrative Biology, University of California, Berkeley, Berkeley, USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, USA
| | - Erin Molloy
- Department of Computer Science, University of Maryland, College Park, MD 20742, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Craig B. Lowe
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA
| | - Mario Ventura
- Department of Biosciences, Biotechnology and Environment, University of Bari, Bari, 70124, Italy
| | - Rachel J. O’Neill
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA
- Departments of Molecular and Cell Biology, UConn Storrs, CT, USA
| | - Sergey Koren
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Kateryna D. Makova
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| |
Collapse
|
11
|
Isakov O, Marek-Yagel D, Greenberg R, Naftali M, Ben-Shachar S. PANGEN: an online platform for the comparison and creation of diagnostic gene panels. Database (Oxford) 2024; 2024:baae065. [PMID: 39043627 PMCID: PMC11265858 DOI: 10.1093/database/baae065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/23/2024] [Accepted: 07/19/2024] [Indexed: 07/25/2024]
Abstract
Targeted gene panel sequencing is used to limit the search for causative genetic variants solely to genes with an established association with the phenotype. The design of gene panels is challenging due to the lack of consensus regarding phenotypic associations for some genes, which results in high variation in gene composition for the same panel offered by different laboratories. We developed PANGEN, a platform that provides a centralized resource for gene panel information, with the ability to compare and generate new intelligent diagnostic panels. Gene-phenotype associations were collected from 12 public and commercial sources (Blueprint, Cegat, Centogene, ClinGen, Fulgent, GeneDx, Health in Code, Human Phenotype Ontology, Invitae, PanelApp, Prevention genetics, and Pronto diagnostics). Gene-phenotype associations are categorized into tiers according to categories derived from the original source panel. Pairwise panel similarity was calculated by dividing the number of common genes by the total number of genes in both panels. Regions with extreme guanine-cytosine (GC) content were collected from the Genome in a Bottle stratifications dataset, and putative genomic duplications were retrieved from the University of Santa Cruz database. Overall, 1533 panels, 9759 phenotypes, and 6979 genes were collected. The platform provides an interface to (i) explore and compare collected panels, (ii) find similar panels, (iii) identify genes with high GC content or duplication levels, (iv) generate gene panels by combining panels from various sources, and (v) stratify a generated panel into genes with a strong phenotype association ('core') and those with a weaker association ('extended'). The presented platform represents a unique resource for gene panel exploration and comparison that facilitates the generation of tailored diagnostic panels through a public online web server. Database URL: https://c-gc.shinyapps.io/PANGEN/.
Collapse
Affiliation(s)
- Ofer Isakov
- Raphael Recanati Genetic Institute, Rabin Medical Center-Beilinson Hospital, Zeev Jabotinsky 39, Petach Tikva 4941492, Israel
- Clalit Research Institute, Clalit Health Services, Tuval 40, Ramat Gan 5252247, Israel
- The Ivan and Francesca Berkowitz Family Living Laboratory Collaboration, Harvard Medical School and Clalit Research Institute, 10 Shattuck Street, Suite 514, Boston, MA 02115, USA
- Faculty of Medicine, Tel Aviv University, Klachkin 35, Tel Aviv 6997801, Israel
| | - Dina Marek-Yagel
- Clalit Research Institute, Clalit Health Services, Tuval 40, Ramat Gan 5252247, Israel
| | - Rotem Greenberg
- Clalit Research Institute, Clalit Health Services, Tuval 40, Ramat Gan 5252247, Israel
| | - Michal Naftali
- Clalit Research Institute, Clalit Health Services, Tuval 40, Ramat Gan 5252247, Israel
| | - Shay Ben-Shachar
- Clalit Research Institute, Clalit Health Services, Tuval 40, Ramat Gan 5252247, Israel
- The Ivan and Francesca Berkowitz Family Living Laboratory Collaboration, Harvard Medical School and Clalit Research Institute, 10 Shattuck Street, Suite 514, Boston, MA 02115, USA
- Faculty of Medicine, Tel Aviv University, Klachkin 35, Tel Aviv 6997801, Israel
| |
Collapse
|
12
|
Chung TH, Zhuravskaya A, Makeyev EV. Regulation potential of transcribed simple repeated sequences in developing neurons. Hum Genet 2024; 143:875-895. [PMID: 38153590 PMCID: PMC11294396 DOI: 10.1007/s00439-023-02626-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023]
Abstract
Simple repeated sequences (SRSs), defined as tandem iterations of microsatellite- to satellite-sized DNA units, occupy a substantial part of the human genome. Some of these elements are known to be transcribed in the context of repeat expansion disorders. Mounting evidence suggests that the transcription of SRSs may also contribute to normal cellular functions. Here, we used genome-wide bioinformatics approaches to systematically examine SRS transcriptional activity in cells undergoing neuronal differentiation. We identified thousands of long noncoding RNAs containing >200-nucleotide-long SRSs (SRS-lncRNAs), with hundreds of these transcripts significantly upregulated in the neural lineage. We show that SRS-lncRNAs often originate from telomere-proximal regions and that they have a strong potential to form multivalent contacts with a wide range of RNA-binding proteins. Our analyses also uncovered a cluster of neurally upregulated SRS-lncRNAs encoded in a centromere-proximal part of chromosome 9, which underwent an evolutionarily recent segmental duplication. Using a newly established in vitro system for rapid neuronal differentiation of induced pluripotent stem cells, we demonstrate that at least some of the bioinformatically predicted SRS-lncRNAs, including those encoded in the segmentally duplicated part of chromosome 9, indeed increase their expression in developing neurons to readily detectable levels. These and other lines of evidence suggest that many SRSs may be expressed in a cell type and developmental stage-specific manner, providing a valuable resource for further studies focused on the functional consequences of SRS-lncRNAs in the normal development of the human brain, as well as in the context of neurodevelopmental disorders.
Collapse
Affiliation(s)
- Tek Hong Chung
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK
| | - Anna Zhuravskaya
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK
| | - Eugene V Makeyev
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK.
| |
Collapse
|
13
|
Paterson AH, Queitsch C. Genome organization and botanical diversity. THE PLANT CELL 2024; 36:1186-1204. [PMID: 38382084 PMCID: PMC11062460 DOI: 10.1093/plcell/koae045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 02/07/2024] [Accepted: 02/07/2024] [Indexed: 02/23/2024]
Abstract
The rich diversity of angiosperms, both the planet's dominant flora and the cornerstone of agriculture, is integrally intertwined with a distinctive evolutionary history. Here, we explore the interplay between angiosperm genome organization and botanical diversity, empowered by genomic approaches ranging from genetic linkage mapping to analysis of gene regulation. Commonality in the genetic hardware of plants has enabled robust comparative genomics that has provided a broad picture of angiosperm evolution and implicated both general processes and specific elements in contributing to botanical diversity. We argue that the hardware of plant genomes-both in content and in dynamics-has been shaped by selection for rather substantial differences in gene regulation between plants and animals such as maize and human, organisms of comparable genome size and gene number. Their distinctive genome content and dynamics may reflect in part the indeterminate development of plants that puts strikingly different demands on gene regulation than in animals. Repeated polyploidization of plant genomes and multiplication of individual genes together with extensive rearrangement and differential retention provide rich raw material for selection of morphological and/or physiological variations conferring fitness in specific niches, whether natural or artificial. These findings exemplify the burgeoning information available to employ in increasing knowledge of plant biology and in modifying selected plants to better meet human needs.
Collapse
Affiliation(s)
- Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, USA
| | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| |
Collapse
|
14
|
Xing L, Gkini V, Nieminen AI, Zhou HC, Aquilino M, Naumann R, Reppe K, Tanaka K, Carmeliet P, Heikinheimo O, Pääbo S, Huttner WB, Namba T. Functional synergy of a human-specific and an ape-specific metabolic regulator in human neocortex development. Nat Commun 2024; 15:3468. [PMID: 38658571 PMCID: PMC11043075 DOI: 10.1038/s41467-024-47437-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 04/02/2024] [Indexed: 04/26/2024] Open
Abstract
Metabolism has recently emerged as a major target of genes implicated in the evolutionary expansion of human neocortex. One such gene is the human-specific gene ARHGAP11B. During human neocortex development, ARHGAP11B increases the abundance of basal radial glia, key progenitors for neocortex expansion, by stimulating glutaminolysis (glutamine-to-glutamate-to-alpha-ketoglutarate) in mitochondria. Here we show that the ape-specific protein GLUD2 (glutamate dehydrogenase 2), which also operates in mitochondria and converts glutamate-to-αKG, enhances ARHGAP11B's ability to increase basal radial glia abundance. ARHGAP11B + GLUD2 double-transgenic bRG show increased production of aspartate, a metabolite essential for cell proliferation, from glutamate via alpha-ketoglutarate and the TCA cycle. Hence, during human evolution, a human-specific gene exploited the existence of another gene that emerged during ape evolution, to increase, via concerted changes in metabolism, progenitor abundance and neocortex size.
Collapse
Affiliation(s)
- Lei Xing
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Department of Biological Sciences, University of Manitoba, Winnipeg, MB, Canada.
| | - Vasiliki Gkini
- Neuroscience Center, HiLIFE - Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
| | - Anni I Nieminen
- FIMM Metabolomics Unit, Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Hui-Chao Zhou
- Center for Cancer Biology (CCB), VIB-KU Leuven, B-3000, Leuven, Belgium
| | - Matilde Aquilino
- Neuroscience Center, HiLIFE - Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
| | - Ronald Naumann
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Katrin Reppe
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Kohichi Tanaka
- Laboratory of Molecular Neuroscience, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Peter Carmeliet
- Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven, B-3000, Leuven, Belgium
- Laboratory of Angiogenesis and Vascular Metabolism, Center for Cancer Biology, VIB, B-3000, Leuven, Belgium
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Oskari Heikinheimo
- Department of Obstetrics and Gynecology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Svante Pääbo
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Human Evolutionary Genomics Unit, Okinawa Institute of Science and Technology, Okinawa, Onna-son, Japan
| | - Wieland B Huttner
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
| | - Takashi Namba
- Neuroscience Center, HiLIFE - Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
15
|
Tajeddin N, Arabfard M, Alizadeh S, Salesi M, Khamse S, Delbari A, Ohadi M. Novel islands of GGC and GCC repeats coincide with human evolution. Gene 2024; 902:148194. [PMID: 38262548 DOI: 10.1016/j.gene.2024.148194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/29/2023] [Accepted: 01/18/2024] [Indexed: 01/25/2024]
Abstract
BACKGROUND Because of high mutation rate, overrepresentation in genic regions, and link with various neurological, neurodegenerative, and movement disorders, GGC and GCC short tandem repeats (STRs) are prone to natural selection. Among a number of lacking data, the 3-repeats of these STRs remain widely unexplored. RESULTS In a genome-wide search in human, here we mapped GGC and GCC STRs of ≥3-repeats, and found novel islands of up to 45 of those STRs, populating spans of 1 to 2 kb of genomic DNA. RGPD4 and NOC4L harbored the densest (GGC)3 (probability 3.09061E-71) and (GCC)3 (probability 1.72376E-61) islands, respectively, and were human-specific. We also found prime instances of directional incremented density of STRs at specific loci in human versus other species, including the FOXK2 and SKI GGC islands. The genes containing those islands significantly diverged in expression in human versus other species, and the proteins encoded by those genes interact closely in a physical interaction network, consequence of which may be human-specific characteristics such as higher order brain functions. CONCLUSION We report novel islands of GGC and GCC STRs of evolutionary relevance to human. The density, and in some instances, periodicity of these islands support them as a novel genomic entity, which need to be further explored in evolutionary, mechanistic, and functional platforms.
Collapse
Affiliation(s)
- N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
16
|
Vervoort L, Dierckxsens N, Santos MS, Meynants S, Souche E, Cools R, Heung T, Devriendt K, Peeters H, McDonald-McGinn DM, Swillen A, Breckpot J, Emanuel BS, Van Esch H, Bassett AS, Vermeesch JR. Multiple paralogues and recombination mechanisms drive the high incidence of 22q11.2 Deletion Syndrome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.14.585046. [PMID: 38562770 PMCID: PMC10983858 DOI: 10.1101/2024.03.14.585046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
The 22q11.2 deletion syndrome (22q11.2DS) is the most common microdeletion disorder. Why the incidence of 22q11.2DS is much greater than that of other genomic disorders remains unknown. Short read sequencing cannot resolve the complex segmental duplicon structure to provide direct confirmation of the hypothesis that the rearrangements are caused by non-allelic homologous recombination between the low copy repeats on chromosome 22 (LCR22s). To enable haplotype-specific assembly and rearrangement mapping in LCR22 regions, we combined fiber-FISH optical mapping with whole genome (ultra-)long read sequencing or rearrangement-specific long-range PCR on 24 duos (22q11.2DS patient and parent-of-origin) comprising several different LCR22-mediated rearrangements. Unexpectedly, we demonstrate that not only different paralogous segmental duplicon but also palindromic AT-rich repeats (PATRR) are driving 22q11.2 rearrangements. In addition, we show the existence of two different inversion polymorphisms preceding rearrangement, and somatic mosaicism. The existence of different recombination sites and mechanisms in paralogues and PATRRs which are copy number expanding in the human population are a likely explanation for the high 22q11.2DS incidence.
Collapse
|
17
|
Arabfard M, Tajeddin N, Alizadeh S, Salesi M, Bayat H, Khorram Khorshid HR, Khamse S, Delbari A, Ohadi M. Dyads of GGC and GCC form hotspot colonies that coincide with the evolution of human and other great apes. BMC Genom Data 2024; 25:21. [PMID: 38383300 PMCID: PMC10880355 DOI: 10.1186/s12863-024-01207-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 02/11/2024] [Indexed: 02/23/2024] Open
Abstract
BACKGROUND GGC and GCC short tandem repeats (STRs) are of various evolutionary, biological, and pathological implications. However, the fundamental two-repeats (dyads) of these STRs are widely unexplored. RESULTS On a genome-wide scale, we mapped (GGC)2 and (GCC)2 dyads in human, and found monumental colonies (distance between each dyad < 500 bp) of extraordinary density, and in some instances periodicity. The largest (GCC)2 and (GGC)2 colonies were intergenic, homogeneous, and human-specific, consisting of 219 (GCC)2 on chromosome 2 (probability < 1.545E-219) and 70 (GGC)2 on chromosome 9 (probability = 1.809E-148). We also found that several colonies were shared in other great apes, and directionally increased in density and complexity in human, such as a colony of 99 (GCC)2 on chromosome 20, that specifically expanded in great apes, and reached maximum complexity in human (probability 1.545E-220). Numerous other colonies of evolutionary relevance in human were detected in other largely overlooked regions of the genome, such as chromosome Y and pseudogenes. Several of the genes containing or nearest to those colonies were divergently expressed in human. CONCLUSION In conclusion, (GCC)2 and (GGC)2 form unprecedented genomic colonies that coincide with the evolution of human and other great apes. The extent of the genomic rearrangements leading to those colonies support overlooked recombination hotspots, shared across great apes. The identified colonies deserve to be studied in mechanistic, evolutionary, and functional platforms.
Collapse
Affiliation(s)
- M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
- Research Center for Prevention of Oral and Dental Diseases, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - H Bayat
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
18
|
Audano PA, Beck CR. Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement. Genome Res 2024; 34:7-19. [PMID: 38176712 PMCID: PMC10904011 DOI: 10.1101/gr.278203.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 01/02/2024] [Indexed: 01/06/2024]
Abstract
High-quality genome assemblies and sophisticated algorithms have increased sensitivity for a wide range of variant types, and breakpoint accuracy for structural variants (SVs, ≥50 bp) has improved to near base pair precision. Despite these advances, many SV breakpoint locations are subject to systematic bias affecting variant representation. To understand why SV breakpoints are inconsistent across samples, we reanalyzed 64 phased haplotypes constructed from long-read assemblies released by the Human Genome Structural Variation Consortium (HGSVC). We identify 882 SV insertions and 180 SV deletions with variable breakpoints not anchored in tandem repeats (TRs) or segmental duplications (SDs). SVs called from aligned sequencing reads increase breakpoint disagreements by 2×-16×. Sequence accuracy had a minimal impact on breakpoints, but we observe a strong effect of ancestry. We confirm that SNP and indel polymorphisms are enriched at shifted breakpoints and are also absent from variant callsets. Breakpoint homology increases the likelihood of imprecise SV calls and the distance they are shifted, and tandem duplications are the most heavily affected SVs. Because graph genome methods normalize SV calls across samples, we investigated graphs generated by two different methods and find the resulting breakpoints are subject to other technical biases affecting breakpoint accuracy. The breakpoint inconsistencies we characterize affect ∼5% of the SVs called in a human genome and can impact variant interpretation and annotation. These limitations underscore a need for algorithm development to improve SV databases, mitigate the impact of ancestry on breakpoints, and increase the value of callsets for investigating breakpoint features.
Collapse
Affiliation(s)
- Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA;
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| |
Collapse
|
19
|
Gaudó P, de Tomás-Mateo E, Garrido-Pérez N, Santana A, Ruiz-Pesini E, Montoya J, Bayona-Bafaluy P. "ATAD3C regulates ATAD3A assembly and function in the mitochondrial membrane". Free Radic Biol Med 2024; 211:114-126. [PMID: 38092275 DOI: 10.1016/j.freeradbiomed.2023.12.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 11/28/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023]
Abstract
Mitochondrial ATAD3A is an ATPase Associated with diverse cellular Activities (AAA) domain containing enzyme, involved in the structural organization of the inner mitochondrial membrane and of increasing importance in childhood disease. In humans, two ATAD3A paralogs arose by gene duplication during evolution: ATAD3B and ATAD3C. Here we investigate the cellular activities of the ATAD3C paralog that has been considered a pseudogene. We detected unique ATAD3C peptides in HEK 293T cells, with expression similar to that in human tissues, and showed that it is an integral membrane protein that exposes its carboxy-terminus to the intermembrane space. Overexpression of ATAD3C, but not of ATAD3A, in fibroblasts caused a decrease in cell proliferation and oxygen consumption rate, and an increase of cellular ROS. This was due to the incorporation of ATAD3C monomers in ATAD3A complex in the mitochondrial membrane reducing its size. Consistent with a negative regulation of ATAD3A function in mitochondrial membrane organization, ATAD3C expression led to increased accumulation of respiratory chain dimeric CIII in the inner membrane, to the detriment to that assembled in respiratory supercomplexes. Our results demonstrate a negative dominant role of the ATAD3C paralog with implications for mitochondrial OXPHOS function and suggest that its expression regulates ATAD3A in the cell.
Collapse
Affiliation(s)
- Paula Gaudó
- Biochemistry and Molecular Biology Department. Universidad de Zaragoza, 50009- and 50013, Zaragoza, Spain
| | - Elena de Tomás-Mateo
- Biochemistry and Molecular Biology Department. Universidad de Zaragoza, 50009- and 50013, Zaragoza, Spain
| | - Nuria Garrido-Pérez
- Biochemistry and Molecular Biology Department. Universidad de Zaragoza, 50009- and 50013, Zaragoza, Spain; Institute for Health Research (IIS) de Aragón, 50009, Zaragoza, Spain; Rare Diseases Networking Biomedical Research Centre (CIBERER), 28029, Madrid, Spain; Institute for Biocomputation and Physics of Complex Systems, University of Zaragoza, 50018, Zaragoza, Spain
| | - Alfredo Santana
- Research Institute of Biomedical and Health Sciences (IUIBS), University of Las Palmas de Gran Canaria, 35001, Las Palmas de Gran Canaria, Spain; Clinical Genetics Unit, Complejo Hospitarlario Universitario Insular-Materno Infantil de Las Palamas de Gran Canaria, 35016, Las Palmas de Gran Canaria, Spain
| | - Eduardo Ruiz-Pesini
- Institute for Health Research (IIS) de Aragón, 50009, Zaragoza, Spain; Rare Diseases Networking Biomedical Research Centre (CIBERER), 28029, Madrid, Spain.
| | - Julio Montoya
- Biochemistry and Molecular Biology Department. Universidad de Zaragoza, 50009- and 50013, Zaragoza, Spain; Institute for Health Research (IIS) de Aragón, 50009, Zaragoza, Spain; Rare Diseases Networking Biomedical Research Centre (CIBERER), 28029, Madrid, Spain
| | - Pilar Bayona-Bafaluy
- Biochemistry and Molecular Biology Department. Universidad de Zaragoza, 50009- and 50013, Zaragoza, Spain; Institute for Health Research (IIS) de Aragón, 50009, Zaragoza, Spain; Rare Diseases Networking Biomedical Research Centre (CIBERER), 28029, Madrid, Spain; Institute for Biocomputation and Physics of Complex Systems, University of Zaragoza, 50018, Zaragoza, Spain.
| |
Collapse
|
20
|
Maeng JH, Jang HJ, Du AY, Tzeng SC, Wang T. Using long-read CAGE sequencing to profile cryptic-promoter-derived transcripts and their contribution to the immunopeptidome. Genome Res 2023; 33:2143-2155. [PMID: 38065624 PMCID: PMC10760525 DOI: 10.1101/gr.277061.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Accepted: 11/13/2023] [Indexed: 01/04/2024]
Abstract
Recent studies have shown that the noncoding genome can produce unannotated proteins as antigens that induce immune response. One major source of this activity is the aberrant epigenetic reactivation of transposable elements (TEs). In tumors, TEs often provide cryptic or alternate promoters, which can generate transcripts that encode tumor-specific unannotated proteins. Thus, TE-derived transcripts (TE transcripts) have the potential to produce tumor-specific, but recurrent, antigens shared among many tumors. Identification of TE-derived tumor antigens holds the promise to improve cancer immunotherapy approaches; however, current genomics and computational tools are not optimized for their detection. Here we combined CAGE technology with full-length long-read transcriptome sequencing (long-read CAGE, or LRCAGE) and developed a suite of computational tools to significantly improve immunopeptidome detection by incorporating TE and other tumor transcripts into the proteome database. By applying our methods to human lung cancer cell line H1299 data, we show that long-read technology significantly improves mapping of promoters with low mappability scores and that LRCAGE guarantees accurate construction of uncharacterized 5' transcript structure. Augmenting a reference proteome database with newly characterized transcripts enabled us to detect noncanonical antigens from HLA-pulldown LC-MS/MS data. Lastly, we show that epigenetic treatment increased the number of noncanonical antigens, particularly those encoded by TE transcripts, which might expand the pool of targetable antigens for cancers with low mutational burden.
Collapse
Affiliation(s)
- Ju Heon Maeng
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - H Josh Jang
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Alan Y Du
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Shin-Cheng Tzeng
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA;
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| |
Collapse
|
21
|
Chaisson MJP, Sulovari A, Valdmanis PN, Miller DE, Eichler EE. Advances in the discovery and analyses of human tandem repeats. Emerg Top Life Sci 2023; 7:361-381. [PMID: 37905568 PMCID: PMC10806765 DOI: 10.1042/etls20230074] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 10/18/2023] [Accepted: 10/18/2023] [Indexed: 11/02/2023]
Abstract
Long-read sequencing platforms provide unparalleled access to the structure and composition of all classes of tandemly repeated DNA from STRs to satellite arrays. This review summarizes our current understanding of their organization within the human genome, their importance with respect to disease, as well as the advances and challenges in understanding their genetic diversity and functional effects. Novel computational methods are being developed to visualize and associate these complex patterns of human variation with disease, expression, and epigenetic differences. We predict accurate characterization of this repeat-rich form of human variation will become increasingly relevant to both basic and clinical human genetics.
Collapse
Affiliation(s)
- Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, U.S.A
- The Genomic and Epigenomic Regulation Program, USC Norris Cancer Center, University of Southern California, Los Angeles, CA 90089, U.S.A
| | - Arvis Sulovari
- Computational Biology, Cajal Neuroscience Inc, Seattle, WA 98102, U.S.A
| | - Paul N Valdmanis
- Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
| | - Danny E Miller
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA 98195, U.S.A
- Department of Pediatrics, University of Washington, Seattle, WA 98195, U.S.A
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, U.S.A
| |
Collapse
|
22
|
Lu M, Cao M, Yang J, Swenson NG. Comparative transcriptomics reveals divergence in pathogen response gene families amongst 20 forest tree species. G3 (BETHESDA, MD.) 2023; 13:jkad233. [PMID: 37812763 PMCID: PMC10700026 DOI: 10.1093/g3journal/jkad233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/22/2023] [Accepted: 09/26/2023] [Indexed: 10/11/2023]
Abstract
Forest trees provide critical ecosystem services for humanity that are under threat due to ongoing global change. Measuring and characterizing genetic diversity are key to understanding adaptive potential and developing strategies to mitigate negative consequences arising from climate change. In the area of forest genetic diversity, genetic divergence caused by large-scale changes at the chromosomal level has been largely understudied. In this study, we used the RNA-seq data of 20 co-occurring forest trees species from genera including Acer, Alnus, Amelanchier, Betula, Cornus, Corylus, Dirca, Fraxinus, Ostrya, Populus, Prunus, Quercus, Ribes, Tilia, and Ulmus sampled from Upper Peninsula of Michigan. These data were used to infer the origin and maintenance of gene family variation, species divergence time, as well as gene family expansion and contraction. We identified a signal of common whole genome duplication events shared by core eudicots. We also found rapid evolution, namely fast expansion or fast contraction of gene families, in plant-pathogen interaction genes amongst the studied diploid species. Finally, the results lay the foundation for further research on the genetic diversity and adaptive capacity of forest trees, which will inform forest management and conservation policies.
Collapse
Affiliation(s)
- Mengmeng Lu
- Department of Biological Sciences, University of Notre Dame, 100 Galvin Life Sciences, Notre Dame, IN 46556, USA
| | - Min Cao
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan 666303, China
| | - Jie Yang
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan 666303, China
| | - Nathan G Swenson
- Department of Biological Sciences, University of Notre Dame, 100 Galvin Life Sciences, Notre Dame, IN 46556, USA
- University of Notre Dame Environmental Research Center (UNDERC), 736 Flanner Hall, Notre Dame, IN 46556, USA
| |
Collapse
|
23
|
Klussmeier A, Putke K, Klasberg S, Kohler M, Sauter J, Schefzyk D, Schöfl G, Massalski C, Schäfer G, Schmidt AH, Roers A, Lange V. High population frequencies of MICA copy number variations originate from independent recombination events. Front Immunol 2023; 14:1297589. [PMID: 38035108 PMCID: PMC10684724 DOI: 10.3389/fimmu.2023.1297589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 10/24/2023] [Indexed: 12/02/2023] Open
Abstract
MICA is a stress-induced ligand of the NKG2D receptor that stimulates NK and T cell responses and was identified as a key determinant of anti-tumor immunity. The MICA gene is located inside the MHC complex and is in strong linkage disequilibrium with HLA-B. While an HLA-B*48-linked MICA deletion-haplotype was previously described in Asian populations, little is known about other MICA copy number variations. Here, we report the genotyping of more than two million individuals revealing high frequencies of MICA duplications (1%) and MICA deletions (0.4%). Their prevalence differs between ethnic groups and can rise to 2.8% (Croatia) and 9.2% (Mexico), respectively. Targeted sequencing of more than 70 samples indicates that these copy number variations originate from independent nonallelic homologous recombination events between segmental duplications upstream of MICA and MICB. Overall, our data warrant further investigation of disease associations and consideration of MICA copy number data in oncological study protocols.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Axel Roers
- Institute for Immunology, Medical Faculty Carl Gustav Carus, University of Technology (TU) Dresden, Dresden, Germany
- Institute for Immunology, University Hospital Heidelberg, Heidelberg, Germany
| | | |
Collapse
|
24
|
Feng LY, Lin PF, Xu RJ, Kang HQ, Gao LZ. Comparative Genomic Analysis of Asian Cultivated Rice and Its Wild Progenitor ( Oryza rufipogon) Has Revealed Evolutionary Innovation of the Pentatricopeptide Repeat Gene Family through Gene Duplication. Int J Mol Sci 2023; 24:16313. [PMID: 38003501 PMCID: PMC10671101 DOI: 10.3390/ijms242216313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 11/10/2023] [Accepted: 11/12/2023] [Indexed: 11/26/2023] Open
Abstract
The pentatricopeptide repeat (PPR) gene family is one of the largest gene families in land plants. However, current knowledge about the evolution of the PPR gene family remains largely limited. In this study, we performed a comparative genomic analysis of the PPR gene family in O. sativa and its wild progenitor, O. rufipogon, and outlined a comprehensive landscape of gene duplications. Our findings suggest that the majority of PPR genes originated from dispersed duplications. Although segmental duplications have only expanded approximately 11.30% and 13.57% of the PPR gene families in the O. sativa and O. rufipogon genomes, we interestingly obtained evidence that segmental duplication promotes the structural diversity of PPR genes through incomplete gene duplications. In the O. sativa and O. rufipogon genomes, 10 (~33.33%) and 22 pairs of gene duplications (~45.83%) had non-PPR paralogous genes through incomplete gene duplication. Segmental duplications leading to incomplete gene duplications might result in the acquisition of domains, thus promoting functional innovation and structural diversification of PPR genes. This study offers a unique perspective on the evolution of PPR gene structures and underscores the potential role of segmental duplications in PPR gene structural diversity.
Collapse
Affiliation(s)
- Li-Ying Feng
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
| | - Pei-Fan Lin
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
| | - Rong-Jing Xu
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| | - Hai-Qi Kang
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| | - Li-Zhi Gao
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| |
Collapse
|
25
|
Pezzi PH, Gonçalves LT, Deprá M, Freitas LBD. Evolution and diversification of the O-methyltransferase (OMT) gene family in Solanaceae. Genet Mol Biol 2023; 46:e20230121. [PMID: 37948506 PMCID: PMC10637433 DOI: 10.1590/1678-4685-gmb-2023-0121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/30/2023] [Indexed: 11/12/2023] Open
Abstract
O-methyltransferases (OMTs) are a group of enzymes involved in several fundamental biological processes in plants, including lignin biosynthesis, pigmentation, and aroma production. Despite the intensive investigation of the role of OMTs in plant secondary metabolism, the evolution and diversification of this gene family in Solanaceae remain poorly understood. Here, we conducted a genome-wide survey of OMT genes in six Solanaceae species, reconstructing gene phylogenetic trees, predicting the potential involvement in biological processes, and investigating the exon/intron structure and chromosomal location. We identified 57 caffeoyl-CoA OMTs (CCoAOMTs) and 196 caffeic acid OMTs (COMTs) in the studied species. We observed a conserved gene block on chromosome 2 that consisted of tandem duplicated copies of OMT genes. Our results suggest that the expansion of the OMT gene family in Solanaceae was driven by whole genome duplication, segmental duplication, and tandem duplication, with multiple genes being retained by neofunctionalization and subfunctionalization. This study represents an essential first step in unraveling the evolutionary history of OMTs in Solanaceae. Our findings deepen our understanding of the crucial role of OMTs in several biological processes and highlight their significance as potential biotechnological targets.
Collapse
Affiliation(s)
- Pedro Henrique Pezzi
- Universidade Federal do Rio Grande do Sul, Departamento de Genética, Porto Alegre, RS, Brazil
| | | | - Maríndia Deprá
- Universidade Federal do Rio Grande do Sul, Departamento de Genética, Porto Alegre, RS, Brazil
| | | |
Collapse
|
26
|
Desgraupes S, Etienne L, Arhel NJ. RANBP2 evolution and human disease. FEBS Lett 2023; 597:2519-2533. [PMID: 37795679 DOI: 10.1002/1873-3468.14749] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/23/2023] [Accepted: 09/25/2023] [Indexed: 10/06/2023]
Abstract
Ran-binding protein 2 (RANBP2)/Nup358 is a nucleoporin and a key component of the nuclear pore complex. Through its multiple functions (e.g., SUMOylation, regulation of nucleocytoplasmic transport) and subcellular localizations (e.g., at the nuclear envelope, kinetochores, annulate lamellae), it is involved in many cellular processes. RANBP2 dysregulation or mutation leads to the development of human pathologies, such as acute necrotizing encephalopathy 1, cancer, neurodegenerative diseases, and it is also involved in viral infections. The chromosomal region containing the RANBP2 gene is highly dynamic, with high structural variation and recombination events that led to the appearance of a gene family called RANBP2 and GCC2 Protein Domains (RGPD), with multiple gene loss/duplication events during ape evolution. Although RGPD homoplasy and maintenance during evolution suggest they might confer an advantage to their hosts, their functions are still unknown and understudied. In this review, we discuss the appearance and importance of RANBP2 in metazoans and its function-related pathologies, caused by an alteration of its expression levels (through promotor activity, post-transcriptional, or post-translational modifications), its localization, or genetic mutations.
Collapse
Affiliation(s)
- Sophie Desgraupes
- Institut de Recherche en Infectiologie de Montpellier (IRIM), University of Montpellier, France
| | - Lucie Etienne
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR 5308, ENS de Lyon, Université de Lyon, France
| | - Nathalie J Arhel
- Institut de Recherche en Infectiologie de Montpellier (IRIM), University of Montpellier, France
| |
Collapse
|
27
|
Du X, Yu H, Wang Y, Liu J, Zhang Q. Comparative Studies on Duplicated foxl2 Paralogs in Spotted Knifejaw Oplegnathus punctatus Show Functional Diversification. Genes (Basel) 2023; 14:1847. [PMID: 37895196 PMCID: PMC10606028 DOI: 10.3390/genes14101847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/18/2023] [Accepted: 09/21/2023] [Indexed: 10/29/2023] Open
Abstract
As a member of the forkhead box L gene family, foxl2 plays a significant role in gonadal development and the regulation of reproduction. During the evolution of deuterostome, whole genome duplication (WGD)-enriched lineage diversifications and regulation mechanisms occurs. However, only limited research exists on foxl2 duplication in teleost or other vertebrate species. In this study, two foxl2 paralogs, foxl2 and foxl2l, were identified in the transcriptome of spotted knifejaw (Oplegnathus punctatus), which had varying expressions in the gonads. The foxl2 was expressed higher in the ovary, while foxl2l was expressed higher in the testis. Phylogenetic reconstruction, synteny analysis, and the molecular evolution test confirmed that foxl2 and foxl2l likely originated from the first two WGD. The expression patterns test using qRT-PCR and ISH as well as motif scan analysis revealed evidence of potentially functional divergence between the foxl2 and foxl2l paralogs in spotted knifejaw. Our results indicate that foxl2 and foxl2l may originate from the first two WGD, be active in transcription, and have undergone functional divergence. These results shed new light on the evolutionary trajectories of foxl2 and foxl2l and highlights the need for further detailed functional analysis of these two duplicated paralogs.
Collapse
Affiliation(s)
- Xinxin Du
- School of Life Science and Bioengineering, Jining University, Jining 273155, China; (X.D.); (H.Y.)
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qingdao 266003, China; (Y.W.); (J.L.)
| | - Haiyang Yu
- School of Life Science and Bioengineering, Jining University, Jining 273155, China; (X.D.); (H.Y.)
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qingdao 266003, China; (Y.W.); (J.L.)
| | - Yujue Wang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qingdao 266003, China; (Y.W.); (J.L.)
| | - Jinxiang Liu
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qingdao 266003, China; (Y.W.); (J.L.)
| | - Quanqi Zhang
- Key Laboratory of Marine Genetics and Breeding, Ministry of Education, College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qingdao 266003, China; (Y.W.); (J.L.)
| |
Collapse
|
28
|
Huang X, Lu Z, Jiang X, Zhang Z, Yan K, Yu G. Single-cell RNA sequencing reveals distinct tumor microenvironment of ground glass nodules and solid nodules in lung adenocarcinoma. Front Cell Dev Biol 2023; 11:1198338. [PMID: 37745301 PMCID: PMC10513029 DOI: 10.3389/fcell.2023.1198338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 08/04/2023] [Indexed: 09/26/2023] Open
Abstract
Introduction: Lung adenocarcinoma (LUAD) is the most prevalent lung cancer. LUAD presents as ground glass nodules (GGN) and solid nodules (SN) in imaging studies. GGN is an early type of LUAD with good prognosis. However, SN exhibits a more malignant behavior than GGN, including worse pathological staging and tumor prognosis. The mechanism leading to the different malignancy levels of GGN and SN remains elusive. Methods: Three patients with GGN and three patients with SN diagnosed with early LUAD were enrolled. The tumor samples were digested to a single-cell suspension and analyzed using 10× Genomic Single-cell ribonucleic acid sequences (scRNA-seq) techniques. Results: A total of 15,902 cells were obtained and classified into nine major types. The tumor microenvironment (TME) was subsequently described in detail. ScRNA-seq revealed that ribosome-related pathways and cell adhesion played similar but distinct roles in the two groups. SN also had more active cell proliferation, enriched cell cycle regulatory pathways, and severe inflammatory responses. Conclusion: We observed changes in the cellular composition and transcriptomic profile of GGN and SN. The study improved the understanding of the underlying mechanisms of lung carcinogenesis and contributed to lung cancer prevention and treatment.
Collapse
Affiliation(s)
| | | | | | | | | | - Guiping Yu
- Department of Cardiothoracic Surgery, Jiangyin Clinical College of Xuzhou Medical University, Jiangyin, China
| |
Collapse
|
29
|
Wang L, Dong B, Xie Y, Kang H, Wu Y. The molecular mechanisms of recombinant chromosome 18 with parental pericentric inversions and a review of the literature. J Hum Genet 2023; 68:625-634. [PMID: 37161033 DOI: 10.1038/s10038-023-01157-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 04/07/2023] [Accepted: 04/26/2023] [Indexed: 05/11/2023]
Abstract
Chromosomal rearrangements mostly result from non-allelic homologous recombination mediated by low-copy repeats (LCRs) or segmental duplications (SDs). Recent studies on recombinant chromosome 18 (rec (18)) have focused on diagnoses and clinical phenotypes. We diagnosed two cases of prenatal rec (18) and identified precise breakpoint intervals using karyotype and chromosomal microarray analyses. We analyzed the distribution characteristics of breakpoint repetitive elements to infer rearrangement mechanisms and reviewed relevant literature to identify genetic trends. Among the 12 families with 25 pregnancies analyzed, 68% rec (18), 24% spontaneous abortions, and 8% normal births were reported. In the 17 rec (18) cases, 65% presented maternal origin and 35% were paternal. Short-arm breakpoints at p11.31 were reported in 10 cases, whereas the long-arm breakpoints were located at q21.3 (6 cases) and q12 (4 cases). Breakpoints of pericentric inversions on chromosome 18 are concentrated in p11.31, q21.3, and q12 regions. Rearrangements at 18p11.31 are non-recurrent events. ALUs, LINE1s, and MIRs were enriched at the breakpoint regions (1.85 to 3.42-fold enrichment over the entire chromosome 18), while SDs and LCRs were absent. ALU subfamilies had sequence identities of 85.94% and 83.01% between two pair breakpoints. Small repetitive elements may mediate recombination-coupled DNA repair processes, facilitating rearrangements on chromosome 18. Maternal inversion carriers are more prone to abnormal recombination in prenatal families with rec (18). Recombinant chromosomes may present preferential segregation during gamete formation.
Collapse
Affiliation(s)
- Lingxi Wang
- Prenatal Diagnosis Center, Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - Bing Dong
- Department of Eugenics, Meishan Women and Children's Hospital, Alliance Hospital of West China Second University Hospital, Sichuan University, Meishan, 620000, China
| | - Yamei Xie
- Prenatal Diagnosis Center, Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Han Kang
- Prenatal Diagnosis Center, Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Yong Wu
- Prenatal Diagnosis Center, Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 611731, China
| |
Collapse
|
30
|
Li B, Gschwend AR. Vitis labrusca genome assembly reveals diversification between wild and cultivated grapevine genomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1234130. [PMID: 37719220 PMCID: PMC10501149 DOI: 10.3389/fpls.2023.1234130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 08/03/2023] [Indexed: 09/19/2023]
Abstract
Wild grapevines are important genetic resources in breeding programs to confer adaptive fitness traits and unique fruit characteristics, but the genetics underlying these traits, and their evolutionary origins, are largely unknown. To determine the factors that contributed to grapevine genome diversification, we performed comprehensive intragenomic and intergenomic analyses with three cultivated European (including the PN40024 reference genome) and two wild North American grapevine genomes, including our newly released Vitis labrusca genome. We found the heterozygosity of the cultivated grapevine genomes was twice as high as the wild grapevine genomes studied. Approximately 30% of V. labrusca and 48% of V. vinifera Chardonnay genes were heterozygous or hemizygous and a considerable number of collinear genes between Chardonnay and V. labrusca had different gene zygosity. Our study revealed evidence that supports gene gain-loss events in parental genomes resulted in the inheritance of hemizygous genes in the Chardonnay genome. Thousands of segmental duplications supplied source material for genome-specific genes, further driving diversification of the genomes studied. We found an enrichment of recently duplicated, adaptive genes in similar functional pathways, but differential retention of environment-specific adaptive genes within each genome. For example, large expansions of NLR genes were discovered in the two wild grapevine genomes studied. Our findings support variation in transposable elements contributed to unique traits in grapevines. Our work revealed gene zygosity, segmental duplications, gene gain-and-loss variations, and transposable element polymorphisms can be key driving forces for grapevine genome diversification.
Collapse
Affiliation(s)
| | - Andrea R. Gschwend
- Department of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United States
| |
Collapse
|
31
|
Wang H, Makowski C, Zhang Y, Qi A, Kaufmann T, Smeland OB, Fiecas M, Yang J, Visscher PM, Chen CH. Chromosomal inversion polymorphisms shape human brain morphology. Cell Rep 2023; 42:112896. [PMID: 37505983 PMCID: PMC10508191 DOI: 10.1016/j.celrep.2023.112896] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 06/27/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
The impact of chromosomal inversions on human brain morphology remains underexplored. We studied 35 common inversions classified from genotypes of 33,018 adults with European ancestry. The inversions at 2p22.3, 16p11.2, and 17q21.31 reach genome-wide significance, followed by 8p23.1 and 6p21.33, in their association with cortical and subcortical morphology. The 17q21.31, 8p23.1, and 16p11.2 regions comprise the LRRC37, OR7E, and NPIP duplicated gene families. We find the 17q21.31 MAPT inversion region, known for harboring neurological risk, to be the most salient locus among common variants for shaping and patterning the cortex. Overall, we observe the inverted orientations decreasing brain size, with the exception that the 2p22.3 inversion is associated with increased subcortical volume and the 8p23.1 inversion is associated with increased motor cortex. These significant inversions are in the genomic hotspots of neuropsychiatric loci. Our findings are generalizable to 3,472 children and demonstrate inversions as essential genetic variation to understand human brain phenotypes.
Collapse
Affiliation(s)
- Hao Wang
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Carolina Makowski
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Yanxiao Zhang
- Ludwig Institute for Cancer Research, La Jolla, CA 92093, USA; School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Anna Qi
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Tobias Kaufmann
- Department of Psychiatry and Psychotherapy, Tübingen Center for Mental Health, University of Tübingen, 72076 Tübingen, Germany; Norwegian Centre for Mental Disorders Research, Oslo University Hospital and University of Oslo, 0450 Oslo, Norway
| | - Olav B Smeland
- Norwegian Centre for Mental Disorders Research, Oslo University Hospital and University of Oslo, 0450 Oslo, Norway
| | - Mark Fiecas
- Division of Biostatistics, University of Minnesota School of Public Health, Minneapolis, MN 55455, USA
| | - Jian Yang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Chi-Hua Chen
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
32
|
Abstract
DNA sequencing has revolutionized medicine over recent decades. However, analysis of large structural variation and repetitive DNA, a hallmark of human genomes, has been limited by short-read technology, with read lengths of 100-300 bp. Long-read sequencing (LRS) permits routine sequencing of human DNA fragments tens to hundreds of kilobase pairs in size, using both real-time sequencing by synthesis and nanopore-based direct electronic sequencing. LRS permits analysis of large structural variation and haplotypic phasing in human genomes and has enabled the discovery and characterization of rare pathogenic structural variants and repeat expansions. It has also recently enabled the assembly of a complete, gapless human genome that includes previously intractable regions, such as highly repetitive centromeres and homologous acrocentric short arms. With the addition of protocols for targeted enrichment, direct epigenetic DNA modification detection, and long-range chromatin profiling, LRS promises to launch a new era of understanding of genetic diversity and pathogenic mutations in human populations.
Collapse
Affiliation(s)
- Peter E Warburton
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert P Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
33
|
Purcell RH, Sefik E, Werner E, King AT, Mosley TJ, Merritt-Garza ME, Chopra P, McEachin ZT, Karne S, Raj N, Vaglio BJ, Sullivan D, Firestein BL, Tilahun K, Robinette MI, Warren ST, Wen Z, Faundez V, Sloan SA, Bassell GJ, Mulle JG. Cross-species analysis identifies mitochondrial dysregulation as a functional consequence of the schizophrenia-associated 3q29 deletion. SCIENCE ADVANCES 2023; 9:eadh0558. [PMID: 37585521 PMCID: PMC10431714 DOI: 10.1126/sciadv.adh0558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 07/12/2023] [Indexed: 08/18/2023]
Abstract
The 1.6-megabase deletion at chromosome 3q29 (3q29Del) is the strongest identified genetic risk factor for schizophrenia, but the effects of this variant on neurodevelopment are not well understood. We interrogated the developing neural transcriptome in two experimental model systems with complementary advantages: isogenic human cortical organoids and isocortex from the 3q29Del mouse model. We profiled transcriptomes from isogenic cortical organoids that were aged for 2 and 12 months, as well as perinatal mouse isocortex, all at single-cell resolution. Systematic pathway analysis implicated dysregulation of mitochondrial function and energy metabolism. These molecular signatures were supported by analysis of oxidative phosphorylation protein complex expression in mouse brain and assays of mitochondrial function in engineered cell lines, which revealed a lack of metabolic flexibility and a contribution of the 3q29 gene PAK2. Together, these data indicate that metabolic disruption is associated with 3q29Del and is conserved across species.
Collapse
Affiliation(s)
- Ryan H. Purcell
- Laboratory of Translational Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Esra Sefik
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Erica Werner
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Alexia T. King
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Trenell J. Mosley
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | | | - Pankaj Chopra
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Zachary T. McEachin
- Laboratory of Translational Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Sridhar Karne
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Nisha Raj
- Laboratory of Translational Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Brandon J. Vaglio
- Department of Cell Biology and Neuroscience, Rutgers University, Piscataway, NJ, USA
| | - Dylan Sullivan
- Department of Cell Biology and Neuroscience, Rutgers University, Piscataway, NJ, USA
| | - Bonnie L. Firestein
- Department of Cell Biology and Neuroscience, Rutgers University, Piscataway, NJ, USA
| | - Kedamawit Tilahun
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Maxine I. Robinette
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Stephen T. Warren
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Zhexing Wen
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA, USA
| | - Victor Faundez
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Steven A. Sloan
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Gary J. Bassell
- Laboratory of Translational Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
- Department of Cell Biology, Emory University School of Medicine, Atlanta, GA, USA
| | - Jennifer G. Mulle
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| |
Collapse
|
34
|
Hanssen R, Auwerx C, Jõeloo M, Sadler MC, Henning E, Keogh J, Bounds R, Smith M, Firth HV, Kutalik Z, Farooqi IS, Reymond A, Lawler K. Chromosomal deletions on 16p11.2 encompassing SH2B1 are associated with accelerated metabolic disease. Cell Rep Med 2023; 4:101155. [PMID: 37586323 PMCID: PMC10439272 DOI: 10.1016/j.xcrm.2023.101155] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/08/2023] [Accepted: 07/18/2023] [Indexed: 08/18/2023]
Abstract
New approaches are needed to treat people whose obesity and type 2 diabetes (T2D) are driven by specific mechanisms. We investigate a deletion on chromosome 16p11.2 (breakpoint 2-3 [BP2-3]) encompassing SH2B1, a mediator of leptin and insulin signaling. Phenome-wide association scans in the UK (N = 502,399) and Estonian (N = 208,360) biobanks show that deletion carriers have increased body mass index (BMI; p = 1.3 × 10-10) and increased rates of T2D. Compared with BMI-matched controls, deletion carriers have an earlier onset of T2D, with poorer glycemic control despite higher medication usage. Cystatin C, a biomarker of kidney function, is significantly elevated in deletion carriers, suggesting increased risk of renal impairment. In a Mendelian randomization study, decreased SH2B1 expression increases T2D risk (p = 8.1 × 10-6). We conclude that people with 16p11.2 BP2-3 deletions have early, complex obesity and T2D and may benefit from therapies that enhance leptin and insulin signaling.
Collapse
Affiliation(s)
- Ruth Hanssen
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | - Chiara Auwerx
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland; Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; University Center for Primary Care and Public Health, 1010 Lausanne, Switzerland
| | - Maarja Jõeloo
- Institute of Molecular and Cell Biology, University of Tartu, 51010 Tartu, Estonia; Estonian Genome Centre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Marie C Sadler
- Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; University Center for Primary Care and Public Health, 1010 Lausanne, Switzerland
| | - Elana Henning
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | - Julia Keogh
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | - Rebecca Bounds
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | - Miriam Smith
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | - Helen V Firth
- Department of Clinical Genetics, Cambridge University Hospitals NHS Foundation Trust & Wellcome Sanger Institute, Cambridge, UK
| | - Zoltán Kutalik
- Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; University Center for Primary Care and Public Health, 1010 Lausanne, Switzerland
| | - I Sadaf Farooqi
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK.
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland.
| | - Katherine Lawler
- University of Cambridge Metabolic Research Laboratories, Wellcome-MRC Institute of Metabolic Science and NIHR Cambridge Biomedical Research Centre, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK.
| |
Collapse
|
35
|
Wilton R, Szalay AS. Short-read aligner performance in germline variant identification. Bioinformatics 2023; 39:btad480. [PMID: 37527006 PMCID: PMC10421969 DOI: 10.1093/bioinformatics/btad480] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 06/01/2023] [Accepted: 07/31/2023] [Indexed: 08/03/2023] Open
Abstract
MOTIVATION Read alignment is an essential first step in the characterization of DNA sequence variation. The accuracy of variant-calling results depends not only on the quality of read alignment and variant-calling software but also on the interaction between these complex software tools. RESULTS In this review, we evaluate short-read aligner performance with the goal of optimizing germline variant-calling accuracy. We examine the performance of three general-purpose short-read aligners-BWA-MEM, Bowtie 2, and Arioc-in conjunction with three germline variant callers: DeepVariant, FreeBayes, and GATK HaplotypeCaller. We discuss the behavior of the read aligners with regard to the data elements on which the variant callers rely, and illustrate how the runtime configurations of these software tools combine to affect variant-calling performance. AVAILABILITY AND IMPLEMENTATION The quick brown fox jumps over the lazy dog.
Collapse
Affiliation(s)
- Richard Wilton
- Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, United States
| | - Alexander S Szalay
- Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, United States
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, United States
| |
Collapse
|
36
|
Soto DC, Uribe-Salazar JM, Shew CJ, Sekar A, McGinty S, Dennis MY. Genomic structural variation: A complex but important driver of human evolution. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2023; 181 Suppl 76:118-144. [PMID: 36794631 PMCID: PMC10329998 DOI: 10.1002/ajpa.24713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 01/21/2023] [Accepted: 02/05/2023] [Indexed: 02/17/2023]
Abstract
Structural variants (SVs)-including duplications, deletions, and inversions of DNA-can have significant genomic and functional impacts but are technically difficult to identify and assay compared with single-nucleotide variants. With the aid of new genomic technologies, it has become clear that SVs account for significant differences across and within species. This phenomenon is particularly well-documented for humans and other primates due to the wealth of sequence data available. In great apes, SVs affect a larger number of nucleotides than single-nucleotide variants, with many identified SVs exhibiting population and species specificity. In this review, we highlight the importance of SVs in human evolution by (1) how they have shaped great ape genomes resulting in sensitized regions associated with traits and diseases, (2) their impact on gene functions and regulation, which subsequently has played a role in natural selection, and (3) the role of gene duplications in human brain evolution. We further discuss how to incorporate SVs in research, including the strengths and limitations of various genomic approaches. Finally, we propose future considerations in integrating existing data and biospecimens with the ever-expanding SV compendium propelled by biotechnology advancements.
Collapse
Affiliation(s)
- Daniela C. Soto
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - José M. Uribe-Salazar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Colin J. Shew
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Aarthi Sekar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Sean McGinty
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| |
Collapse
|
37
|
Laufer VA, Glover TW, Wilson TE. Applications of advanced technologies for detecting genomic structural variation. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]
Abstract
Chromosomal structural variation (SV) encompasses a heterogenous class of genetic variants that exerts strong influences on human health and disease. Despite their importance, many structural variants (SVs) have remained poorly characterized at even a basic level, a discrepancy predicated upon the technical limitations of prior genomic assays. However, recent advances in genomic technology can identify and localize SVs accurately, opening new questions regarding SV risk factors and their impacts in humans. Here, we first define and classify human SVs and their generative mechanisms, highlighting characteristics leveraged by various SV assays. We next examine the first-ever gapless assembly of the human genome and the technical process of assembling it, which required third-generation sequencing technologies to resolve structurally complex loci. The new portions of that "telomere-to-telomere" and subsequent pangenome assemblies highlight aspects of SV biology likely to develop in the near-term. We consider the strengths and limitations of the most promising new SV technologies and when they or longstanding approaches are best suited to meeting salient goals in the study of human SV in population-scale genomics research, clinical, and public health contexts. It is a watershed time in our understanding of human SV when new approaches are expected to fundamentally change genomic applications.
Collapse
Affiliation(s)
- Vincent A Laufer
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas W Glover
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas E Wilson
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| |
Collapse
|
38
|
Prodanov T, Bansal V. A multilocus approach for accurate variant calling in low-copy repeats using whole-genome sequencing. Bioinformatics 2023; 39:i279-i287. [PMID: 37387146 PMCID: PMC10311303 DOI: 10.1093/bioinformatics/btad268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION Low-copy repeats (LCRs) or segmental duplications are long segments of duplicated DNA that cover > 5% of the human genome. Existing tools for variant calling using short reads exhibit low accuracy in LCRs due to ambiguity in read mapping and extensive copy number variation. Variants in more than 150 genes overlapping LCRs are associated with risk for human diseases. METHODS We describe a short-read variant calling method, ParascopyVC, that performs variant calling jointly across all repeat copies and utilizes reads independent of mapping quality in LCRs. To identify candidate variants, ParascopyVC aggregates reads mapped to different repeat copies and performs polyploid variant calling. Subsequently, paralogous sequence variants that can differentiate repeat copies are identified using population data and used for estimating the genotype of variants for each repeat copy. RESULTS On simulated whole-genome sequence data, ParascopyVC achieved higher precision (0.997) and recall (0.807) than three state-of-the-art variant callers (best precision = 0.956 for DeepVariant and best recall = 0.738 for GATK) in 167 LCR regions. Benchmarking of ParascopyVC using the genome-in-a-bottle high-confidence variant calls for HG002 genome showed that it achieved a very high precision of 0.991 and a high recall of 0.909 across LCR regions, significantly better than FreeBayes (precision = 0.954 and recall = 0.822), GATK (precision = 0.888 and recall = 0.873) and DeepVariant (precision = 0.983 and recall = 0.861). ParascopyVC demonstrated a consistently higher accuracy (mean F1 = 0.947) than other callers (best F1 = 0.908) across seven human genomes. AVAILABILITY AND IMPLEMENTATION ParascopyVC is implemented in Python and is freely available at https://github.com/tprodanov/ParascopyVC.
Collapse
Affiliation(s)
- Timofey Prodanov
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, United States
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf 40225, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf 40225, Germany
| | - Vikas Bansal
- School of Medicine, University of California San Diego, La Jolla, CA 92093, United States
| |
Collapse
|
39
|
Audano PA, Beck CR. Small allelic variants are a source of ancestral bias in structural variant breakpoint placement. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.25.546295. [PMID: 37425850 PMCID: PMC10327140 DOI: 10.1101/2023.06.25.546295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
High-quality genome assemblies and sophisticated algorithms have increased sensitivity for a wide range of variant types, and breakpoint accuracy for structural variants (SVs, ≥ 50 bp) has improved to near basepair precision. Despite these advances, many SVs in unique regions of the genome are subject to systematic bias that affects breakpoint location. This ambiguity leads to less accurate variant comparisons across samples, and it obscures true breakpoint features needed for mechanistic inferences. To understand why SVs are not consistently placed, we reanalyzed 64 phased haplotypes constructed from long-read assemblies released by the Human Genome Structural Variation Consortium (HGSVC). We identified variable breakpoints for 882 SV insertions and 180 SV deletions not anchored in tandem repeats (TRs) or segmental duplications (SDs). While this is unexpectedly high for genome assemblies in unique loci, we find read-based callsets from the same sequencing data yielded 1,566 insertions and 986 deletions with inconsistent breakpoints also not anchored in TRs or SDs. When we investigated causes for breakpoint inaccuracy, we found sequence and assembly errors had minimal impact, but we observed a strong effect of ancestry. We confirmed that polymorphic mismatches and small indels are enriched at shifted breakpoints and that these polymorphisms are generally lost when breakpoints shift. Long tracts of homology, such as SVs mediated by transposable elements, increase the likelihood of imprecise SV calls and the distance they are shifted. Tandem Duplication (TD) breakpoints are the most heavily affected SV class with 14% of TDs placed at different locations across haplotypes. While graph genome methods normalize SV calls across many samples, the resulting breakpoints are sometimes incorrect, highlighting a need to tune graph methods for breakpoint accuracy. The breakpoint inconsistencies we characterize collectively affect ~5% of the SVs called in a human genome and underscore a need for algorithm development to improve SV databases, mitigate the impact of ancestry on breakpoint placement, and increase the value of callsets for investigating mutational processes.
Collapse
Affiliation(s)
- Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, USA
| |
Collapse
|
40
|
Shao Y, Zhou L, Li F, Zhao L, Zhang BL, Shao F, Chen JW, Chen CY, Bi X, Zhuang XL, Zhu HL, Hu J, Sun Z, Li X, Wang D, Rivas-González I, Wang S, Wang YM, Chen W, Li G, Lu HM, Liu Y, Kuderna LFK, Farh KKH, Fan PF, Yu L, Li M, Liu ZJ, Tiley GP, Yoder AD, Roos C, Hayakawa T, Marques-Bonet T, Rogers J, Stenson PD, Cooper DN, Schierup MH, Yao YG, Zhang YP, Wang W, Qi XG, Zhang G, Wu DD. Phylogenomic analyses provide insights into primate evolution. Science 2023; 380:913-924. [PMID: 37262173 DOI: 10.1126/science.abn6919] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 01/26/2023] [Indexed: 06/03/2023]
Abstract
Comparative analysis of primate genomes within a phylogenetic context is essential for understanding the evolution of human genetic architecture and primate diversity. We present such a study of 50 primate species spanning 38 genera and 14 families, including 27 genomes first reported here, with many from previously less well represented groups, the New World monkeys and the Strepsirrhini. Our analyses reveal heterogeneous rates of genomic rearrangement and gene evolution across primate lineages. Thousands of genes under positive selection in different lineages play roles in the nervous, skeletal, and digestive systems and may have contributed to primate innovations and adaptations. Our study reveals that many key genomic innovations occurred in the Simiiformes ancestral node and may have had an impact on the adaptive radiation of the Simiiformes and human evolution.
Collapse
Affiliation(s)
- Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Long Zhou
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Fang Li
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- Institute of Animal Sex and Development, ZhejiangWanli University, Ningbo 315100, China
| | - Lan Zhao
- Shaanxi Key Laboratory for Animal Conservation, College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Bao-Lin Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Feng Shao
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing 400715, China
| | | | - Chun-Yan Chen
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Xupeng Bi
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Xiao-Lin Zhuang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming 650204, China
| | | | - Jiang Hu
- Grandomics Biosciences, Beijing 102206, China
| | - Zongyi Sun
- Grandomics Biosciences, Beijing 102206, China
| | - Xin Li
- Grandomics Biosciences, Beijing 102206, China
| | - Depeng Wang
- Grandomics Biosciences, Beijing 102206, China
| | | | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Yun-Mei Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Wu Chen
- Guangzhou Zoo & Guangzhou Wildlife Research Center, Guangzhou 510070, China
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Hui-Meng Lu
- School of Life Sciences, Northwestern Polytechnical University, Xi'an 710072, China
| | - Yang Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Lukas F K Kuderna
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Illumina Artificial Intelligence Laboratory, Illumina Inc, San Diego, CA 92122, USA
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc, San Diego, CA 92122, USA
| | - Peng-Fei Fan
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Ming Li
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhi-Jin Liu
- College of Life Sciences, Capital Normal University, Beijing 100048, China
| | - George P Tiley
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany
| | - Takashi Hayakawa
- Faculty of Environmental Earth Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Japan Monkey Centre, Inuyama, Aichi 484-0081, Japan
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Peter D Stenson
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK
| | | | - Yong-Gang Yao
- Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming 650204, China
- Key Laboratory of Animal Models and Human Disease Mechanisms of Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
| | - Xiao-Guang Qi
- Shaanxi Key Laboratory for Animal Conservation, College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Guojie Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou 311121, China
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650204, China
| |
Collapse
|
41
|
Purcell RH, Sefik E, Werner E, King AT, Mosley TJ, Merritt-Garza ME, Chopra P, McEachin ZT, Karne S, Raj N, Vaglio BJ, Sullivan D, Firestein BL, Tilahun K, Robinette MI, Warren ST, Wen Z, Faundez V, Sloan SA, Bassell GJ, Mulle JG. Cross-species transcriptomic analysis identifies mitochondrial dysregulation as a functional consequence of the schizophrenia-associated 3q29 deletion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.27.525748. [PMID: 36747819 PMCID: PMC9901184 DOI: 10.1101/2023.01.27.525748] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Recent advances in the genetics of schizophrenia (SCZ) have identified rare variants that confer high disease risk, including a 1.6 Mb deletion at chromosome 3q29 with a staggeringly large effect size (O.R. > 40). Understanding the impact of the 3q29 deletion (3q29Del) on the developing CNS may therefore lead to insights about the pathobiology of schizophrenia. To gain clues about the molecular and cellular perturbations caused by the 3q29 deletion, we interrogated transcriptomic effects in two experimental model systems with complementary advantages: isogenic human forebrain cortical organoids and isocortex from the 3q29Del mouse model. We first created isogenic lines by engineering the full 3q29Del into an induced pluripotent stem cell line from a neurotypical individual. We profiled transcriptomes from isogenic cortical organoids that were aged for 2 months and 12 months, as well as day p7 perinatal mouse isocortex, all at single cell resolution. Differential expression analysis by genotype in each cell-type cluster revealed that more than half of the differentially expressed genes identified in mouse cortex were also differentially expressed in human cortical organoids, and strong correlations were observed in mouse-human differential gene expression across most major cell-types. We systematically filtered differentially expressed genes to identify changes occurring in both model systems. Pathway analysis on this filtered gene set implicated dysregulation of mitochondrial function and energy metabolism, although the direction of the effect was dependent on developmental timepoint. Transcriptomic changes were validated at the protein level by analysis of oxidative phosphorylation protein complexes in mouse brain tissue. Assays of mitochondrial function in human heterologous cells further confirmed robust mitochondrial dysregulation in 3q29Del cells, and these effects are partially recapitulated by ablation of the 3q29Del gene PAK2 . Taken together these data indicate that metabolic disruption is associated with 3q29Del and is conserved across species. These results converge with data from other rare SCZ-associated variants as well as idiopathic schizophrenia, suggesting that mitochondrial dysfunction may be a significant but overlooked contributing factor to the development of psychotic disorders. This cross-species scRNA-seq analysis of the SCZ-associated 3q29 deletion reveals that this copy number variant may produce early and persistent changes in cellular metabolism that are relevant to human neurodevelopment.
Collapse
|
42
|
Vollger MR, Dishuck PC, Harvey WT, DeWitt WS, Guitart X, Goldberg ME, Rozanski AN, Lucas J, Asri M, Munson KM, Lewis AP, Hoekzema K, Logsdon GA, Porubsky D, Paten B, Harris K, Hsieh P, Eichler EE. Increased mutation and gene conversion within human segmental duplications. Nature 2023; 617:325-334. [PMID: 37165237 PMCID: PMC10172114 DOI: 10.1038/s41586-023-05895-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 02/28/2023] [Indexed: 05/12/2023]
Abstract
Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have 'relocated' on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.
Collapse
Affiliation(s)
- Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William S DeWitt
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
| | - Xavi Guitart
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Michael E Goldberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Julian Lucas
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Kelley Harris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
43
|
Guo F, Liu Z, Long G, Zhang B, Dong X, Liu D, Yu S. High-resolution genotyping of 58 STRs in 635 Northern Han Chinese with MiSeq FGx ® Forensic Genomics System. Forensic Sci Int Genet 2023; 65:102879. [PMID: 37150076 DOI: 10.1016/j.fsigen.2023.102879] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 04/16/2023] [Accepted: 04/22/2023] [Indexed: 05/09/2023]
Abstract
Sequence polymorphisms were characterized at 27 autosomal STRs (A-STRs), 7 X chromosomal STRs (X-STRs), and 24 Y chromosomal STRs (Y-STRs) in 635 Northern Han Chinese with the ForenSeq DNA Signature Prep Kit on the MiSeq FGx Forensic Genomics System. Since repeat region (RR) and flanking region (FR) variation can be detected by massively parallel sequencing (MPS), the increase in the number of unique alleles and the average of gene diversity was 78.18% and 3.51% between sequence and length, respectively. A total of 74 novel RR variants were identified at 33 STRs compared with STRSeq and previous studies, and 13 FR variants (rs1770275883, rs2053373277, rs2082557941, rs1925525766, rs1926380862, rs1569322793, rs2051848492, rs2051848696, rs2016239814, rs2053269960, rs2044518192, rs2044536444, and rs2089968964) were first submitted to dbSNP. Also, 99.94% of alleles were concordant between the ForenSeq DNA Signature Prep Kit and commercial CE kits. Discordance resulted from the low performance at D22S1045 and occasionally at DYS392, flanking region deletions at D7S820 and DXS10074, and the strict alignment algorithm at DXS7132. Null alleles at DYS505 and DYS448 and multialleles at DYS387S1a/b, DYS385a/b, DYS448, DYS505, DXS7132, and HPRTB were validated with other MPS and CE kits. Thus, a high-resolution sequence-based (SB) and length-based (LB) allele frequencies dataset from Northern Han Chinese has been established already. As expected, forensic parameters increased significantly on combined power of discrimination (PD) and combined power of exclusion (PE) at A-STRs, mildly on combined PD and combined mean exclusion chance (MEC) at X-STRs, and barely on discrimination capacity (DC) at Y-STRs. Additionally, MiSeq FGx quality metrics and MPS performance were evaluated in this study, which presented the high-quality of the dataset at 20 consecutive runs, such as ≥ 60% bases with a quality score of 20 or higher (%≥ Q20), > 60% of effective reads, > 2000 × of depth of coverage (DoC), ≥ 60% of allele coverage ratio (ACR) or heterozygote balance, ≥ 70% of inter-locus balance, and ≤ 0.4 of the absolute value of observed minus expected heterozygosity (|Hexp - Hobs|). In conclusion, MiSeq FGx can help us generate a high-resolution and high-quality dataset for human identification and population genetic studies.
Collapse
Affiliation(s)
- Fei Guo
- School of Forensic Science and Technology, Criminal Investigation Police University of China, Shenyang, Liaoning 110854, PR China.
| | - Ze Liu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Guannan Long
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Biao Zhang
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Xinyu Dong
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi 030619, PR China
| | - Dahua Liu
- Department of Forensic Medicine, Jinzhou Medical University, Jinzhou, Liaoning 121001, PR China
| | - Shaobo Yu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China.
| |
Collapse
|
44
|
Losilla M, Gallant JR. Molecular evolution of the ependymin-related gene epdl2 in African weakly electric fish. G3 (BETHESDA, MD.) 2023; 13:6931758. [PMID: 36529459 PMCID: PMC9997568 DOI: 10.1093/g3journal/jkac331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 11/01/2022] [Accepted: 11/16/2022] [Indexed: 12/23/2022]
Abstract
Gene duplication and subsequent molecular evolution can give rise to taxon-specific gene specializations. In previous work, we found evidence that African weakly electric fish (Mormyridae) may have as many as three copies of the epdl2 gene, and the expression of two epdl2 genes is correlated with electric signal divergence. Epdl2 belongs to the ependymin-related family (EPDR), a functionally diverse family of secretory glycoproteins. In this study, we first describe vertebrate EPDR evolution and then present a detailed evolutionary history of epdl2 in Mormyridae with emphasis on the speciose genus Paramormyrops. Using Sanger sequencing, we confirm three apparently functional epdl2 genes in Paramormyrops kingsleyae. Next, we developed a nanopore-based amplicon sequencing strategy and bioinformatics pipeline to obtain and classify full-length epdl2 gene sequences (N = 34) across Mormyridae. Our phylogenetic analysis proposes three or four epdl2 paralogs dating from early Paramormyrops evolution. Finally, we conducted selection tests which detected positive selection around the duplication events and identified ten sites likely targeted by selection in the resulting paralogs. These sites' locations in our modeled 3D protein structure involve four sites in ligand binding and six sites in homodimer formation. Together, these findings strongly imply an evolutionary mechanism whereby epdl2 genes underwent selection-driven functional specialization after tandem duplications in the rapidly speciating Paramormyrops. Considering previous evidence, we propose that epdl2 may contribute to electric signal diversification in mormyrids, an important aspect of species recognition during mating.
Collapse
Affiliation(s)
- Mauricio Losilla
- Department of Integrative Biology, Michigan State University, East Lansing, MI 48824, USA.,Graduate Program in Ecology, Evolution and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Jason R Gallant
- Department of Integrative Biology, Michigan State University, East Lansing, MI 48824, USA.,Graduate Program in Ecology, Evolution and Behavior, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
45
|
Dachs N, Upadhyay M, Hannemann E, Hauser A, Krebs S, Seichter D, Russ I, Gehrke LJ, Thaller G, Medugorac I. Quantitative trait locus for calving traits on Bos taurus autosome 18 in Holstein cattle is embedded in a complex genomic region. J Dairy Sci 2023; 106:1925-1941. [PMID: 36710189 DOI: 10.3168/jds.2021-21625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 10/10/2022] [Indexed: 01/31/2023]
Abstract
Although the quantitative trait locus (QTL) on chromosome 18 (BTA18) associated with paternal calving ease and stillbirth in Holstein Friesian cattle and its cross has been known for over 20 years, to our knowledge, the exact causal genetic sequence has yet escaped identification. The aim of this study was to re-examine the region of the published QTL on BTA18 and to investigate the possible reasons behind this elusiveness. For this purpose, we carried out a combined linkage disequilibrium and linkage analysis using genotyping data of 2,697 German Holstein Friesian (HF) animals and subsequent whole-genome sequencing (WGS) data analyses and genome assembly of HF samples. We confirmed the known QTL in the 95% confidence interval of 1.089 Mbp between 58.34 and 59.43 Mbp on BTA18. Additionally, these 4 SNPs in the near-perfect linkage disequilibrium with the QTL haplotype were identified: rs381577268 (on 57,816,137 bp, C/T), rs381878735 (on 59,574,329 bp, A/T), rs464221818 (on 59,329,176 bp, C/T), and rs472502785 (on 59,345,689 bp, T/C). Search for the causal mutation using short and long-read sequences, and methylation data of the BTA18 QTL region did not reveal any candidates though. The assembly showed problems in the region, as well as an abundance of segmental duplications within and around the region. Taking the QTL of BTA18 in Holstein cattle as an example, the data presented in this study comprehensively characterize the genomic features that could also be relevant for other such elusive QTL in various other cattle breeds and livestock species as well.
Collapse
Affiliation(s)
- Nina Dachs
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany; Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Maulik Upadhyay
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany
| | - Elisabeth Hannemann
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany
| | - Andreas Hauser
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor-Lynen-Straße 25, 81377 Munich, Germany
| | - Stefan Krebs
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor-Lynen-Straße 25, 81377 Munich, Germany
| | - Doris Seichter
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Ingolf Russ
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Lilian Johanna Gehrke
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Olshausenstraße 40, 24098 Kiel, Germany; Vereinigte Informationssysteme Tierhaltung w.V. (vit) Verden, Heinrich-Schröder-Weg 1, 27283 Verden (Aller), Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Olshausenstraße 40, 24098 Kiel, Germany
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany.
| |
Collapse
|
46
|
Johansson PA, Palmer JM, Hamilton HR, Whiteman DC, Pritchard AL, Hayward NK. Germline Variants in Childhood Cutaneous Melanoma. J Invest Dermatol 2023:S0022-202X(23)00155-0. [PMID: 36863448 DOI: 10.1016/j.jid.2023.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 02/14/2023] [Accepted: 02/20/2023] [Indexed: 03/04/2023]
Affiliation(s)
- Peter A Johansson
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Jane M Palmer
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Hayley R Hamilton
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - David C Whiteman
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Antonia L Pritchard
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia; Genetics and Immunology Group, University of the Highlands and Islands, Inverness, United Kingdom
| | - Nicholas K Hayward
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia.
| |
Collapse
|
47
|
Sun YH, Cui H, Song C, Shen JT, Zhuo X, Wang RH, Yu X, Ndamba R, Mu Q, Gu H, Wang D, Murthy GG, Li P, Liang F, Liu L, Tao Q, Wang Y, Orlowski S, Xu Q, Zhou H, Jagne J, Gokcumen O, Anthony N, Zhao X, Li XZ. Amniotes co-opt intrinsic genetic instability to protect germ-line genome integrity. Nat Commun 2023; 14:812. [PMID: 36781861 PMCID: PMC9925758 DOI: 10.1038/s41467-023-36354-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 01/27/2023] [Indexed: 02/15/2023] Open
Abstract
Unlike PIWI-interacting RNA (piRNA) in other species that mostly target transposable elements (TEs), >80% of piRNAs in adult mammalian testes lack obvious targets. However, mammalian piRNA sequences and piRNA-producing loci evolve more rapidly than the rest of the genome for unknown reasons. Here, through comparative studies of chickens, ducks, mice, and humans, as well as long-read nanopore sequencing on diverse chicken breeds, we find that piRNA loci across amniotes experience: (1) a high local mutation rate of structural variations (SVs, mutations ≥ 50 bp in size); (2) positive selection to suppress young and actively mobilizing TEs commencing at the pachytene stage of meiosis during germ cell development; and (3) negative selection to purge deleterious SV hotspots. Our results indicate that genetic instability at pachytene piRNA loci, while producing certain pathogenic SVs, also protects genome integrity against TE mobilization by driving the formation of rapid-evolving piRNA sequences.
Collapse
Affiliation(s)
- Yu H Sun
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Hongxiao Cui
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Chi Song
- College of Public Health, Division of Biostatistics, The Ohio State University, Columbus, OH, 43210, USA
| | - Jiafei Teng Shen
- International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, 322000, China
| | - Xiaoyu Zhuo
- Department of Genetics, The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Ruoqiao Huiyi Wang
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Xiaohui Yu
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi, 712100, China
| | - Rudo Ndamba
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Qian Mu
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Hanwen Gu
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Duolin Wang
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Gayathri Guru Murthy
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Pidong Li
- Grandomics Biosciences Co., Ltd, Beijing, 102206, China
| | - Fan Liang
- Grandomics Biosciences Co., Ltd, Beijing, 102206, China
| | - Lei Liu
- Grandomics Biosciences Co., Ltd, Beijing, 102206, China
| | - Qing Tao
- Grandomics Biosciences Co., Ltd, Beijing, 102206, China
| | - Ying Wang
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Sara Orlowski
- Department of Poultry Science, University of Arkansas, Fayetteville, AR, 72701, USA
| | - Qi Xu
- Department of Animal Science, McGill University, Quebec, H9X 3V9, Canada
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Jarra Jagne
- Animal Health Diagnostic Center, Cornell University College of Veterinary Medicine, Ithaca, NY, 14850, USA
| | - Omer Gokcumen
- Department of Biological Sciences, University at Buffalo, State University of New York, Buffalo, NY, 14260, USA
| | - Nick Anthony
- Department of Poultry Science, University of Arkansas, Fayetteville, AR, 72701, USA
| | - Xin Zhao
- Department of Animal Science, McGill University, Quebec, H9X 3V9, Canada.
| | - Xin Zhiguo Li
- Center for RNA Biology: From Genome to Therapeutics, Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, 14642, USA.
| |
Collapse
|
48
|
Paulin LF, Raveendran M, Harris RA, Rogers J, von Haeseler A, Sedlazeck FJ. SVhound: detection of regions that harbor yet undetected structural variation. BMC Bioinformatics 2023; 24:23. [PMID: 36670361 PMCID: PMC9854228 DOI: 10.1186/s12859-022-05046-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 11/08/2022] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Recent population studies are ever growing in number of samples to investigate the diversity of a population or species. These studies reveal new polymorphism that lead to important insights into the mechanisms of evolution, but are also important for the interpretation of these variations. Nevertheless, while the full catalog of variations across entire species remains unknown, we can predict which regions harbor additional not yet detected variations and investigate their properties, thereby enhancing the analysis for potentially missed variants. RESULTS To achieve this we developed SVhound ( https://github.com/lfpaulin/SVhound ), which based on a population level SVs dataset can predict regions that harbor unseen SV alleles. We tested SVhound using subsets of the 1000 genomes project data and showed that its correlation (average correlation of 2800 tests r = 0.7136) is high to the full data set. Next, we utilized SVhound to investigate potentially missed or understudied regions across 1KGP and CCDG. Lastly we also apply SVhound on a small and novel SV call set for rhesus macaque (Macaca mulatta) and discuss the impact and choice of parameters for SVhound. CONCLUSIONS SVhound is a unique method to identify potential regions that harbor hidden diversity in model and non model organisms and can also be potentially used to ensure high quality of SV call sets.
Collapse
Affiliation(s)
- Luis F Paulin
- Center for Integrative Bioinformatics Vienna, Max Perutz Labs, University of Vienna, Medical University of Vienna, Vienna, Austria.
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - R Alan Harris
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Arndt von Haeseler
- Center for Integrative Bioinformatics Vienna, Max Perutz Labs, University of Vienna, Medical University of Vienna, Vienna, Austria
- Faculty of Computer Science, University of Vienna, Vienna, Austria
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
49
|
Maskalenka K, Alagöz G, Krueger F, Wright J, Rostovskaya M, Nakhuda A, Bendall A, Krueger C, Walker S, Scally A, Rugg-Gunn PJ. NANOGP1, a tandem duplicate of NANOG, exhibits partial functional conservation in human naïve pluripotent stem cells. Development 2023; 150:286291. [PMID: 36621005 PMCID: PMC10110494 DOI: 10.1242/dev.201155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/16/2022] [Indexed: 01/10/2023]
Abstract
Gene duplication events can drive evolution by providing genetic material for new gene functions, and they create opportunities for diverse developmental strategies to emerge between species. To study the contribution of duplicated genes to human early development, we examined the evolution and function of NANOGP1, a tandem duplicate of the transcription factor NANOG. We found that NANOGP1 and NANOG have overlapping but distinct expression profiles, with high NANOGP1 expression restricted to early epiblast cells and naïve-state pluripotent stem cells. Sequence analysis and epitope-tagging revealed that NANOGP1 is protein coding with an intact homeobox domain. The duplication that created NANOGP1 occurred earlier in primate evolution than previously thought and has been retained only in great apes, whereas Old World monkeys have disabled the gene in different ways, including homeodomain point mutations. NANOGP1 is a strong inducer of naïve pluripotency; however, unlike NANOG, it is not required to maintain the undifferentiated status of human naïve pluripotent cells. By retaining expression, sequence and partial functional conservation with its ancestral copy, NANOGP1 exemplifies how gene duplication and subfunctionalisation can contribute to transcription factor activity in human pluripotency and development.
Collapse
Affiliation(s)
| | - Gökberk Alagöz
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Felix Krueger
- Bioinformatics Group, Babraham Institute, Cambridge CB22 3AT, UK
| | - Joshua Wright
- Epigenetics Programme, Babraham Institute, Cambridge CB22 3AT, UK
| | | | - Asif Nakhuda
- Gene Targeting Facility, Babraham Institute, Cambridge CB22 3AT, UK
| | - Adam Bendall
- Epigenetics Programme, Babraham Institute, Cambridge CB22 3AT, UK
| | - Christel Krueger
- Epigenetics Programme, Babraham Institute, Cambridge CB22 3AT, UK
| | - Simon Walker
- Imaging Facility, Babraham Institute, Cambridge CB22 3AT, UK
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Peter J Rugg-Gunn
- Epigenetics Programme, Babraham Institute, Cambridge CB22 3AT, UK
- Wellcome-MRC Cambridge Stem Cell Institute, Cambridge CB2 0AW, UK
- Centre for Trophoblast Research, University of Cambridge, Cambridge CB2 3EG, UK
| |
Collapse
|
50
|
Tanwar UK, Stolarska E, Rudy E, Paluch-Lubawa E, Grabsztunowicz M, Arasimowicz-Jelonek M, Sobieszczuk-Nowicka E. Metal tolerance gene family in barley: an in silico comprehensive analysis. J Appl Genet 2022; 64:197-215. [PMID: 36586056 PMCID: PMC10076399 DOI: 10.1007/s13353-022-00744-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 01/01/2023]
Abstract
Metal-tolerance proteins (MTPs) are divalent cation transporters that play critical roles in metal tolerance and ion homeostasis in plants. However, a comprehensive study of MTPs is still lacking in crop plants. The current study aimed to comprehensively identify and characterize the MTP gene family in barley (Hordeum vulgare, Hv), an important crop. In total, 12 HvMTPs were identified in the barley genome in this study. They were divided into three phylogenetic groups (Zn-cation diffusion facilitator proteins [CDFs], Fe/Zn-CDFs, and Mn-CDFs) and further subdivided into seven groups (G1, G5, G6, G7, G8, G9, and G12). The majority of MTPs were hydrophobic proteins found in the vacuolar membrane. Gene duplication analysis of HvMTPs revealed one pair of segmental-like duplications in the barley genome. Evolutionary analysis suggested that barley MTPs underwent purifying natural selection. Additionally, the HvMTPs were analyzed in the pan-genome sequences of barley (20 accessions), which suggests that HvMTPs are highly conserved in barley evolution. Cis-acting regulatory elements, microRNA target sites, and protein-protein interaction analysis indicated the role of HvMTPs in a variety of biological processes. Expression profiling suggests that HvMTPs play an active role in maintaining barley nutrient homeostasis throughout its life cycle, and their expression levels were not significantly altered by abiotic stresses like cold, drought, or heat. The expression of barley HvMTP genes in the presence of heavy metals such as Zn2+, Cu2+, As3+, and Cd2+ revealed that these MTPs were induced by at least one metal ion, implying their involvement in metal tolerance or transportation. The identification and comprehensive investigation of MTP gene family members will provide important gene resources for the genetic improvement of crops for metal tolerance, bioremediation, or biofortification of staple crops.
Collapse
Affiliation(s)
- Umesh Kumar Tanwar
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland.
| | - Ewelina Stolarska
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland
| | - Elżbieta Rudy
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland
| | - Ewelina Paluch-Lubawa
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland
| | - Magda Grabsztunowicz
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland
| | - Magdalena Arasimowicz-Jelonek
- Department of Plant Ecophysiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland
| | - Ewa Sobieszczuk-Nowicka
- Department of Plant Physiology, Faculty of Biology, Adam Mickiewicz University, ul. Uniwersytetu Poznańskiego 6, 61-614, Poznań, Poland.
| |
Collapse
|