1
|
Xu H, Wang Z, Sa S, Yang Y, Zhang X, Li D. Identification of novel compound heterozygous variants of the ALMS1 gene in a child with Alström syndrome by whole genome sequencing. Gene 2024; 929:148827. [PMID: 39122231 DOI: 10.1016/j.gene.2024.148827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 07/24/2024] [Accepted: 08/06/2024] [Indexed: 08/12/2024]
Abstract
BACKGROUND Alström syndrome (ALMS), a rare recessively inherited ciliopathy caused by mutations in ALMS1, is characterized by retinal dystrophy, childhood obesity, sensorineural hearing loss, and type 2 diabetes mellitus. The majority of pathogenic variants in ALMS1 are nonsense and frameshift mutations, which would lead to premature protein truncation, whereas copy number variants are seldom reported. METHODS Herein, we present a 10-year-old Chinese girl with ALMS. The potential causative genetic variant was confirmed through whole genome sequencing, quantitative real-time PCR analysis, and Sanger sequencing. Additionally, breakpoint analysis was performed to determine the exact breakpoint site of the large deletion and elucidate its probable formation mechanism. RESULTS The patient had a cor triatriatum sinister (CTS) structure. Genetic analysis identified novel compound heterozygous variants in the patient, consisting of a frameshift variant c.4414_4415delGT (p.V1472Nfs*26) in ALMS1 and a novel large deletion at chr2:73,612,355-73,626,339, which encompasses exon 1 of the ALMS1 gene. Moreover, breakpoint analysis revealed that the large deletion probably formed through the microhomology-mediated end joining (MMEJ) mechanism due to the 6-bp microhomologies (TCCTTC) observed at both ends of the breakpoints. CONCLUSIONS In this study, novel compound heterozygous variants in the ALMS1 gene were identified in an ALMS patient with a CTS structure. The molecular confirmation of these variants expands the mutational spectrum of ALMS1, while the manifestation of ALMS in the patient provides additional clinical insights into this syndrome.
Collapse
Affiliation(s)
- Haikun Xu
- Department of Radiology, China-Japan Union Hospital of Jilin University, Changchun 130033, P.R. China
| | - Ziju Wang
- Department of Pediatrics, China-Japan Union Hospital of Jilin University, Changchun 130033, P.R. China
| | - Sha Sa
- Department of Radiology, China-Japan Union Hospital of Jilin University, Changchun 130033, P.R. China
| | - Ying Yang
- Department of Ultrasound, China-Japan Union Hospital of Jilin University, Changchun 130033, P.R. China
| | - Xiaofei Zhang
- Department of Pediatrics, China-Japan Union Hospital of Jilin University, Changchun 130033, P.R. China.
| | - Dejun Li
- Center for Reproductive Medicine and Prenatal Diagnosis, The First Hospital of Jilin University, Changchun 130021, P.R. China.
| |
Collapse
|
2
|
Ciantar J, Marttila S, Rajić S, Kostiniuk D, Mishra PP, Lyytikäinen LP, Mononen N, Kleber ME, März W, Kähönen M, Raitakari O, Lehtimäki T, Raitoharju E. Identification and functional characterisation of DNA methylation differences between East- and West-originating Finns. Epigenetics 2024; 19:2397297. [PMID: 39217505 PMCID: PMC11382697 DOI: 10.1080/15592294.2024.2397297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/14/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024] Open
Abstract
Eastern and Western Finns show a striking difference in coronary heart disease-related mortality; genetics is a known contributor for this discrepancy. Here, we discuss the potential role of DNA methylation in mediating the discrepancy in cardiometabolic disease-risk phenotypes between the sub-populations. We used data from the Young Finns Study (n = 969) to compare the genome-wide DNA methylation levels of East- and West-originating Finns. We identified 21 differentially methylated loci (FDR < 0.05; Δβ >2.5%) and 7 regions (smoothed FDR < 0.05; CpGs ≥ 5). Methylation at all loci and regions associates with genetic variants (p < 5 × 10-8). Independently of genetics, methylation at 11 loci and 4 regions associates with transcript expression, including genes encoding zinc finger proteins. Similarly, methylation at 5 loci and 4 regions associates with cardiometabolic disease-risk phenotypes including triglycerides, glucose, cholesterol, as well as insulin treatment. This analysis was also performed in LURIC (n = 2371), a German cardiovascular patient cohort, and results replicated for the association of methylation at cg26740318 and DMR_11p15 with diabetes-related phenotypes and methylation at DMR_22q13 with triglyceride levels. Our results indicate that DNA methylation differences between East and West Finns may have a functional role in mediating the cardiometabolic disease discrepancy between the sub-populations.
Collapse
Affiliation(s)
- Joanna Ciantar
- Molecular Epidemiology (MOLE), Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Saara Marttila
- Molecular Epidemiology (MOLE), Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Gerontology Research Center, Tampere University, Tampere, Finland
- Tays Research Services, Wellbeing Services County of Pirkanmaa, Tampere University Hospital, Tampere, Finland
| | - Sonja Rajić
- Molecular Epidemiology (MOLE), Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Daria Kostiniuk
- Molecular Epidemiology (MOLE), Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Pashupati P Mishra
- Department of Clinical Chemistry, Tays Research Services, Fimlab Laboratories, and Finnish Cardiovascular Research Center, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Leo-Pekka Lyytikäinen
- Department of Clinical Chemistry, Tays Research Services, Fimlab Laboratories, and Finnish Cardiovascular Research Center, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Nina Mononen
- Department of Clinical Chemistry, Tays Research Services, Fimlab Laboratories, and Finnish Cardiovascular Research Center, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Marcus E Kleber
- Vth Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, Heidelberg University, Mannheim, Germany
- SYNLAB MVZ Humangenetik Mannheim, Mannheim, Germany
| | - Winfried März
- Vth Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, Heidelberg University, Mannheim, Germany
- Synlab Academy, SYNLAB Holding Deutschland GmbH, Mannheim, Germany
| | - Mika Kähönen
- Department of Clinical Physiology, Tampere University Hospital and Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Olli Raitakari
- Centre for Population Health Research, University of Turku and Turku University Hospital, Turku, Finland
- Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland
- Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland
| | - Terho Lehtimäki
- Department of Clinical Chemistry, Tays Research Services, Fimlab Laboratories, and Finnish Cardiovascular Research Center, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Emma Raitoharju
- Molecular Epidemiology (MOLE), Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- Fimlab Laboratories, Tampere, Finland
| |
Collapse
|
3
|
Tao J, Luo J, Li K, Yang R, Lin Y, Ge J. Comprehensive genetic analysis uncovers the mutational spectrum of MFRP and its genotype-phenotype correlation in a large cohort of Chinese microphthalmia patients. Gene 2024; 926:148647. [PMID: 38848879 DOI: 10.1016/j.gene.2024.148647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 05/21/2024] [Accepted: 06/03/2024] [Indexed: 06/09/2024]
Abstract
PURPOSE Microphthalmia is a severe congenital ocular disease featured by abnormal ocular development. The aim of this study was to detail the genetic and clinical characteristics of a large cohort of Chinese patients with microphthalmia related to MFRP variants, focusing on uncovering genotype-phenotype correlations. METHODS Fifty microphthalmia patients from 44 unrelated Chinese families were recruited. Whole-exome sequencing (WES) was conducted to analyze the coding regions and adjacent intronic regions of MFRP. Axial lengths (AL) were measured for all probands and available family members. Protein structures of mutations with high frequency in our cohort were predicted. The genotype-phenotype correlations were explored by statistical analysis. RESULTS Sixteen MFRP variants were detected in 17 families, accounting for 38.64 % of all microphthalmia families. There were 9 novel mutations (c.427+1G>C, c.428-2A>C, c.561_575del:p.A188_E192del, c.836G>A:p.C279Y, c.1010_1021del:p.H337_E340del:p.Y479*, c.1516_1517del:p.S506Pfs*66, c.1561T>G:p.C521G, c.1616G>A:p.R539H, and c.1735C>T:p.P579S) and six previously reported variants in MFRP, with p.E496K and p.H337_E340del being highly frequent, found in eight (47.06 %) and two families (11.76 %), respectively. Seven variants (43.75 %) were located in the C-terminal cysteine-rich frizzled-related domain (CRD) (7/16, 43.75 %). Protein prediction implicated p.E496K and p.H337_E340del mutations might lead to a destabilization of the MFRP protein. The average AL of all 42 eyes was 16.02 ± 1.05 mm, and 78.36 % of eyes with AL < 16 mm harbored p.E496K variant. Twenty-six eyes with variant variant had shorter AL than that of the other 16 eyes without this variant (p = 0.006), highlighting a novel genotype-phenotype correlation. CONCLUSIONS In this largest cohort of Chinese patients with microphthalmia, the 9 novel variants, high frequency of p.E496W, and mutation hotspots in CRD reveals unique insights into the MFRP mutation spectrum among Chinese patients, indicating ethnic variability. A new genotype-phenotype correlation that p.E496K variant associated with a shorter AL is unveiled. Our findings enhance the current knowledge of MFRP-associated microphthalmia and provide valuable information for prenatal diagnosis as well as future therapy.
Collapse
Affiliation(s)
- Jing Tao
- Beijing Tongren Eye Center, Beijing Tongren Hospital, Capital Medical University, Beijing Key Laboratory of Ophthalmology and Visual Sciences, Beijing 100730, China
| | - Jingyi Luo
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, Guangdong 510000, China
| | - Kaijing Li
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, Guangdong 510000, China
| | - Runcai Yang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, Guangdong 510000, China
| | - Yixiu Lin
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, Guangdong 510000, China
| | - Jian Ge
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
| |
Collapse
|
4
|
Li S, Gui J, Passarelli MN, Andrew AS, Sullivan KM, Cornell KA, Traynor BJ, Stark A, Chia R, Kuenzler RM, Pioro EP, Bradley WG, Stommel EW. Genome-Wide and Transcriptome-Wide Association Studies on Northern New England and Ohio Amyotrophic Lateral Sclerosis Cohorts. Neurol Genet 2024; 10:e200188. [PMID: 39246739 PMCID: PMC11380502 DOI: 10.1212/nxg.0000000000200188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 07/23/2024] [Indexed: 09/10/2024]
Abstract
Background and Objectives Amyotrophic lateral sclerosis (ALS) is an age-associated, fatal neurodegenerative disorder causing progressive paralysis and respiratory failure. The genetic architecture of ALS is still largely unknown. Methods We performed a genome-wide association study (GWAS) and transcriptome-wide association study (TWAS) to understand genetic risk factors for ALS using a population-based case-control study of 435 ALS cases and 279 controls from Northern New England and Ohio. Single nucleotide polymorphism (SNP) genotyping was conducted using the Illumina NeuroChip array. Odds ratios were estimated using covariate-adjusted logistic regression. We also performed a genome-wide SNP-smoking interaction screening. TWAS analyses used PrediXcan to estimate associations between predicted gene expression levels across 15 tissues (13 brain tissues, skeletal muscle, and whole blood) and ALS risk. Results GWAS analyses identified the p.A382T missense variant (rs367543041, p = 3.95E-6) in the TARDBP gene, which has previously been reported in association with increased ALS risk and was found to share a close affinity with the Sardinian haplotype. Both GWAS and TWAS analyses suggested that ZNF235 is associated with decreased ALS risk. Discussion Our results support the need for future evaluation to clarify the role of these potential genetic risk factors for ALS and to understand genetic susceptibility to environmental risk factors.
Collapse
Affiliation(s)
- Siting Li
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Jiang Gui
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Michael N Passarelli
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Angeline S Andrew
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Kathleen M Sullivan
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Kevin A Cornell
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Bryan J Traynor
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Ali Stark
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Ruth Chia
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Rebecca M Kuenzler
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Erik P Pioro
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Walter G Bradley
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| | - Elijah W Stommel
- From the Departments of Biomedical Data Science (S.L., J.G.), Epidemiology (S.L., M.N.P.), and Neurology, Geisel School of Medicine at Dartmouth (E.W.S.), Dartmouth College, Hanover; Dartmouth Health (A.S.A., K.M.S., K.A.C., E.W.S.), Lebanon, NH; Neuromuscular Diseases Research Section (B.J.T., A.S., R.C.), National Institute on Aging; National Institute of Neurological Disorders and Stroke (B.J.T.), National Institutes of Health, Bethesda; RNA Therapeutics Laboratory (B.J.T.), National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD; Cleveland Clinic (R.M.K.), OH; Department of Medicine (E.P.P.), University of British Columbia, Vancouver, BC, Canada; and University of Miami Miller School of Medicine (W.G.B.), FL
| |
Collapse
|
5
|
Raingeval M, Leduque B, Baduel P, Edera A, Roux F, Colot V, Quadrana L. Retrotransposon-driven environmental regulation of FLC leads to adaptive response to herbicide. NATURE PLANTS 2024:10.1038/s41477-024-01807-8. [PMID: 39333353 DOI: 10.1038/s41477-024-01807-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 09/05/2024] [Indexed: 09/29/2024]
Abstract
The mobilization of transposable elements is a potent source of mutations. In plants, several stransposable elements respond to external cues, fuelling the hypothesis that natural transposition can create environmentally sensitive alleles for adaptation. Here we report on the detailed characterization of a retrotransposon insertion within the first intron of the Arabidopsis floral-repressor gene FLOWERING LOCUS C (FLC) and the discovery of its role for adaptation. The insertion mutation augments the environmental sensitivity of FLC by affecting the balance between coding and non-coding transcripts in response to stress, thus expediting flowering. This balance is modulated by DNA methylation and orchestrated by IBM2, a factor involved in the processing of intronic heterochromatic sequences. The stress-sensitive allele of FLC has spread across populations subjected to recurrent chemical weeding, and we show that retrotransposon-driven acceleration of the life cycle represents a rapid response to herbicide application. Our work provides a compelling example of a transposable element-driven environmentally sensitive allele that confers an adaptive response in nature.
Collapse
Affiliation(s)
- Mathieu Raingeval
- Institute of Plant Sciences Paris-Saclay (IPS2), Centre National de la Recherche Scientifique, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Université Evry, Université Paris-Saclay, Gif sur Yvette, France
- Department of Molecular Mechanisms of Disease, University of Zurich, Zurich, Switzerland
| | - Basile Leduque
- Institute of Plant Sciences Paris-Saclay (IPS2), Centre National de la Recherche Scientifique, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Université Evry, Université Paris-Saclay, Gif sur Yvette, France
| | - Pierre Baduel
- Institut de Biologie de l'Ecole Normale Supérieure, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Ecole Normale Supérieure, PSL Research University, Paris, France
| | - Alejandro Edera
- Institute of Plant Sciences Paris-Saclay (IPS2), Centre National de la Recherche Scientifique, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Université Evry, Université Paris-Saclay, Gif sur Yvette, France
| | - Fabrice Roux
- Laboratoire des Interactions Plantes-Microbes-Environnement, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Centre National de la Recherche Scientifique, Université de Toulouse, Castanet-Tolosan, France
| | - Vincent Colot
- Institut de Biologie de l'Ecole Normale Supérieure, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Ecole Normale Supérieure, PSL Research University, Paris, France
| | - Leandro Quadrana
- Institute of Plant Sciences Paris-Saclay (IPS2), Centre National de la Recherche Scientifique, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Université Evry, Université Paris-Saclay, Gif sur Yvette, France.
| |
Collapse
|
6
|
Lee AS, Ayers LJ, Kosicki M, Chan WM, Fozo LN, Pratt BM, Collins TE, Zhao B, Rose MF, Sanchis-Juan A, Fu JM, Wong I, Zhao X, Tenney AP, Lee C, Laricchia KM, Barry BJ, Bradford VR, Jurgens JA, England EM, Lek M, MacArthur DG, Lee EA, Talkowski ME, Brand H, Pennacchio LA, Engle EC. A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders. Nat Commun 2024; 15:8268. [PMID: 39333082 DOI: 10.1038/s41467-024-52463-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 09/04/2024] [Indexed: 09/29/2024] Open
Abstract
Unsolved Mendelian cases often lack obvious pathogenic coding variants, suggesting potential non-coding etiologies. Here, we present a single cell multi-omic framework integrating embryonic mouse chromatin accessibility, histone modification, and gene expression assays to discover cranial motor neuron (cMN) cis-regulatory elements and subsequently nominate candidate non-coding variants in the congenital cranial dysinnervation disorders (CCDDs), a set of Mendelian disorders altering cMN development. We generate single cell epigenomic profiles for ~86,000 cMNs and related cell types, identifying ~250,000 accessible regulatory elements with cognate gene predictions for ~145,000 putative enhancers. We evaluate enhancer activity for 59 elements using an in vivo transgenic assay and validate 44 (75%), demonstrating that single cell accessibility can be a strong predictor of enhancer activity. Applying our cMN atlas to 899 whole genome sequences from 270 genetically unsolved CCDD pedigrees, we achieve significant reduction in our variant search space and nominate candidate variants predicted to regulate known CCDD disease genes MAFB, PHOX2A, CHN1, and EBF3 - as well as candidates in recurrently mutated enhancers through peak- and gene-centric allelic aggregation. This work delivers non-coding variant discoveries of relevance to CCDDs and a generalizable framework for nominating non-coding variants of potentially high functional impact in other Mendelian disorders.
Collapse
Affiliation(s)
- Arthur S Lee
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Lauren J Ayers
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Michael Kosicki
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Wai-Man Chan
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Lydia N Fozo
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Brandon M Pratt
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Thomas E Collins
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Boxun Zhao
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Matthew F Rose
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Pathology, Boston Children's Hospital, Boston, MA, USA
- Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Medical Genetics Training Program, Harvard Medical School, Boston, MA, USA
| | - Alba Sanchis-Juan
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Jack M Fu
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Isaac Wong
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Alan P Tenney
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cassia Lee
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Harvard College, Cambridge, MA, USA
| | - Kristen M Laricchia
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Brenda J Barry
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Victoria R Bradford
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Julie A Jurgens
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Eleina M England
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Monkol Lek
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, NSW, Australia
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, VIC, Australia
| | - Eunjung Alice Lee
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Michael E Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Harrison Brand
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, MA, USA
| | - Len A Pennacchio
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Elizabeth C Engle
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA.
- Medical Genetics Training Program, Harvard Medical School, Boston, MA, USA.
- Department of Ophthalmology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
7
|
Dodge TO, Kim BY, Baczenas JJ, Banerjee SM, Gunn TR, Donny AE, Given LA, Rice AR, Haase Cox SK, Weinstein ML, Cross R, Moran BM, Haber K, Haghani NB, Machin Kairuz JA, Gellert HR, Du K, Aguillon SM, Tudor MS, Gutiérrez-Rodríguez C, Rios-Cardenas O, Morris MR, Schartl M, Powell DL, Schumer M. Structural genomic variation and behavioral interactions underpin a balanced sexual mimicry polymorphism. Curr Biol 2024:S0960-9822(24)01168-0. [PMID: 39326413 DOI: 10.1016/j.cub.2024.08.053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 07/15/2024] [Accepted: 08/29/2024] [Indexed: 09/28/2024]
Abstract
How phenotypic diversity originates and persists within populations are classic puzzles in evolutionary biology. While balanced polymorphisms segregate within many species, it remains rare for both the genetic basis and the selective forces to be known, leading to an incomplete understanding of many classes of traits under balancing selection. Here, we uncover the genetic architecture of a balanced sexual mimicry polymorphism and identify behavioral mechanisms that may be involved in its maintenance in the swordtail fish Xiphophorus birchmanni. We find that ∼40% of X. birchmanni males develop a "false gravid spot," a melanic pigmentation pattern that mimics the "pregnancy spot" associated with sexual maturity in female live-bearing fish. Using genome-wide association mapping, we detect a single intergenic region associated with variation in the false gravid spot phenotype, which is upstream of kitlga, a melanophore patterning gene. By performing long-read sequencing within and across populations, we identify complex structural rearrangements between alternate alleles at this locus. The false gravid spot haplotype drives increased allele-specific expression of kitlga, which provides a mechanistic explanation for the increased melanophore abundance that causes the spot. By studying social interactions in the laboratory and in nature, we find that males with the false gravid spot experience less aggression; however, they also receive increased attention from other males and are disdained by females. These behavioral interactions may contribute to the maintenance of this phenotypic polymorphism in natural populations. We speculate that structural variants affecting gene regulation may be an underappreciated driver of balanced polymorphisms across diverse species.
Collapse
Affiliation(s)
- Tristram O Dodge
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México.
| | - Bernard Y Kim
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - John J Baczenas
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - Shreya M Banerjee
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México; Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, 475 Storer Mall, Davis, CA 95616, USA
| | - Theresa R Gunn
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México
| | - Alex E Donny
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México
| | - Lyle A Given
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - Andreas R Rice
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - Sophia K Haase Cox
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - M Luke Weinstein
- Department of Biological Sciences, Ohio University, 7 Depot St., Athens, OH 45701, USA
| | - Ryan Cross
- Department of Biological Sciences, Ohio University, 7 Depot St., Athens, OH 45701, USA
| | - Benjamin M Moran
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México
| | - Kate Haber
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Berkeley High School, 1980 Allston Way, Berkeley, CA 94704, USA
| | - Nadia B Haghani
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México
| | | | - Hannah R Gellert
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA
| | - Kang Du
- Xiphophorus Genetic Stock Center, Texas State University, San Marcos, 601 University Drive, San Marcos, TX 78666, USA
| | - Stepfanie M Aguillon
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México; Department of Ecology and Evolutionary Biology, University of California, Los Angeles, 612 Charles E. Young Drive South, Los Angeles, CA 90095, USA
| | - M Scarlett Tudor
- Cooperative Extension and Aquaculture Research Institute, University of Maine, 33 Salmon Farm Road, Franklin, ME 04634, USA
| | - Carla Gutiérrez-Rodríguez
- Red de Biología Evolutiva, Instituto de Ecología, A.C., Carretera antigua a Coatepec 351, Col. El Haya, Xalapa, Veracruz 91073, México
| | - Oscar Rios-Cardenas
- Red de Biología Evolutiva, Instituto de Ecología, A.C., Carretera antigua a Coatepec 351, Col. El Haya, Xalapa, Veracruz 91073, México
| | - Molly R Morris
- Department of Biological Sciences, Ohio University, 7 Depot St., Athens, OH 45701, USA
| | - Manfred Schartl
- Xiphophorus Genetic Stock Center, Texas State University, San Marcos, 601 University Drive, San Marcos, TX 78666, USA; Developmental Biochemistry, Biocenter, University of Würzburg, Am Hubland, 97074 Wuerzburg, Germany
| | - Daniel L Powell
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México; Department of Biology, Louisiana State University, 202 Life Science Building, Baton Rouge, LA 70803, USA
| | - Molly Schumer
- Department of Biology, Stanford University, 327 Campus Drive, Stanford, CA 94305, USA; Centro de Investigaciones Científicas de las Huastecas "Aguazarca" A.C., 16 de Septiembre, 392 Barrio Aguazarca, Calnali, Hidalgo 43240, México; Howard Hughes Medical Institute, 327 Campus Drive, Stanford, CA 94305, USA.
| |
Collapse
|
8
|
Gretzinger J, Gibbon VE, Penske SE, Sealy JC, Rohrlach AB, Salazar-García DC, Krause J, Schiffels S. 9,000 years of genetic continuity in southernmost Africa demonstrated at Oakhurst rockshelter. Nat Ecol Evol 2024:10.1038/s41559-024-02532-3. [PMID: 39300260 DOI: 10.1038/s41559-024-02532-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 08/02/2024] [Indexed: 09/22/2024]
Abstract
Southern Africa has one of the longest records of fossil hominins and harbours the largest human genetic diversity in the world. Yet, despite its relevance for human origins and spread around the globe, the formation and processes of its gene pool in the past are still largely unknown. Here, we present a time transect of genome-wide sequences from nine individuals recovered from a single site in South Africa, Oakhurst Rockshelter. Spanning the whole Holocene, the ancient DNA of these individuals allows us to reconstruct the demographic trajectories of the indigenous San population and their ancestors during the last 10,000 years. We show that, in contrast to most regions around the world, the population history of southernmost Africa was not characterized by several waves of migration, replacement and admixture but by long-lasting genetic continuity from the early Holocene to the end of the Later Stone Age. Although the advent of pastoralism and farming substantially transformed the gene pool in most parts of southern Africa after 1,300 BP, we demonstrate using allele-frequency and identity-by-descent segment-based methods that the ‡Khomani San and Karretjiemense from South Africa still show direct signs of relatedness to the Oakhurst hunter-gatherers, a pattern obscured by recent, extensive non-Southern African admixture. Yet, some southern San in South Africa still preserve this ancient, Pleistocene-derived genetic signature, extending the period of genetic continuity until today.
Collapse
Affiliation(s)
- Joscha Gretzinger
- Max Planck Institute for Evolutionary Anthropology, Department of Archaeogenetics, Leipzig, Germany
| | - Victoria E Gibbon
- Division of Clinical Anatomy and Biological Anthropology, Department of Human Biology, University of Cape Town, Cape Town, South Africa.
| | - Sandra E Penske
- Max Planck Institute for Evolutionary Anthropology, Department of Archaeogenetics, Leipzig, Germany
| | - Judith C Sealy
- Department of Archaeology, University of Cape Town, Cape Town, South Africa
| | - Adam B Rohrlach
- Max Planck Institute for Evolutionary Anthropology, Department of Archaeogenetics, Leipzig, Germany
- School of Computer and Mathematical Sciences, University of Adelaide, Adelaide, South Australia, Australia
| | - Domingo C Salazar-García
- Department of Geological Sciences, University of Cape Town, Cape Town, South Africa
- Departament de Prehistòria, Arqueologia i Història Antiga, Universitat de València, València, Spain
| | - Johannes Krause
- Max Planck Institute for Evolutionary Anthropology, Department of Archaeogenetics, Leipzig, Germany
| | - Stephan Schiffels
- Max Planck Institute for Evolutionary Anthropology, Department of Archaeogenetics, Leipzig, Germany.
| |
Collapse
|
9
|
Bravo JI, Zhang L, Benayoun BA. Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-copy numbers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.10.612283. [PMID: 39314493 PMCID: PMC11419044 DOI: 10.1101/2024.09.10.612283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Long INterspersed Element-1 (LINE-1; L1) and Alu are two families of transposable elements (TEs) occupying ~17% and ~11% of the human genome, respectively. Though only a small fraction of L1 copies is able to produce the machinery to mobilize autonomously, Alu elements and degenerate L1 copies can hijack their functional machinery and mobilize in trans. The expression and subsequent copy number expansion of L1 and Alu can exert pathological effects on their hosts, promoting genome instability, inflammation, and cell cycle alterations. These features have made L1 and Alu promising focus subjects in studies of aging and aging diseases where they can become active. However, the mechanisms regulating variation in their expression and copy number remain incompletely characterized. Moreover, the relevance of known mechanisms to diverse human populations remains unclear, as mechanisms are often characterized in isogenic cell culture models. To address these gaps, we leveraged genomic data from the 1000 Genomes Project to carry out a trans-ethnic GWAS of L1 and Alu insertion global singletons. These singletons are rare insertions observed only once in a population, potentially reflecting recently acquired L1 and Alu integrants or structural variants, and which we used as proxies for L1/Alu-associated copy number variation. Our computational approach identified single nucleotide variants in genomic regions containing genes with potential and known TE regulatory properties, and it enriched for single nucleotide variants in regions containing known regulators of L1 expression. Moreover, we identified many reference TE copies and polymorphic structural variants that were associated with L1/Alu singletons, suggesting their potential contribution to TE copy number variation through transposition-dependent or transposition-independent mechanisms. Finally, a transcriptional analysis of lymphoblastoid cells highlighted potential cell cycle alterations in a subset of samples harboring L1/Alu singletons. Collectively, our results (i) suggest that known TE regulatory mechanisms may also play regulatory roles in diverse human populations, (ii) expand the list of genic and repetitive genomic loci implicated in TE copy number variation, and (iii) reinforce the links between TEs and disease.
Collapse
Affiliation(s)
- Juan I. Bravo
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
| | - Lucia Zhang
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Quantitative and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, California, USA
| | - Bérénice A. Benayoun
- Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Department, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA 90089, USA
- Biochemistry and Molecular Medicine Department, USC Keck School of Medicine, Los Angeles, CA 90089, USA
- USC Norris Comprehensive Cancer Center, Epigenetics and Gene Regulation, Los Angeles, CA 90089, USA
- USC Stem Cell Initiative, Los Angeles, CA 90089, USA
| |
Collapse
|
10
|
Märkle H, John S, Metzger L, Ansari MA, Pedergnana V, Tellier A. Inference of Host-Pathogen Interaction Matrices from Genome-Wide Polymorphism Data. Mol Biol Evol 2024; 41:msae176. [PMID: 39172738 DOI: 10.1093/molbev/msae176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/04/2024] [Accepted: 08/20/2024] [Indexed: 08/24/2024] Open
Abstract
Host-pathogen coevolution is defined as the reciprocal evolutionary changes in both species due to genotype × genotype (G×G) interactions at the genetic level determining the outcome and severity of infection. While co-analyses of hosts and pathogen genomes (co-genome-wide association studies) allow us to pinpoint the interacting genes, these do not reveal which host genotype(s) is/are resistant to which pathogen genotype(s). The knowledge of this so-called infection matrix is important for agriculture and medicine. Building on established theories of host-pathogen interactions, we here derive four novel indices capturing the characteristics of the infection matrix. These indices can be computed from full genome polymorphism data of randomly sampled uninfected hosts, as well as infected hosts and their pathogen strains. We use these indices in an approximate Bayesian computation method to pinpoint loci with relevant G×G interactions and to infer their underlying interaction matrix. In a combined single nucleotide polymorphism dataset of 451 European humans and their infecting hepatitis C virus (HCV) strains and 503 uninfected individuals, we reveal a new human candidate gene for resistance to HCV and new virus mutations matching human genes. For two groups of significant human-HCV (G×G) associations, we infer a gene-for-gene infection matrix, which is commonly assumed to be typical of plant-pathogen interactions. Our model-based inference framework bridges theoretical models of G×G interactions with host and pathogen genomic data. It, therefore, paves the way for understanding the evolution of key G×G interactions underpinning HCV adaptation to the European human population after a recent expansion.
Collapse
Affiliation(s)
- Hanna Märkle
- Population Genetics, Department of Life Science Systems, School of Life Sciences, Technical University of Munich, Freising 85354Germany
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Sona John
- Population Genetics, Department of Life Science Systems, School of Life Sciences, Technical University of Munich, Freising 85354Germany
| | - Lukas Metzger
- Population Genetics, Department of Life Science Systems, School of Life Sciences, Technical University of Munich, Freising 85354Germany
| | - M Azim Ansari
- Nuffield Department of Medicine, Peter Medawar Building for Pathogen Research, University of Oxford, Oxford, UK
| | - Vincent Pedergnana
- Laboratoire MIVEGEC (UMR CNRS 5290, UR IRD 224, UM), Montpellier, France
| | - Aurélien Tellier
- Population Genetics, Department of Life Science Systems, School of Life Sciences, Technical University of Munich, Freising 85354Germany
| |
Collapse
|
11
|
Mirus T, Lohmayer R, Döhring C, Halldórsson BV, Kehr B. GGTyper: genotyping complex structural variants using short-read sequencing data. Bioinformatics 2024; 40:ii11-ii19. [PMID: 39230689 PMCID: PMC11373317 DOI: 10.1093/bioinformatics/btae391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024] Open
Abstract
MOTIVATION Complex structural variants (SVs) are genomic rearrangements that involve multiple segments of DNA. They contribute to human diversity and have been shown to cause Mendelian disease. Nevertheless, our abilities to analyse complex SVs are very limited. As opposed to deletions and other canonical types of SVs, there are no established tools that have explicitly been designed for analysing complex SVs. RESULTS Here, we describe a new computational approach that we specifically designed for genotyping complex SVs in short-read sequenced genomes. Given a variant description, our approach computes genotype-specific probability distributions for observing aligned read pairs with a wide range of properties. Subsequently, these distributions can be used to efficiently determine the most likely genotype for any set of aligned read pairs observed in a sequenced genome. In addition, we use these distributions to compute a genotyping difficulty for a given variant, which predicts the amount of data needed to achieve a reliable call. Careful evaluation confirms that our approach outperforms other genotypers by making reliable genotype predictions across both simulated and real data. On up to 7829 human genomes, we achieve high concordance with population-genetic assumptions and expected inheritance patterns. On simulated data, we show that precision correlates well with our prediction of genotyping difficulty. This together with low memory and time requirements makes our approach well-suited for application in biomedical studies involving small to very large numbers of short-read sequenced genomes. AVAILABILITY AND IMPLEMENTATION Source code is available at https://github.com/kehrlab/Complex-SV-Genotyping.
Collapse
Affiliation(s)
- Tim Mirus
- AG Algorithmic Bioinformatics, Leibniz-Institut für Immuntherapie, Regensburg 93053, Germany
| | - Robert Lohmayer
- AG Algorithmic Bioinformatics, Leibniz-Institut für Immuntherapie, Regensburg 93053, Germany
| | - Clementine Döhring
- AG Algorithmic Bioinformatics, Leibniz-Institut für Immuntherapie, Regensburg 93053, Germany
| | - Bjarni V Halldórsson
- deCODE genetics/Amgen Inc, Reykjavik 101, Iceland
- School of Technology, Reykjavik University, Reykjavic 102, Iceland
| | - Birte Kehr
- AG Algorithmic Bioinformatics, Leibniz-Institut für Immuntherapie, Regensburg 93053, Germany
- Fakultät für Informatik und Data Science, Universität Regensburg, Regensburg 93053, Germany
| |
Collapse
|
12
|
Li C, Zhao W, Zhou H, Wu J, Huo Y, Jiang D, Ji X, Liu K, Xu Q, Li W. Functional Mutations in the microRNA-155 Promoter Modulate its Transcription Efficiency and Expression. Mol Biotechnol 2024; 66:2262-2272. [PMID: 37624482 DOI: 10.1007/s12033-023-00857-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 08/07/2023] [Indexed: 08/26/2023]
Abstract
Limited research has been conducted on porcine miR-155 promoters, and previous study from our group have identified two haplotypes (TT and CC) in different pig breeds, each associated with five fully linked mutation sites within or near the miR-155 gene (Li et al. Dev Comp Immunol 39(1):110-116, 2013). In this study, the promoter region of porcine miR-155 was screened, and two important transcription factors, Foxp3 and RelA, were identified. The binding ability of Foxp3 protein was found to be affected by the first mutation site (A/C) using EMSA analysis. In vitro experiments revealed that the expression level of miR-155 was significantly higher in the C haplotype compared to the T haplotype. Additionally, northern blotting assays indicated that both the first mutation site (A/C) and the fourth mutation site (G/T) had a significant impact on miR-155 expression levels. These findings provide further insights into the transcriptional regulation of porcine miR-155 and identify crucial mutation sites that influence miR-155 expression. This knowledge can serve as a basis for identifying potential molecular markers associated with disease resistance in swine.
Collapse
Affiliation(s)
- Congcong Li
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China.
| | - Wanxia Zhao
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Huijie Zhou
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Jiao Wu
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Yong Huo
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Dongfeng Jiang
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Xiangbo Ji
- Henan Key Laboratory of Unconventional Feed Resources Innovative Utilization, Henan University of Animal Husbandry and Economy, Zhengzhou, Henan, China
| | - Kun Liu
- Henan Key Laboratory of Unconventional Feed Resources Innovative Utilization, Henan University of Animal Husbandry and Economy, Zhengzhou, Henan, China
| | - Qiuliang Xu
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| | - Wantao Li
- College of Animal Science and Technology, Henan University of Animal Husbandry and Economy, No. 6 Longzi North Road, Zhengdong New District, Zhengzhou, 450046, Henan, China
| |
Collapse
|
13
|
Gao Y, Zhou Y, Lin Z, Chen F, Wu H, Peng C, Xie Y. Prioritizing drug targets in systemic lupus erythematosus from a genetic perspective: a druggable genome-wide Mendelian randomization study. Clin Rheumatol 2024; 43:2843-2856. [PMID: 38997544 PMCID: PMC11330408 DOI: 10.1007/s10067-024-07059-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 06/05/2024] [Accepted: 07/04/2024] [Indexed: 07/14/2024]
Abstract
OBJECTIVES Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease with an unsatisfactory state of treatment. We aim to explore novel targets for SLE from a genetic standpoint. METHODS Cis-expression quantitative trait loci (eQTLs) for whole blood from 31,684 samples provided by the eQTLGen Consortium as well as two large SLE cohorts were utilized for screening and validating genes causally associated with SLE. Colocalization analysis was employed to further investigate whether changes in the expression of risk genes, as indicated by GWAS signals, influence the occurrence and development of SLE. Targets identified for drug development were evaluated for potential side effects using a phenome-wide association study (PheWAS). Based on the multiple databases, we explored the interactions between drugs and genes for drug prediction and the assessment of current medications. RESULTS The analysis comprised 5427 druggable genes in total. The two-sample Mendelian randomization (MR) in the discovery phase identified 20 genes causally associated with SLE and validated 8 genes in the replication phase. Colocalization analysis ultimately identified five genes (BLK, HIST1H3H, HSPA1A, IL12A, NEU1) with PPH4 > 0.8. PheWAS further indicated that drugs acting on BLK and IL12A are less likely to have potential side effects, while HSPA1A and NEU1 were associated with other traits. Four genes (BLK, HSPA1A, IL12A, NEU1) have been targeted for drug development in autoimmune diseases and other conditions. CONCLUSIONS .This study identified five genes as therapeutic targets for SLE. Repurposing and developing drugs targeting these genes is anticipated to improve the existing treatment state for SLE. Key Points • We identified five gene targets of priority for the treatment of SLE, with BLK and IL12A indicating fewer side effects. • Among the existing drugs that target these candidate genes, Ustekinumab, Ebdarokimab, and Briakinumab (targeting the IL12 gene) and CD24FC (targeting HSPA1A) may potentially be repurposed for the treatment of SLE.
Collapse
Affiliation(s)
- Yuan Gao
- The First Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Youtao Zhou
- The First Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Zikai Lin
- Nanshan College, Guangzhou Medical University, Guangzhou, China
| | - Fengzhen Chen
- The First Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Haiyang Wu
- The Second Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Chusheng Peng
- The First Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Yingying Xie
- Department of Rheumatology and Immunology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou Medical University, No. 151, Yanjiang West Road, Yuexiu District, Guangzhou, China.
| |
Collapse
|
14
|
Wu XR, Wu BS, Kang JJ, Chen LM, Deng YT, Chen SD, Dong Q, Feng JF, Cheng W, Yu JT. Contribution of copy number variations to education, socioeconomic status and cognition from a genome-wide study of 305,401 subjects. Mol Psychiatry 2024:10.1038/s41380-024-02717-z. [PMID: 39215183 DOI: 10.1038/s41380-024-02717-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 08/19/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024]
Abstract
Educational attainment (EA), socioeconomic status (SES) and cognition are phenotypically and genetically linked to health outcomes. However, the role of copy number variations (CNVs) in influencing EA/SES/cognition remains unclear. Using a large-scale (n = 305,401) genome-wide CNV-level association analysis, we discovered 33 CNV loci significantly associated with EA/SES/cognition, 20 of which were novel (deletions at 2p22.2, 2p16.2, 2p12, 3p25.3, 4p15.2, 5p15.33, 5q21.1, 8p21.3, 9p21.1, 11p14.3, 13q12.13, 17q21.31, and 20q13.33, as well as duplications at 3q12.2, 3q23, 7p22.3, 8p23.1, 8p23.2, 17q12 (105 kb), and 19q13.32). The genes identified in gene-level tests were enriched in biological pathways such as neurodegeneration, telomere maintenance and axon guidance. Phenome-wide association studies further identified novel associations of EA/SES/cognition-associated CNVs with mental and physical diseases, such as 6q27 duplication with upper respiratory disease and 17q12 (105 kb) duplication with mood disorders. Our findings provide a genome-wide CNV profile for EA/SES/cognition and bridge their connections to health. The expanded candidate CNVs database and the residing genes would be a valuable resource for future studies aimed at uncovering the biological mechanisms underlying cognitive function and related clinical phenotypes.
Collapse
Affiliation(s)
- Xin-Rui Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Bang-Sheng Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Ju-Jiao Kang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Li-Min Chen
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Yue-Ting Deng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Shi-Dong Chen
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Qiang Dong
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Jian-Feng Feng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Department of Computer Science, University of Warwick, Coventry, CV4 7AL, UK
| | - Wei Cheng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
| | - Jin-Tai Yu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
| |
Collapse
|
15
|
Ma W, Chaisson MJ. Genotyping sequence-resolved copy-number variation using pangenomes reveals paralog-specific global diversity and expression divergence of duplicated genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.11.607269. [PMID: 39149335 PMCID: PMC11326217 DOI: 10.1101/2024.08.11.607269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Human pangenomes contain assemblies of non-reference copy-number variable (CNV) genes. We developed a new method, ctyper, to identify the copy-number of specific alleles of CNV genes cataloged in pangenomes with NGS datasets. Applying ctyper to the 1000-genomes samples revealed population stratification of paralogs and two classes of CNVs: recent CNVs due to ongoing duplications, and polymorphic CNVs from non-reference ancient paralogs. Expression quantitative trait locus analysis determined allele-specific expression within gene families, revealing that 7.94% of paralogs and 3.28% orthologs had significantly divergent expression. Case studies of individual genes include finding lower expression on SMN-1 copies that arose from conversion from SMN-2, and increased expression on a form of AMY2B that has undergone a translocation. Moreover, 4.7% of paralogs and 1.2% of orthologs had different most-expressed tissues. Furthermore, the genotypes explain more expression variance than known eQTL variants. Overall, ctyper enables biobank-scale genotyping of sequence-resolved CNVs.
Collapse
Affiliation(s)
- Walfred Ma
- Quantitative and Computational Biology, University of Southern California, CA, USA
| | - Mark Jp Chaisson
- Quantitative and Computational Biology, University of Southern California, CA, USA
- The Genomic and Epigenomic Regulation Program, USC Norris Cancer Center, University of Southern California, Los Angeles, California 90033, USA
| |
Collapse
|
16
|
Negi S, Stenton SL, Berger SI, McNulty B, Violich I, Gardner J, Hillaker T, O'Rourke SM, O'Leary MC, Carbonell E, Austin-Tse C, Lemire G, Serrano J, Mangilog B, VanNoy G, Kolmogorov M, Vilain E, O'Donnell-Luria A, Délot E, Miga KH, Monlong J, Paten B. Advancing long-read nanopore genome assembly and accurate variant calling for rare disease detection. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.08.22.24312327. [PMID: 39228712 PMCID: PMC11370519 DOI: 10.1101/2024.08.22.24312327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
More than 50% of families with suspected rare monogenic diseases remain unsolved after whole genome analysis by short read sequencing (SRS). Long-read sequencing (LRS) could help bridge this diagnostic gap by capturing variants inaccessible to SRS, facilitating long-range mapping and phasing, and providing haplotype-resolved methylation profiling. To evaluate LRS's additional diagnostic yield, we sequenced a rare disease cohort of 98 samples, including 41 probands and some family members, using nanopore sequencing, achieving per sample ∼36x average coverage and 32 kilobase (kb) read N50 from a single flow cell. Our Napu pipeline generated assemblies, phased variants, and methylation calls. LRS covered, on average, coding exons in ∼280 genes and ∼5 known Mendelian disease genes that were not covered by SRS. In comparison to SRS, LRS detected additional rare, functionally annotated variants, including SVs and tandem repeats, and completely phased 87% of protein-coding genes. LRS detected additional de novo variants, and could be used to distinguish postzygotic mosaic variants from prezygotic de novos . Eleven probands were solved, with diverse underlying genetic causes including de novo and compound heterozygous variants, large-scale SVs, and epigenetic modifications. Our study demonstrates LRS's potential to enhance diagnostic yield for rare monogenic diseases, implying utility in future clinical genomics workflows.
Collapse
|
17
|
Vialle RA, de Paiva Lopes K, Li Y, Ng B, Schneider JA, Buchman AS, Wang Y, Farfel JM, Barnes LL, Wingo AP, Wingo TS, Seyfried NT, De Jager PL, Gaiteri C, Tasaki S, Bennett DA. Structural variants linked to Alzheimer's Disease and other common age-related clinical and neuropathologic traits. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.08.12.24311887. [PMID: 39185527 PMCID: PMC11343262 DOI: 10.1101/2024.08.12.24311887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Advances have led to a greater understanding of the genetics of Alzheimer's Disease (AD). However, the gap between the predicted and observed genetic heritability estimates when using single nucleotide polymorphisms (SNPs) and small indel data remains. Large genomic rearrangements, known as structural variants (SVs), have the potential to account for this missing genetic heritability. By leveraging data from two ongoing cohort studies of aging and dementia, the Religious Orders Study and Rush Memory and Aging Project (ROS/MAP), we performed genome-wide association analysis testing around 20,000 common SVs from 1,088 participants with whole genome sequencing (WGS) data. A range of Alzheimer's Disease and Related Disorders (AD/ADRD) clinical and pathologic traits were examined. Given the limited sample size, no genome-wide significant association was found, but we mapped SVs across 81 AD risk loci and discovered 22 SVs in linkage disequilibrium (LD) with GWAS lead variants and directly associated with AD/ADRD phenotypes (nominal P < 0.05). The strongest association was a deletion of an Alu element in the 3'UTR of the TMEM106B gene. This SV was in high LD with the respective AD GWAS locus and was associated with multiple AD/ADRD phenotypes, including tangle density, TDP-43, and cognitive resilience. The deletion of this element was also linked to lower TMEM106B protein abundance. We also found a 22 kb deletion associated with depression in ROSMAP and bearing similar association patterns as AD GWAS SNPs at the IQCK locus. In addition, genome-wide scans allowed the identification of 7 SVs, with no LD with SNPs and nominally associated with AD/ADRD traits. This result suggests potentially new ADRD risk loci not discoverable using SNP data. Among these findings, we highlight a 5.6 kb duplication of coding regions of the gene C1orf186 at chromosome 1 associated with indices of cognitive impairment, decline, and resilience. While further replication in independent datasets is needed to validate these findings, our results support the potential roles of common structural variations in the pathogenesis of AD/ADRD.
Collapse
Affiliation(s)
- Ricardo A Vialle
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Katia de Paiva Lopes
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Yan Li
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Bernard Ng
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Julie A Schneider
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Aron S Buchman
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Yanling Wang
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Jose M Farfel
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Lisa L Barnes
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - Aliza P Wingo
- Department of Psychiatry, University of California, Davis CA, USA
- VA Northern California Health Care System, McClellan Park, CA, USA
| | - Thomas S Wingo
- Department of Neurology, University of California, Davis, CA, USA
| | - Nicholas T Seyfried
- Goizueta Alzheimer's Disease Research Center, Department of Neurology and Department of Biochemistry, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Philip L De Jager
- Department of Neurology, College of Physicians and Surgeons, Columbia University and the New York Presbyterian Hospital, New York, NY, USA
| | - Chris Gaiteri
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
- Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Shinya Tasaki
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| | - David A Bennett
- Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL, USA
| |
Collapse
|
18
|
Luo C, Liu YH, Zhou XM. VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing. Nat Commun 2024; 15:6956. [PMID: 39138168 PMCID: PMC11322167 DOI: 10.1038/s41467-024-51282-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 07/31/2024] [Indexed: 08/15/2024] Open
Abstract
Structural variants (SVs) significantly contribute to human genome diversity and play a crucial role in precision medicine. Although advancements in single-molecule long-read sequencing offer a groundbreaking resource for SV detection, identifying SV breakpoints and sequences accurately and robustly remains challenging. We introduce VolcanoSV, an innovative hybrid SV detection pipeline that utilizes both a reference genome and local de novo assembly to generate a phased diploid assembly. VolcanoSV uses phased SNPs and unique k-mer similarity analysis, enabling precise haplotype-resolved SV discovery. VolcanoSV is adept at constructing comprehensive genetic maps encompassing SNPs, small indels, and all types of SVs, making it well-suited for human genomics studies. Our extensive experiments demonstrate that VolcanoSV surpasses state-of-the-art assembly-based tools in the detection of insertion and deletion SVs, exhibiting superior recall, precision, F1 scores, and genotype accuracy across a diverse range of datasets, including low-coverage (10x) datasets. VolcanoSV outperforms assembly-based tools in the identification of complex SVs, including translocations, duplications, and inversions, in both simulated and real cancer data. Moreover, VolcanoSV is robust to various evaluation parameters and accurately identifies breakpoints and SV sequences.
Collapse
Affiliation(s)
- Can Luo
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA
| | - Yichen Henry Liu
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA
| | - Xin Maizie Zhou
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA.
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA.
- Data Science Institute, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
19
|
Schultz LM, Knighton A, Huguet G, Saci Z, Jean-Louis M, Mollon J, Knowles EEM, Glahn DC, Jacquemont S, Almasy L. Copy-number variants differ in frequency across genetic ancestry groups. HGG ADVANCES 2024; 5:100340. [PMID: 39138864 PMCID: PMC11401192 DOI: 10.1016/j.xhgg.2024.100340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 08/07/2024] [Accepted: 08/07/2024] [Indexed: 08/15/2024] Open
Abstract
Copy-number variants (CNVs) have been implicated in a variety of neuropsychiatric and cognitive phenotypes. We found that deleterious CNVs are less prevalent in non-European ancestry groups than they are in European ancestry groups of both the UK Biobank (UKBB) and a US replication cohort (SPARK). We also identified specific recurrent CNVs that consistently differ in frequency across ancestry groups in both the UKBB and SPARK. These ancestry-related differences in CNV prevalence present in both an unselected community population and a family cohort enriched with individuals diagnosed with autism spectrum disorder (ASD) strongly suggest that genetic ancestry should be considered when probing associations between CNVs and health outcomes.
Collapse
Affiliation(s)
- Laura M Schultz
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
| | - Alexys Knighton
- School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Zohra Saci
- CHU Sainte-Justine, Montréal, QC, Canada
| | | | - Josephine Mollon
- Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Boston, MA, USA; Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - Emma E M Knowles
- Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Boston, MA, USA; Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - David C Glahn
- Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Boston, MA, USA; Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - Sébastien Jacquemont
- CHU Sainte-Justine, Montréal, QC, Canada; Department of Pediatrics, Université de Montréal, Montréal, QC, Canada
| | - Laura Almasy
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
20
|
Foltz SM, Li Y, Yao L, Terekhanova NV, Weerasinghe A, Gao Q, Dong G, Schindler M, Cao S, Sun H, Jayasinghe RG, Fulton RS, Fronick CC, King J, Kohnen DR, Fiala MA, Chen K, DiPersio JF, Vij R, Ding L. Somatic mutation phasing and haplotype extension using linked-reads in multiple myeloma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.09.607342. [PMID: 39149342 PMCID: PMC11326269 DOI: 10.1101/2024.08.09.607342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Somatic mutation phasing informs our understanding of cancer-related events, like driver mutations. We generated linked-read whole genome sequencing data for 23 samples across disease stages from 14 multiple myeloma (MM) patients and systematically assigned somatic mutations to haplotypes using linked-reads. Here, we report the reconstructed cancer haplotypes and phase blocks from several MM samples and show how phase block length can be extended by integrating samples from the same individual. We also uncover phasing information in genes frequently mutated in MM, including DIS3, HIST1H1E, KRAS, NRAS, and TP53, phasing 79.4% of 20,705 high-confidence somatic mutations. In some cases, this enabled us to interpret clonal evolution models at higher resolution using pairs of phased somatic mutations. For example, our analysis of one patient suggested that two NRAS hotspot mutations occurred on the same haplotype but were independent events in different subclones. Given sufficient tumor purity and data quality, our framework illustrates how haplotype-aware analysis of somatic mutations in cancer can be beneficial for some cancer cases.
Collapse
Affiliation(s)
- Steven M. Foltz
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Yize Li
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Lijun Yao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Nadezhda V. Terekhanova
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Amila Weerasinghe
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Qingsong Gao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Guanlan Dong
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Moses Schindler
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Song Cao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Hua Sun
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Reyka G. Jayasinghe
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Robert S. Fulton
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Catrina C. Fronick
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
| | - Justin King
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Daniel R. Kohnen
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Mark A. Fiala
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Ken Chen
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - John F. DiPersio
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Ravi Vij
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Li Ding
- Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
- McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA
| |
Collapse
|
21
|
Gretzinger J, Schmitt F, Mötsch A, Carlhoff S, Lamnidis TC, Huang Y, Ringbauer H, Knipper C, Francken M, Mandt F, Hansen L, Freund C, Posth C, Rathmann H, Harvati K, Wieland G, Granehäll L, Maixner F, Zink A, Schier W, Krausse D, Krause J, Schiffels S. Evidence for dynastic succession among early Celtic elites in Central Europe. Nat Hum Behav 2024; 8:1467-1480. [PMID: 38831077 PMCID: PMC11343710 DOI: 10.1038/s41562-024-01888-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 04/15/2024] [Indexed: 06/05/2024]
Abstract
The early Iron Age (800 to 450 BCE) in France, Germany and Switzerland, known as the 'West-Hallstattkreis', stands out as featuring the earliest evidence for supra-regional organization north of the Alps. Often referred to as 'early Celtic', suggesting tentative connections to later cultural phenomena, its societal and population structure remain enigmatic. Here we present genomic and isotope data from 31 individuals from this context in southern Germany, dating between 616 and 200 BCE. We identify multiple biologically related groups spanning three elite burials as far as 100 km apart, supported by trans-regional individual mobility inferred from isotope data. These include a close biological relationship between two of the richest burial mounds of the Hallstatt culture. Bayesian modelling points to an avuncular relationship between the two individuals, which may suggest a practice of matrilineal dynastic succession in early Celtic elites. We show that their ancestry is shared on a broad geographic scale from Iberia throughout Central-Eastern Europe, undergoing a decline after the late Iron Age (450 BCE to ~50 CE).
Collapse
Affiliation(s)
- Joscha Gretzinger
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Felicitas Schmitt
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany
| | - Angela Mötsch
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Selina Carlhoff
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | | - Yilei Huang
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Harald Ringbauer
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Corina Knipper
- Curt Engelhorn Zentrum Archäometrie gGmbH, Mannheim, Germany
| | - Michael Francken
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany
| | - Franziska Mandt
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany
| | - Leif Hansen
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany
| | - Cäcilia Freund
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Cosimo Posth
- Institute for Archaeological Sciences, Department of Geosciences, Eberhard Karls University of Tübingen, Tübingen, Germany
- Senckenberg Centre for Human Evolution and Palaeoenvironment, Eberhard Karls University of Tübingen, Tübingen, Germany
| | - Hannes Rathmann
- Institute for Archaeological Sciences, Department of Geosciences, Eberhard Karls University of Tübingen, Tübingen, Germany
- Senckenberg Centre for Human Evolution and Palaeoenvironment, Eberhard Karls University of Tübingen, Tübingen, Germany
| | - Katerina Harvati
- Institute for Archaeological Sciences, Department of Geosciences, Eberhard Karls University of Tübingen, Tübingen, Germany
- Senckenberg Centre for Human Evolution and Palaeoenvironment, Eberhard Karls University of Tübingen, Tübingen, Germany
- DFG Center for Advanced Studies in the Humanities 'Words, Bones, Genes, Tools: Tracking Linguistic, Cultural and Biological Trajectories of the Human Past', Eberhard Karls University of Tübingen, Tübingen, Germany
| | - Günther Wieland
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany
| | - Lena Granehäll
- Institute for Mummy Studies, EURAC Research, Bolzano, Italy
| | - Frank Maixner
- Institute for Mummy Studies, EURAC Research, Bolzano, Italy
| | - Albert Zink
- Institute for Mummy Studies, EURAC Research, Bolzano, Italy
| | - Wolfram Schier
- Institut für Prähistorische Archäologie, Freie Universität Berlin, Berlin, Germany
| | - Dirk Krausse
- Landesamt für Denkmalpflege im Regierungspräsidium Stuttgart, Esslingen, Germany.
| | - Johannes Krause
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
| | - Stephan Schiffels
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
| |
Collapse
|
22
|
L Rocha J, Lou RN, Sudmant PH. Structural variation in humans and our primate kin in the era of telomere-to-telomere genomes and pangenomics. Curr Opin Genet Dev 2024; 87:102233. [PMID: 39042999 DOI: 10.1016/j.gde.2024.102233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 07/02/2024] [Accepted: 07/05/2024] [Indexed: 07/25/2024]
Abstract
Structural variants (SVs) account for the majority of base pair differences both within and between primate species. However, our understanding of inter- and intra-species SV has been historically hampered by the quality of draft primate genomes and the absence of genome resources for key taxa. Recently, advances in long-read sequencing and genome assembly have begun to radically reshape our understanding of SVs. Two landmark achievements include the publication of a human telomere-to-telomere (T2T) genome as well as the development of the first human pangenome reference. In this review, we first look back to the major works laying the foundation for these projects. We then examine the ways in which T2T genome assemblies and pangenomes are transforming our understanding of and approach to primate SV. Finally, we discuss what the future of primate SV research may look like in the era of T2T genomes and pangenomics.
Collapse
Affiliation(s)
- Joana L Rocha
- Department of Integrative Biology, University of California, Berkeley, Berkeley, USA. https://twitter.com/@joanocha
| | - Runyang N Lou
- Department of Integrative Biology, University of California, Berkeley, Berkeley, USA. https://twitter.com/@NicolasLou10
| | - Peter H Sudmant
- Department of Integrative Biology, University of California, Berkeley, Berkeley, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, USA.
| |
Collapse
|
23
|
Betancourt AJ, Wei KHC, Huang Y, Lee YCG. Causes and Consequences of Varying Transposable Element Activity: An Evolutionary Perspective. Annu Rev Genomics Hum Genet 2024; 25:1-25. [PMID: 38603565 DOI: 10.1146/annurev-genom-120822-105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Transposable elements (TEs) are genomic parasites found in nearly all eukaryotes, including humans. This evolutionary success of TEs is due to their replicative activity, involving insertion into new genomic locations. TE activity varies at multiple levels, from between taxa to within individuals. The rapidly accumulating evidence of the influence of TE activity on human health, as well as the rapid growth of new tools to study it, motivated an evaluation of what we know about TE activity thus far. Here, we discuss why TE activity varies, and the consequences of this variation, from an evolutionary perspective. By studying TE activity in nonhuman organisms in the context of evolutionary theories, we can shed light on the factors that affect TE activity. While the consequences of TE activity are usually deleterious, some have lasting evolutionary impacts by conferring benefits on the host or affecting other evolutionary processes.
Collapse
Affiliation(s)
- Andrea J Betancourt
- Institute of Infection, Veterinary, and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Kevin H-C Wei
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| | - Yuh Chwen G Lee
- Center for Complex Biological Systems, University of California, Irvine, California, USA;
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| |
Collapse
|
24
|
Frost B, Dubnau J. The Role of Retrotransposons and Endogenous Retroviruses in Age-Dependent Neurodegenerative Disorders. Annu Rev Neurosci 2024; 47:123-143. [PMID: 38663088 DOI: 10.1146/annurev-neuro-082823-020615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/09/2024]
Abstract
Over 40% of the human genome is composed of retrotransposons, DNA species that hold the potential to replicate via an RNA intermediate and are evolutionarily related to retroviruses. Retrotransposons are most studied for their ability to jump within a genome, which can cause DNA damage and novel insertional mutations. Retrotransposon-encoded products, including viral-like proteins, double-stranded RNAs, and extrachromosomal circular DNAs, can also be potent activators of the innate immune system. A growing body of evidence suggests that retrotransposons are activated in age-related neurodegenerative disorders and that such activation causally contributes to neurotoxicity. Here we provide an overview of retrotransposon biology and outline evidence of retrotransposon activation in age-related neurodegenerative disorders, with an emphasis on those involving TAR-DNA binding protein-43 (TDP-43) and tau. Studies to date provide the basis for ongoing clinical trials and hold promise for innovative strategies to ameliorate the adverse effects of retrotransposon dysregulation in neurodegenerative disorders.
Collapse
Affiliation(s)
- Bess Frost
- Sam and Ann Barshop Institute for Longevity and Aging Studies, Glenn Biggs Institute for Alzheimer's and Neurodegenerative Diseases, and Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas, USA;
| | - Josh Dubnau
- Department of Anesthesiology and Department of Neurobiology and Behavior, Stony Brook School of Medicine, Stony Brook, New York, USA;
| |
Collapse
|
25
|
Zhao Z, Liu A, Citu C, Enduru N, Chen X, Manuel A, Sinha T, Gorski D, Fernandes B, Yu M, Schulz P, Simon L, Soto C. Single-nucleus multiomics reveals the disrupted regulatory programs in three brain regions of sporadic early-onset Alzheimer's disease. RESEARCH SQUARE 2024:rs.3.rs-4622123. [PMID: 39149497 PMCID: PMC11326379 DOI: 10.21203/rs.3.rs-4622123/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Sporadic early-onset Alzheimer's disease (sEOAD) represents a significant but less-studied subtype of Alzheimer's disease (AD). Here, we generated a single-nucleus multiome atlas derived from the postmortem prefrontal cortex, entorhinal cortex, and hippocampus of nine individuals with or without sEOAD. Comprehensive analyses were conducted to delineate cell type-specific transcriptomic changes and linked candidate cis-regulatory elements (cCREs) across brain regions. We prioritized seven conservative transcription factors in glial cells in multiple brain regions, including RFX4 in astrocytes and IKZF1 in microglia, which are implicated in regulating sEOAD-associated genes. Moreover, we identified the top 25 altered intercellular signaling between glial cells and neurons, highlighting their regulatory potential on gene expression in receiver cells. We reported 38 cCREs linked to sEOAD-associated genes overlapped with late-onset AD risk loci, and sEOAD cCREs enriched in neuropsychiatric disorder risk loci. This atlas helps dissect transcriptional and chromatin dynamics in sEOAD, providing a key resource for AD research.
Collapse
Affiliation(s)
- Zhongming Zhao
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
- Faillace Department of Psychiatry and Behavioral Sciences, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Andi Liu
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
- Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Citu Citu
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
| | - Nitesh Enduru
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
- Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Xian Chen
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
| | - Astrid Manuel
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
| | - Tirthankar Sinha
- Mitchell Center for Alzheimer's Disease and Related Brain Disorders, Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Damian Gorski
- Mitchell Center for Alzheimer's Disease and Related Brain Disorders, Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Brisa Fernandes
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
| | - Meifang Yu
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
| | - Paul Schulz
- Department of Neurology, McGovern School of Medicine, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Lukas Simon
- Therapeutic Innovation Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Claudio Soto
- Mitchell Center for Alzheimer's Disease and Related Brain Disorders, Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| |
Collapse
|
26
|
Burt CH. Polygenic Indices (a.k.a. Polygenic Scores) in Social Science: A Guide for Interpretation and Evaluation. SOCIOLOGICAL METHODOLOGY 2024; 54:300-350. [PMID: 39091537 PMCID: PMC11293310 DOI: 10.1177/00811750241236482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Polygenic indices (PGI)-the new recommended label for polygenic scores (PGS) in social science-are genetic summary scales often used to represent an individual's liability for a disease, trait, or behavior based on the additive effects of measured genetic variants. Enthusiasm for linking genetic data with social outcomes and the inclusion of premade PGIs in social science datasets have facilitated increased uptake of PGIs in social science research-a trend that will likely continue. Yet, most social scientists lack the expertise to interpret and evaluate PGIs in social science research. Here, we provide a primer on PGIs for social scientists focusing on key concepts, unique statistical genetic considerations, and best practices in calculation, estimation, reporting, and interpretation. We summarize our recommended best practices as a checklist to aid social scientists in evaluating and interpreting studies with PGIs. We conclude by discussing the similarities between PGIs and standard social science scales and unique interpretative considerations.
Collapse
|
27
|
Sun Y, Dong H, Sun C, Du D, Gao R, Voevoda M, Knyazev R, Wu N. Investigating the association between gut microbiome and aortic aneurysm diseases: a bidirectional two-sample Mendelian randomization analysis. Front Cell Infect Microbiol 2024; 14:1406845. [PMID: 39139765 PMCID: PMC11319299 DOI: 10.3389/fcimb.2024.1406845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 07/11/2024] [Indexed: 08/15/2024] Open
Abstract
Objective This study aims to investigate the associations between specific bacterial taxa of the gut microbiome and the development of aortic aneurysm diseases, utilizing Mendelian Randomization (MR) to explore these associations and overcome the confounding factors commonly present in observational studies. Methods Employing the largest available gut microbiome and aortic aneurysm Genome-Wide Association Study databases, including MiBioGen, Dutch Microbiome Project, FinnGen, UK Biobank, and Michigan Genomics Initiative, this study performs two-sample bidirectional MR analyses. Instrumental variables, linked to microbiome taxa at significant levels, were selected for identifying relationships with abdominal aortic aneurysms (AAA), thoracic aortic aneurysms (TAA), and aortic dissection (AD). Methods like inverse variance weighted, MR-PRESSO, MR-Egger, weighted median, simple mode, and mode-based estimate were used for MR analysis. Heterogeneity was assessed with the Cochran Q test. MR-Egger regression and MR-PRESSO addressed potential unbalanced horizontal pleiotropy. Results The analysis did not find any evidence of statistically significant associations between the gut microbiome and aortic aneurysm diseases after adjusting for the false discovery rate (FDR). Specifically, while initial results suggested correlations between 19 taxa and AAA, 25 taxa and TAA, and 13 taxa with AD, these suggested associations did not hold statistical significance post-FDR correction. Therefore, the role of individual gut microbial taxa as independent factors in the development and progression of aortic aneurysm diseases remains inconclusive. This finding underscores the necessity for larger sample sizes and more comprehensive studies to further investigate these potential links. Conclusion The study emphasizes the complex relationship between the gut microbiome and aortic aneurysm diseases. Although no statistically significant associations were found after FDR correction, the findings provide valuable insights and highlight the importance of considering gut microbiota in aortic aneurysm diseases research. Understanding these interactions may eventually contribute to identifying new therapeutic and preventive strategies for aortic aneurysm diseases.
Collapse
Affiliation(s)
- Yaodong Sun
- Department of Cardiovascular Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Haoju Dong
- Department of Cardiovascular Surgery, Tianjin Medical University General Hospital, Tianjin, China
- Pediatric Cardiac Surgery, Fuwai Central China Cardiovascular Hospital, Zhengzhou, China
- Department of Cardiovascular Surgery, Henan Provincial People’s Hospital, Zhengzhou, China
| | - Chao Sun
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Dongdong Du
- Department of Cardiovascular Surgery, Tianjin Medical University General Hospital, Tianjin, China
- Department of Cardiovascular Surgery, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China
| | - Ruirong Gao
- Department of Cardiovascular Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Mikhail Voevoda
- Federal Research Center of Fundamental and Translational Medicine (FRC FTM), Novosibirsk, Russia
| | - Roman Knyazev
- Federal Research Center of Fundamental and Translational Medicine (FRC FTM), Novosibirsk, Russia
| | - Naishi Wu
- Department of Cardiovascular Surgery, Tianjin Medical University General Hospital, Tianjin, China
| |
Collapse
|
28
|
Ormond C, Ryan NM, Hedman AM, Cannon TD, Sullivan PF, Gill M, Hultman C, Heron EA, Johansson V, Corvin A. Whole genome sequencing study of identical twins discordant for psychosis. Transl Psychiatry 2024; 14:313. [PMID: 39080272 PMCID: PMC11289105 DOI: 10.1038/s41398-024-02982-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 06/14/2024] [Accepted: 06/18/2024] [Indexed: 08/02/2024] Open
Abstract
Monozygotic (MZ) twins are often thought to have identical genomes, but recent work has shown that early post-zygotic events can result in a spectrum of DNA variants that are different between MZ twins. Such variants may explain phenotypic discordance and contribute to disease etiology. Here we performed whole genome sequencing in 17 pairs of MZ twins discordant for a psychotic disorder (schizophrenia, schizoaffective disorder or bipolar disorder). We examined various classes of rare variants that are discordant within a twin pair. We identified four genes harboring rare, predicted deleterious missense variants that were private to an affected individual in the cohort. Variants in FOXN1 and FLOT2 would have been categorized as damaging from recent schizophrenia and bipolar exome sequencing studies. Additionally, we identified four rare genic copy number variants (CNVs) private to an affected sample, two of which overlapped genes that have shown evidence for association with schizophrenia or bipolar disorder. One such CNV was a 3q29 duplication previously implicated in autism and developmental delay. We have performed the largest MZ twin study for discordant psychotic phenotypes to date. These findings warrant further investigation using other analytical approaches.
Collapse
Affiliation(s)
- Cathal Ormond
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Niamh M Ryan
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Anna M Hedman
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Tyrone D Cannon
- Departments of Psychology and Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Patrick F Sullivan
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Departments of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
| | - Michael Gill
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Christina Hultman
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Elizabeth A Heron
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
| | - Viktoria Johansson
- Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Sciences, Psychiatry Unit, Umeå University, Umeå, Sweden
| | - Aiden Corvin
- Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland.
| |
Collapse
|
29
|
Yuan N, Jia P. Comprehensive assessment of long-read sequencing platforms and calling algorithms for detection of copy number variation. Brief Bioinform 2024; 25:bbae441. [PMID: 39256200 PMCID: PMC11387058 DOI: 10.1093/bib/bbae441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 07/09/2024] [Accepted: 08/25/2024] [Indexed: 09/12/2024] Open
Abstract
Copy number variations (CNVs) play pivotal roles in disease susceptibility and have been intensively investigated in human disease studies. Long-read sequencing technologies offer opportunities for comprehensive structural variation (SV) detection, and numerous methodologies have been developed recently. Consequently, there is a pressing need to assess these methods and aid researchers in selecting appropriate techniques for CNV detection using long-read sequencing. Hence, we conducted an evaluation of eight CNV calling methods across 22 datasets from nine publicly available samples and 15 simulated datasets, covering multiple sequencing platforms. The overall performance of CNV callers varied substantially and was influenced by the input dataset type, sequencing depth, and CNV type, among others. Specifically, the PacBio CCS sequencing platform outperformed PacBio CLR and Nanopore platforms regarding CNV detection recall rates. A sequencing depth of 10x demonstrated the capability to identify 85% of the CNVs detected in a 50x dataset. Moreover, deletions were more generally detectable than duplications. Among the eight benchmarked methods, cuteSV, Delly, pbsv, and Sniffles2 demonstrated superior accuracy, while SVIM exhibited high recall rates.
Collapse
Affiliation(s)
- Na Yuan
- National Genomics Data Center, China National Center for Bioinformation, Beichen West Road, Chaoyang District, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beichen West Road, Chaoyang District, Beijing 100101, China
| | - Peilin Jia
- National Genomics Data Center, China National Center for Bioinformation, Beichen West Road, Chaoyang District, Beijing 100101, China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beichen West Road, Chaoyang District, Beijing 100101, China
| |
Collapse
|
30
|
Sarwal V, Lee S, Yang J, Sankararaman S, Chaisson M, Eskin E, Mangul S. VISTA: an integrated framework for structural variant discovery. Brief Bioinform 2024; 25:bbae462. [PMID: 39297879 PMCID: PMC11411772 DOI: 10.1093/bib/bbae462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 08/27/2024] [Accepted: 09/07/2024] [Indexed: 09/26/2024] Open
Abstract
Structural variation (SV) refers to insertions, deletions, inversions, and duplications in human genomes. SVs are present in approximately 1.5% of the human genome. Still, this small subset of genetic variation has been implicated in the pathogenesis of psoriasis, Crohn's disease and other autoimmune disorders, autism spectrum and other neurodevelopmental disorders, and schizophrenia. Since identifying structural variants is an important problem in genetics, several specialized computational techniques have been developed to detect structural variants directly from sequencing data. With advances in whole-genome sequencing (WGS) technologies, a plethora of SV detection methods have been developed. However, dissecting SVs from WGS data remains a challenge, with the majority of SV detection methods prone to a high false-positive rate, and no existing method able to precisely detect a full range of SVs present in a sample. Previous studies have shown that none of the existing SV callers can maintain high accuracy across various SV lengths and genomic coverages. Here, we report an integrated structural variant calling framework, Variant Identification and Structural Variant Analysis (VISTA), that leverages the results of individual callers using a novel and robust filtering and merging algorithm. In contrast to existing consensus-based tools which ignore the length and coverage, VISTA overcomes this limitation by executing various combinations of top-performing callers based on variant length and genomic coverage to generate SV events with high accuracy. We evaluated the performance of VISTA on comprehensive gold-standard datasets across varying organisms and coverage. We benchmarked VISTA using the Genome-in-a-Bottle gold standard SV set, haplotype-resolved de novo assemblies from the Human Pangenome Reference Consortium, along with an in-house polymerase chain reaction (PCR)-validated mouse gold standard set. VISTA maintained the highest F1 score among top consensus-based tools measured using a comprehensive gold standard across both mouse and human genomes. VISTA also has an optimized mode, where the calls can be optimized for precision or recall. VISTA-optimized can attain 100% precision and the highest sensitivity among other variant callers. In conclusion, VISTA represents a significant advancement in structural variant calling, offering a robust and accurate framework that outperforms existing consensus-based tools and sets a new standard for SV detection in genomic research.
Collapse
Affiliation(s)
- Varuni Sarwal
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, United States
| | - Seungmo Lee
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, United States
| | - Jianzhi Yang
- Department of Quantitative and Computational Biology, Dana and David Dornsife College of Letters, Arts and Sciences University of Southern California, 3540 S Figueroa St, Los Angeles, California 90089, United States
| | - Sriram Sankararaman
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, United States
| | - Mark Chaisson
- Department of Quantitative and Computational Biology, Dana and David Dornsife College of Letters, Arts and Sciences University of Southern California, 3540 S Figueroa St, Los Angeles, California 90089, United States
| | - Eleazar Eskin
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA 90095, United States
| | - Serghei Mangul
- Department of Quantitative and Computational Biology, Dana and David Dornsife College of Letters, Arts and Sciences University of Southern California, 3540 S Figueroa St, Los Angeles, California 90089, United States
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy, University of Southern California, 1540 Alcazar Street, Los Angeles, CA 90033, United States
| |
Collapse
|
31
|
Wang C, Liu H, Li XY, Ma J, Gu Z, Feng X, Xie S, Tang BS, Chen S, Wang W, Wang J, Zhang J, Chan P. High-depth whole-genome sequencing identifies structure variants, copy number variants and short tandem repeats associated with Parkinson's disease. NPJ Parkinsons Dis 2024; 10:134. [PMID: 39043730 PMCID: PMC11266557 DOI: 10.1038/s41531-024-00722-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 05/10/2024] [Indexed: 07/25/2024] Open
Abstract
While numerous single nucleotide variants and small indels have been identified in Parkinson's disease (PD), the contribution of structural variants (SVs), copy number variants (CNVs), and short tandem repeats (STRs) remains poorly understood. Here we investigated the association using the high-depth whole-genome sequencing data from 466 Chinese PD patients and 513 controls. Totally, we identified 29,561 SVs, 32,153 CNVs, and 174,905 STRs, and found that CNV deletions were significantly enriched in the end-proportion of autosomal chromosomes in PD. After genome-wide association analysis and replication in an external cohort of 352 cases and 547 controls, we validated that the 1.6 kb-deletion neighboring MUC19, 12.4kb-deletion near RXFP1 and GGGAAA repeats in SLC2A13 were significantly associated with PD. Moreover, the MUC19 deletion and the SLC2A13 5-copy repeat reduced the penetrance of the LRRK2 G2385R variant. Moreover, genes with these variants were dosage-sensitive. These data provided novel insights into the genetic architecture of PD.
Collapse
Affiliation(s)
- Chaodong Wang
- Department of Neurology & Neurobiology, Xuanwu Hospital of Capital Medical University, National Clinical Research Center for Geriatric Diseases, Beijing, 100053, China
| | - Hankui Liu
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, 518083, China
| | - Xu-Ying Li
- Department of Neurology & Neurobiology, Xuanwu Hospital of Capital Medical University, National Clinical Research Center for Geriatric Diseases, Beijing, 100053, China
| | - Jinghong Ma
- Department of Neurology & Neurobiology, Xuanwu Hospital of Capital Medical University, National Clinical Research Center for Geriatric Diseases, Beijing, 100053, China
| | - Zhuqin Gu
- Department of Neurology & Neurobiology, Xuanwu Hospital of Capital Medical University, National Clinical Research Center for Geriatric Diseases, Beijing, 100053, China
| | - Xiuli Feng
- National Human Genome Center in Beijing, Beijing Economic-Technological Development Zone, Beijing, 100176, China
| | - Shu Xie
- National Human Genome Center in Beijing, Beijing Economic-Technological Development Zone, Beijing, 100176, China
| | - Bei-Sha Tang
- Department of Neurology, Xiangya Hospital, Central South University, State Key Laboratory of Medical Genetics, Changsha, China
| | - Shengdi Chen
- Department of Neurology, Ruijin Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Wei Wang
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, 518083, China
| | - Jian Wang
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, 518083, China
| | - Jianguo Zhang
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, 518083, China.
- Hebei Industrial Technology Research Institute of Genomics in Maternal & Child Health, Shijiazhuang, 050000, China.
| | - Piu Chan
- Department of Neurology & Neurobiology, Xuanwu Hospital of Capital Medical University, National Clinical Research Center for Geriatric Diseases, Beijing, 100053, China.
- Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China.
- Clinical Center for Parkinson's Disease, Capital Medical University, Key Laboratory for Neurodegenerative Disease of the Ministry of Education, Beijing Key Laboratory for Parkinson's Disease, Beijing, China.
- Beijing Institute of Brain Disorders, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, China.
| |
Collapse
|
32
|
Lin HJ, Huang YT, Liao WL, Huang YC, Chang YW, Weng AL, Tsai FJ. Developing a Polygenic Risk Score with Age and Sex to Identify High-Risk Myopia in Taiwan. Biomedicines 2024; 12:1619. [PMID: 39062192 PMCID: PMC11274619 DOI: 10.3390/biomedicines12071619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/12/2024] [Accepted: 07/15/2024] [Indexed: 07/28/2024] Open
Abstract
Myopia is the leading cause of impaired vision, and its prevalence is increasing among Asian populations. This study aimed to develop a polygenic risk score (PRS) followed by replication to predict myopia in the Taiwanese population. In total, 23,688 participants with cycloplegic autorefraction-measured mean spherical equivalent (SE), genetic, and demographic data were included. The myopia PRS was generated based on genome-wide association study (GWAS) outcomes in a Taiwanese population and previously published GWAS reports. The results demonstrated that the inclusion of age and sex in the PRS had an area under the curve (AUC) of 0.80, 0.78, and 0.73 (p < 0.001) for participants aged >18 years with high (SE < -6.0 diopters (D); n = 1089), moderate (-6.0 D < SE ≤ -3.0 D; n = 3929), and mild myopia (-3.0 D < SE ≤ -1.0 D; n = 2241), respectively. Participants in the top PRS quartile had a 1.30-fold greater risk of high myopia (95% confidence interval = 1.09-1.55, p = 0.003) compared with that in the remaining participants. Further, a higher PRS significantly increased the risk of high myopia (SE ≤ -2.0 D) in children ≤6 years of age (p = 0.027). In conclusion, including the PRS, age, and sex improved the prediction of high myopia risk in the Taiwanese population.
Collapse
Affiliation(s)
- Hui-Ju Lin
- Department of Ophthalmology, China Medical University Hospital, Taichung 404327, Taiwan; (H.-J.L.); (Y.-T.H.)
- School of Chinese Medicine, China Medical University, Taichung 404328, Taiwan;
| | - Yu-Te Huang
- Department of Ophthalmology, China Medical University Hospital, Taichung 404327, Taiwan; (H.-J.L.); (Y.-T.H.)
| | - Wen-Ling Liao
- Center for Personalized Medicine, China Medical University Hospital, Taichung 404327, Taiwan;
- Graduate Institute of Integrated Medicine, China Medical University, Taichung 404328, Taiwan
| | - Yu-Chuen Huang
- School of Chinese Medicine, China Medical University, Taichung 404328, Taiwan;
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung 404327, Taiwan;
| | - Ya-Wen Chang
- Genetic Center, Department of Medical Research, China Medical University Hospital, Taichung 404327, Taiwan;
| | - Angel L. Weng
- American School in Taichung, Taichung 406051, Taiwan;
| | - Fuu-Jen Tsai
- School of Chinese Medicine, China Medical University, Taichung 404328, Taiwan;
- Department of Medical Genetics, China Medical University Hospital, Taichung 404327, Taiwan
- Children’s Hospital of China Medical University, Taichung 404327, Taiwan
| |
Collapse
|
33
|
Junjun R, Zhengqian Z, Ying W, Jialiang W, Yongzhuang L. A comprehensive review of deep learning-based variant calling methods. Brief Funct Genomics 2024; 23:303-313. [PMID: 38366908 DOI: 10.1093/bfgp/elae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/14/2024] [Accepted: 01/18/2023] [Indexed: 02/18/2024] Open
Abstract
Genome sequencing data have become increasingly important in the field of personalized medicine and diagnosis. However, accurately detecting genomic variations remains a challenging task. Traditional variation detection methods rely on manual inspection or predefined rules, which can be time-consuming and prone to errors. Consequently, deep learning-based approaches for variation detection have gained attention due to their ability to automatically learn genomic features that distinguish between variants. In our review, we discuss the recent advancements in deep learning-based algorithms for detecting small variations and structural variations in genomic data, as well as their advantages and limitations.
Collapse
Affiliation(s)
- Ren Junjun
- Harbin Institute of Technology, School of Computer Science and Technology, Harbin 150001, China
| | - Zhang Zhengqian
- Harbin Institute of Technology, School of Computer Science and Technology, Harbin 150001, China
| | - Wu Ying
- Harbin Institute of Technology, School of Computer Science and Technology, Harbin 150001, China
| | - Wang Jialiang
- Harbin Institute of Technology, School of Computer Science and Technology, Harbin 150001, China
| | - Liu Yongzhuang
- Harbin Institute of Technology, School of Computer Science and Technology, Harbin 150001, China
| |
Collapse
|
34
|
Subramanian K, Chopra M, Kahali B. Landscape of genomic structural variations in Indian population-based cohorts: Deeper insights into their prevalence and clinical relevance. HGG ADVANCES 2024; 5:100285. [PMID: 38521976 PMCID: PMC11007539 DOI: 10.1016/j.xhgg.2024.100285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 03/13/2024] [Accepted: 03/20/2024] [Indexed: 03/25/2024] Open
Abstract
Structural variations (SV) are large (>50 base pairs) genomic rearrangements comprising deletions, duplications, insertions, inversions, and translocations. Studying SVs is important because they play active and critical roles in regulating gene expression, determining disease predispositions, and identifying population-specific differences among individuals of diverse ancestries. However, SV discoveries in the Indian population using whole-genome sequencing (WGS) have been limited. In this study, using short-read WGS having an average 42X depth of coverage, we identify and characterize 36,210 SVs from 529 individuals enrolled in population-based cohorts in India. These SVs include 24,574 deletions, 2,913 duplications, 8,710 insertions, and 13 inversions; 1.26% (456 out of 36,210) of the identified SVs can potentially impact the coding regions of genes. Furthermore, 56 of these SVs are highly intolerant to loss-of-function changes to the mapped genes, and five SVs impacting ADAMTS17, CCDC40, and RHCE are common in our study individuals. Seven rare SVs significantly impact dosage sensitivity of genes known to be associated with various clinical phenotypes. Most of the SVs in our study are rare and heterozygous. This fine-scale SV discovery in the underrepresented Indian population provides valuable insights that extend beyond Eurocentric human genetic studies.
Collapse
Affiliation(s)
- Krithika Subramanian
- Centre for Brain Research, Indian Institute of Science, Bangalore 560012, India; Manipal Academy of Higher Education, Manipal, Karnataka 576104, India
| | - Mehak Chopra
- Centre for Brain Research, Indian Institute of Science, Bangalore 560012, India
| | - Bratati Kahali
- Centre for Brain Research, Indian Institute of Science, Bangalore 560012, India.
| |
Collapse
|
35
|
Pei Y, Tanguy M, Giess A, Dixit A, Wilson LC, Gibbons RJ, Twigg SRF, Elgar G, Wilkie AOM. A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark. Genes (Basel) 2024; 15:925. [PMID: 39062704 PMCID: PMC11276380 DOI: 10.3390/genes15070925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 07/03/2024] [Accepted: 07/11/2024] [Indexed: 07/28/2024] Open
Abstract
The identification of structural variants (SVs) in genomic data represents an ongoing challenge because of difficulties in reliable SV calling leading to reduced sensitivity and specificity. We prepared high-quality DNA from 9 parent-child trios, who had previously undergone short-read whole-genome sequencing (Illumina platform) as part of the Genomics England 100,000 Genomes Project. We reanalysed the genomes using both Bionano optical genome mapping (OGM; 8 probands and one trio) and Nanopore long-read sequencing (Oxford Nanopore Technologies [ONT] platform; all samples). To establish a "truth" dataset, we asked whether rare proband SV calls (n = 234) made by the Bionano Access (version 1.6.1)/Solve software (version 3.6.1_11162020) could be verified by individual visualisation using the Integrative Genomics Viewer with either or both of the Illumina and ONT raw sequence. Of these, 222 calls were verified, indicating that Bionano OGM calls have high precision (positive predictive value 95%). We then asked what proportion of the 222 true Bionano SVs had been identified by SV callers in the other two datasets. In the Illumina dataset, sensitivity varied according to variant type, being high for deletions (115/134; 86%) but poor for insertions (13/58; 22%). In the ONT dataset, sensitivity was generally poor using the original Sniffles variant caller (48% overall) but improved substantially with use of Sniffles2 (36/40; 90% and 17/23; 74% for deletions and insertions, respectively). In summary, we show that the precision of OGM is very high. In addition, when applying the Sniffles2 caller, the sensitivity of SV calling using ONT long-read sequence data outperforms Illumina sequencing for most SV types.
Collapse
Affiliation(s)
- Yang Pei
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK; (Y.P.); (S.R.F.T.)
| | - Melanie Tanguy
- Genomics England Limited, One Canada Square, London E14 5AB, UK
| | - Adam Giess
- Genomics England Limited, One Canada Square, London E14 5AB, UK
| | - Abhijit Dixit
- Clinical Genetics Service, Nottingham University Hospitals NHS Foundation Trust, City Hospital, Nottingham NG5 1PB, UK
| | - Louise C. Wilson
- North East Thames Regional Genetics Service, Great Ormond Street Hospital for Children NHS Foundation Trust, Great Ormond Street Hospital, London WC1N 3JH, UK
| | - Richard J. Gibbons
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK
| | - Stephen R. F. Twigg
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK; (Y.P.); (S.R.F.T.)
| | - Greg Elgar
- Genomics England Limited, One Canada Square, London E14 5AB, UK
| | - Andrew O. M. Wilkie
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK; (Y.P.); (S.R.F.T.)
| |
Collapse
|
36
|
Liu Z, Xie Z, Li M. Comprehensive and deep evaluation of structural variation detection pipelines with third-generation sequencing data. Genome Biol 2024; 25:188. [PMID: 39010145 PMCID: PMC11247875 DOI: 10.1186/s13059-024-03324-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 06/26/2024] [Indexed: 07/17/2024] Open
Abstract
BACKGROUND Structural variation (SV) detection methods using third-generation sequencing data are widely employed, yet accurately detecting SVs remains challenging. Different methods often yield inconsistent results for certain SV types, complicating tool selection and revealing biases in detection. RESULTS This study comprehensively evaluates 53 SV detection pipelines using simulated and real data from PacBio (CLR: Continuous Long Read, CCS: Circular Consensus Sequencing) and Nanopore (ONT) platforms. We assess their performance in detecting various sizes and types of SVs, breakpoint biases, and genotyping accuracy with various sequencing depths. Notably, pipelines such as Minimap2-cuteSV2, NGMLR-SVIM, PBMM2-pbsv, Winnowmap-Sniffles2, and Winnowmap-SVision exhibit comparatively higher recall and precision. Our findings also show that combining multiple pipelines with the same aligner, like pbmm2 or winnowmap, can significantly enhance performance. The individual pipelines' detailed ranking and performance metrics can be viewed in a dynamic table: http://pmglab.top/SVPipelinesRanking . CONCLUSIONS This study comprehensively characterizes the strengths and weaknesses of numerous pipelines, providing valuable insights that can improve SV detection in third-generation sequencing data and inform SV annotation and function prediction.
Collapse
Affiliation(s)
- Zhi Liu
- Program in Bioinformatics, Zhongshan School of Medicine, The Fifth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
- Key Laboratory of Tropical Disease Control (Sun Yat-Sen University), Ministry of Education, Guangzhou, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Miaoxin Li
- Program in Bioinformatics, Zhongshan School of Medicine, The Fifth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China.
- Key Laboratory of Tropical Disease Control (Sun Yat-Sen University), Ministry of Education, Guangzhou, China.
- Center for Precision Medicine, Sun Yat-Sen University, Guangzhou, China.
- Department of Psychiatry, The University of Hong Kong, Hong Kong, SAR, China.
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-Sen University, Zhuhai, China.
| |
Collapse
|
37
|
Sun G, Huang W, Wang L, Wu J, Zhao G, Ren H, Liu L, Kong X. Molecular findings in patients for whole exome sequencing and mitochondrial genome assessment. Clin Chim Acta 2024; 561:119774. [PMID: 38852791 DOI: 10.1016/j.cca.2024.119774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Revised: 05/20/2024] [Accepted: 06/06/2024] [Indexed: 06/11/2024]
Abstract
OBJECTIVE Whole exome sequencing (WES) is becoming more widely used as a diagnostic tool in the field of medicine. In this article, we reported the diagnostic yield of WES and mitochondrial genome assessment in 2226 consecutive cases in a single clinical laboratory. MATERIALS AND METHODS We retrospectively analyzed consecutive WES reports from 2226 patients with various genetic disorders. WES-process was focused exclusively on the probands and aimed at a higher diagnostic capacity. We determined the diagnostic rate of WES overall and by phenotypic category, mode of inheritance, mitochondrial genome variant, and copy number variants (CNVs). RESULTS Among the 2226 patients who had diagnostic WES proband-only, the overall diagnostic yield of WES was 34.59% (770/2226). The highest diagnostic yield was observed in autosomal dominant disorders, at 45.58% (351/770), followed by autosomal recessive at 31.95%(246/770), X-linked disorder at 9.61%(74/770), and mitochondrial diseases at a notably lower 0.65%(5/770). The 12.21% (94/770) diagnoses were based on a total of 94 copy number variants reported from WES data. CNVs in children accounted for 67.02% of the total CNVs. While majority of the molecular diagnoses were related to nuclear genes, the inclusion of mitochondrial genome sequencing in the WES test contributed to five diagnoses. all mitochondrial diseases were identified in adults. CONCLUSIONS The proband-only WES provided a definitive molecular diagnosis for 34.59% of a large cohort of patients while analysis of WES simultaneously analyzed the SNVs, exons, mitochondrial genome, and CNVs, thereby improving the diagnostic yield significantly compared to the single-detection WES method; and facilitating the identification of novel candidate genes.
Collapse
Affiliation(s)
- Gege Sun
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Wei Huang
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Li Wang
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Jinlin Wu
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Ganye Zhao
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Huanan Ren
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Lina Liu
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Xiangdong Kong
- Genetics and Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China.
| |
Collapse
|
38
|
Hong H, Mocci E, Kamp K, Zhu S, Cain KC, Burr RL, Perry JA, Heitkemper MM, Weaver-Toedtman KR, Dorsey SG. Genetic Variations in TrkB.T1 Isoform and Their Association With Somatic and Psychological Symptoms in Individuals With IBS. THE JOURNAL OF PAIN 2024:104634. [PMID: 39004388 DOI: 10.1016/j.jpain.2024.104634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 06/13/2024] [Accepted: 06/19/2024] [Indexed: 07/16/2024]
Abstract
Irritable bowel syndrome (IBS), a disorder of gut-brain interaction, is often comorbid with somatic pain and psychological disorders. Dysregulated signaling of brain-derived neurotrophic factor (BDNF) and its receptor, tropomyosin-related kinase B (TrkB), has been implicated in somatic-psychological symptoms in individuals with IBS. We investigated the association of 10 single-nucleotide polymorphisms (SNPs) in the regulatory 3' untranslated region of neurotrophic receptor tyrosine kinase-2 (NTRK2) kinase domain-deficient truncated isoform (TrkB.T1) and BDNF Val66Met SNP with somatic and psychological symptoms and quality-of-life (QoL) in a cohort from the United States (IBS, n = 464; healthy controls, n = 156). We found that the homozygous recessive genotype (G/G) of rs2013566 in individuals with IBS is associated with worsened somatic symptoms, including headache, back pain, joint pain, muscle pain, and somatization as well as diminished sleep quality, energy level, and overall QoL. Validation using United Kingdom BioBank data confirmed the association of rs2013566 with an increased likelihood of headache. Several SNPs (rs1627784, rs1624327, and rs1147198) showed significant associations with muscle pain in our U.S. cohort. These 4 SNPs are predominantly located in H3K4Me1-enriched regions, suggesting their enhancer and/or transcription regulation potential. Our findings suggest that genetic variation within the 3' untranslated region region of the TrkB.T1 isoform may contribute to comorbid conditions in individuals with IBS, resulting in a spectrum of somatic and psychological symptoms impacting their QoL. These findings advance our understanding of the genetic interaction between BDNF/TrkB pathways and somatic-psychological symptoms in IBS, highlighting the importance of further exploring this interaction for potential clinical applications. PERSPECTIVE: This study aims to understand the genetic effects on IBS-related symptoms across somatic, psychological, and quality-of-life (QoL) domains, validated by United Kingdom BioBank data. The rs2013566 homozygous recessive genotype correlates with worsened somatic symptoms and reduced QoL, emphasizing its clinical significance.
Collapse
Affiliation(s)
- Hyejeong Hong
- Department of Biobehavioral Health Sciences, University of Pennsylvania School of Nursing, Philadelphia, PA
| | - Evelina Mocci
- Department of Pain and Translational Symptom Science, University of Maryland School of Nursing, Baltimore, MD
| | - Kendra Kamp
- Department of Biobehavioral Nursing and Health Informatics, University of Washington School of Nursing, Seattle, WA
| | - Shijun Zhu
- Department of Organizational Systems and Adult Health, University of Maryland School of Nursing, Baltimore, MD
| | - Kevin C Cain
- Department of Biostatistics, University of Washington School of Nursing, Seattle, WA
| | - Robert L Burr
- Department of Biobehavioral Nursing and Health Informatics, University of Washington School of Nursing, Seattle, WA
| | - James A Perry
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD
| | - Margaret M Heitkemper
- Department of Biobehavioral Nursing and Health Informatics, University of Washington School of Nursing, Seattle, WA
| | - Kristen R Weaver-Toedtman
- Department of Biobehavioral Health and Nursing Science, University of South Carolina College of Nursing, Columbia, SC
| | - Susan G Dorsey
- Department of Pain and Translational Symptom Science, University of Maryland School of Nursing, Baltimore, MD.
| |
Collapse
|
39
|
Zhang Z, Liu Y, Li X, Liu Y, Wang Y, Jiang T. HapKled: a haplotype-aware structural variant calling approach for Oxford nanopore sequencing data. Front Genet 2024; 15:1435087. [PMID: 39045321 PMCID: PMC11263161 DOI: 10.3389/fgene.2024.1435087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Accepted: 06/13/2024] [Indexed: 07/25/2024] Open
Abstract
Introduction: Structural Variants (SVs) are a type of variation that can significantly influence phenotypes and cause diseases. Thus, the accurate detection of SVs is a vital part of modern genetic analysis. The advent of long-read sequencing technology ushers in a new era of more accurate and comprehensive SV calling, and many tools have been developed to call SVs using long-read data. Haplotype-tagging is a procedure that can tag haplotype information on reads and can thus potentially improve the SV detection; nevertheless, few methods make use of this information. In this article, we introduce HapKled, a new SV detection tool that can accurately detect SVs from Oxford Nanopore Technologies (ONT) long-read alignment data. Methods: HapKled utilizes haplotype information underlying alignment data by conducting haplotype-tagging using Whatshap on the reads to improve the detection performance, with three unique calling mechanics including altering clustering conditions according to haplotype information of signatures, determination of similar SVs based on haplotype information, and slack filtering conditions based on haplotype quality. Results: In our evaluations, HapKled outperformed state-of-the-art tools and can deliver better SV detection results on both simulated and real sequencing data. The code and experiments of HapKled can be obtained from https://github.com/CoREse/HapKled. Discussion: With the superb SV detection performance that HapKled can deliver, HapKled could be useful in bioinformatics research, clinical diagnosis, and medical research and development.
Collapse
Affiliation(s)
- Zhendong Zhang
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Yue Liu
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Xin Li
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Yadong Liu
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
- Zhengzhou Research Institute, Harbin Institute of Technology, Zhengzhou, Henan, China
| | - Yadong Wang
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
- Zhengzhou Research Institute, Harbin Institute of Technology, Zhengzhou, Henan, China
| | - Tao Jiang
- Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
- Zhengzhou Research Institute, Harbin Institute of Technology, Zhengzhou, Henan, China
| |
Collapse
|
40
|
Wiener E, Cottino L, Botha G, Nyangiri O, Noyes H, McLeod A, Jakubosky D, Adebamowo C, Awadalla P, Landouré G, Matshaba M, Matovu E, Ramsay M, Simo G, Simuunza M, Tiemessen C, Wonkam A, Sahibdeen V, Krause A, Lombard Z, Hazelhurst S. An assessment of the genomic structural variation landscape in Sub-Saharan African populations. RESEARCH SQUARE 2024:rs.3.rs-4485126. [PMID: 39041024 PMCID: PMC11261963 DOI: 10.21203/rs.3.rs-4485126/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/24/2024]
Abstract
Structural variants are responsible for a large part of genomic variation between individuals and play a role in both common and rare diseases. Databases cataloguing structural variants notably do not represent the full spectrum of global diversity, particularly missing information from most African populations. To address this representation gap, we analysed 1,091 high-coverage African genomes, 545 of which are public data sets, and 546 which have been analysed for structural variants for the first time. Variants were called using five different tools and datasets merged and jointly called using SURVIVOR. We identified 67,795 structural variants throughout the genome, with 10,421 genes having at least one variant. Using a conservative overlap in merged data, 6,414 of the structural variants (9.5%) are novel compared to the Database of Genomic Variants. This study contributes to knowledge of the landscape of structural variant diversity in Africa and presents a reliable dataset for potential applications in population genetics and health-related research.
Collapse
Affiliation(s)
- Emma Wiener
- Sydney Brenner Institute for Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Laura Cottino
- Sydney Brenner Institute for Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Gerrit Botha
- Computational Biology Unit, University of Cape Town, Cape Town, South Africa
| | - Oscar Nyangiri
- College of Veterinary Medicine, Animal Resources and Biosecurity Makerere University, Kampala, Uganda
| | - Harry Noyes
- Centre for Genomic Research, University of Liverpool, Liverpool, United Kingdom
| | - Annette McLeod
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, United Kingdom
| | - David Jakubosky
- Department of Biomedical Informatics, University of California, San Diego, United States of America
- Institute of Genomic Medicine, University of California, San Diego, United States of America
| | - Clement Adebamowo
- Department of Epidemiology and Public Health and Greenebaum Comprehensive Cancer Center University of Maryland School of Medicine, Baltimore, United States of America
| | - Phillip Awadalla
- Ontario Institute for Cancer Research, Toronto, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | - Guida Landouré
- Faculty of Medicine and Odontostomatology University of Sciences, Techniques and Technology of Bamako, Bamako Mali
- Neurology Department Point ”G” University Hospital, Bamako, Mali
| | - Mogomotsi Matshaba
- Botswana-Baylor Children’s Clinical Center of Excellence, Gaborone, Botswana
- Baylor College of Medicine, Houston, United States
| | - Enock Matovu
- College of Veterinary Medicine, Animal Resources and Biosecurity Makerere University, Kampala, Uganda
| | - Michèle Ramsay
- Sydney Brenner Institute for Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
| | - Gustave Simo
- Molecular Parasitology and Entomology Unit, Department of Biochemistry University of Dschang, Dschang, Cameroon
| | - Martin Simuunza
- Department of Disease Control, School of Veterinary Medicine University of Zambia, Lusaka, Zambia
| | - Caroline Tiemessen
- Centre for HIV and STIs, National Institute for Communicable Diseases, National Health Laboratory Services and Faculty of Health Sciences University of the Witwatersrand, Johannesburg, South Africa
| | - Ambroise Wonkam
- McKusick-Nathans Institute and Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, United States of America
- Division of Human Genetics, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
| | - Venesa Sahibdeen
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Amanda Krause
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Zané Lombard
- Division of Human Genetics, National Health Laboratory Service and School of Pathology, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Scott Hazelhurst
- Sydney Brenner Institute for Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
- School of Electrical & Information Engineering, University of the Witwatersrand, Johannesburg, South Africa
| | | |
Collapse
|
41
|
Kramer M, Goodwin S, Wappel R, Borio M, Offit K, Feldman DR, Stadler ZK, McCombie WR. Exploring the genetic and epigenetic underpinnings of early-onset cancers: Variant prioritization for long read whole genome sequencing from family cancer pedigrees. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601096. [PMID: 39005350 PMCID: PMC11244929 DOI: 10.1101/2024.06.27.601096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Despite significant advances in our understanding of genetic cancer susceptibility, known inherited cancer predisposition syndromes explain at most 20% of early-onset cancers. As early-onset cancer prevalence continues to increase, the need to assess previously inaccessible areas of the human genome, harnessing a trio or quad family-based architecture for variant filtration, may reveal further insights into cancer susceptibility. To assess a broader spectrum of variation than can be ascertained by multi-gene panel sequencing, or even whole genome sequencing with short reads, we employed long read whole genome sequencing using an Oxford Nanopore Technology (ONT) PromethION of 3 families containing an early-onset cancer proband using a trio or quad family architecture. Analysis included 2 early-onset colorectal cancer family trios and one quad consisting of two siblings with testicular cancer, all with unaffected parents. Structural variants (SVs), epigenetic profiles and single nucleotide variants (SNVs) were determined for each individual, and a filtering strategy was employed to refine and prioritize candidate variants based on the family architecture. The family architecture enabled us to focus on inapposite variants while filtering variants shared with the unaffected parents, significantly decreasing background variation that can hamper identification of potentially disease causing differences. Candidate d e novo and compound heterozygous variants were identified in this way. Gene expression, in matched neoplastic and pre-neoplastic lesions, was assessed for one trio. Our study demonstrates the feasibility of a streamlined analysis of genomic variants from long read ONT whole genome sequencing and a way to prioritize key variants for further evaluation of pathogenicity, while revealing what may be missing from panel based analyses.
Collapse
|
42
|
Xiu Z, Sun L, Liu K, Cao H, Qu HQ, Glessner JT, Ding Z, Zheng G, Wang N, Xia Q, Li J, Li MJ, Hakonarson H, Liu W, Li J. Shared molecular mechanisms and transdiagnostic potential of neurodevelopmental disorders and immune disorders. Brain Behav Immun 2024; 119:767-780. [PMID: 38677625 DOI: 10.1016/j.bbi.2024.04.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 02/27/2024] [Accepted: 04/22/2024] [Indexed: 04/29/2024] Open
Abstract
The co-occurrence and familial clustering of neurodevelopmental disorders and immune disorders suggest shared genetic risk factors. Based on genome-wide association summary statistics from five neurodevelopmental disorders and four immune disorders, we conducted genome-wide, local genetic correlation and polygenic overlap analysis. We further performed a cross-trait GWAS meta-analysis. Pleotropic loci shared between the two categories of diseases were mapped to candidate genes using multiple algorithms and approaches. Significant genetic correlations were observed between neurodevelopmental disorders and immune disorders, including both positive and negative correlations. Neurodevelopmental disorders exhibited higher polygenicity compared to immune disorders. Around 50%-90% of genetic variants of the immune disorders were shared with neurodevelopmental disorders. The cross-trait meta-analysis revealed 154 genome-wide significant loci, including 8 novel pleiotropic loci. Significant associations were observed for 30 loci with both types of diseases. Pathway analysis on the candidate genes at these loci revealed common pathways shared by the two types of diseases, including neural signaling, inflammatory response, and PI3K-Akt signaling pathway. In addition, 26 of the 30 lead SNPs were associated with blood cell traits. Neurodevelopmental disorders exhibit complex polygenic architecture, with a subset of individuals being at a heightened genetic risk for both neurodevelopmental and immune disorders. The identification of pleiotropic loci has important implications for exploring opportunities for drug repurposing, enabling more accurate patient stratification, and advancing genomics-informed precision in the medical field of neurodevelopmental disorders.
Collapse
Affiliation(s)
- Zhanjie Xiu
- Department of Cell Biology, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Key Laboratory of Immune Microenvironment and Disease (Ministry of Education), Tianjin Key Laboratory of Medical Epigenetics, Tianjin Institute of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China; Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Ling Sun
- Department of Child and Adolescent Psychology, Tianjin Anding Hospital, Mental Health Center of Tianjin Medical University, Tianjin, China
| | - Kunlun Liu
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Haiyan Cao
- Department of Child and Adolescent Psychology, Tianjin Anding Hospital, Mental Health Center of Tianjin Medical University, Tianjin, China
| | - Hui-Qi Qu
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, United States
| | - Joseph T Glessner
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, United States; Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA, United States; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Zhiyong Ding
- Mills Institute for Personalized Cancer Care, Fynn Biotechnologies Ltd., Jinan, China
| | - Gang Zheng
- National Supercomputer Center in Tianjin (NSCC-TJ), Tianjin, China
| | - Nan Wang
- Mills Institute for Personalized Cancer Care, Fynn Biotechnologies Ltd., Jinan, China
| | - Qianghua Xia
- Department of Cell Biology, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Key Laboratory of Immune Microenvironment and Disease (Ministry of Education), Tianjin Key Laboratory of Medical Epigenetics, Tianjin Institute of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China; Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Jie Li
- Laboratory of Biological Psychiatry, Institute of Mental Health, Tianjin Anding Hospital, Tianjin Medical University, Tianjin, China
| | - Mulin Jun Li
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, United States; Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA, United States; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.
| | - Wei Liu
- Tianjin Children's Hospital (Tianjin University Children's Hospital), Tianjin, China; Tianjin Key Laboratory of Birth Defects for Prevention and Treatment, Tianjin, China.
| | - Jin Li
- Department of Cell Biology, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Key Laboratory of Immune Microenvironment and Disease (Ministry of Education), Tianjin Key Laboratory of Medical Epigenetics, Tianjin Institute of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China; Department of Rheumatology and Immunology, Tianjin Medical University General Hospital, Tianjin, China.
| |
Collapse
|
43
|
Zhang T, Peng W, Xiao H, Cao S, Chen Z, Su X, Luo Y, Liu Z, Peng Y, Yang X, Jiang GF, Xu X, Ma Z, Zhou Y. Population genomics highlights structural variations in local adaptation to saline coastal environments in woolly grape. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2024; 66:1408-1426. [PMID: 38578160 DOI: 10.1111/jipb.13653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 03/04/2024] [Indexed: 04/06/2024]
Abstract
Structural variations (SVs) are a feature of plant genomes that has been largely unexplored despite their significant impact on plant phenotypic traits and local adaptation to abiotic and biotic stress. In this study, we employed woolly grape (Vitis retordii), a species native to the tropical and subtropical regions of East Asia with both coastal and inland habitats, as a valuable model for examining the impact of SVs on local adaptation. We assembled a haplotype-resolved chromosomal reference genome for woolly grape, and conducted population genetic analyses based on whole-genome sequencing (WGS) data from coastal and inland populations. The demographic analyses revealed recent bottlenecks in all populations and asymmetric gene flow from the inland to the coastal population. In total, 1,035 genes associated with plant adaptive regulation for salt stress, radiation, and environmental adaptation were detected underlying local selection by SVs and SNPs in the coastal population, of which 37.29% and 65.26% were detected by SVs and SNPs, respectively. Candidate genes such as FSD2, RGA1, and AAP8 associated with salt tolerance were found to be highly differentiated and selected during the process of local adaptation to coastal habitats in SV regions. Our study highlights the importance of SVs in local adaptation; candidate genes related to salt stress and climatic adaptation to tropical and subtropical environments are important genomic resources for future breeding programs of grapevine and its rootstocks.
Collapse
Affiliation(s)
- Tianhao Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530004, China
- Guangxi Key Laboratory of Forest Ecology and Conservation, Guangxi Colleges and Universities Key Laboratory for Cultivation and Utilization of Subtropical Forest Plantation, College of Forestry, Guangxi University, Nanning, 530004, China
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Wenjing Peng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530004, China
- Guangxi Key Laboratory of Sugarcane Biology, College of Agriculture, Guangxi University, Nanning, 530004, China
| | - Hua Xiao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Shuo Cao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
- Key Laboratory of Horticultural Plant Biology Ministry of Education, Huazhong Agricultural University, Wuhan, 430070, China
| | - Zhuyifu Chen
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Xiangnian Su
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530004, China
- Guangxi Key Laboratory of Forest Ecology and Conservation, Guangxi Colleges and Universities Key Laboratory for Cultivation and Utilization of Subtropical Forest Plantation, College of Forestry, Guangxi University, Nanning, 530004, China
| | - Yuanyuan Luo
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| | - Zhongjie Liu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Yanling Peng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Xiping Yang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530004, China
- Guangxi Key Laboratory of Sugarcane Biology, College of Agriculture, Guangxi University, Nanning, 530004, China
| | - Guo-Feng Jiang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, 530004, China
- Guangxi Key Laboratory of Forest Ecology and Conservation, Guangxi Colleges and Universities Key Laboratory for Cultivation and Utilization of Subtropical Forest Plantation, College of Forestry, Guangxi University, Nanning, 530004, China
| | - Xiaodong Xu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Zhiyao Ma
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Yongfeng Zhou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
- National Key Laboratory of Tropical Crop Breeding, Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101, China
| |
Collapse
|
44
|
Ou J, Wang J, Sun J, Ni M, Meng Q, Ding J, Fan H, Feng S, Huang Y, Li H, Fei J. Analysis of Preimplantation and Clinical Outcomes of Two Cases by Oxford Nanopore Sequencing. Reprod Sci 2024; 31:2123-2134. [PMID: 38347380 DOI: 10.1007/s43032-024-01470-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 01/19/2024] [Indexed: 07/03/2024]
Abstract
It is challenging to distinguish embryos with a balanced translocation karyotype from a normal karyotype by existing conventional genetic testing methods. However, in germ-cell gamete generation, chromosome exchange and separation through cell meiosis form a different proportion of unbalanced gametes. Adverse birth events may occur, such as repeated miscarriages and fetal birth defects. In this study, the exact breakpoints of structural variation (SV) from two balanced translocation carrier families by using Nanopore long reads sequencing technology were obtained, and haplotype analysis and Sanger verified the accuracy of the detection results, confirming the application value of the Nanopore sequencing technology in the detection of balanced translocation before embryo implantation. Nanopore long-read sequencing was performed to find the precise breakpoint of chromosome-balanced translocation carriers. The breakpoints were subsequently verified by designing primers across the breakpoints and Sanger sequencing. Haplotype linkage analysis of SNPs which can be linked by a read block of families around the breakpoint regions was followed. After frozen (-thawed) embryo transfer (FET), prenatal cytogenetic analysis of amniotic fluid cells confirmed the predicted karyotypes from the transferred embryos. The presence of breakpoints was detected in three embryos of patient 1. No breakpoints were detected in either embryo of patient 2. One balanced translocated embryo from patient 1 and one normal euploid embryo from patient 2 were transplanted back into the patients, and amniotic fluid cells were analyzed for the karyotype of fetuses. The results were entirely consistent with the fetal karyotype. And through late follow-up, both patients successfully had a live birth fetus. The breakpoint location of the balanced chromosome translocation can be accurately found by Nanopore sequencing. The haplotype of carriers can be successfully constructed by Nanopore and sanger sequencing confirmed that the results were accurate. This is very advantageous for preimplantation genetic testing for chromosomal structural rearrangements (PGT-SR) detection in the families without proband.
Collapse
Affiliation(s)
- Jian Ou
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China
| | | | - Jian Sun
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China
| | - Mengxia Ni
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China
| | - QingXia Meng
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China
| | - Jie Ding
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China
| | - Haiyang Fan
- Peking Jabrehoo Med-Tech Co., Ltd, Beijing, China
| | - Shaohua Feng
- Peking Jabrehoo Med-Tech Co., Ltd, Beijing, China
| | - Yining Huang
- Peking Jabrehoo Med-Tech Co., Ltd, Beijing, China
| | - Hong Li
- Center for Reproduction and Genetics, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Suzhou, China.
| | - Jia Fei
- Peking Jabrehoo Med-Tech Co., Ltd, Beijing, China.
| |
Collapse
|
45
|
Liu A, Citu C, Enduru N, Chen X, Manuel AM, Sinha T, Gorski D, Fernandes BS, Yu M, Schulz PE, Simon LM, Soto C, Zhao Z. Single-nucleus multiomics reveals the disrupted regulatory programs in three brain regions of sporadic early-onset Alzheimer's disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600720. [PMID: 38979371 PMCID: PMC11230393 DOI: 10.1101/2024.06.25.600720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Sporadic early-onset Alzheimer's disease (sEOAD) represents a significant but less-studied subtype of Alzheimer's disease (AD). Here, we generated a single-nucleus multiome atlas derived from the postmortem prefrontal cortex, entorhinal cortex, and hippocampus of nine individuals with or without sEOAD. Comprehensive analyses were conducted to delineate cell type-specific transcriptomic changes and linked candidate cis- regulatory elements (cCREs) across brain regions. We prioritized seven conservative transcription factors in glial cells in multiple brain regions, including RFX4 in astrocytes and IKZF1 in microglia, which are implicated in regulating sEOAD-associated genes. Moreover, we identified the top 25 altered intercellular signaling between glial cells and neurons, highlighting their regulatory potential on gene expression in receiver cells. We reported 38 cCREs linked to sEOAD-associated genes overlapped with late-onset AD risk loci, and sEOAD cCREs enriched in neuropsychiatric disorder risk loci. This atlas helps dissect transcriptional and chromatin dynamics in sEOAD, providing a key resource for AD research.
Collapse
|
46
|
Li W, Xu M, Zhang Z, Liang J, Fu R, Lin W, Luo W, Zhang X, Ren T. Regulatory Effects of 198-bp Structural Variants in the GSTA2 Promoter Region on Adipogenesis in Chickens. Int J Mol Sci 2024; 25:7155. [PMID: 39000259 PMCID: PMC11241197 DOI: 10.3390/ijms25137155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 06/20/2024] [Accepted: 06/24/2024] [Indexed: 07/16/2024] Open
Abstract
Molecular breeding accelerates animal breeding and improves efficiency by utilizing genetic mutations. Structural variations (SVs), a significant source of genetic mutations, have a greater impact on phenotypic variation than SNPs. Understanding SV functional mechanisms and obtaining precise information are crucial for molecular breeding. In this study, association analysis revealed significant correlations between 198-bp SVs in the GSTA2 promoter region and abdominal fat weight, intramuscular fat content, and subcutaneous fat thickness in chickens. High expression of GSTA2 in adipose tissue was positively correlated with the abdominal fat percentage, and different genotypes of GSTA2 exhibited varied expression patterns in the liver. The 198-bp SVs regulate GSTA2 expression by binding to different transcription factors. Overexpression of GSTA2 promoted preadipocyte proliferation and differentiation, while interference had the opposite effect. Mechanistically, the 198-bp fragment contains binding sites for transcription factors such as C/EBPα that regulate GSTA2 expression and fat synthesis. These SVs are significantly associated with chicken fat traits, positively influencing preadipocyte development by regulating cell proliferation and differentiation. Our work provides compelling evidence for the use of 198-bp SVs in the GSTA2 promoter region as molecular markers for poultry breeding and offers new insights into the pivotal role of the GSTA2 gene in fat generation.
Collapse
Affiliation(s)
- Wangyu Li
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Meng Xu
- College of Veterinary Medicine, Jilin University, Changchun 130062, China
| | - Zihao Zhang
- College of Coastal Agricultural Sciences, Guangdong Ocean University, Zhanjiang 524088, China;
| | - Jiaying Liang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Rong Fu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Wujian Lin
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Wen Luo
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Xiquan Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
| | - Tuanhui Ren
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China (R.F.); (W.L.); (W.L.)
- Guangdong Key Laboratory of Genome and Molecular Breeding of Agricultural Animals and Key Laboratory of Chicken Genetic Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, China
- College of Veterinary Medicine, Jilin University, Changchun 130062, China
| |
Collapse
|
47
|
Matsuo T, Hamasaki I, Kamatani Y, Kawaguchi T, Yamaguchi I, Matsuda F, Saito A, Nakazono K, Kamitsuji S. Genome-Wide Association Study with Three Control Cohorts of Japanese Patients with Esotropia and Exotropia of Comitant Strabismus and Idiopathic Superior Oblique Muscle Palsy. Int J Mol Sci 2024; 25:6986. [PMID: 39000095 PMCID: PMC11241339 DOI: 10.3390/ijms25136986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 06/10/2024] [Accepted: 06/15/2024] [Indexed: 07/16/2024] Open
Abstract
Esotropia and exotropia in the entity of comitant strabismus are multifactorial diseases with both genetic and environmental backgrounds. Idiopathic superior oblique muscle palsy, as the predominant entity of non-comitant (paralytic) strabismus, also has a genetic background, as evidenced by varying degrees of muscle hypoplasia. A genome-wide association study (GWAS) was conducted of 711 Japanese patients with esotropia (n= 253), exotropia (n = 356), and idiopathic superior oblique muscle palsy (n = 102). The genotypes of single nucleotide polymorphisms (SNPs) were determined by Infinium Asian Screening Array. Three control cohorts from the Japanese population were used: two cohorts from BioBank Japan (BBJ) and the Nagahama Cohort. BBJ (180K) was genotyped by a different array, Illumina Infinium OmniExpressExome or HumanOmniExpress, while BBJ (ASA) and the Nagahama Cohort were genotyped by the same Asian array. After quality control of SNPs and individuals, common SNPs between the case cohort and the control cohort were chosen in the condition of genotyping by different arrays, while all SNPs genotyped by the same array were used for SNP imputation. The SNPs imputed with R-square values ≥ 0.3 were used to compare the case cohort of each entity or the combined entity with the control cohort. In comparison with BBJ (180K), the esotropia group and the exotropia group showed CDCA7 and HLA-F, respectively, as candidate genes at a significant level of p < 5 × 10-8, while the idiopathic superior oblique muscle palsy group showed DAB1 as a candidate gene which is involved in neuronal migration. DAB1 was also detected as a candidate in comparison with BBJ (ASA) and the Nagahama Cohort at a weak level of significance of p < 1 × 10-6. In comparison with BBJ (180K), RARB (retinoic acid receptor-β) was detected as a candidate at a significant level of p < 5 × 10-8 in the combined group of esotropia, exotropia, and idiopathic superior oblique muscle palsy. In conclusion, a series of GWASs with three different control cohorts would be an effective method with which to search for candidate genes for multifactorial diseases such as strabismus.
Collapse
Affiliation(s)
- Toshihiko Matsuo
- Graduate School of Interdisciplinary Science and Engineering in Health Systems, Okayama University, Okayama City 700-8558, Japan
- Department of Ophthalmology, Okayama University Hospital, Okayama City 700-8558, Japan
| | - Ichiro Hamasaki
- Department of Ophthalmology, Okayama University Hospital, Okayama City 700-8558, Japan
| | - Yoichiro Kamatani
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo 108-8639, Japan;
| | - Takahisa Kawaguchi
- Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto 606-8507, Japan (F.M.)
| | - Izumi Yamaguchi
- Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto 606-8507, Japan (F.M.)
| | - Fumihiko Matsuda
- Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto 606-8507, Japan (F.M.)
| | - Akira Saito
- StaGen Co., Ltd., Tokyo 111-0051, Japan (S.K.)
| | | | | |
Collapse
|
48
|
Koenig Z, Yohannes MT, Nkambule LL, Zhao X, Goodrich JK, Kim HA, Wilson MW, Tiao G, Hao SP, Sahakian N, Chao KR, Walker MA, Lyu Y, Rehm HL, Neale BM, Talkowski ME, Daly MJ, Brand H, Karczewski KJ, Atkinson EG, Martin AR. A harmonized public resource of deeply sequenced diverse human genomes. Genome Res 2024; 34:796-809. [PMID: 38749656 PMCID: PMC11216312 DOI: 10.1101/gr.278378.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 05/07/2024] [Indexed: 05/18/2024]
Abstract
Underrepresented populations are often excluded from genomic studies owing in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high-quality set of 4094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also show substantial added value from this data set compared with the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared with previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality-control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.
Collapse
Affiliation(s)
- Zan Koenig
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Mary T Yohannes
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Lethukuthula L Nkambule
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Julia K Goodrich
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Heesu Ally Kim
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Michael W Wilson
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Grace Tiao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Stephanie P Hao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Nareh Sahakian
- Broad Genomics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02141, USA
| | - Katherine R Chao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Mark A Walker
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Yunfei Lyu
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Heidi L Rehm
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Benjamin M Neale
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Michael E Talkowski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Mark J Daly
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts 02115, USA
- Institute for Molecular Medicine Finland, 00290 Helsinki, Finland
| | - Harrison Brand
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Konrad J Karczewski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elizabeth G Atkinson
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Alicia R Martin
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| |
Collapse
|
49
|
Pathak AK, Simonian H, Ibrahim IAA, Hrechdakian P, Behar DM, Ayub Q, Arsanov P, Metspalu E, Yepiskoposyan L, Rootsi S, Endicott P, Villems R, Sahakyan H. Human Y chromosome haplogroup L1-M22 traces Neolithic expansion in West Asia and supports the Elamite and Dravidian connection. iScience 2024; 27:110016. [PMID: 38883810 PMCID: PMC11177204 DOI: 10.1016/j.isci.2024.110016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 04/06/2024] [Accepted: 05/14/2024] [Indexed: 06/18/2024] Open
Abstract
West and South Asian populations profoundly influenced Eurasian genetic and cultural diversity. We investigate the genetic history of the Y chromosome haplogroup L1-M22, which, while prevalent in these regions, lacks in-depth study. Robust Bayesian analyses of 165 high-coverage Y chromosomes favor a West Asian origin for L1-M22 ∼20.6 thousand years ago (kya). Moreover, this haplogroup parallels the genome-wide genetic ancestry of hunter-gatherers from the Iranian Plateau and the Caucasus. We characterized two L1-M22 harboring population groups during the Early Holocene. One expanded with the West Asian Neolithic transition. The other moved to South Asia ∼8-6 kya but showed no expansion. This group likely participated in the spread of Dravidian languages. These South Asian L1-M22 lineages expanded ∼4-3 kya, coinciding with the Steppe ancestry introduction. Our findings advance the current understanding of Eurasian historical dynamics, emphasizing L1-M22's West Asian origin, associated population movements, and possible linguistic impacts.
Collapse
Affiliation(s)
- Ajai Kumar Pathak
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
- Department of Human Genetics, KU Leuven, 3000 Leuven, Belgium
| | - Hovann Simonian
- Armenian DNA Project at Family Tree DNA, Houston, TX 77008, USA
| | - Ibrahim Abdel Aziz Ibrahim
- Department of Pharmacology and Toxicology, Faculty of Medicine, Umm Al-Qura University, Makkah 21955, Saudi Arabia
| | | | - Doron M Behar
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Qasim Ayub
- Monash University Malaysia Genomics Platform, School of Science, Monash University, Bandar Sunway, Selangor Darul Ehsan 47500, Malaysia
| | - Pakhrudin Arsanov
- Chechen-Noahcho DNA Project at Family Tree DNA, Kostanay 110008, Kazakhstan
| | - Ene Metspalu
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Levon Yepiskoposyan
- Laboratory of Evolutionary Genomics, Institute of Molecular Biology of National Academy of Sciences of the Republic of Armenia, Yerevan 0014, Armenia
| | - Siiri Rootsi
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Phillip Endicott
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
- Department of Archaeology and Anthropology, Bournemouth University, Fern Barrow, Poole, Dorset BH12 5BB, UK
- Department of Linguistics, University of Hawai'i at Mānoa, Honolulu, Hawai'i 96822, USA
- DFG Center for Advanced Studies, University of Tübingen, 72074 Tübingen, Germany
| | - Richard Villems
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Hovhannes Sahakyan
- Estonian Biocentre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
- Laboratory of Evolutionary Genomics, Institute of Molecular Biology of National Academy of Sciences of the Republic of Armenia, Yerevan 0014, Armenia
| |
Collapse
|
50
|
Wang Q, Han R, Xing H, Li H. A consensus genome of sika deer (Cervus nippon) and transcriptome analysis provided novel insights on the regulation mechanism of transcript factor in antler development. BMC Genomics 2024; 25:617. [PMID: 38890595 PMCID: PMC11186158 DOI: 10.1186/s12864-024-10522-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 06/13/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Sika deer (Cervus nippon) holds significance among cervids, with three genomes recently published. However, these genomes still contain hundreds of gaps and display significant discrepancies in continuity and accuracy. This poses challenges to functional genomics research and the selection of an appropriate reference genome. Thus, obtaining a high-quality reference genome is imperative to delve into functional genomics effectively. FINDINGS Here we report a high-quality consensus genome of male sika deer. All 34 chromosomes are assembled into single-contig pseudomolecules without any gaps, which is the most complete assembly. The genome size is 2.7G with 23,284 protein-coding genes. Comparative genomics analysis found that the genomes of sika deer and red deer are highly conserved, an approximately 2.4G collinear regions with up to 99% sequence similarity. Meanwhile, we observed the fusion of red deer's Chr23 and Chr4 during evolution, forming sika deer's Chr1. Additionally, we identified 607 transcription factors (TFs) that are involved in the regulation of antler development, including RUNX2, SOX6, SOX8, SOX9, PAX8, SIX2, SIX4, SIX6, SPI1, NFAC1, KLHL8, ZN710, JDP2, and TWST2, based on this consensus reference genome. CONCLUSIONS Our results indicated that we acquired a high-quality consensus reference genome. That provided valuable resources for understanding functional genomics. In addition, discovered the genetic basis of sika-red hybrid fertility and identified 607 significant TFs that impact antler development.
Collapse
Affiliation(s)
- Qianghui Wang
- College of Wildlife and Protected Area, Northeast Forestry University, Harbin, 150040, China
| | - Ruobing Han
- College of Wildlife and Protected Area, Northeast Forestry University, Harbin, 150040, China
| | - Haihua Xing
- College of Wildlife and Protected Area, Northeast Forestry University, Harbin, 150040, China
| | - Heping Li
- College of Wildlife and Protected Area, Northeast Forestry University, Harbin, 150040, China.
| |
Collapse
|