26
|
Hazra S, Chaudhuri AG, Tiwary BK, Chakrabarti N. Integrated network-based multiple computational analyses for identification of co-expressed candidate genes associated with neurological manifestations of COVID-19. Sci Rep 2022; 12:17141. [PMID: 36229517 PMCID: PMC9558001 DOI: 10.1038/s41598-022-21109-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 09/22/2022] [Indexed: 01/04/2023] Open
Abstract
'Tripartite network' (TN) and 'combined gene network' (CGN) were constructed and their hub-bottleneck and driver nodes (44 genes) were evaluated as 'target genes' (TG) to identify 21 'candidate genes' (CG) and their relationship with neurological manifestations of COVID-19. TN was developed using neurological symptoms of COVID-19 found in literature. Under query genes (TG of TN), co-expressed genes were identified using pair-wise mutual information to genes available in RNA-Seq autopsy data of frontal cortex of COVID-19 victims. CGN was constructed with genes selected from TN and co-expressed in COVID-19. TG and their connecting genes of respective networks underwent functional analyses through findings of their enrichment terms and pair-wise 'semantic similarity scores' (SSS). A new integrated 'weighted harmonic mean score' was formulated assimilating values of SSS and STRING-based 'combined score' of the selected TG-pairs, which provided CG-pairs with properties of CGs as co-expressed and 'indispensable nodes' in CGN. Finally, six pairs sharing seven 'prevalent CGs' (ADAM10, ADAM17, AKT1, CTNNB1, ESR1, PIK3CA, FGFR1) showed linkages with the phenotypes (a) directly under neurodegeneration, neurodevelopmental diseases, tumour/cancer and cellular signalling, and (b) indirectly through other CGs under behavioural/cognitive and motor dysfunctions. The pathophysiology of 'prevalent CGs' has been discussed to interpret neurological phenotypes of COVID-19.
Collapse
|
27
|
Abstract
One core goal of genetics is to systematically understand the mapping between the DNA sequence of an organism (genotype) and its measurable characteristics (phenotype). Understanding this mapping is often challenging because of interactions between mutations, where the result of combining several different mutations can be very different than the sum of their individual effects. Here we provide a statistical framework for modeling complex genetic interactions of this type. The key idea is to ask how fast the effects of mutations change when introducing the same mutation in increasingly distant genetic backgrounds. We then propose a model for phenotypic prediction that takes into account this tendency for the effects of mutations to be more similar in nearby genetic backgrounds. Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA 5′ splice sites, for which we also validate our model predictions via additional low-throughput experiments.
Collapse
|
28
|
Fóthi Á, Pintér C, Pollner P, Lőrincz A. Peripheral gene interactions define interpretable clusters of core ASD genes in a network-based investigation of the omnigenic theory. NPJ Syst Biol Appl 2022; 8:28. [PMID: 35948596 PMCID: PMC9365765 DOI: 10.1038/s41540-022-00240-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 07/20/2022] [Indexed: 11/23/2022] Open
Abstract
According to the recently proposed omnigenic theory, all expressed genes in a relevant tissue are contributing directly or indirectly to the manifestation of complex disorders such as autism. Thus, holistic approaches can be complementary in studying genetics of these complex disorders to focusing on a limited number of candidate genes. Gene interaction networks can be used for holistic studies of the omnigenic nature of autism. We used Louvain clustering on tissue-specific gene interaction networks and their subgraphs exclusively containing autism-related genes to study the effects of peripheral gene interactions. We observed that the autism gene clusters are significantly weaker connected to each other and the peripheral genes in non-neuronal tissues than in brain-related tissues. The biological functions of the brain clusters correlated well with previous findings on autism, such as synaptic signaling, regulation of DNA methylation, or regulation of lymphocyte activation, however, on the other tissues they did not enrich as significantly. Furthermore, ASD subjects with disruptive mutations in specific gene clusters show phenotypical differences compared to other disruptive variants carrying ASD individuals. Our results strengthen the omnigenic theory and can advance our understanding of the genetic background of autism.
Collapse
|
29
|
Cuomo ASE, Heinen T, Vagiaki D, Horta D, Marioni JC, Stegle O. CellRegMap: a statistical framework for mapping context-specific regulatory variants using scRNA-seq. Mol Syst Biol 2022; 18:e10663. [PMID: 35972065 PMCID: PMC9380406 DOI: 10.15252/msb.202110663] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 06/28/2022] [Accepted: 07/01/2022] [Indexed: 11/11/2022] Open
Abstract
Single‐cell RNA sequencing (scRNA‐seq) enables characterizing the cellular heterogeneity in human tissues. Recent technological advances have enabled the first population‐scale scRNA‐seq studies in hundreds of individuals, allowing to assay genetic effects with single‐cell resolution. However, existing strategies to analyze these data remain based on principles established for the genetic analysis of bulk RNA‐seq. In particular, current methods depend on a priori definitions of discrete cell types, and hence cannot assess allelic effects across subtle cell types and cell states. To address this, we propose the Cell Regulatory Map (CellRegMap), a statistical framework to test for and quantify genetic effects on gene expression in individual cells. CellRegMap provides a principled approach to identify and characterize genotype–context interactions of known eQTL variants using scRNA‐seq data. This model‐based approach resolves allelic effects across cellular contexts of different granularity, including genetic effects specific to cell subtypes and continuous cell transitions. We validate CellRegMap using simulated data and apply it to previously identified eQTL from two recent studies of differentiating iPSCs, where we uncover hundreds of eQTL displaying heterogeneity of genetic effects across cellular contexts. Finally, we identify fine‐grained genetic regulation in neuronal subtypes for eQTL that are colocalized with human disease variants.
Collapse
|
30
|
Westerman KE, Majarian TD, Giulianini F, Jang DK, Miao J, Florez JC, Chen H, Chasman DI, Udler MS, Manning AK, Cole JB. Variance-quantitative trait loci enable systematic discovery of gene-environment interactions for cardiometabolic serum biomarkers. Nat Commun 2022; 13:3993. [PMID: 35810165 PMCID: PMC9271055 DOI: 10.1038/s41467-022-31625-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 06/24/2022] [Indexed: 11/29/2022] Open
Abstract
Gene-environment interactions represent the modification of genetic effects by environmental exposures and are critical for understanding disease and informing personalized medicine. These often induce differential phenotypic variance across genotypes; these variance-quantitative trait loci can be prioritized in a two-stage interaction detection strategy to greatly reduce the computational and statistical burden and enable testing of a broader range of exposures. We perform genome-wide variance-quantitative trait locus analysis for 20 serum cardiometabolic biomarkers by multi-ancestry meta-analysis of 350,016 unrelated participants in the UK Biobank, identifying 182 independent locus-biomarker pairs (p < 4.5×10-9). Most are concentrated in a small subset (4%) of loci with genome-wide significant main effects, and 44% replicate (p < 0.05) in the Women's Genome Health Study (N = 23,294). Next, we test each locus-biomarker pair for interaction across 2380 exposures, identifying 847 significant interactions (p < 2.4×10-7), of which 132 are independent (p < 0.05) after accounting for correlation between exposures. Specific examples demonstrate interaction of triglyceride-associated variants with distinct body mass- versus body fat-related exposures as well as genotype-specific associations between alcohol consumption and liver stress at the ADH1B gene. Our catalog of variance-quantitative trait loci and gene-environment interactions is publicly available in an online portal.
Collapse
|
31
|
Selemenakis P, Sharma N, Uhrig ME, Katz J, Kwon Y, Sung P, Wiese C. RAD51AP1 and RAD54L Can Underpin Two Distinct RAD51-Dependent Routes of DNA Damage Repair via Homologous Recombination. Front Cell Dev Biol 2022; 10:866601. [PMID: 35652094 PMCID: PMC9149245 DOI: 10.3389/fcell.2022.866601] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 04/20/2022] [Indexed: 11/17/2022] Open
Abstract
Homologous recombination DNA repair (HR) is a complex DNA damage repair pathway and an attractive target of inhibition in anti-cancer therapy. To help guide the development of efficient HR inhibitors, it is critical to identify compensatory HR sub-pathways. In this study, we describe a novel synthetic interaction between RAD51AP1 and RAD54L, two structurally unrelated proteins that function downstream of the RAD51 recombinase in HR. We show that concomitant deletion of RAD51AP1 and RAD54L further sensitizes human cancer cell lines to treatment with olaparib, a Poly (adenosine 5′-diphosphate-ribose) polymerase inhibitor, to the DNA inter-strand crosslinking agent mitomycin C, and to hydroxyurea, which induces DNA replication stress. We also show that the RAD54L paralog RAD54B compensates for RAD54L deficiency, although, surprisingly, less extensively than RAD51AP1. These results, for the first time, delineate RAD51AP1- and RAD54L-dependent sub-pathways and will guide the development of inhibitors that target HR stimulators of strand invasion.
Collapse
|
32
|
Hogan AM, Cardona ST. Gradients in gene essentiality reshape antibacterial research. FEMS Microbiol Rev 2022; 46:fuac005. [PMID: 35104846 PMCID: PMC9075587 DOI: 10.1093/femsre/fuac005] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 01/14/2022] [Accepted: 01/24/2022] [Indexed: 02/03/2023] Open
Abstract
Essential genes encode the processes that are necessary for life. Until recently, commonly applied binary classifications left no space between essential and non-essential genes. In this review, we frame bacterial gene essentiality in the context of genetic networks. We explore how the quantitative properties of gene essentiality are influenced by the nature of the encoded process, environmental conditions and genetic background, including a strain's distinct evolutionary history. The covered topics have important consequences for antibacterials, which inhibit essential processes. We argue that the quantitative properties of essentiality can thus be used to prioritize antibacterial cellular targets and desired spectrum of activity in specific infection settings. We summarize our points with a case study on the core essential genome of the cystic fibrosis pathobiome and highlight avenues for targeted antibacterial development.
Collapse
|
33
|
Zhang B, Ma L, Wu B, Xing Y, Qiu X. Introgression Lines: Valuable Resources for Functional Genomics Research and Breeding in Rice ( Oryza sativa L.). FRONTIERS IN PLANT SCIENCE 2022; 13:863789. [PMID: 35557720 PMCID: PMC9087921 DOI: 10.3389/fpls.2022.863789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 04/01/2022] [Indexed: 05/14/2023]
Abstract
The narrow base of genetic diversity of modern rice varieties is mainly attributed to the overuse of the common backbone parents that leads to the lack of varied favorable alleles in the process of breeding new varieties. Introgression lines (ILs) developed by a backcross strategy combined with marker-assisted selection (MAS) are powerful prebreeding tools for broadening the genetic base of existing cultivars. They have high power for mapping quantitative trait loci (QTLs) either with major or minor effects, and are used for precisely evaluating the genetic effects of QTLs and detecting the gene-by-gene or gene-by-environment interactions due to their low genetic background noise. ILs developed from multiple donors in a fixed background can be used as an IL platform to identify the best alleles or allele combinations for breeding by design. In the present paper, we reviewed the recent achievements from ILs in rice functional genomics research and breeding, including the genetic dissection of complex traits, identification of elite alleles and background-independent and epistatic QTLs, analysis of genetic interaction, and genetic improvement of single and multiple target traits. We also discussed how to develop ILs for further identification of new elite alleles, and how to utilize IL platforms for rice genetic improvement.
Collapse
|
34
|
Köferle A, Schlattl A, Hörmann A, Thatikonda V, Popa A, Spreitzer F, Ravichandran MC, Supper V, Oberndorfer S, Puchner T, Wieshofer C, Corcokovic M, Reiser C, Wöhrle S, Popow J, Pearson M, Martinez J, Weitzer S, Mair B, Neumüller RA. Interrogation of cancer gene dependencies reveals paralog interactions of autosome and sex chromosome-encoded genes. Cell Rep 2022; 39:110636. [PMID: 35417719 DOI: 10.1016/j.celrep.2022.110636] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 12/22/2021] [Accepted: 03/16/2022] [Indexed: 02/07/2023] Open
Abstract
Genetic networks are characterized by extensive buffering. During tumor evolution, disruption of functional redundancies can create de novo vulnerabilities that are specific to cancer cells. Here, we systematically search for cancer-relevant paralog interactions using CRISPR screens and publicly available loss-of-function datasets. Our analysis reveals >2,000 candidate dependencies, several of which we validate experimentally, including CSTF2-CSTF2T, DNAJC15-DNAJC19, FAM50A-FAM50B, and RPP25-RPP25L. We provide evidence that RPP25L can physically and functionally compensate for the absence of RPP25 as a member of the RNase P/MRP complexes in tRNA processing. Our analysis also reveals unexpected redundancies between sex chromosome genes. We show that chrX- and chrY-encoded paralogs, such as ZFX-ZFY, DDX3X-DDX3Y, and EIF1AX-EIF1AY, are functionally linked. Tumor cell lines from male patients with loss of chromosome Y become dependent on the chrX-encoded gene. We propose targeting of chrX-encoded paralogs as a general therapeutic strategy for human tumors that have lost the Y chromosome.
Collapse
|
35
|
Priolo M, Palermo V, Aiello F, Ciolfi A, Pannone L, Muto V, Motta M, Mancini C, Radio FC, Niceta M, Leoni C, Pintomalli L, Carrozzo R, Rajola G, Mammì C, Zampino G, Martinelli S, Dallapiccola B, Pichierri P, Tartaglia M. SHP2's gain-of-function in Werner syndrome causes childhood disease onset likely resulting from negative genetic interaction. Clin Genet 2022; 102:12-21. [PMID: 35396703 DOI: 10.1111/cge.14140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 03/21/2022] [Accepted: 04/05/2022] [Indexed: 11/03/2022]
Abstract
Prompt diagnosis of complex phenotypes is a challenging task in clinical genetics. Whole exome sequencing has proved to be effective in solving such conditions. Here, we report on an unpredictable presentation of Werner Syndrome (WRNS) in a 12 year-old girl carrying a homozygous truncating variant in RECQL2, the gene mutated in WRNS, and a de novo activating missense change in PTPN11, the major Noonan syndrome gene, encoding SHP2, a protein tyrosine phosphatase positively controlling RAS function and MAPK signaling, which have tightly been associated with senescence in primary cells. All the major WRNS clinical criteria were present with an extreme precocious onset and were associated with mild intellectual disability, severe growth retardation and facial dysmorphism. Compared to primary fibroblasts from adult subjects with WRNS, proband's fibroblasts showed a dramatically reduced proliferation rate and competence, and a more accelerated senescence, in line with the anticipated WRNS features occurring in the child. In vitro functional characterization of the SHP2 mutant documented its hyperactive behavior and a significantly enhanced activation of the MAPK pathway. Based on the functional interaction of WRN and MAPK signaling in processes relevant to replicative senescence, these findings disclose a unique phenotype likely resulting from negative genetic interaction. This article is protected by copyright. All rights reserved.
Collapse
|
36
|
Preciado J, Begcy K, Liu T. The Arabidopsis HDZIP class II transcription factor ABA INSENSITIVE TO GROWTH 1 functions in leaf development. JOURNAL OF EXPERIMENTAL BOTANY 2022; 73:1978-1991. [PMID: 34849741 DOI: 10.1093/jxb/erab523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 11/26/2021] [Indexed: 06/13/2023]
Abstract
Leaf laminar growth and adaxial-abaxial boundary formation are fundamental outcomes of plant development. Boundary and laminar growth coordinate the further patterning and growth of the leaf, directing the differentiation of cell types within the top and bottom domains and promoting initiation of lateral organs along their adaxial or abaxial axis. Leaf adaxial-abaxial polarity specification and laminar outgrowth are regulated by two transcription factors, REVOLUTA (REV) and KANADI (KAN). ABA INSENSITIVE TO GROWTH 1 (ABIG1) encodes a HOMEODOMAIN-LEUCINE ZIPPER (HD-ZIP) class II transcription factor and is a direct target of the adaxial-abaxial regulators REV and KAN. To investigate the role of ABIG1 in leaf development and in the establishment of polarity, we examined the phenotypes of both gain-of-function and loss-of-function mutants. Through genetic interaction analysis with REV and KAN mutants, we determined that ABIG1 plays a role in leaf laminar growth as well as in adaxial-abaxial polarity establishment. Genetic and physical interaction assays showed that ABIG1 interacts with the transcriptional TOPLESS corepressor. This study provides new evidence that ABIG1, another HD-ZIP II, facilitates growth through the corepressor TOPLESS.
Collapse
|
37
|
Lamaze A, Chen C, Leleux S, Xu M, George R, Stanewsky R. A natural timeless polymorphism allowing circadian clock synchronization in "white nights". Nat Commun 2022; 13:1724. [PMID: 35361756 PMCID: PMC8971440 DOI: 10.1038/s41467-022-29293-6|] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 03/08/2022] [Indexed: 06/19/2023] Open
Abstract
Daily temporal organisation offers a fitness advantage and is determined by an interplay between environmental rhythms and circadian clocks. While light:dark cycles robustly synchronise circadian clocks, it is not clear how animals experiencing only weak environmental cues deal with this problem. Like humans, Drosophila originate in sub-Saharan Africa and spread North up to the polar circle, experiencing long summer days or even constant light (LL). LL disrupts clock function, due to constant activation of CRYPTOCHROME, which induces degradation of the clock protein TIMELESS (TIM), but temperature cycles are able to overcome these deleterious effects of LL. We show here that for this to occur a recently evolved natural timeless allele (ls-tim) is required, encoding the less light-sensitive L-TIM in addition to S-TIM, the only form encoded by the ancient s-tim allele. We show that only ls-tim flies can synchronise their behaviour to semi-natural conditions typical for Northern European summers, suggesting that this functional gain is driving the Northward ls-tim spread.
Collapse
|
38
|
Yamsri S, Prommetta S, Srivorakun H, Taweenan W, Sanchaisuriya K, Chaibunruang A, Fucharoen G, Fucharoen S. α 0-thalassemia in affected fetuses with hemoglobin E-β 0-thalassemia disease in a high-risk population in Thailand. Am J Transl Res 2022; 14:1315-1323. [PMID: 35273733 PMCID: PMC8902522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 01/31/2022] [Indexed: 06/14/2023]
Abstract
OBJECTIVES A co-inheritance of α0-thalassemia can ameliorate the clinical severity of the hemoglobin (Hb) E-β-thalassemia disease. This information should be provided at prenatal diagnosis. Identification of α0-thalassemia in an affected fetus is therefore valuable. We have explored this genetic interaction in a large cohort of affected fetuses with hemoglobin (Hb) E-β-thalassemia in northeast Thailand. METHODS A study was done retrospectively on 1,592 couples at risk of having fetuses with Hb E-β0-thalassemia, encountered from January 2011 to December 2019. A total of 415 left-over DNA specimens of the affected fetuses with Hb E-β0-thalassemia disease were further investigated. Examination of α0-thalassemia was done using gap-PCR or a multiplex PCR assay for simultaneous detection of Hb E and α0-thalassemia mutations. RESULTS Of the 415 affected fetuses, the two most common β0-thalassemia genes found were the codons 41/42 (-TTCT) (199/415; 48.0%) and codon 17 (A-T) (115/415; 27.7%). α0-thalassemia was found unexpectedly in 21 (5.1%) fetuses. Hematologic phenotypes of the parents indicated that it was impossible to differentiate a pure β0-thalassemia carrier from a double β0-thalassemia/α0-thalassemia heterozygote unless DNA analysis is performed. In contrast, a reduced level of Hb E in the Hb E carrier (<25%) is a valuable marker for predicting double heterozygosity for Hb E/α0-thalassemia. This could be further confirmed using a multiplex PCR assay. CONCLUSIONS There is a high prevalence of co-inheritance of α0-thalassemia in fetuses with Hb E-β0-thalassemia disease. In a high-risk population such as Thailand, we recommend screening for α0-thalassemia in all affected fetuses with Hb E-β0-thalassemia disease and providing complete genetic information to the parents to make appropriate decisions at prenatal diagnosis and genetic counseling.
Collapse
|
39
|
Mao G, Zheng Y, Lin S, Ma L, Zhou Z, Zhang S. Bioinformatic Analysis of Prognostic Value, Genetic Interaction, and Immune Infiltration of Chromobox Family Proteins in Breast Cancer. Int J Gen Med 2021; 14:9181-9191. [PMID: 34880657 PMCID: PMC8647335 DOI: 10.2147/ijgm.s343948] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 11/23/2021] [Indexed: 11/23/2022] Open
Abstract
Introduction Breast cancer (BC) has become the malignant tumor with the highest incidence worldwide. As a critical components of epigenetic regulation complexes, chromobox (CBX) family members inhibit the transcription of target genes through chromatin modification, leading to the progression of various human diseases and cancers. So far, little is known about the role of different CBX members in BC, especially their association with immune cells. Methods We conducted the analysis of differential expression of CBXs using Oncomine and GEPIA, prognostic value of CBXs using GEPIA and Kaplan-Meier, genetic interaction of CBXs using cBioPortal and GeneMANIA, and immune cell infiltration of CBXs in BC patients using TIMER. Results The CBX2/3/4/8 expression levels were increased significantly, while the CBX6/7 expression levels were decreased. We found that CBX3 was significantly correlated with clinicopathological staging and short DFS in BC patients. High CBX3/5 expression was correlated with short OS in BC patients, while high expression of CBX4 was correlated with long OS in BC patients. In addition, the functions of CBXs family members mainly focus on methylated histone residue binding and chromatin organization. The CBXs expressions were closely related to the infiltration level of a variety of immune cells, including CD4/8+ T cells, B cells, neutrophils, macrophages and dendritic cells in BC cancers. The correlation between CBXs and immune cell infiltration was more common in Luminal BC than in Basal and Her-2 type. Conclusion This study may provide a new understanding for selection of molecular typing, therapeutic and prognostic biomarkers of CBX family in BC.
Collapse
|
40
|
Buphamalai P, Kokotovic T, Nagy V, Menche J. Network analysis reveals rare disease signatures across multiple levels of biological organization. Nat Commun 2021; 12:6306. [PMID: 34753928 PMCID: PMC8578255 DOI: 10.1038/s41467-021-26674-1] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 10/19/2021] [Indexed: 01/26/2023] Open
Abstract
Rare genetic diseases are typically caused by a single gene defect. Despite this clear causal relationship between genotype and phenotype, identifying the pathobiological mechanisms at various levels of biological organization remains a practical and conceptual challenge. Here, we introduce a network approach for evaluating the impact of rare gene defects across biological scales. We construct a multiplex network consisting of over 20 million gene relationships that are organized into 46 network layers spanning six major biological scales between genotype and phenotype. A comprehensive analysis of 3,771 rare diseases reveals distinct phenotypic modules within individual layers. These modules can be exploited to mechanistically dissect the impact of gene defects and accurately predict rare disease gene candidates. Our results show that the disease module formalism can be applied to rare diseases and generalized beyond physical interaction networks. These findings open up new venues to apply network-based tools for cross-scale data integration.
Collapse
|
41
|
Precise Characterization of Genetic Interactions in Cancer via Molecular Network Refining Processes. Int J Mol Sci 2021; 22:ijms222011114. [PMID: 34681774 PMCID: PMC8540220 DOI: 10.3390/ijms222011114] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/05/2021] [Accepted: 10/13/2021] [Indexed: 12/13/2022] Open
Abstract
Genetic interactions (GIs), such as the synthetic lethal interaction, are promising therapeutic targets in precision medicine. However, despite extensive efforts to characterize GIs by large-scale perturbation screening, considerable false positives have been reported in multiple studies. We propose a new computational approach for improved precision in GI identification by applying constraints that consider actual biological phenomena. In this study, GIs were characterized by assessing mutation, loss of function, and expression profiles in the DEPMAP database. The expression profiles were used to exclude loss-of-function data for nonexpressed genes in GI characterization. More importantly, the characterized GIs were refined based on Kyoto Encyclopedia of Genes and Genomes (KEGG) or protein–protein interaction (PPI) networks, under the assumption that genes genetically interacting with a certain mutated gene are adjacent in the networks. As a result, the initial GIs characterized with CRISPR and RNAi screenings were refined to 65 and 23 GIs based on KEGG networks and to 183 and 142 GIs based on PPI networks. The evaluation of refined GIs showed improved precision with respect to known synthetic lethal interactions. The refining process also yielded a synthetic partner network (SPN) for each mutated gene, which provides insight into therapeutic strategies for the mutated genes; specifically, exploring the SPN of mutated BRAF revealed ELAVL1 as a potential target for treating BRAF-mutated cancer, as validated by previous research. We expect that this work will advance cancer therapeutic research.
Collapse
|
42
|
Kolosov N, Daly MJ, Artomov M. Prioritization of disease genes from GWAS using ensemble-based positive-unlabeled learning. Eur J Hum Genet 2021; 29:1527-1535. [PMID: 34276057 PMCID: PMC8484264 DOI: 10.1038/s41431-021-00930-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 05/23/2021] [Accepted: 06/21/2021] [Indexed: 02/07/2023] Open
Abstract
A primary challenge in understanding disease biology from genome-wide association studies (GWAS) arises from the inability to directly implicate causal genes from association data. Integration of multiple-omics data sources potentially provides important functional links between associated variants and candidate genes. Machine-learning is well-positioned to take advantage of a variety of such data and provide a solution for the prioritization of disease genes. Yet, classical positive-negative classifiers impose strong limitations on the gene prioritization procedure, such as a lack of reliable non-causal genes for training. Here, we developed a novel gene prioritization tool-Gene Prioritizer (GPrior). It is an ensemble of five positive-unlabeled bagging classifiers (Logistic Regression, Support Vector Machine, Random Forest, Decision Tree, Adaptive Boosting), that treats all genes of unknown relevance as an unlabeled set. GPrior selects an optimal composition of algorithms to tune the model for each specific phenotype. Altogether, GPrior fills an important niche of methods for GWAS data post-processing, significantly improving the ability to pinpoint disease genes compared to existing solutions.
Collapse
|
43
|
Xiao L, Man L, Yang L, Zhang J, Liu B, Quan M, Lu W, Fang Y, Wang D, Du Q, Zhang D. Association Study and Mendelian Randomization Analysis Reveal Effects of the Genetic Interaction Between PtoMIR403b and PtoGT31B-1 on Wood Formation in Populus tomentosa. FRONTIERS IN PLANT SCIENCE 2021; 12:704941. [PMID: 34527007 PMCID: PMC8435637 DOI: 10.3389/fpls.2021.704941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 08/04/2021] [Indexed: 06/13/2023]
Abstract
MicroRNAs (miRNAs), important posttranscriptional regulators of gene expression, play a crucial role in plant growth and development. A single miRNA can regulate numerous target genes, making the determination of its function and interaction with targets challenging. We identified PtomiR403b target to PtoGT31B-1, which encodes a galactosyltransferase responsible for the biosynthesis of cell wall polysaccharides. We performed an association study and epistasis and Mendelian randomization (MR) analyses to explore how the genetic interaction between PtoMIR403b and its target PtoGT31B-1 underlies wood formation. Single nucleotide polymorphism (SNP)-based association studies identified 25 significant associations (P < 0.01, Q < 0.05), and PtoMIR403b and PtoGT31B-1 were associated with five traits, suggesting a role for PtomiR403b and PtoGT31B-1 in wood formation. Epistasis analysis identified 93 significant pairwise epistatic associations with 10 wood formation traits, and 37.89% of the SNP-SNP pairs indicated interactions between PtoMIR403b and PtoGT31B-1. We performed an MR analysis to demonstrate the causality of the relationships between SNPs in PtoMIR403b and wood property traits and that PtoMIR403b modulates wood formation by regulating expression of PtoGT31B-1. Therefore, our findings will facilitate dissection of the functions and interactions with miRNA-targets.
Collapse
|
44
|
Cheruiyot EK, Haile-Mariam M, Cocks BG, MacLeod IM, Xiang R, Pryce JE. New loci and neuronal pathways for resilience to heat stress in cattle. Sci Rep 2021; 11:16619. [PMID: 34404823 PMCID: PMC8371109 DOI: 10.1038/s41598-021-95816-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 07/30/2021] [Indexed: 02/07/2023] Open
Abstract
While understanding the genetic basis of heat tolerance is crucial in the context of global warming's effect on humans, livestock, and wildlife, the specific genetic variants and biological features that confer thermotolerance in animals are still not well characterized. We used dairy cows as a model to study heat tolerance because they are lactating, and therefore often prone to thermal stress. The data comprised almost 0.5 million milk records (milk, fat, and proteins) of 29,107 Australian Holsteins, each having around 15 million imputed sequence variants. Dairy animals often reduce their milk production when temperature and humidity rise; thus, the phenotypes used to measure an individual's heat tolerance were defined as the rate of milk production decline (slope traits) with a rising temperature-humidity index. With these slope traits, we performed a genome-wide association study (GWAS) using different approaches, including conditional analyses, to correct for the relationship between heat tolerance and level of milk production. The results revealed multiple novel loci for heat tolerance, including 61 potential functional variants at sites highly conserved across 100 vertebrate species. Moreover, it was interesting that specific candidate variants and genes are related to the neuronal system (ITPR1, ITPR2, and GRIA4) and neuroactive ligand-receptor interaction functions for heat tolerance (NPFFR2, CALCR, and GHR), providing a novel insight that can help to develop genetic and management approaches to combat heat stress.
Collapse
|
45
|
Golovina E, Fadason T, Lints TJ, Walker C, Vickers MH, O’Sullivan JM. Understanding the impact of SNPs associated with autism spectrum disorder on biological pathways in the human fetal and adult cortex. Sci Rep 2021; 11:15867. [PMID: 34354167 PMCID: PMC8342620 DOI: 10.1038/s41598-021-95447-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/26/2021] [Indexed: 02/06/2023] Open
Abstract
Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by significant and complex genetic etiology. GWAS studies have identified genetic variants associated with ASD, but the functional impacts of these variants remain unknown. Here, we integrated four distinct levels of biological information (GWAS, eQTL, spatial genome organization and protein-protein interactions) to identify potential regulatory impacts of ASD-associated SNPs (p < 5 × 10-8) on biological pathways within fetal and adult cortical tissues. We found 80 and 58 SNPs that mark regulatory regions (i.e. expression quantitative trait loci or eQTLs) in the fetal and adult cortex, respectively. These eQTLs were also linked to other psychiatric disorders (e.g. schizophrenia, ADHD, bipolar disorder). Functional annotation of ASD-associated eQTLs revealed that they are involved in diverse regulatory processes. In particular, we found significant enrichment of eQTLs within regions repressed by Polycomb proteins in the fetal cortex compared to the adult cortex. Furthermore, we constructed fetal and adult cortex-specific protein-protein interaction networks and identified that ASD-associated regulatory SNPs impact on immune pathways, fatty acid metabolism, ribosome biogenesis, aminoacyl-tRNA biosynthesis and spliceosome in the fetal cortex. By contrast, in the adult cortex they largely affect immune pathways. Overall, our findings highlight potential regulatory mechanisms and pathways important for the etiology of ASD in early brain development and adulthood. This approach, in combination with clinical studies on ASD, will contribute to individualized mechanistic understanding of ASD development.
Collapse
|
46
|
Rosas-Madrigal S, Villarreal-Molina MT, Flores-Rivera J, Rivas-Alonso V, Macias-Kauffer LR, Ordoñez G, Chima-Galán MDC, Acuña-Alonzo V, Macín-Pérez G, Barquera R, Granados J, Valle-Rios R, Corona T, Carnevale A, Romero-Hidalgo S. Interaction of HLA Class II rs9272219 and TMPO rs17028450 (Arg690Cys) Variants Affects Neuromyelitis Optica Spectrum Disorder Susceptibility in an Admixed Mexican Population. Front Genet 2021; 12:647343. [PMID: 34335680 PMCID: PMC8320513 DOI: 10.3389/fgene.2021.647343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 06/23/2021] [Indexed: 12/02/2022] Open
Abstract
Neuromyelitis Optica Spectrum Disorder (NMOSD) is a demyelinating autoimmune disease of the central nervous system, more prevalent in individuals of non-European ancestry. Few studies have analyzed genetic risk factors in NMOSD, and HLA class II gene variation has been associated NMOSD risk in various populations including Mexicans. Thymopoietin (TMPO) has not been tested as a candidate gene for NMOSD or other autoimmune disease, however, experimental evidence suggests this gene may be involved in negative selection of autoreactive T cells and autoimmunity. We thus investigated whether the missense TMPO variant rs17028450 (Arg630Cys, frequent in Latin America) is associated with NMOSD, and whether this variant shows an interaction with HLA-class II rs9272219, previously associated with NMOSD risk. A total of 119 Mexican NMOSD patients, 1208 controls and 357 Native Mexican individuals were included. The HLA rs9272219 “T” risk allele frequency ranged from 21 to 68%, while the rs17028450 “T” minor allele frequency was as high as 18% in Native Mexican groups. Both rs9272219 and rs17028450 were significantly associated with NMOSD risk under additive models (OR = 2.48; p = 8 × 10–10 and OR = 1.59; p = 0.0075, respectively), and a significant interaction between both variants was identified with logistic regression models (p = 0.048). Individuals bearing both risk alleles had an estimated 3.9-fold increased risk of NMOSD. To our knowledge, this is the first study reporting an association of TMPO gene variation with an autoimmune disorder and the interaction of specific susceptibility gene variants, that may contribute to the genetic architecture of NMOSD in admixed Latin American populations.
Collapse
|
47
|
Bowen ME, Mulligan AS, Sorayya A, Attardi LD. Puma- and Caspase9-mediated apoptosis is dispensable for p53-driven neural crest-based developmental defects. Cell Death Differ 2021; 28:2083-2094. [PMID: 33574585 PMCID: PMC8257737 DOI: 10.1038/s41418-021-00738-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2020] [Revised: 01/12/2021] [Accepted: 01/14/2021] [Indexed: 01/20/2023] Open
Abstract
Inappropriate activation of the p53 transcription factor is thought to contribute to the developmental phenotypes in a range of genetic syndromes. Whether p53 activation drives these developmental phenotypes by triggering apoptosis, cell cycle arrest, or other p53 cellular responses, however, has remained elusive. As p53 hyperactivation in embryonic neural crest cells (NCCs) drives a number of phenotypes, including abnormal craniofacial and neuronal development, we investigate the basis for p53 action in this context. We show that p53-driven developmental defects are associated with the induction of a robust pro-apoptotic transcriptional signature. Intriguingly, however, deleting Puma or Caspase9, which encode key components of the intrinsic apoptotic pathway, does not rescue craniofacial, neuronal or pigmentation defects triggered by p53 hyperactivation in NCCs. Immunostaining analyses for two key apoptosis markers confirm that deleting Puma or Caspase9 does indeed impair p53-hyperactivation-induced apoptosis in NCCs. Furthermore, we demonstrate that p53 hyperactivation does not trigger a compensatory dampening of cell cycle progression in NCCs upon inactivation of apoptotic pathways. Together, our results indicate that p53-driven craniofacial, neuronal and pigmentation defects can arise in the absence of apoptosis and cell cycle arrest, suggesting that p53 hyperactivation can act via alternative pathways to trigger developmental phenotypes.
Collapse
|
48
|
Genetic diversity of Colletotrichum lupini and its virulence on white and Andean lupin. Sci Rep 2021; 11:13547. [PMID: 34188142 PMCID: PMC8242092 DOI: 10.1038/s41598-021-92953-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 06/14/2021] [Indexed: 02/06/2023] Open
Abstract
Lupin cultivation worldwide is threatened by anthracnose, a destructive disease caused by the seed- and air-borne fungal pathogen Colletotrichum lupini. In this study we explored the intraspecific diversity of 39 C. lupini isolates collected from different lupin cultivating regions around the world, and representative isolates were screened for their pathogenicity and virulence on white and Andean lupin. Multi-locus phylogeny and morphological characterizations showed intraspecific diversity to be greater than previously shown, distinguishing a total of six genetic groups and ten distinct morphotypes. Highest diversity was found across South America, indicating it as the center of origin of C. lupini. The isolates that correspond to the current pandemic belong to a genetic and morphological uniform group, were globally widespread, and showed high virulence on tested white and Andean lupin accessions. Isolates belonging to the other five genetic groups were mostly found locally and showed distinct virulence patterns. Two highly virulent strains were shown to overcome resistance of advanced white lupin breeding material. This stresses the need to be careful with international seed transports in order to prevent spread of currently confined but potentially highly virulent strains. This study improves our understanding of the diversity, phylogeography and pathogenicity of a member of one of the world's top 10 plant pathogen genera, providing valuable information for breeding programs and future disease management.
Collapse
|
49
|
Braz CU, Rowan TN, Schnabel RD, Decker JE. Genome-wide association analyses identify genotype-by-environment interactions of growth traits in Simmental cattle. Sci Rep 2021; 11:13335. [PMID: 34172761 PMCID: PMC8233360 DOI: 10.1038/s41598-021-92455-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 06/07/2021] [Indexed: 02/06/2023] Open
Abstract
Understanding genotype-by-environment interactions (G × E) is crucial to understand environmental adaptation in mammals and improve the sustainability of agricultural production. Here, we present an extensive study investigating the interaction of genome-wide SNP markers with a vast assortment of environmental variables and searching for SNPs controlling phenotypic variance (vQTL) using a large beef cattle dataset. We showed that G × E contribute 10.1%, 3.8%, and 2.8% of the phenotypic variance of birth weight, weaning weight, and yearling weight, respectively. G × E genome-wide association analysis (GWAA) detected a large number of G × E loci affecting growth traits, which the traditional GWAA did not detect, showing that functional loci may have non-additive genetic effects regardless of differences in genotypic means. Further, variance-heterogeneity GWAA detected loci enriched with G × E effects without requiring prior knowledge of the interacting environmental factors. Functional annotation and pathway analysis of G × E genes revealed biological mechanisms by which cattle respond to changes in their environment, such as neurotransmitter activity, hypoxia-induced processes, keratinization, hormone, thermogenic and immune pathways. We unraveled the relevance and complexity of the genetic basis of G × E underlying growth traits, providing new insights into how different environmental conditions interact with specific genes influencing adaptation and productivity in beef cattle and potentially across mammals.
Collapse
|
50
|
Chen K, Xu H, Lei Y, Lio P, Li Y, Guo H, Ali Moni M. Integration and interplay of machine learning and bioinformatics approach to identify genetic interaction related to ovarian cancer chemoresistance. Brief Bioinform 2021; 22:6272796. [PMID: 33971668 DOI: 10.1093/bib/bbab100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/04/2021] [Accepted: 03/06/2021] [Indexed: 11/15/2022] Open
Abstract
Although chemotherapy is the first-line treatment for ovarian cancer (OCa) patients, chemoresistance (CR) decreases their progression-free survival. This paper investigates the genetic interaction (GI) related to OCa-CR. To decrease the complexity of establishing gene networks, individual signature genes related to OCa-CR are identified using a gradient boosting decision tree algorithm. Additionally, the genetic interaction coefficient (GIC) is proposed to measure the correlation of two signature genes quantitatively and explain their joint influence on OCa-CR. Gene pair that possesses high GIC is identified as signature pair. A total of 24 signature gene pairs are selected that include 10 individual signature genes and the influence of signature gene pairs on OCa-CR is explored. Finally, a signature gene pair-based prediction of OCa-CR is identified. The area under curve (AUC) is a widely used performance measure for machine learning prediction. The AUC of signature gene pair reaches 0.9658, whereas the AUC of individual signature gene-based prediction is 0.6823 only. The identified signature gene pairs not only build an efficient GI network of OCa-CR but also provide an interesting way for OCa-CR prediction. This improvement shows that our proposed method is a useful tool to investigate GI related to OCa-CR.
Collapse
|