1
|
Dong Q, Wu J, Zhang H, Luo L, Wu W. The causal role of circulating inflammatory markers in osteoporosis: a bidirectional Mendelian randomized study. Front Immunol 2024; 15:1412298. [PMID: 39091505 PMCID: PMC11291241 DOI: 10.3389/fimmu.2024.1412298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 07/02/2024] [Indexed: 08/04/2024] Open
Abstract
Background Osteoporosis (OP) associated with aging exerts substantial clinical and fiscal strains on societal structures. An increasing number of research studies have suggested a bidirectional relationship between circulating inflammatory markers (CIMs) and OP. However, observational studies are susceptible to perturbations in confounding variables. In contrast, Mendelian randomization (MR) offers a robust methodological framework to circumvent such confounders, facilitating a more accurate assessment of causality. Our study aimed to evaluate the causal relationships between CIMs and OP, identifying new approaches and strategies for the prevention, diagnosis and treatment of OP. Methods We analyzed publicly available GWAS summary statistics to investigate the causal relationships between CIMs and OP. Causal estimates were calculated via a systematic analytical framework, including bidirectional MR analysis and Bayesian colocalization analysis. Results Genetically determined levels of CXCL11 (OR = 0.91, 95% CI = 0.85-0.98, P = 0.008, PFDR = 0.119), IL-18 (OR = 0.88, 95% CI = 0.83-0.94, P = 8.66×10-5, PFDR = 0.008), and LIF (OR = 0.86, 95% CI = 0.76-0.96, P = 0.008, PFDR = 0.119) were linked to a reduced risk of OP. Conversely, higher levels of ARTN (OR = 1.11, 95% CI = 1.02-1.20, P = 0.012, PFDR = 0.119) and IFNG (OR = 1.16, 95% CI = 1.03-1.30, P = 0.013, PFDR = 0.119) were associated with an increased risk of OP. Bayesian colocalization analysis revealed no evidence of shared causal variants. Conclusion Despite finding no overall association between CIMs and OP, five CIMs demonstrated a potentially significant association with OP. These findings could pave the way for future mechanistic studies aimed at discovering new treatments for this disease. Additionally, we are the first to suggest a unidirectional causal relationship between ARTN and OP. This novel insight introduces new avenues for research into diagnostic and therapeutic strategies for OP.
Collapse
Affiliation(s)
- Qiu Dong
- Department of Bone and Joint Surgery, First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Jiayang Wu
- Medical Imaging Centre, First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Huaguo Zhang
- Department of Ultrasonography, First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Liangping Luo
- Medical Imaging Centre, First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
- Medical Imaging Center, The Fifth Affiliated Hospital of Jinan University, Heyuan, Guangdong, China
| | - Wenrui Wu
- Department of Bone and Joint Surgery, First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
- Department of Orthopedics, Chaoshan Hospital, The First Affiliated Hospital of Jinan University, Chaozhou, Guangdong, China
| |
Collapse
|
2
|
Ajay A, Begum T, Arya A, Kumar K, Ahmad S. Global and local genomic features together modulate the spontaneous single nucleotide mutation rate. Comput Biol Chem 2024; 112:108107. [PMID: 38875896 DOI: 10.1016/j.compbiolchem.2024.108107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 04/23/2024] [Accepted: 05/17/2024] [Indexed: 06/16/2024]
Abstract
Spontaneous mutations are evolutionary engines as they generate variants for the evolutionary downstream processes that give rise to speciation and adaptation. Single nucleotide mutations (SNM) are the most abundant type of mutations among them. Here, we perform a meta-analysis to quantify the influence of selected global genomic parameters (genome size, genomic GC content, genomic repeat fraction, number of coding genes, gene count, and strand bias in prokaryotes) and local genomic features (local GC content, repeat content, CpG content and the number of SNM at CpG islands) on spontaneous SNM rates across the tree of life (prokaryotes, unicellular eukaryotes, multicellular eukaryotes) using wild-type sequence data in two different taxon classification systems. We find that the spontaneous SNM rates in our data are correlated with many genomic features in prokaryotes and unicellular eukaryotes irrespective of their sample sizes. On the other hand, only the number of coding genes was correlated with the spontaneous SNM rates in multicellular eukaryotes primarily contributed by vertebrates data. Considering local features, we notice that local GC content and CpG content significantly were correlated with the spontaneous SNM rates in the unicellular eukaryotes, while local repeat fraction is an important feature in prokaryotes and certain specific uni- and multi-cellular eukaryotes. Such predictive features of the spontaneous SNM rates often support non-linear models as the best fit compared to the linear model. We also observe that the strand asymmetry in prokaryotes plays an important role in determining the spontaneous SNM rates but the SNM spectrum does not.
Collapse
Affiliation(s)
- Akash Ajay
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India; School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Tina Begum
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| | - Ajay Arya
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Krishan Kumar
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Shandar Ahmad
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| |
Collapse
|
3
|
Gafurov A, Vinar T, Medvedev P, Brejova B. Efficient Analysis of Annotation Colocalization Accounting for Genomic Contexts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.22.568259. [PMID: 38045397 PMCID: PMC10690252 DOI: 10.1101/2023.11.22.568259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
An annotation is a set of genomic intervals sharing a particular function or property. Examples include genes or their exons, evolutionarily conserved elements, and regions with a particular epigenetic state. A common task is to compare two annotations to determine if one is enriched or depleted in the regions covered by the other. We study the problem of assigning statistical significance to such a comparison based on a null model representing two random unrelated annotations. To incorporate more background information into such analyses,we propose a new null model based on a Markov chain which differentiates among several genomic contexts. These contexts can capture various confounding factors, such as GC content or assembly gaps. We then develop a new algorithm for estimating p-values by computing the exact expectation and variance of the test statistics and then estimating the p-value using a normal approximation. Compared to the previous algorithm by Gafurov et al., the new algorithm provides three advances: (1) the running time is improved from quadratic to linear or quasi-linear, (2) the algorithm can handle two different test statistics, and (3) the algorithm can handle both simple and context-dependent Markov chain null models. We demonstrate the efficiency and accuracy of our algorithm on synthetic and real data sets, including the recent human telomere-to-telomere assembly. In particular, our algorithm computed p-values for 450 pairs of human genome annotations using 24 threads in under three hours. Moreover, the use of genomic contexts to correct for GC bias resulted in the reversal of some previously published findings.
Collapse
|
4
|
Liu X, Yuan J, Wang X, Tang M, Meng X, Zhang L, Wang S, Zhang H. Association between rheumatoid arthritis and autoimmune thyroid disease: evidence from complementary genetic methods. Endocrine 2024; 84:171-178. [PMID: 37884826 DOI: 10.1007/s12020-023-03571-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 10/09/2023] [Indexed: 10/28/2023]
Abstract
OBJECTIVES To assess the causal association of Rheumatoid Arthritis (RA) with Autoimmune thyroid disease (AITD). METHOD Complementary genetic approaches, including genetic correlation, Mendelian randomization (MR) and colocalization analysis, were conducted to assess the potential causal association between RA and AITD using summary statistics from large-scale genome-wide association studies (GWASs). Various sensitivity analyses had been conducted to assess the robustness and the consistency of the findings. RESULTS The linkage disequilibrium score regression revealed a shared genetic structure between RA and AITD, with the significant genetic correlation between RA and autoimmune hyperthyroidism and autoimmune hypothyroidism estimated to be 0.3945 (P = 2.83 × 10-6) and 0.2771 (P = 1.04 × 10-6) respectively. The results of MR analysis showed that RA had a positive causal relationship with autoimmune hypothyroidism and autoimmune hyperthyroidism. The odds ratio (OR) were 1.29 (95% CI, 1.17-1.42; P = 1.08 × 10-7) and 1.47 (95% CI, 1.25-1.72; P = 1.85 × 10-6), respectively. In reverse MR analysis, autoimmune hypothyroidism had a positive causal relationship with RA, OR was 1.51 (95% CI, 1.37-1.66; P = 1.10 × 10-16); autoimmune hyperthyroidism had no causal relationship with RA relationship (P = 0.22). Similar results were found using different MR methods. In addition, colocalization analysis suggested that shared causal variants existed between RA and AITD. CONCLUSIONS Our study suggested a potentially causal effect of genetically predicted RA on autoimmune hyperthyroidism and a bidirectional causal relationship between RA and autoimmune hypothyroidism was also observed with complementary genetic approaches, which supports the importance and necessity of thyroid function screening and monitoring in RA patient management in clinical practice.
Collapse
Affiliation(s)
- Xue Liu
- Department of Endocrinology, Shandong Provincial Hospital, Shandong University, Jinan, 250021, Shandong, China
| | - Jie Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250021, Shandong, China
| | - Xinhui Wang
- Department of Endocrinology, Shandong Provincial Hospital, Shandong University, Jinan, 250021, Shandong, China
| | - Mulin Tang
- Department of Endocrinology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China
| | - Xue Meng
- Department of Endocrinology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China
| | - Li Zhang
- Department of Vascular Surgery, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China
| | - Shukang Wang
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250021, Shandong, China.
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China.
| | - Haiqing Zhang
- Department of Endocrinology, Shandong Provincial Hospital, Shandong University, Jinan, 250021, Shandong, China.
- Department of Endocrinology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China.
- Shandong Clinical Medical Center of Endocrinology and Metabolism, Jinan, 250021, China.
- Institute of Endocrinology and Metabolism, Shandong Academy of Clinical Medicine, Jinan, 250021, China.
| |
Collapse
|
5
|
Xu Q, del Mundo IMA, Zewail-Foote M, Luke BT, Vasquez KM, Kowalski J. MoCoLo: a testing framework for motif co-localization. Brief Bioinform 2024; 25:bbae019. [PMID: 38521050 PMCID: PMC10960634 DOI: 10.1093/bib/bbae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/08/2024] [Accepted: 01/09/2024] [Indexed: 03/25/2024] Open
Abstract
Sequence-level data offers insights into biological processes through the interaction of two or more genomic features from the same or different molecular data types. Within motifs, this interaction is often explored via the co-occurrence of feature genomic tracks using fixed-segments or analytical tests that respectively require window size determination and risk of false positives from over-simplified models. Moreover, methods for robustly examining the co-localization of genomic features, and thereby understanding their spatial interaction, have been elusive. We present a new analytical method for examining feature interaction by introducing the notion of reciprocal co-occurrence, define statistics to estimate it and hypotheses to test for it. Our approach leverages conditional motif co-occurrence events between features to infer their co-localization. Using reverse conditional probabilities and introducing a novel simulation approach that retains motif properties (e.g. length, guanine-content), our method further accounts for potential confounders in testing. As a proof-of-concept, motif co-localization (MoCoLo) confirmed the co-occurrence of histone markers in a breast cancer cell line. As a novel analysis, MoCoLo identified significant co-localization of oxidative DNA damage within non-B DNA-forming regions that significantly differed between non-B DNA structures. Altogether, these findings demonstrate the potential utility of MoCoLo for testing spatial interactions between genomic features via their co-localization.
Collapse
Affiliation(s)
- Qi Xu
- Department of Molecular Biosciences, College of Natural Sciences, The University of Texas at Austin, Austin, TX, 78712, USA
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Imee M A del Mundo
- Dell Pediatric Research Institute, Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Austin, Texas, 78723, USA
| | - Maha Zewail-Foote
- Department of Chemistry and Biochemistry, Southwestern University, Georgetown, TX, 78626, USA
| | - Brian T Luke
- Bioinformatics and Computational Science, Frederick National Laboratory for Cancer Research, Frederick, Maryland, 21701, USA
| | - Karen M Vasquez
- Dell Pediatric Research Institute, Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Austin, Texas, 78723, USA
| | - Jeanne Kowalski
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX, 78712, USA
| |
Collapse
|
6
|
Cai Y, Xie S, Jia X, Chen D, Wu D, Bao W, Cai J, Mao J, Ye J. Integrated analysis of Mendelian Randomization and Bayesian colocalization reveals bidirectional causal association between inflammatory bowel disease and psoriasis. Ann Med 2023; 55:2281658. [PMID: 37988718 PMCID: PMC10836255 DOI: 10.1080/07853890.2023.2281658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 11/03/2023] [Indexed: 11/23/2023] Open
Abstract
BACKGROUND Observational studies have suggested an association between inflammatory bowel disease [IBD] and psoriasis. However, the detailed genetic basis, causality, and direction of this association remain unclear. METHODS Bidirectional two-sample Mendelian Randomization [MR] analysis was conducted using summary statistics from published genome-wide association studies. Bayesian Colocalization and multivariable MR [MVMR] analyses were performed to identify candidate variants and risk genes involved in the shared genetic basis between IBD, psoriasis, and their subtypes. RESULTS Genetically predicted IBD and Crohn's disease [CD] were associated with an increased risk of psoriasis, psoriasis vulgaris [PsV], and psoriatic arthritis [PsA] (IBD on psoriasis: pooled odds ratio [OR] 1.09, 95% confidence interval [CI] 1.04-1.14, p = .0001; CD on psoriasis: pooled OR 1.10, 95% CI 1.06-1.15, p < .0001) and vice versa (psoriasis on IBD: pooled OR 1.11, 95%CI 1.02-1.21), whereas CD only exhibited a unidirectional association with psoriasis. Colocalization analysis revealed eight candidate genetic variants and risk genes (including LINC00824, CDKAL1, IL10, IL23R, DNAJC27, LPP, RUNX3, and RGS14) associated with a shared genetic basis. Among these, IL23R, DNAJC27, LPP, and RGS14 were further validated by MVMR analysis. CONCLUSION Our findings indicated bidirectional causal associations between IBD and psoriasis (including PsV and PsA), which were attributed primarily to CD rather than Ulcerative colitis [UC]. Furthermore, we identified several candidate variants and risk genes involved in the shared genetic basis of IBD and psoriasis. Acquiring a better understanding of the shared genetic architecture underlying IBD and psoriasis would help improve clinical strategies.
Collapse
Affiliation(s)
- Yangke Cai
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Siyuan Xie
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Xuan Jia
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Delong Chen
- Department of Cardiology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Dehao Wu
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Wenwen Bao
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Jianting Cai
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Jianshan Mao
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Jun Ye
- Department of Gastroenterology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| |
Collapse
|
7
|
Georgiou AN, Zagkos L, Markozannes G, Chalitsios CV, Asimakopoulos AG, Xu W, Wang L, Mesa‐Eguiagaray I, Zhou X, Loizidou EM, Kretsavos N, Theodoratou E, Gill D, Burgess S, Evangelou E, Tsilidis KK, Tzoulaki I. Appraising the Causal Role of Risk Factors in Coronary Artery Disease and Stroke: A Systematic Review of Mendelian Randomization Studies. J Am Heart Assoc 2023; 12:e029040. [PMID: 37804188 PMCID: PMC7615320 DOI: 10.1161/jaha.122.029040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 06/27/2023] [Indexed: 10/09/2023]
Abstract
BACKGROUND Mendelian randomization (MR) offers a powerful approach to study potential causal associations between exposures and health outcomes by using genetic variants associated with an exposure as instrumental variables. In this systematic review, we aimed to summarize previous MR studies and to evaluate the evidence for causality for a broad range of exposures in relation to coronary artery disease and stroke. METHODS AND RESULTS MR studies investigating the association of any genetically predicted exposure with coronary artery disease or stroke were identified. Studies were classified into 4 categories built on the significance of the main MR analysis results and its concordance with sensitivity analyses, namely, robust, probable, suggestive, and insufficient. Studies reporting associations that did not perform any sensitivity analysis were classified as nonevaluable. We identified 2725 associations eligible for evaluation, examining 535 distinct exposures. Of them, 141 were classified as robust, 353 as probable, 110 as suggestive, and 926 had insufficient evidence. The most robust associations were observed for anthropometric traits, lipids, and lipoproteins and type 2 diabetes with coronary artery; disease and clinical measurements with coronary artery disease and stroke; and thrombotic factors with stroke. CONCLUSIONS Despite the large number of studies that have been conducted, only a limited number of associations were supported by robust evidence. Approximately half of the studies reporting associations presented an MR sensitivity analysis along with the main analysis that further supported the causality of associations. Future research should focus on more thorough assessments of sensitivity MR analyses and further assessments of mediation effects or nonlinearity of associations.
Collapse
Affiliation(s)
- Andrea N. Georgiou
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
| | - Loukas Zagkos
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
| | - Georgios Markozannes
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
| | - Christos V. Chalitsios
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
| | | | - Wei Xu
- Centre for Global Health, Usher InstituteThe University of EdinburghEdinburghUK
| | - Lijuan Wang
- Centre for Global Health, Usher InstituteThe University of EdinburghEdinburghUK
| | | | - Xuan Zhou
- Centre for Global Health, Usher InstituteThe University of EdinburghEdinburghUK
| | - Eleni M. Loizidou
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
- Biobank Cyprus Center of Excellence in Biobanking and Biomedical ResearchUniversity of CyprusNicosiaCyprus
| | - Nikolaos Kretsavos
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
| | - Evropi Theodoratou
- Centre for Global Health, Usher InstituteThe University of EdinburghEdinburghUK
- Cancer Research UK Edinburgh Centre, Institute of Genetics and CancerThe University of EdinburghEdinburghUK
| | - Dipender Gill
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
- Medical Research Council Biostatistics UnitUniversity of CambridgeCambridgeUK
| | - Stephen Burgess
- Medical Research Council Biostatistics UnitUniversity of CambridgeCambridgeUK
- Cardiovascular Epidemiology UnitUniversity of CambridgeCambridgeUK
| | - Evangelos Evangelou
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
- Department of Biomedical Research, Institute of Molecular Biology and BiotechnologyFoundation for Research and Technology‐HellasIoanninaGreece
| | - Konstantinos K. Tsilidis
- Department of Hygiene and EpidemiologyUniversity of Ioannina School of MedicineIoanninaGreece
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
| | - Ioanna Tzoulaki
- Department of Epidemiology and BiostatisticsSchool of Public Health, Imperial College LondonLondonUK
- Centre for Systems Biology, Biomedical Research FoundationAcademy of AthensAthensGreece
| |
Collapse
|
8
|
Wang Y, Wei Z, Su J, Coenen F, Meng J. RgnTX: Colocalization analysis of transcriptome elements in the presence of isoform heterogeneity and ambiguity. Comput Struct Biotechnol J 2023; 21:4110-4117. [PMID: 37671241 PMCID: PMC10475473 DOI: 10.1016/j.csbj.2023.08.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/13/2023] [Accepted: 08/23/2023] [Indexed: 09/07/2023] Open
Abstract
Colocalization analysis of genomic region sets has been widely adopted to unveil potential functional interactions between corresponding biological attributes, which often serves as the basis for further investigation. A number of methods have been developed for colocalization analysis of genomic elements. However, none of them explicitly considered the transcriptome heterogeneity and isoform ambiguity, making them less appropriate for analyzing transcriptome elements. Here, we developed RgnTX, an R/Bioconductor tool for the colocalization analysis of transcriptome elements with permutation tests. Different from existing approaches, RgnTX directly takes advantage of transcriptome annotation, and offers high flexibility in the null model to simulate realistic transcriptome-wide background, such as the complex alternative splicing patterns. Importantly, it supports the testing of transcriptome elements without clear isoform association, which is often the real scenario due to technical limitations. Proposed package offers a wide selection of pre-defined functions, easy to be utilized by users for visualizing permutation results, calculating shifted z-scores and conducting multiple hypothesis testing under Benjamini-Hochberg correction. Moreover, with synthetic and real datasets, we show that RgnTX novel testing modes return distinct and more significant results compared to existing genome-based methods. We believe RgnTX should make a useful tool to characterize the randomness of the transcriptome, and for conducting statistical association analysis for genomic region sets within the heterogeneous transcriptome. The package now has been accepted by Bioconductor and is freely available at: https://bioconductor.org/packages/RgnTX.
Collapse
Affiliation(s)
- Yue Wang
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Department of Computer Science, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Zhen Wei
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Jionglong Su
- School of AI and Advanced Computing, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Frans Coenen
- Department of Computer Science, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| |
Collapse
|
9
|
Ciochetti NP, Lugli-Moraes B, da Silva BS, Rovaris DL. Genome-wide association studies: utility and limitations for research in physiology. J Physiol 2023; 601:2771-2799. [PMID: 37208942 PMCID: PMC10527550 DOI: 10.1113/jp284241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 05/10/2023] [Indexed: 05/21/2023] Open
Abstract
Physiological systems are subject to interindividual variation encoded by genetics. Genome-wide association studies (GWAS) operate by surveying thousands of genetic variants from a substantial number of individuals and assessing their association to a trait of interest, be it a physiological variable, a molecular phenotype (e.g. gene expression), or even a disease or condition. Through a myriad of methods, GWAS downstream analyses then explore the functional consequences of each variant and attempt to ascertain a causal relationship to the phenotype of interest, as well as to delve into its links to other traits. This type of investigation allows mechanistic insights into physiological functions, pathological disturbances and shared biological processes between traits (i.e. pleiotropy). An exciting example is the discovery of a new thyroid hormone transporter (SLC17A4) and hormone metabolising enzyme (AADAT) from a GWAS on free thyroxine levels. Therefore, GWAS have substantially contributed with insights into physiology and have been shown to be useful in unveiling the genetic control underlying complex traits and pathological conditions; they will continue to do so with global collaborations and advances in genotyping technology. Finally, the increasing number of trans-ancestry GWAS and initiatives to include ancestry diversity in genomics will boost the power for discoveries, making them also applicable to non-European populations.
Collapse
Affiliation(s)
- Nicolas Pereira Ciochetti
- Laboratory of Physiological Genomics of Mental Health (PhysioGen Lab), Instituto de Ciencias Biomedicas Universidade de Sao Paulo, São Paulo, Brazil
| | - Beatriz Lugli-Moraes
- Laboratory of Physiological Genomics of Mental Health (PhysioGen Lab), Instituto de Ciencias Biomedicas Universidade de Sao Paulo, São Paulo, Brazil
| | - Bruna Santos da Silva
- Laboratory of Physiological Genomics of Mental Health (PhysioGen Lab), Instituto de Ciencias Biomedicas Universidade de Sao Paulo, São Paulo, Brazil
- Laboratory of Developmental Psychiatry, Center of Experimental Research, Hospital de Clínicas de Porto Alegre, Porto Alegre, Rio Grande do Sul, Brazil
| | - Diego Luiz Rovaris
- Laboratory of Physiological Genomics of Mental Health (PhysioGen Lab), Instituto de Ciencias Biomedicas Universidade de Sao Paulo, São Paulo, Brazil
| |
Collapse
|
10
|
Mu W, Davis ES, Lee S, Dozmorov MG, Phanstiel DH, Love MI. bootRanges: flexible generation of null sets of genomic ranges for hypothesis testing. Bioinformatics 2023; 39:btad190. [PMID: 37042725 PMCID: PMC10159650 DOI: 10.1093/bioinformatics/btad190] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 01/06/2023] [Accepted: 03/28/2023] [Indexed: 04/13/2023] Open
Abstract
MOTIVATION Enrichment analysis is a widely utilized technique in genomic analysis that aims to determine if there is a statistically significant association between two sets of genomic features. To conduct this type of hypothesis testing, an appropriate null model is typically required. However, the null distribution that is commonly used can be overly simplistic and may result in inaccurate conclusions. RESULTS bootRanges provides fast functions for generation of block bootstrapped genomic ranges representing the null hypothesis in enrichment analysis. As part of a modular workflow, bootRanges offers greater flexibility for computing various test statistics leveraging other Bioconductor packages. We show that shuffling or permutation schemes may result in overly narrow test statistic null distributions and over-estimation of statistical significance, while creating new range sets with a block bootstrap preserves local genomic correlation structure and generates more reliable null distributions. It can also be used in more complex analyses, such as accessing correlations between cis-regulatory elements (CREs) and genes across cell types or providing optimized thresholds, e.g. log fold change (logFC) from differential analysis. AVAILABILITY AND IMPLEMENTATION bootRanges is freely available in the R/Bioconductor package nullranges hosted at https://bioconductor.org/packages/nullranges.
Collapse
Affiliation(s)
- Wancen Mu
- Department of Biostatistics, University of North Carolina, Chapel Hill 27514, United States
| | - Eric S Davis
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill 27514, United States
| | - Stuart Lee
- Genentech, South San Francisco, Western California 94080, United States
| | - Mikhail G Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA 23284, United States
- Department of Pathology, Virginia Commonwealth University, Richmond, VA 23284, United States
| | - Douglas H Phanstiel
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill 27514, United States
- Thurston Arthritis Research Center, University of North Carolina, Chapel Hill 27514, United States
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill 27514, United States
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill 27514, United States
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill 27514, United States
| | - Michael I Love
- Department of Biostatistics, University of North Carolina, Chapel Hill 27514, United States
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill 27514, United States
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill 27514, United States
- Department of Genetics, University of North Carolina, Chapel Hill 27514, United States
| |
Collapse
|
11
|
A comparison of the genes and genesets identified by GWAS and EWAS of fifteen complex traits. Nat Commun 2022; 13:7816. [PMID: 36535946 PMCID: PMC9763500 DOI: 10.1038/s41467-022-35037-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 11/16/2022] [Indexed: 12/23/2022] Open
Abstract
Identifying genomic regions pertinent to complex traits is a common goal of genome-wide and epigenome-wide association studies (GWAS and EWAS). GWAS identify causal genetic variants, directly or via linkage disequilibrium, and EWAS identify variation in DNA methylation associated with a trait. While GWAS in principle will only detect variants due to causal genes, EWAS can also identify genes via confounding, or reverse causation. We systematically compare GWAS (N > 50,000) and EWAS (N > 4500) results of 15 complex traits. We evaluate if the genes or gene ontology terms flagged by GWAS and EWAS overlap, and find substantial overlap for diastolic blood pressure, (gene overlap P = 5.2 × 10-6; term overlap P = 0.001). We superimpose our empirical findings against simulated models of varying genetic and epigenetic architectures and observe that in most cases GWAS and EWAS are likely capturing distinct genesets. Our results indicate that GWAS and EWAS are capturing different aspects of the biology of complex traits.
Collapse
|
12
|
Levin MG, Tsao NL, Singhal P, Liu C, Vy HMT, Paranjpe I, Backman JD, Bellomo TR, Bone WP, Biddinger KJ, Hui Q, Dikilitas O, Satterfield BA, Yang Y, Morley MP, Bradford Y, Burke M, Reza N, Charest B, Judy RL, Puckelwartz MJ, Hakonarson H, Khan A, Kottyan LC, Kullo I, Luo Y, McNally EM, Rasmussen-Torvik LJ, Day SM, Do R, Phillips LS, Ellinor PT, Nadkarni GN, Ritchie MD, Arany Z, Cappola TP, Margulies KB, Aragam KG, Haggerty CM, Joseph J, Sun YV, Voight BF, Damrauer SM. Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure. Nat Commun 2022; 13:6914. [PMID: 36376295 PMCID: PMC9663424 DOI: 10.1038/s41467-022-34216-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 10/17/2022] [Indexed: 11/16/2022] Open
Abstract
Heart failure is a leading cause of cardiovascular morbidity and mortality. However, the contribution of common genetic variation to heart failure risk has not been fully elucidated, particularly in comparison to other common cardiometabolic traits. We report a multi-ancestry genome-wide association study meta-analysis of all-cause heart failure including up to 115,150 cases and 1,550,331 controls of diverse genetic ancestry, identifying 47 risk loci. We also perform multivariate genome-wide association studies that integrate heart failure with related cardiac magnetic resonance imaging endophenotypes, identifying 61 risk loci. Gene-prioritization analyses including colocalization and transcriptome-wide association studies identify known and previously unreported candidate cardiomyopathy genes and cellular processes, which we validate in gene-expression profiling of failing and healthy human hearts. Colocalization, gene expression profiling, and Mendelian randomization provide convergent evidence for the roles of BCKDHA and circulating branch-chain amino acids in heart failure and cardiac structure. Finally, proteome-wide Mendelian randomization identifies 9 circulating proteins associated with heart failure or quantitative imaging traits. These analyses highlight similarities and differences among heart failure and associated cardiovascular imaging endophenotypes, implicate common genetic variation in the pathogenesis of heart failure, and identify circulating proteins that may represent cardiomyopathy treatment targets.
Collapse
Affiliation(s)
- Michael G Levin
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA
| | - Noah L Tsao
- Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Pankhuri Singhal
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Chang Liu
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Ha My T Vy
- The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ishan Paranjpe
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Tiffany R Bellomo
- Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - William P Bone
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Kiran J Biddinger
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
| | - Qin Hui
- Emory University School of Public Health, Atlanta, GA, USA
- Atlanta VA Health Care System, Decatur, GA, USA
| | - Ozan Dikilitas
- Departments of Internal Medicine and Cardiovascular Medicine, and Mayo Clinician-Investigator Training Program, Mayo Clinic, Rochester, MN, USA
| | | | - Yifan Yang
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael P Morley
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yuki Bradford
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Megan Burke
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Nosheen Reza
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Brian Charest
- Massachusetts Veterans Epidemiology Research and Information Center, VA Boston Healthcare System, Boston, MA, USA
| | - Renae L Judy
- Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Megan J Puckelwartz
- Department of Pharmacology, Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY, USA
| | - Leah C Kottyan
- Department of Pediatrics, Division of Human Genetics and Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Iftikhar Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Elizabeth M McNally
- Center for Genetic Medicine, Bluhm Cardiovascular Institute, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Laura J Rasmussen-Torvik
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Sharlene M Day
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Ron Do
- The Charles Bronfman Institute for Personalized Medicine, BioMe Phenomics Center, and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Lawrence S Phillips
- Atlanta VA Health Care System, Decatur, GA, USA
- Division of Endocrinology, Emory University School of Medicine, Atlanta, GA, USA
| | - Patrick T Ellinor
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center and Cardiac Arrhythmia Service, Massachusetts General Hospital, Boston, MA, USA
| | - Girish N Nadkarni
- Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Institute for Biomedical Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Zoltan Arany
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Thomas P Cappola
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Kenneth B Margulies
- Division of Cardiovascular Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Krishna G Aragam
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Christopher M Haggerty
- Department of Translational Data Science and Informatics and Heart Institute, Geisinger, Danville, PA, USA
| | - Jacob Joseph
- Massachusetts Veterans Epidemiology Research and Information Center, VA Boston Healthcare System, Boston, MA, USA
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Yan V Sun
- Emory University School of Public Health, Atlanta, GA, USA
- Atlanta VA Health Care System, Decatur, GA, USA
| | - Benjamin F Voight
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Institute of Translational Medicine and Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Scott M Damrauer
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA.
- Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
| |
Collapse
|
13
|
Markozannes G, Kanellopoulou A, Dimopoulou O, Kosmidis D, Zhang X, Wang L, Theodoratou E, Gill D, Burgess S, Tsilidis KK. Systematic review of Mendelian randomization studies on risk of cancer. BMC Med 2022; 20:41. [PMID: 35105367 PMCID: PMC8809022 DOI: 10.1186/s12916-022-02246-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 01/10/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We aimed to map and describe the current state of Mendelian randomization (MR) literature on cancer risk and to identify associations supported by robust evidence. METHODS We searched PubMed and Scopus up to 06/10/2020 for MR studies investigating the association of any genetically predicted risk factor with cancer risk. We categorized the reported associations based on a priori designed levels of evidence supporting a causal association into four categories, namely robust, probable, suggestive, and insufficient, based on the significance and concordance of the main MR analysis results and at least one of the MR-Egger, weighed median, MRPRESSO, and multivariable MR analyses. Associations not presenting any of the aforementioned sensitivity analyses were not graded. RESULTS We included 190 publications reporting on 4667 MR analyses. Most analyses (3200; 68.6%) were not accompanied by any of the assessed sensitivity analyses. Of the 1467 evaluable analyses, 87 (5.9%) were supported by robust, 275 (18.7%) by probable, and 89 (6.1%) by suggestive evidence. The most prominent robust associations were observed for anthropometric indices with risk of breast, kidney, and endometrial cancers; circulating telomere length with risk of kidney, lung, osteosarcoma, skin, thyroid, and hematological cancers; sex steroid hormones and risk of breast and endometrial cancer; and lipids with risk of breast, endometrial, and ovarian cancer. CONCLUSIONS Despite the large amount of research on genetically predicted risk factors for cancer risk, limited associations are supported by robust evidence for causality. Most associations did not present a MR sensitivity analysis and were thus non-evaluable. Future research should focus on more thorough assessment of sensitivity MR analyses and on more transparent reporting.
Collapse
Affiliation(s)
- Georgios Markozannes
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
- Department of Epidemiology and Biostatistics, St. Mary's Campus, School of Public Health, Imperial College London, Norfolk Place, London, W2 1PG, UK
| | - Afroditi Kanellopoulou
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | | | - Dimitrios Kosmidis
- Department of Hygiene, Epidemiology and Medical Statistics, Medical School, National and Kapodistrian University of Athens, Athens, Greece
| | - Xiaomeng Zhang
- Centre for Global Health, Usher Institute, The University of Edinburgh, Edinburgh, UK
| | - Lijuan Wang
- Centre for Global Health, Usher Institute, The University of Edinburgh, Edinburgh, UK
| | - Evropi Theodoratou
- Centre for Global Health, Usher Institute, The University of Edinburgh, Edinburgh, UK
- CRUK Edinburgh Centre, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
| | - Dipender Gill
- Department of Epidemiology and Biostatistics, St. Mary's Campus, School of Public Health, Imperial College London, Norfolk Place, London, W2 1PG, UK
| | - Stephen Burgess
- Medical Research Council Biostatistics Unit, University of Cambridge, Cambridge, UK
- Cardiovascular Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Konstantinos K Tsilidis
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece.
- Department of Epidemiology and Biostatistics, St. Mary's Campus, School of Public Health, Imperial College London, Norfolk Place, London, W2 1PG, UK.
| |
Collapse
|
14
|
Bouras E, Karhunen V, Gill D, Huang J, Haycock PC, Gunter MJ, Johansson M, Brennan P, Key T, Lewis SJ, Martin RM, Murphy N, Platz EA, Travis R, Yarmolinsky J, Zuber V, Martin P, Katsoulis M, Freisling H, Nøst TH, Schulze MB, Dossus L, Hung RJ, Amos CI, Ahola-Olli A, Palaniswamy S, Männikkö M, Auvinen J, Herzig KH, Keinänen-Kiukaanniemi S, Lehtimäki T, Salomaa V, Raitakari O, Salmi M, Jalkanen S, Jarvelin MR, Dehghan A, Tsilidis KK. Circulating inflammatory cytokines and risk of five cancers: a Mendelian randomization analysis. BMC Med 2022; 20:3. [PMID: 35012533 PMCID: PMC8750876 DOI: 10.1186/s12916-021-02193-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 11/18/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Epidemiological and experimental evidence has linked chronic inflammation to cancer aetiology. It is unclear whether associations for specific inflammatory biomarkers are causal or due to bias. In order to examine whether altered genetically predicted concentration of circulating cytokines are associated with cancer development, we performed a two-sample Mendelian randomisation (MR) analysis. METHODS Up to 31,112 individuals of European descent were included in genome-wide association study (GWAS) meta-analyses of 47 circulating cytokines. Single nucleotide polymorphisms (SNPs) robustly associated with the cytokines, located in or close to their coding gene (cis), were used as instrumental variables. Inverse-variance weighted MR was used as the primary analysis, and the MR assumptions were evaluated in sensitivity and colocalization analyses and a false discovery rate (FDR) correction for multiple comparisons was applied. Corresponding germline GWAS summary data for five cancer outcomes (breast, endometrial, lung, ovarian, and prostate), and their subtypes were selected from the largest cancer-specific GWASs available (cases ranging from 12,906 for endometrial to 133,384 for breast cancer). RESULTS There was evidence of inverse associations of macrophage migration inhibitory factor with breast cancer (OR per SD = 0.88, 95% CI 0.83 to 0.94), interleukin-1 receptor antagonist with endometrial cancer (0.86, 0.80 to 0.93), interleukin-18 with lung cancer (0.87, 0.81 to 0.93), and beta-chemokine-RANTES with ovarian cancer (0.70, 0.57 to 0.85) and positive associations of monokine induced by gamma interferon with endometrial cancer (3.73, 1.86 to 7.47) and cutaneous T-cell attracting chemokine with lung cancer (1.51, 1.22 to 1.87). These associations were similar in sensitivity analyses and supported in colocalization analyses. CONCLUSIONS Our study adds to current knowledge on the role of specific inflammatory biomarker pathways in cancer aetiology. Further validation is needed to assess the potential of these cytokines as pharmacological or lifestyle targets for cancer prevention.
Collapse
Affiliation(s)
- Emmanouil Bouras
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | - Ville Karhunen
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, Finland
| | - Dipender Gill
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
- Novo Nordisk Research Centre Oxford, Old Road Campus, Oxford, UK
- Clinical Pharmacology Group, Pharmacy and Medicines Directorate, St George's University Hospitals NHS Foundation Trust, London, UK
- Clinical Pharmacology and Therapeutics Section, Institute for Infection and Immunity, St George's, University of London, London, UK
| | - Jian Huang
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Philip C Haycock
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Marc J Gunter
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Mattias Johansson
- Genomics Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Paul Brennan
- Genomics Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Tim Key
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Sarah J Lewis
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Richard M Martin
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
- National Institute for Health Research (NIHR) Bristol Biomedical Research Centre, University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK
| | - Neil Murphy
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Elizabeth A Platz
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Ruth Travis
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - James Yarmolinsky
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Verena Zuber
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
| | - Paul Martin
- School of Biochemistry, University of Bristol, Bristol, UK
| | - Michail Katsoulis
- Institute of Health Informatics, University College London, London, UK
- Health Data Research UK, London, UK
| | - Heinz Freisling
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Therese Haugdahl Nøst
- Department of Community Medicine, Faculty of Health Sciences, Arctic University of Norway, Tromsø, Norway
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Trondheim, Norway
| | - Matthias B Schulze
- Department of Molecular Epidemiology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nutehtal, Germany
- Institute of Nutritional Science, University of Potsdam, Potsdam, Germany
| | - Laure Dossus
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Rayjean J Hung
- Prosserman Centre for Population Health Research, Lunenfeld-Tanenbaum Research Institute of Sinai Health System, Toronto, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | | | - Ari Ahola-Olli
- The Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Saranya Palaniswamy
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
| | - Minna Männikkö
- Northern Finland Birth Cohorts, Infrastructure for Population Studies, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Juha Auvinen
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Karl-Heinz Herzig
- Research Unit of Biomedicine, Medical Research Center, Faculty of Medicine, University of Oulu, and Oulu University Hospital, Oulu, Finland
| | | | - Terho Lehtimäki
- Department of Clinical Chemistry, Finnish Cardiovascular Research Center - Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Veikko Salomaa
- Finnish Institute for Health and Welfare, Helsinki, Finland
| | - Olli Raitakari
- Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland
- Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland
| | - Marko Salmi
- MediCity Research Laboratory, University of Turku, Turku, Finland
- Institute of Biomedicine, University of Turku, Turku, Finland
| | - Sirpa Jalkanen
- MediCity Research Laboratory, University of Turku, Turku, Finland
- Institute of Biomedicine, University of Turku, Turku, Finland
| | - Marjo-Riitta Jarvelin
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Unit of Primary Care, Oulu University Hospital, Oulu, Finland
| | - Abbas Dehghan
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- UK Dementia Research Institute at Imperial College London, London, UK
| | - Konstantinos K Tsilidis
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece.
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St Mary's Campus, London, W2 1PG, UK.
| |
Collapse
|
15
|
Ferré Q, Capponi C, Puthier D. OLOGRAM-MODL: mining enriched n-wise combinations of genomic features with Monte Carlo and dictionary learning. NAR Genom Bioinform 2022; 3:lqab114. [PMID: 34988437 PMCID: PMC8693575 DOI: 10.1093/nargab/lqab114] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 11/08/2021] [Accepted: 11/23/2021] [Indexed: 02/06/2023] Open
Abstract
Most epigenetic marks, such as Transcriptional Regulators or histone marks, are biological objects known to work together in n-wise complexes. A suitable way to infer such functional associations between them is to study the overlaps of the corresponding genomic regions. However, the problem of the statistical significance of n-wise overlaps of genomic features is seldom tackled, which prevent rigorous studies of n-wise interactions. We introduce OLOGRAM-MODL, which considers overlaps between n ≥ 2 sets of genomic regions, and computes their statistical mutual enrichment by Monte Carlo fitting of a Negative Binomial distribution, resulting in more resolutive P-values. An optional machine learning method is proposed to find complexes of interest, using a new itemset mining algorithm based on dictionary learning which is resistant to noise inherent to biological assays. The overall approach is implemented through an easy-to-use CLI interface for workflow integration, and a visual tree-based representation of the results suited for explicability. The viability of the method is experimentally studied using both artificial and biological data. This approach is accessible through the command line interface of the pygtftk toolkit, available on Bioconda and from https://github.com/dputhier/pygtftk
Collapse
Affiliation(s)
- Quentin Ferré
- Aix Marseille Univ, INSERM, UMR U1090, TAGC, Marseille, France
| | - Cécile Capponi
- Aix Marseille Univ, CNRS, UMR 7020, LIS, Qarma, Marseille, France
| | - Denis Puthier
- Aix Marseille Univ, INSERM, UMR U1090, TAGC, Marseille, France
| |
Collapse
|
16
|
Gafurov A, Brejová B, Medvedev P. OUP accepted manuscript. Bioinformatics 2022; 38:i203-i211. [PMID: 35758770 PMCID: PMC9235476 DOI: 10.1093/bioinformatics/btac255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Motivation Genome annotations are a common way to represent genomic features such as genes, regulatory elements or epigenetic modifications. The amount of overlap between two annotations is often used to ascertain if there is an underlying biological connection between them. In order to distinguish between true biological association and overlap by pure chance, a robust measure of significance is required. One common way to do this is to determine if the number of intervals in the reference annotation that intersect the query annotation is statistically significant. However, currently employed statistical frameworks are often either inefficient or inaccurate when computing P-values on the scale of the whole human genome. Results We show that finding the P-values under the typically used ‘gold’ null hypothesis is NP-hard. This motivates us to reformulate the null hypothesis using Markov chains. To be able to measure the fidelity of our Markovian null hypothesis, we develop a fast direct sampling algorithm to estimate the P-value under the gold null hypothesis. We then present an open-source software tool MCDP that computes the P-values under the Markovian null hypothesis in O(m2+n) time and O(m) memory, where m and n are the numbers of intervals in the reference and query annotations, respectively. Notably, MCDP runtime and memory usage are independent from the genome length, allowing it to outperform previous approaches in runtime and memory usage by orders of magnitude on human genome annotations, while maintaining the same level of accuracy. Availability and implementation The software is available at https://github.com/fmfi-compbio/mc-overlaps. All data for reproducibility are available at https://github.com/fmfi-compbio/mc-overlaps-reproducibility. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Broňa Brejová
- Department of Computer Science, Comenius University, Bratislava 84248, Slovakia
| | - Paul Medvedev
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, USA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
17
|
Adam Y, Samtal C, Brandenburg JT, Falola O, Adebiyi E. Performing post-genome-wide association study analysis: overview, challenges and recommendations. F1000Res 2021; 10:1002. [PMID: 35222990 PMCID: PMC8847724 DOI: 10.12688/f1000research.53962.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/22/2021] [Indexed: 12/17/2022] Open
Abstract
Genome-wide association studies (GWAS) provide huge information on statistically significant single-nucleotide polymorphisms (SNPs) associated with various human complex traits and diseases. By performing GWAS studies, scientists have successfully identified the association of hundreds of thousands to millions of SNPs to a single phenotype. Moreover, the association of some SNPs with rare diseases has been intensively tested. However, classic GWAS studies have not yet provided solid, knowledgeable insight into functional and biological mechanisms underlying phenotypes or mechanisms of diseases. Therefore, several post-GWAS (pGWAS) methods have been recommended. Currently, there is no simple scientific document to provide a quick guide for performing pGWAS analysis. pGWAS is a crucial step for a better understanding of the biological machinery beyond the SNPs. Here, we provide an overview to performing pGWAS analysis and demonstrate the challenges behind each method. Furthermore, we direct readers to key articles for each pGWAS method and present the overall issues in pGWAS analysis. Finally, we include a custom pGWAS pipeline to guide new users when performing their research.
Collapse
Affiliation(s)
- Yagoub Adam
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun, 112233, Nigeria
| | - Chaimae Samtal
- Laboratory of Biotechnology, Environment, Agri-food and Health, Sidi Mohammed Ben Abdellah University, Fez, Fez-Meknes, 30000, Morocco
| | - Jean-tristan Brandenburg
- Sydney Brenner Institute for Molecular Bioscience (SBIMB), University of the Witwatersrand, Johannesburg, South Africa
| | - Oluwadamilare Falola
- Laboratory of Biotechnology, Environment, Agri-food and Health, Sidi Mohammed Ben Abdellah University, Fez, Fez-Meknes, 30000, Morocco
| | - Ezekiel Adebiyi
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun, 112233, Nigeria
- Computer & Information Sciences, Covenant University, Ota, Ogun, 112233, Nigeria
- Covenant Applied Informatics and Communication Africa Centre of Excellence, Covenant University, Ota, Ogun, 112233, Nigeria
- Applied Bioinformatics Division, German Cancer Center DKFZ - Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany
| |
Collapse
|
18
|
Gu A, Cho HJ, Sheffield NC. Bedshift: perturbation of genomic interval sets. Genome Biol 2021; 22:238. [PMID: 34416909 PMCID: PMC8379854 DOI: 10.1186/s13059-021-02440-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 07/26/2021] [Indexed: 12/25/2022] Open
Abstract
Functional genomics experiments, like ChIP-Seq or ATAC-Seq, produce results that are summarized as a region set. There is no way to objectively evaluate the effectiveness of region set similarity metrics. We present Bedshift, a tool for perturbing BED files by randomly shifting, adding, and dropping regions from a reference file. The perturbed files can be used to benchmark similarity metrics, as well as for other applications. We highlight differences in behavior between metrics, such as that the Jaccard score is most sensitive to added or dropped regions, while coverage score is most sensitive to shifted regions.
Collapse
Affiliation(s)
- Aaron Gu
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Computer Science, University of Virginia School of Engineering, Charlottesville, VA, USA
| | - Hyun Jae Cho
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Computer Science, University of Virginia School of Engineering, Charlottesville, VA, USA
| | - Nathan C Sheffield
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA.
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA.
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
19
|
Pandey P, Gao Y, Kingsford C. VariantStore: an index for large-scale genomic variant search. Genome Biol 2021; 22:231. [PMID: 34412679 PMCID: PMC8375130 DOI: 10.1186/s13059-021-02442-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 07/27/2021] [Indexed: 11/18/2022] Open
Abstract
Efficiently scaling genomic variant search indexes to thousands of samples is computationally challenging due to the presence of multiple coordinate systems to avoid reference biases. We present VariantStore, a system that indexes genomic variants from multiple samples using a variation graph and enables variant queries across any sample-specific coordinate system. We show the scalability of VariantStore by indexing genomic variants from the TCGA project in 4 h and the 1000 Genomes project in 3 h. Querying for variants in a gene takes between 0.002 and 3 seconds using memory only 10% of the size of the full representation.
Collapse
Affiliation(s)
- Prashant Pandey
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| | - Yinjie Gao
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| | - Carl Kingsford
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
| |
Collapse
|
20
|
Gharavi E, Gu A, Zheng G, Smith JP, Cho HJ, Zhang A, Brown DE, Sheffield NC. Embeddings of genomic region sets capture rich biological associations in lower dimensions. Bioinformatics 2021; 37:4299-4306. [PMID: 34156475 PMCID: PMC8652032 DOI: 10.1093/bioinformatics/btab439] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 06/07/2021] [Accepted: 06/15/2021] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Genomic region sets summarize functional genomics data and define locations of interest in the genome such as regulatory regions or transcription factor binding sites. The number of publicly available region sets has increased dramatically, leading to challenges in data analysis. RESULTS We propose a new method to represent genomic region sets as vectors, or embeddings, using an adapted word2vec approach. We compared our approach to two simpler methods based on interval unions or term frequency-inverse document frequency and evaluated the methods in three ways: First, by classifying the cell line, antibody, or tissue type of the region set; second, by assessing whether similarity among embeddings can reflect simulated random perturbations of genomic regions; and third, by testing robustness of the proposed representations to different signal thresholds for calling peaks. Our word2vec-based region set embeddings reduce dimensionality from more than a hundred thousand to 100 without significant loss in classification performance. The vector representation could identify cell line, antibody, and tissue type with over 90% accuracy. We also found that the vectors could quantitatively summarize simulated random perturbations to region sets and are more robust to subsampling the data derived from different peak calling thresholds. Our evaluations demonstrate that the vectors retain useful biological information in relatively lower-dimensional spaces. We propose that vector representation of region sets is a promising approach for efficient analysis of genomic region data. AVAILABILITY https://github.com/databio/regionset-embedding.
Collapse
Affiliation(s)
- Erfaneh Gharavi
- Center for Public Health Genomics, University of Virginia.,School of Data Science, University of Virginia
| | - Aaron Gu
- Center for Public Health Genomics, University of Virginia.,Department of Computer Science, University of Virginia
| | | | - Jason P Smith
- Center for Public Health Genomics, University of Virginia.,Department of Biochemistry and Molecular Genetics, University of Virginia
| | - Hyun Jae Cho
- Center for Public Health Genomics, University of Virginia.,Department of Computer Science, University of Virginia
| | - Aidong Zhang
- Department of Computer Science, University of Virginia
| | | | - Nathan C Sheffield
- Center for Public Health Genomics, University of Virginia.,Department of Public Health Sciences, University of Virginia.,Department of Biomedical Engineering, University of Virginia.,Department of Biochemistry and Molecular Genetics, University of Virginia.,School of Data Science, University of Virginia
| |
Collapse
|
21
|
Jiang JC, Rothnagel JA, Upton KR. Widespread Exaptation of L1 Transposons for Transcription Factor Binding in Breast Cancer. Int J Mol Sci 2021; 22:5625. [PMID: 34070697 PMCID: PMC8199441 DOI: 10.3390/ijms22115625] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/18/2021] [Accepted: 05/20/2021] [Indexed: 12/29/2022] Open
Abstract
L1 transposons occupy 17% of the human genome and are widely exapted for the regulation of human genes, particularly in breast cancer, where we have previously shown abundant cancer-specific transcription factor (TF) binding sites within the L1PA2 subfamily. In the current study, we performed a comprehensive analysis of TF binding activities in primate-specific L1 subfamilies and identified pervasive exaptation events amongst these evolutionarily related L1 transposons. By motif scanning, we predicted diverse and abundant TF binding potentials within the L1 transposons. We confirmed substantial TF binding activities in the L1 subfamilies using TF binding sites consolidated from an extensive collection of publicly available ChIP-seq datasets. Young L1 subfamilies (L1HS, L1PA2 and L1PA3) contributed abundant TF binding sites in MCF7 cells, primarily via their 5' UTR. This is expected as the L1 5' UTR hosts cis-regulatory elements that are crucial for L1 replication and mobilisation. Interestingly, the ancient L1 subfamilies, where 5' truncation was common, displayed comparable TF binding capacity through their 3' ends, suggesting an alternative exaptation mechanism in L1 transposons that was previously unnoticed. Overall, primate-specific L1 transposons were extensively exapted for TF binding in MCF7 breast cancer cells and are likely prominent genetic players modulating breast cancer transcriptional regulation.
Collapse
Affiliation(s)
| | | | - Kyle R. Upton
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, QLD 4072, Australia; (J.-C.J.); (J.A.R.)
| |
Collapse
|
22
|
Gundersen S, Boddu S, Capella-Gutierrez S, Drabløs F, Fernández JM, Kompova R, Taylor K, Titov D, Zerbino D, Hovig E. Recommendations for the FAIRification of genomic track metadata. F1000Res 2021; 10. [PMID: 34249331 PMCID: PMC8226415 DOI: 10.12688/f1000research.28449.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/17/2021] [Indexed: 01/25/2023] Open
Abstract
Background: Many types of data from genomic analyses can be represented as genomic tracks,
i.e. features linked to the genomic coordinates of a reference genome. Examples of such data are epigenetic DNA methylation data, ChIP-seq peaks, germline or somatic DNA variants, as well as RNA-seq expression levels. Researchers often face difficulties in locating, accessing and combining relevant tracks from external sources, as well as locating the raw data, reducing the value of the generated information. Description of work: We propose to advance the application of FAIR data principles (Findable, Accessible, Interoperable, and Reusable) to produce searchable metadata for genomic tracks. Findability and Accessibility of metadata can then be ensured by a track search service that integrates globally identifiable metadata from various track hubs in the Track Hub Registry and other relevant repositories. Interoperability and Reusability need to be ensured by the specification and implementation of a basic set of recommendations for metadata. We have tested this concept by developing such a specification in a JSON Schema, called FAIRtracks, and have integrated it into a novel track search service, called TrackFind. We demonstrate practical usage by importing datasets through TrackFind into existing examples of relevant analytical tools for genomic tracks: EPICO and the GSuite HyperBrowser. Conclusion: We here provide a first iteration of a draft standard for genomic track metadata, as well as the accompanying software ecosystem. It can easily be adapted or extended to future needs of the research community regarding data, methods and tools, balancing the requirements of both data submitters and analytical end-users.
Collapse
Affiliation(s)
| | - Sanjay Boddu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | | | - Finn Drabløs
- Department of Clinical and Molecular Medicine, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Radmila Kompova
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway
| | - Kieron Taylor
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Dmytro Titov
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway
| | - Daniel Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Eivind Hovig
- Center for Bioinformatics, University of Oslo (UiO), Oslo, Norway.,Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital (OUH), Oslo, Norway
| |
Collapse
|
23
|
Kanduri C, Sandve GK, Hovig E, De S, Layer RM. Editorial: Genomic Colocalization and Enrichment Analyses. Front Genet 2021; 11:617876. [PMID: 33574832 PMCID: PMC7870795 DOI: 10.3389/fgene.2020.617876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 11/24/2020] [Indexed: 12/03/2022] Open
Affiliation(s)
| | | | - Eivind Hovig
- Department of Informatics, University of Oslo, Oslo, Norway.,Department of Tumor Biology, Institute for Cancer Research, Radium Hospital, Oslo University Hospital, Oslo, Norway
| | - Subhajyoti De
- Rutgers Cancer Institute of New Jersey, New Brunswick, GA, United States
| | - Ryan M Layer
- Computer Science Department, University of Colorado, Boulder, CO, United States.,BioFrontiers Institute, University of Colorado, Boulder, CO, United States
| |
Collapse
|
24
|
Feng J, Sheffield NC. IGD: high-performance search for large-scale genomic interval datasets. Bioinformatics 2020; 37:118-120. [PMID: 33367484 DOI: 10.1093/bioinformatics/btaa1062] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 10/19/2020] [Accepted: 12/15/2020] [Indexed: 01/04/2023] Open
Abstract
SUMMARY Databases of large-scale genome projects now contain thousands of genomic interval datasets. These data are a critical resource for understanding the function of DNA. However, our ability to examine and integrate interval data of this scale is limited. Here, we introduce the integrated genome database (IGD), a method and tool for searching genome interval datasets more than three orders of magnitude faster than existing approaches, while using only one hundredth of the memory. IGD uses a novel linear binning method that allows us to scale analysis to billions of genomic regions. AVAILABILITY https://github.com/databio/IGD.
Collapse
Affiliation(s)
- Jianglin Feng
- Center for Public Health Genomics, University of Virginia
| | - Nathan C Sheffield
- Center for Public Health Genomics, University of Virginia.,Department of Public Health Sciences, University of Virginia.,Department of Biomedical Engineering, University of Virginia.,Department of Biochemistry and Molecular Genetics, University of Virginia
| |
Collapse
|
25
|
Huang D, Yi X, Zhou Y, Yao H, Xu H, Wang J, Zhang S, Nong W, Wang P, Shi L, Xuan C, Li M, Wang J, Li W, Kwan HS, Sham PC, Wang K, Li MJ. Ultrafast and scalable variant annotation and prioritization with big functional genomics data. Genome Res 2020; 30:1789-1801. [PMID: 33060171 PMCID: PMC7706736 DOI: 10.1101/gr.267997.120] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Accepted: 09/22/2020] [Indexed: 02/06/2023]
Abstract
The advances of large-scale genomics studies have enabled compilation of cell type–specific, genome-wide DNA functional elements at high resolution. With the growing volume of functional annotation data and sequencing variants, existing variant annotation algorithms lack the efficiency and scalability to process big genomic data, particularly when annotating whole-genome sequencing variants against a huge database with billions of genomic features. Here, we develop VarNote to rapidly annotate genome-scale variants in large and complex functional annotation resources. Equipped with a novel index system and a parallel random-sweep searching algorithm, VarNote shows substantial performance improvements (two to three orders of magnitude) over existing algorithms at different scales. It supports both region-based and allele-specific annotations and introduces advanced functions for the flexible extraction of annotations. By integrating massive base-wise and context-dependent annotations in the VarNote framework, we introduce three efficient and accurate pipelines to prioritize the causal regulatory variants for common diseases, Mendelian disorders, and cancers.
Collapse
Affiliation(s)
- Dandan Huang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China.,Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China.,Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Xianfu Yi
- School of Biomedical Engineering, Tianjin Medical University, Tianjin 300070, China
| | - Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Hongcheng Yao
- School of Biomedical Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Hang Xu
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China.,School of Biomedical Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Jianhua Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Shijie Zhang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Wenyan Nong
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Panwen Wang
- Department of Health Sciences Research and Center for Individualized Medicine, Mayo Clinic, Scottsdale, Arizona 85259, USA
| | - Lei Shi
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Chenghao Xuan
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Miaoxin Li
- Center for Genome Research, Center for Precision Medicine, Zhongshan School of Medicine, First Affiliated Hospital, Sun Yat-Sen University, Guangzhou 510080, China
| | - Junwen Wang
- Department of Health Sciences Research and Center for Individualized Medicine, Mayo Clinic, Scottsdale, Arizona 85259, USA
| | - Weidong Li
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Hoi Shan Kwan
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong SAR 999077, China
| | - Pak Chung Sham
- Centre of Genomics Sciences, Departments of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
| | - Mulin Jun Li
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China.,Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China.,Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| |
Collapse
|
26
|
Viñuela A, Varshney A, van de Bunt M, Prasad RB, Asplund O, Bennett A, Boehnke M, Brown AA, Erdos MR, Fadista J, Hansson O, Hatem G, Howald C, Iyengar AK, Johnson P, Krus U, MacDonald PE, Mahajan A, Manning Fox JE, Narisu N, Nylander V, Orchard P, Oskolkov N, Panousis NI, Payne A, Stitzel ML, Vadlamudi S, Welch R, Collins FS, Mohlke KL, Gloyn AL, Scott LJ, Dermitzakis ET, Groop L, Parker SCJ, McCarthy MI. Genetic variant effects on gene expression in human pancreatic islets and their implications for T2D. Nat Commun 2020; 11:4912. [PMID: 32999275 PMCID: PMC7528108 DOI: 10.1038/s41467-020-18581-8] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 08/12/2020] [Indexed: 02/08/2023] Open
Abstract
Most signals detected by genome-wide association studies map to non-coding sequence and their tissue-specific effects influence transcriptional regulation. However, key tissues and cell-types required for functional inference are absent from large-scale resources. Here we explore the relationship between genetic variants influencing predisposition to type 2 diabetes (T2D) and related glycemic traits, and human pancreatic islet transcription using data from 420 donors. We find: (a) 7741 cis-eQTLs in islets with a replication rate across 44 GTEx tissues between 40% and 73%; (b) marked overlap between islet cis-eQTL signals and active regulatory sequences in islets, with reduced eQTL effect size observed in the stretch enhancers most strongly implicated in GWAS signal location; (c) enrichment of islet cis-eQTL signals with T2D risk variants identified in genome-wide association studies; and (d) colocalization between 47 islet cis-eQTLs and variants influencing T2D or glycemic traits, including DGKB and TCF7L2. Our findings illustrate the advantages of performing functional and regulatory studies in disease relevant tissues.
Collapse
Affiliation(s)
- Ana Viñuela
- grid.8591.50000 0001 2322 4988Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland ,grid.8591.50000 0001 2322 4988Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland ,grid.419765.80000 0001 2223 3006Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland ,grid.1006.70000 0001 0462 7212Biosciences Institute, Faculty of Medical Sciences, Newcastle University, NE1 4EP Newcastle, UK
| | - Arushi Varshney
- grid.214458.e0000000086837370Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Martijn van de Bunt
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK ,grid.4991.50000 0004 1936 8948Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 7LE UK ,grid.410556.30000 0001 0440 1440Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Trust, Oxford, OX3 7LE UK
| | - Rashmi B. Prasad
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Olof Asplund
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Amanda Bennett
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK
| | - Michael Boehnke
- grid.214458.e0000000086837370Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Andrew A. Brown
- grid.8591.50000 0001 2322 4988Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland ,grid.8591.50000 0001 2322 4988Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland ,grid.419765.80000 0001 2223 3006Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland ,grid.8241.f0000 0004 0397 2876Population Health and Genomics, University of Dundee, Dundee, Scotland, DD1 9SY UK
| | - Michael R. Erdos
- grid.280128.10000 0001 2233 9230Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892 USA
| | - João Fadista
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden ,grid.6203.70000 0004 0417 4147Department of Epidemiology Research, Statens Serum Institut, Copenhagen, DK 2300 Denmark ,grid.7737.40000 0004 0410 2071Finnish Institute for Molecular Medicine (FIMM), University of Helsinki, Helsinki, Finland
| | - Ola Hansson
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden ,grid.7737.40000 0004 0410 2071Finnish Institute for Molecular Medicine (FIMM), University of Helsinki, Helsinki, Finland
| | - Gad Hatem
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Cédric Howald
- grid.8591.50000 0001 2322 4988Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland ,grid.8591.50000 0001 2322 4988Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland ,grid.419765.80000 0001 2223 3006Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Apoorva K. Iyengar
- grid.410711.20000 0001 1034 1720Department of Genetics, University of North Carolina, Chapel Hill, NC 27599 USA
| | - Paul Johnson
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK
| | - Ulrika Krus
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Patrick E. MacDonald
- grid.17089.37Department of Pharmacology and Alberta Diabetes Institute, University of Alberta, Edmonton, Alberta Canada
| | - Anubha Mahajan
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK ,grid.418158.10000 0004 0534 4718Present Address: Human Genetics, Genentech, 1 DNA Way, South San Francisco, CA 94080 USA
| | - Jocelyn E. Manning Fox
- grid.17089.37Department of Pharmacology and Alberta Diabetes Institute, University of Alberta, Edmonton, Alberta Canada
| | - Narisu Narisu
- grid.280128.10000 0001 2233 9230Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892 USA
| | - Vibe Nylander
- grid.4991.50000 0004 1936 8948Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 7LE UK
| | - Peter Orchard
- grid.214458.e0000000086837370Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Nikolay Oskolkov
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Nikolaos I. Panousis
- grid.8591.50000 0001 2322 4988Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland ,grid.8591.50000 0001 2322 4988Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland ,grid.419765.80000 0001 2223 3006Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Anthony Payne
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK
| | - Michael L. Stitzel
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA ,grid.63054.340000 0001 0860 4915Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032 USA
| | - Swarooparani Vadlamudi
- grid.410711.20000 0001 1034 1720Department of Genetics, University of North Carolina, Chapel Hill, NC 27599 USA
| | - Ryan Welch
- grid.214458.e0000000086837370Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Francis S. Collins
- grid.280128.10000 0001 2233 9230Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892 USA
| | - Karen L. Mohlke
- grid.410711.20000 0001 1034 1720Department of Genetics, University of North Carolina, Chapel Hill, NC 27599 USA
| | - Anna L. Gloyn
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK ,grid.4991.50000 0004 1936 8948Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 7LE UK ,grid.410556.30000 0001 0440 1440Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Trust, Oxford, OX3 7LE UK ,grid.168010.e0000000419368956Department of Pediatrics, Division of Endocrinology, Stanford School of Medicine, Stanford University, Stanford, CA USA
| | - Laura J. Scott
- grid.214458.e0000000086837370Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Emmanouil T. Dermitzakis
- grid.8591.50000 0001 2322 4988Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland ,grid.8591.50000 0001 2322 4988Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland ,grid.419765.80000 0001 2223 3006Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Leif Groop
- Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden ,grid.7737.40000 0004 0410 2071Finnish Institute for Molecular Medicine (FIMM), University of Helsinki, Helsinki, Finland
| | - Stephen C. J. Parker
- grid.214458.e0000000086837370Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109 USA ,grid.214458.e0000000086837370Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI 48109 USA
| | - Mark I. McCarthy
- grid.4991.50000 0004 1936 8948Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN UK ,grid.4991.50000 0004 1936 8948Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 7LE UK ,grid.410556.30000 0001 0440 1440Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Trust, Oxford, OX3 7LE UK ,grid.418158.10000 0004 0534 4718Present Address: Human Genetics, Genentech, 1 DNA Way, South San Francisco, CA 94080 USA
| |
Collapse
|
27
|
Levitsky V, Zemlyanskaya E, Oshchepkov D, Podkolodnaya O, Ignatieva E, Grosse I, Mironova V, Merkulova T. A single ChIP-seq dataset is sufficient for comprehensive analysis of motifs co-occurrence with MCOT package. Nucleic Acids Res 2020; 47:e139. [PMID: 31750523 PMCID: PMC6868382 DOI: 10.1093/nar/gkz800] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 08/12/2019] [Accepted: 09/09/2019] [Indexed: 01/20/2023] Open
Abstract
Recognition of composite elements consisting of two transcription factor binding sites gets behind the studies of tissue-, stage- and condition-specific transcription. Genome-wide data on transcription factor binding generated with ChIP-seq method facilitate an identification of composite elements, but the existing bioinformatics tools either require ChIP-seq datasets for both partner transcription factors, or omit composite elements with motifs overlapping. Here we present an universal Motifs Co-Occurrence Tool (MCOT) that retrieves maximum information about overrepresented composite elements from a single ChIP-seq dataset. This includes homo- and heterotypic composite elements of four mutual orientations of motifs, separated with a spacer or overlapping, even if recognition of motifs within composite element requires various stringencies. Analysis of 52 ChIP-seq datasets for 18 human transcription factors confirmed that for over 60% of analyzed datasets and transcription factors predicted co-occurrence of motifs implied experimentally proven protein-protein interaction of respecting transcription factors. Analysis of 164 ChIP-seq datasets for 57 mammalian transcription factors showed that abundance of predicted composite elements with an overlap of motifs compared to those with a spacer more than doubled; and they had 1.5-fold increase of asymmetrical pairs of motifs with one more conservative 'leading' motif and another one 'guided'.
Collapse
Affiliation(s)
- Victor Levitsky
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia.,Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Elena Zemlyanskaya
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia.,Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Dmitry Oshchepkov
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
| | - Olga Podkolodnaya
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
| | - Elena Ignatieva
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia.,Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Ivo Grosse
- Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia.,Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle (Saale), Germany.,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig, Germany
| | - Victoria Mironova
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia.,Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Tatyana Merkulova
- Department of Natural Science, Novosibirsk State University, Novosibirsk 630090, Russia.,Department of Molecular Genetics, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
| |
Collapse
|
28
|
George G, Gan S, Huang Y, Appleby P, Nar AS, Venkatesan R, Mohan V, Palmer CNA, Doney ASF. PheGWAS: a new dimension to visualize GWAS across multiple phenotypes. Bioinformatics 2020; 36:2500-2505. [PMID: 31860083 PMCID: PMC7178436 DOI: 10.1093/bioinformatics/btz944] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 11/23/2019] [Accepted: 12/18/2019] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION PheGWAS was developed to enhance exploration of phenome-wide pleiotropy at the genome-wide level through the efficient generation of a dynamic visualization combining Manhattan plots from GWAS with PheWAS to create a 3D 'landscape'. Pleiotropy in sub-surface GWAS significance strata can be explored in a sectional view plotted within user defined levels. Further complexity reduction is achieved by confining to a single chromosomal section. Comprehensive genomic and phenomic coordinates can be displayed. RESULTS PheGWAS is demonstrated using summary data from Global Lipids Genetics Consortium GWAS across multiple lipid traits. For single and multiple traits PheGWAS highlighted all 88 and 69 loci, respectively. Further, the genes and SNPs reported in Global Lipids Genetics Consortium were identified using additional functions implemented within PheGWAS. Not only is PheGWAS capable of identifying independent signals but also provides insights to local genetic correlation (verified using HESS) and in identifying the potential regions that share causal variants across phenotypes (verified using colocalization tests). AVAILABILITY AND IMPLEMENTATION The PheGWAS software and code are freely available at (https://github.com/georgeg0/PheGWAS). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gittu George
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Sushrima Gan
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Yu Huang
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Philip Appleby
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - A S Nar
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Radha Venkatesan
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Viswanathan Mohan
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Colin N A Palmer
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| | - Alex S F Doney
- NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Division of Population Health and Genomics, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK
| |
Collapse
|
29
|
Nicholls HL, John CR, Watson DS, Munroe PB, Barnes MR, Cabrera CP. Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci. Front Genet 2020; 11:350. [PMID: 32351543 PMCID: PMC7174742 DOI: 10.3389/fgene.2020.00350] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 03/23/2020] [Indexed: 12/21/2022] Open
Abstract
Genome-wide association studies (GWAS) have revealed thousands of genetic loci that underpin the complex biology of many human traits. However, the strength of GWAS - the ability to detect genetic association by linkage disequilibrium (LD) - is also its limitation. Whilst the ever-increasing study size and improved design have augmented the power of GWAS to detect effects, differentiation of causal variants or genes from other highly correlated genes associated by LD remains the real challenge. This has severely hindered the biological insights and clinical translation of GWAS findings. Although thousands of disease susceptibility loci have been reported, causal genes at these loci remain elusive. Machine learning (ML) techniques offer an opportunity to dissect the heterogeneity of variant and gene signals in the post-GWAS analysis phase. ML models for GWAS prioritization vary greatly in their complexity, ranging from relatively simple logistic regression approaches to more complex ensemble models such as random forests and gradient boosting, as well as deep learning models, i.e., neural networks. Paired with functional validation, these methods show important promise for clinical translation, providing a strong evidence-based approach to direct post-GWAS research. However, as ML approaches continue to evolve to meet the challenge of causal gene identification, a critical assessment of the underlying methodologies and their applicability to the GWAS prioritization problem is needed. This review investigates the landscape of ML applications in three parts: selected models, input features, and output model performance, with a focus on prioritizations of complex disease associated loci. Overall, we explore the contributions ML has made towards reaching the GWAS end-game with consequent wide-ranging translational impact.
Collapse
Affiliation(s)
- Hannah L. Nicholls
- Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - Christopher R. John
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - David S. Watson
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- Oxford Internet Institute, University of Oxford, Oxford, United Kingdom
| | - Patricia B. Munroe
- Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- NIHR Barts Biomedical Research Centre, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - Michael R. Barnes
- Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- NIHR Barts Biomedical Research Centre, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- The Alan Turing Institute, British Library, London, United Kingdom
| | - Claudia P. Cabrera
- Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- NIHR Barts Biomedical Research Centre, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| |
Collapse
|
30
|
Xu D, Wang C, Kiryluk K, Buxbaum JD, Ionita-Laza I. Co-localization between Sequence Constraint and Epigenomic Information Improves Interpretation of Whole-Genome Sequencing Data. Am J Hum Genet 2020; 106:513-524. [PMID: 32243819 PMCID: PMC7118583 DOI: 10.1016/j.ajhg.2020.03.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 03/09/2020] [Indexed: 10/24/2022] Open
Abstract
The identification of functional regions in the noncoding human genome is difficult but critical in order to gain understanding of the role noncoding variation plays in gene regulation in human health and disease. We describe here a co-localization approach that aims to identify constrained sequences that co-localize with tissue- or cell-type-specific regulatory regions, and we show that the resulting score is particularly well suited for the identification of rare regulatory variants. For 127 tissues and cell types in the ENCODE/Roadmap Epigenomics Project, we provide catalogs of putative tissue- or cell-type-specific regulatory regions under sequence constraint. We use the newly developed co-localization score for brain tissues to score de novo mutations in whole genomes from 1,902 individuals affected with autism spectrum disorder (ASD) and their unaffected siblings in the Simons Simplex Collection. We show that noncoding de novo mutations near genes co-expressed in midfetal brain with high confidence ASD risk genes, and near FMRP gene targets are more likely to be in co-localized regions if they occur in ASD probands versus in their unaffected siblings. We also observed a similar enrichment for mutations near lincRNAs, previously shown to co-express with ASD risk genes. Additionally, we provide strong evidence that prioritized de novo mutations in autism probands point to a small set of well-known ASD genes, the disruption of which produces relevant mouse phenotypes such as abnormal social investigation and abnormal discrimination/associative learning, unlike the de novo mutations in unaffected siblings. The genome-wide co-localization results are available online.
Collapse
Affiliation(s)
- Danqing Xu
- Department of Biostatistics, Columbia University, New York, NY 10032, USA
| | - Chen Wang
- Department of Biostatistics, Columbia University, New York, NY 10032, USA; Department of Medicine, Columbia University, New York, NY 10032, USA
| | - Krzysztof Kiryluk
- Department of Medicine, Columbia University, New York, NY 10032, USA
| | - Joseph D Buxbaum
- Departments of Psychiatry, Neuroscience, and Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Friedman Brain Institute and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | |
Collapse
|
31
|
Ronzio M, Zambelli F, Dolfini D, Mantovani R, Pavesi G. Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules. Front Genet 2020; 11:72. [PMID: 32153638 PMCID: PMC7046753 DOI: 10.3389/fgene.2020.00072] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 01/22/2020] [Indexed: 12/14/2022] Open
Abstract
Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-Seq) has opened new avenues of research in the genome-wide characterization of regulatory DNA-protein interactions at the genetic and epigenetic level. As a consequence, it has become the de facto standard for studies on the regulation of transcription, and literally thousands of data sets for transcription factors and cofactors in different conditions and species are now available to the scientific community. However, while pipelines and best practices have been established for the analysis of a single experiment, there is still no consensus on the best way to perform an integrated analysis of multiple datasets in the same condition, in order to identify the most relevant and widespread regulatory modules composed by different transcription factors and cofactors. We present here a computational pipeline for this task, that integrates peak summit colocalization, a novel statistical framework for the evaluation of its significance, and motif enrichment analysis. We show examples of its application to ENCODE data, that led to the identification of relevant regulatory modules composed of different factors, as well as the organization on DNA of the binding motifs responsible for their recruitment.
Collapse
Affiliation(s)
- Mirko Ronzio
- Dipartimento di Bioscienze, Università di Milano, Milan, Italy
| | | | - Diletta Dolfini
- Dipartimento di Bioscienze, Università di Milano, Milan, Italy
| | | | - Giulio Pavesi
- Dipartimento di Bioscienze, Università di Milano, Milan, Italy
| |
Collapse
|
32
|
Zhou Y, Sun Y, Huang D, Li MJ. epiCOLOC: Integrating Large-Scale and Context-Dependent Epigenomics Features for Comprehensive Colocalization Analysis. Front Genet 2020; 11:53. [PMID: 32117461 PMCID: PMC7029718 DOI: 10.3389/fgene.2020.00053] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Accepted: 01/17/2020] [Indexed: 12/18/2022] Open
Abstract
High-throughput genome-wide epigenomic assays, such as ChIP-seq, DNase-seq and ATAC-seq, have profiled a huge number of functional elements across numerous human tissues/cell types, which provide an unprecedented opportunity to interpret human genome and disease in context-dependent manner. Colocalization analysis determines whether genomic features are functionally related to a given search and will facilitate identifying the underlying biological functions characterizing intricate relationships with queries for genomic regions. Existing colocalization methods leveraged diverse assumptions and background models to assess the significance of enrichment, however, they only provided limited and predefined sets of epigenomic features. Here, we comprehensively collected and integrated over 44,385 bulk or single-cell epigenomic assays across 53 human tissues/cell types, such as transcription factor binding, histone modification, open chromatin and transcriptional event. By classifying these profiles into hierarchy of tissue/cell type, we developed a web portal, epiCOLOC (http://mulinlab.org/epicoloc or http://mulinlab.tmu.edu.cn/epicoloc), for users to perform context-dependent colocalization analysis in a convenient way.
Collapse
Affiliation(s)
- Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Yongzheng Sun
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Dandan Huang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.,Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Tianjin Medical University, Tianjin, China
| |
Collapse
|
33
|
Richardson TG, Hemani G, Gaunt TR, Relton CL, Davey Smith G. A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome. Nat Commun 2020; 11:185. [PMID: 31924771 PMCID: PMC6954187 DOI: 10.1038/s41467-019-13921-9] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 11/26/2019] [Indexed: 11/09/2022] Open
Abstract
Developing insight into tissue-specific transcriptional mechanisms can help improve our understanding of how genetic variants exert their effects on complex traits and disease. In this study, we apply the principles of Mendelian randomization to systematically evaluate transcriptome-wide associations between gene expression (across 48 different tissue types) and 395 complex traits. Our findings indicate that variants which influence gene expression levels in multiple tissues are more likely to influence multiple complex traits. Moreover, detailed investigations of our results highlight tissue-specific associations, drug validation opportunities, insight into the likely causal pathways for trait-associated variants and also implicate putative associations at loci yet to be implicated in disease susceptibility. Similar evaluations can be conducted at http://mrcieu.mrsoftware.org/Tissue_MR_atlas/.
Collapse
Affiliation(s)
- Tom G Richardson
- MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK.
| | - Gibran Hemani
- MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK
| | - Tom R Gaunt
- MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK
| | - Caroline L Relton
- MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK
| | - George Davey Smith
- MRC Integrative Epidemiology Unit (IEU), Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK
| |
Collapse
|
34
|
Ben-Elazar S, Chor B, Yakhini Z. The Functional 3D Organization of Unicellular Genomes. Sci Rep 2019; 9:12734. [PMID: 31484964 PMCID: PMC6726614 DOI: 10.1038/s41598-019-48798-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 08/12/2019] [Indexed: 11/09/2022] Open
Abstract
Genome conformation capture techniques permit a systematic investigation into the functional spatial organization of genomes, including functional aspects like assessing the co-localization of sets of genomic elements. For example, the co-localization of genes targeted by a transcription factor (TF) within a transcription factory. We quantify spatial co-localization using a rigorous statistical model that measures the enrichment of a subset of elements in neighbourhoods inferred from Hi-C data. We also control for co-localization that can be attributed to genomic order. We systematically apply our open-sourced framework, spatial-mHG, to search for spatial co-localization phenomena in multiple unicellular Hi-C datasets with corresponding genomic annotations. Our biological findings shed new light on the functional spatial organization of genomes, including: In C. crescentus, DNA replication genes reside in two genomic clusters that are spatially co-localized. Furthermore, these clusters contain similar gene copies and lay in genomic vicinity to the ori and ter sequences. In S. cerevisae, Ty5 retrotransposon family element spatially co-localize at a spatially adjacent subset of telomeres. In N. crassa, both Proteasome lid subcomplex genes and protein refolding genes jointly spatially co-localize at a shared location. An implementation of our algorithms is available online.
Collapse
|
35
|
Bomsztyk K, Wang Y. Genome-bound enzymes as epigenetic drug targets in cancer. Epigenomics 2019; 11:1463-1467. [PMID: 31536377 DOI: 10.2217/epi-2019-0197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Affiliation(s)
- Karol Bomsztyk
- UW Medicine South Lake Union, University of Washington, Seattle, WA 98109, USA.,Institute for Stem Cell & Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Yuliang Wang
- Institute for Stem Cell & Regenerative Medicine, University of Washington, Seattle, WA 98109, USA.,Paul G Allen School of Computer Science & Engineering, University of Washington, WA 98195, USA
| |
Collapse
|