1
|
Xie H, Cao X, Zhang S, Sha Q. Joint analysis of multiple phenotypes for extremely unbalanced case-control association studies using multi-layer network. Bioinformatics 2023; 39:btad707. [PMID: 37991852 PMCID: PMC10697735 DOI: 10.1093/bioinformatics/btad707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 09/29/2023] [Accepted: 11/21/2023] [Indexed: 11/24/2023] Open
Abstract
MOTIVATION Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association tests for joint analysis of multiple phenotypes. In this article, we first propose a novel method to construct a Multi-Layer Network (MLN) using individuals with at least one case status among all phenotypes. Then, we introduce a computationally efficient community detection method to group phenotypes into disjoint clusters based on the MLN. Finally, we propose a novel approach, MLN with Omnibus (MLN-O), to jointly analyse the association between phenotypes and a SNP. MLN-O uses the score test to test the association of each merged phenotype in a cluster and a SNP, then uses the Omnibus test to obtain an overall test statistic to test the association between all phenotypes and a SNP. RESULTS We conduct extensive simulation studies to reveal that the proposed approach can control type I error rates and is more powerful than some existing methods. Meanwhile, we apply the proposed method to a real data set in the UK Biobank. Using phenotypes in Chapter XIII (Diseases of the musculoskeletal system and connective tissue) in the UK Biobank, we find that MLN-O identifies more significant SNPs than other methods we compare with. AVAILABILITY AND IMPLEMENTATION https://github.com/Hongjing-Xie/Multi-Layer-Network-with-Omnibus-MLN-O.
Collapse
Affiliation(s)
- Hongjing Xie
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, United States
| | - Xuewei Cao
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, United States
| | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, United States
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, United States
| |
Collapse
|
2
|
Pace NP, Mintoff D, Borg I. The Genomic Architecture of Hidradenitis Suppurativa-A Systematic Review. Front Genet 2022; 13:861241. [PMID: 35401657 PMCID: PMC8986338 DOI: 10.3389/fgene.2022.861241] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 03/07/2022] [Indexed: 12/11/2022] Open
Abstract
Hidradenitis suppurativa is a chronic, suppurative condition of the pilosebaceous unit manifesting as painful nodules, abscesses, and sinus tracts mostly in, but not limited to, intertriginous skin. Great strides have been made at elucidating the pathophysiology of hidradenitis suppurativa, which appears to be the product of hyperkeratinization and inflammation brought about by environmental factors and a genetic predisposition. The identification of familial hidradenitis suppurativa has sparked research aimed at identifying underlying pathogenic variants in patients who harbor them. The objective of this review is to provide a broad overview of the role of genetics in various aspects of hidradenitis suppurativa, specifically the pathophysiology, diagnosis, and clinical application.
Collapse
Affiliation(s)
- Nikolai Paul Pace
- Center for Molecular Medicine and Biobanking, University of Malta, Msida, Malta
| | - Dillon Mintoff
- Department of Pathology, Faculty of Medicine and Surgery, University of Malta, Msida, Malta
- Department of Dermatology, Mater Dei Hospital, Msida, Malta
| | - Isabella Borg
- Center for Molecular Medicine and Biobanking, University of Malta, Msida, Malta
- Department of Pathology, Faculty of Medicine and Surgery, University of Malta, Msida, Malta
- Department of Pathology, Mater Dei Hospital, Msida, Malta
| |
Collapse
|
3
|
Krieger MS, Moreau JM, Zhang H, Chien M, Zehnder JL, Craig M. A Blueprint for Identifying Phenotypes and Drug Targets in Complex Disorders with Empirical Dynamics. PATTERNS (NEW YORK, N.Y.) 2020; 1:100138. [PMID: 33336196 PMCID: PMC7733879 DOI: 10.1016/j.patter.2020.100138] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 09/25/2020] [Accepted: 10/12/2020] [Indexed: 02/06/2023]
Abstract
A central challenge in medicine is translating from observational understanding to mechanistic understanding, where some observations are recognized as causes for the others. This can lead not only to new treatments and understanding, but also to recognition of novel phenotypes. Here, we apply a collection of mathematical techniques (empirical dynamics), which infer mechanistic networks in a model-free manner from longitudinal data, to hematopoiesis. Our study consists of three subjects with markers for cyclic thrombocytopenia, in which multiple cells and proteins undergo abnormal oscillations. One subject has atypical markers and may represent a rare phenotype. Our analyses support this contention, and also lend new evidence to a theory for the cause of this disorder. Simulations of an intervention yield encouraging results, even when applied to patient data outside our three subjects. These successes suggest that this blueprint has broader applicability in understanding and treating complex disorders.
Collapse
Affiliation(s)
- Madison S. Krieger
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Joshua M. Moreau
- Department of Dermatology, University of California, San Francisco, CA, USA
| | - Haiyu Zhang
- Department of Pathology, Stanford School of Medicine, Stanford, CA, USA
| | - May Chien
- Department of Medicine (Hematology), Stanford, CA, USA
| | - James L. Zehnder
- Department of Pathology, Stanford School of Medicine, Stanford, CA, USA
- Department of Medicine (Hematology), Stanford, CA, USA
| | - Morgan Craig
- Département de Mathématiques et de Statistique, Université de Montréal, Montréal, QC, Canada
- CHU Sainte-Justine Research Centre, 3175 Chemin de la Côte-Sainte-Catherine, Montréal, QC H3T 1C5, Canada
| |
Collapse
|
4
|
Arpón A, Milagro FI, Ramos-Lopez O, Mansego ML, Riezu-Boj JI, Martínez JA. Methylome-Wide Association Study in Peripheral White Blood Cells Focusing on Central Obesity and Inflammation. Genes (Basel) 2019; 10:E444. [PMID: 31212707 PMCID: PMC6627499 DOI: 10.3390/genes10060444] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Revised: 06/03/2019] [Accepted: 06/07/2019] [Indexed: 12/13/2022] Open
Abstract
Epigenetic signatures such as DNA methylation may be associated with specific obesity traits in different tissues. The onset and development of some obesity-related complications are often linked to visceral fat accumulation. The aim of this study was to explore DNA methylation levels in peripheral white blood cells to identify epigenetic methylation marks associated with waist circumference (WC). DNA methylation levels were assessed using Infinium Human Methylation 450K and MethylationEPIC beadchip (Illumina) to search for putative associations with WC values of 473 participants from the Methyl Epigenome Network Association (MENA) project. Statistical analysis and Ingenuity Pathway Analysis (IPA) were employed for assessing the relationship between methylation and WC. A total of 669 CpGs were statistically associated with WC (FDR < 0.05, slope ≥ |0.1|). From these CpGs, 375 CpGs evidenced a differential methylation pattern between females with WC ≤ 88 and > 88 cm, and 95 CpGs between males with WC ≤ 102 and > 102 cm. These differentially methylated CpGs are located in genes related to inflammation and obesity according to IPA. Receiver operating characteristic (ROC) curves of the top four significant differentially methylated CpGs separated by sex discriminated individuals with presence or absence of abdominal fat. ROC curves of all the CpGs from females and one CpG from males were validated in an independent sample (n = 161). These methylation results add further insights about the relationships between obesity, adiposity-associated comorbidities, and DNA methylation where inflammation processes may be involved.
Collapse
Affiliation(s)
- Ana Arpón
- Department of Nutrition, Food Sciences and Physiology, University of Navarra, Irunlarrea 1,31008 Pamplona, Spain.
- Centre for Nutrition Research, University of Navarra, Irunlarrea 1, 31008, Pamplona, Spain.
| | - Fermín I Milagro
- Department of Nutrition, Food Sciences and Physiology, University of Navarra, Irunlarrea 1,31008 Pamplona, Spain.
- Centre for Nutrition Research, University of Navarra, Irunlarrea 1, 31008, Pamplona, Spain.
- Centro de Investigación Biomédica en Red Fisiopatología de la Obesidad y Nutrición (CIBERobn), Instituto de Salud Carlos III, 28029, Madrid, Spain.
- Navarra Institute for Health Research (IdiSNa), 31008, Pamplona, Spain.
| | - Omar Ramos-Lopez
- Department of Nutrition, Food Sciences and Physiology, University of Navarra, Irunlarrea 1,31008 Pamplona, Spain.
- Centre for Nutrition Research, University of Navarra, Irunlarrea 1, 31008, Pamplona, Spain.
| | - Maria L Mansego
- Department of Bioinformatics, Making Genetics S.L., 31002, Pamplona, Spain.
| | - José-Ignacio Riezu-Boj
- Department of Nutrition, Food Sciences and Physiology, University of Navarra, Irunlarrea 1,31008 Pamplona, Spain.
- Centre for Nutrition Research, University of Navarra, Irunlarrea 1, 31008, Pamplona, Spain.
- Navarra Institute for Health Research (IdiSNa), 31008, Pamplona, Spain.
| | - J Alfredo Martínez
- Department of Nutrition, Food Sciences and Physiology, University of Navarra, Irunlarrea 1,31008 Pamplona, Spain.
- Centre for Nutrition Research, University of Navarra, Irunlarrea 1, 31008, Pamplona, Spain.
- Centro de Investigación Biomédica en Red Fisiopatología de la Obesidad y Nutrición (CIBERobn), Instituto de Salud Carlos III, 28029, Madrid, Spain.
- Navarra Institute for Health Research (IdiSNa), 31008, Pamplona, Spain.
- Precision Nutrition and Cardiometabolic Health Program, Madrid Institute for Advanced Studies (IMDEA), IMDEA Food, 28049, Madrid, Spain.
| |
Collapse
|
5
|
Archer R, Hock E, Hamilton J, Stevens J, Essat M, Poku E, Clowes M, Pandor A, Stevenson M. Assessing prognosis and prediction of treatment response in early rheumatoid arthritis: systematic reviews. Health Technol Assess 2019; 22:1-294. [PMID: 30501821 DOI: 10.3310/hta22660] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Rheumatoid arthritis (RA) is a chronic, debilitating disease associated with reduced quality of life and substantial costs. It is unclear which tests and assessment tools allow the best assessment of prognosis in people with early RA and whether or not variables predict the response of patients to different drug treatments. OBJECTIVE To systematically review evidence on the use of selected tests and assessment tools in patients with early RA (1) in the evaluation of a prognosis (review 1) and (2) as predictive markers of treatment response (review 2). DATA SOURCES Electronic databases (e.g. MEDLINE, EMBASE, The Cochrane Library, Web of Science Conference Proceedings; searched to September 2016), registers, key websites, hand-searching of reference lists of included studies and key systematic reviews and contact with experts. STUDY SELECTION Review 1 - primary studies on the development, external validation and impact of clinical prediction models for selected outcomes in adult early RA patients. Review 2 - primary studies on the interaction between selected baseline covariates and treatment (conventional and biological disease-modifying antirheumatic drugs) on salient outcomes in adult early RA patients. RESULTS Review 1 - 22 model development studies and one combined model development/external validation study reporting 39 clinical prediction models were included. Five external validation studies evaluating eight clinical prediction models for radiographic joint damage were also included. c-statistics from internal validation ranged from 0.63 to 0.87 for radiographic progression (different definitions, six studies) and 0.78 to 0.82 for the Health Assessment Questionnaire (HAQ). Predictive performance in external validations varied considerably. Three models [(1) Active controlled Study of Patients receiving Infliximab for the treatment of Rheumatoid arthritis of Early onset (ASPIRE) C-reactive protein (ASPIRE CRP), (2) ASPIRE erythrocyte sedimentation rate (ASPIRE ESR) and (3) Behandelings Strategie (BeSt)] were externally validated using the same outcome definition in more than one population. Results of the random-effects meta-analysis suggested substantial uncertainty in the expected predictive performance of models in a new sample of patients. Review 2 - 12 studies were identified. Covariates examined included anti-citrullinated protein/peptide anti-body (ACPA) status, smoking status, erosions, rheumatoid factor status, C-reactive protein level, erythrocyte sedimentation rate, swollen joint count (SJC), body mass index and vascularity of synovium on power Doppler ultrasound (PDUS). Outcomes examined included erosions/radiographic progression, disease activity, physical function and Disease Activity Score-28 remission. There was statistical evidence to suggest that ACPA status, SJC and PDUS status at baseline may be treatment effect modifiers, but not necessarily that they are prognostic of response for all treatments. Most of the results were subject to considerable uncertainty and were not statistically significant. LIMITATIONS The meta-analysis in review 1 was limited by the availability of only a small number of external validation studies. Studies rarely investigated the interaction between predictors and treatment. SUGGESTED RESEARCH PRIORITIES Collaborative research (including the use of individual participant data) is needed to further develop and externally validate the clinical prediction models. The clinical prediction models should be validated with respect to individual treatments. Future assessments of treatment by covariate interactions should follow good statistical practice. CONCLUSIONS Review 1 - uncertainty remains over the optimal prediction model(s) for use in clinical practice. Review 2 - in general, there was insufficient evidence that the effect of treatment depended on baseline characteristics. STUDY REGISTRATION This study is registered as PROSPERO CRD42016042402. FUNDING The National Institute for Health Research Health Technology Assessment programme.
Collapse
Affiliation(s)
- Rachel Archer
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Emma Hock
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Jean Hamilton
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - John Stevens
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Munira Essat
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Edith Poku
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Mark Clowes
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Abdullah Pandor
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Matt Stevenson
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| |
Collapse
|
6
|
Gordon D, Londono D, Patel P, Kim W, Finch SJ, Heiman GA. An Analytic Solution to the Computation of Power and Sample Size for Genetic Association Studies under a Pleiotropic Mode of Inheritance. Hum Hered 2017; 81:194-209. [PMID: 28315880 DOI: 10.1159/000457135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 01/20/2017] [Indexed: 01/14/2023] Open
Abstract
Our motivation here is to calculate the power of 3 statistical tests used when there are genetic traits that operate under a pleiotropic mode of inheritance and when qualitative phenotypes are defined by use of thresholds for the multiple quantitative phenotypes. Specifically, we formulate a multivariate function that provides the probability that an individual has a vector of specific quantitative trait values conditional on having a risk locus genotype, and we apply thresholds to define qualitative phenotypes (affected, unaffected) and compute penetrances and conditional genotype frequencies based on the multivariate function. We extend the analytic power and minimum-sample-size-necessary (MSSN) formulas for 2 categorical data-based tests (genotype, linear trend test [LTT]) of genetic association to the pleiotropic model. We further compare the MSSN of the genotype test and the LTT with that of a multivariate ANOVA (Pillai). We approximate the MSSN for statistics by linear models using a factorial design and ANOVA. With ANOVA decomposition, we determine which factors most significantly change the power/MSSN for all statistics. Finally, we determine which test statistics have the smallest MSSN. In this work, MSSN calculations are for 2 traits (bivariate distributions) only (for illustrative purposes). We note that the calculations may be extended to address any number of traits. Our key findings are that the genotype test usually has lower MSSN requirements than the LTT. More inclusive thresholds (top/bottom 25% vs. top/bottom 10%) have higher sample size requirements. The Pillai test has a much larger MSSN than both the genotype test and the LTT, as a result of sample selection. With these formulas, researchers can specify how many subjects they must collect to localize genes for pleiotropic phenotypes.
Collapse
Affiliation(s)
- Derek Gordon
- Department of Genetics, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | | | | |
Collapse
|
7
|
Schillert A, Konigorski S. Joint analysis of multiple phenotypes: summary of results and discussions from the Genetic Analysis Workshop 19. BMC Genet 2016; 17 Suppl 2:7. [PMID: 26866608 PMCID: PMC4895558 DOI: 10.1186/s12863-015-0317-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
For Genetic Analysis Workshop 19, 2 extensive data sets were provided, including whole genome and whole exome sequence data, gene expression data, and longitudinal blood pressure outcomes, together with nongenetic covariates. These data sets gave researchers the chance to investigate different aspects of more complex relationships within the data, and the contributions in our working group focused on statistical methods for the joint analysis of multiple phenotypes, which is part of the research field of data integration. The analysis of data from different sources poses challenges to researchers but provides the opportunity to model the real-life situation more realistically.Our 4 contributions all used the provided real data to identify genetic predictors for blood pressure. In the contributions, novel multivariate rare variant tests, copula models, structural equation models and a sparse matrix representation variable selection approach were applied. Each of these statistical models can be used to investigate specific hypothesized relationships, which are described together with their biological assumptions.The results showed that all methods are ready for application on a genome-wide scale and can be used or extended to include multiple omics data sets. The results provide potentially interesting genetic targets for future investigation and replication. Furthermore, all contributions demonstrated that the analysis of complex data sets could benefit from modeling correlated phenotypes jointly as well as by adding further bioinformatics information.
Collapse
Affiliation(s)
- Arne Schillert
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Ratzeburger Allee 160, 23562, Lübeck, Germany.
| | - Stefan Konigorski
- Molecular Epidemiology Research Group, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Robert-Rössle-Straße 10, 13125, Berlin-Buch, Germany.
| |
Collapse
|
8
|
Feng Q, Vickers KC, Anderson MP, Levin MG, Chen W, Harrison DG, Wilke RA. A common functional promoter variant links CNR1 gene expression to HDL cholesterol level. Nat Commun 2013; 4:1973. [PMID: 23748922 PMCID: PMC3873874 DOI: 10.1038/ncomms2973] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Accepted: 05/08/2013] [Indexed: 12/17/2022] Open
Abstract
CB1 receptor blockers increase HDL-C levels. Although genetic variation in the CB1 receptor – encoded by the CNR1 gene – is known to influence HDL-C level as well, human studies conducted to date have been limited to genetic markers such as haplotype tagging SNPs. Here we identify rs806371 in the CNR1 promoter as the causal variant. We resequenced the CNR1 gene and genotype all variants in a DNA biobank linked to comprehensive electronic medical records. By testing each variant for association with HDL-C level in a clinical practice-based setting, we localize a putative functional allele to a 100bp window in the 5′-flanking region. Assessment of variants in this window for functional impact on electrophoretic mobility shift assay identified rs806371 as a novel regulatory binding element. Reporter gene assays confirm that rs806371 reduces HDL-C gene expression, thereby linking CNR1 gene variation to HDL-C level in humans.
Collapse
Affiliation(s)
- Q Feng
- Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA.
| | | | | | | | | | | | | |
Collapse
|
9
|
Imboden M, Probst-Hensch NM. Biobanking across the phenome - at the center of chronic disease research. BMC Public Health 2013; 13:1094. [PMID: 24274136 PMCID: PMC4222669 DOI: 10.1186/1471-2458-13-1094] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2012] [Accepted: 09/25/2013] [Indexed: 11/10/2022] Open
Abstract
Background Recognized public health relevant risk factors such as obesity, physical inactivity, smoking or air pollution are common to many non-communicable diseases (NCDs). NCDs cluster and co-morbidities increase in parallel to age. Pleiotropic genes and genetic variants have been identified by genome-wide association studies (GWAS) linking NCD entities hitherto thought to be distant in etiology. These different lines of evidence suggest that NCD disease mechanisms are in part shared. Discussion Identification of common exogenous and endogenous risk patterns may promote efficient prevention, an urgent need in the light of the global NCD epidemic. The prerequisite to investigate causal risk patterns including biologic, genetic and environmental factors across different NCDs are well characterized cohorts with associated biobanks. Prospectively collected data and biospecimen from subjects of various age, sociodemographic, and cultural groups, both healthy and affected by one or more NCD, are essential for exploring biologic mechanisms and susceptibilities interlinking different environmental and lifestyle exposures, co-morbidities, as well as cellular senescence and aging. A paradigm shift in the research activities can currently be observed, moving from focused investigations on the effect of a single risk factor on an isolated health outcome to a more comprehensive assessment of risk patterns and a broader phenome approach. Though important methodological and analytical challenges need to be resolved, the ongoing international efforts to establish large-scale population-based biobank cohorts are a critical basis for moving NCD disease etiology forward. Summary Future epidemiologic and public health research should aim at sustaining a comprehensive systems view on health and disease. The political and public discussions about the utilitarian aspect of investing in and contributing to cohort and biobank research are essential and are indirectly linked to the achievement of public health programs effectively addressing the global NCD epidemic.
Collapse
Affiliation(s)
- Medea Imboden
- Swiss Tropical and Public Health Institute, Basel, Switzerland.
| | | |
Collapse
|
10
|
Abstract
In gene prediction, studying phenotypes is highly valuable for reducing the number of locus candidates in association studies and to aid disease gene candidate prioritization. This is due to the intrinsic nature of phenotypes to visibly reflect genetic activity, making them potentially one of the most useful data types for functional studies. However, systematic use of these data has begun only recently. 'Comparative phenomics' is the analysis of genotype-phenotype associations across species and experimental methods. This is an emerging research field of utmost importance for gene discovery and gene function annotation. In this chapter, we review the use of phenotype data in the biomedical field. We will give an overview of phenotype resources, focusing on PhenomicDB--a cross-species genotype-phenotype database--which is the largest available collection of phenotype descriptions across species and experimental methods. We report on its latest extension by which genotype-phenotype relationships can be viewed as graphical representations of similar phenotypes clustered together ('phenoclusters'), supplemented with information from protein-protein interactions and Gene Ontology terms. We show that such 'phenoclusters' represent a novel approach to group genes functionally and to predict novel gene functions with high precision. We explain how these data and methods can be used to supplement the results of gene discovery approaches. The aim of this chapter is to assist researchers interested in understanding how phenotype data can be used effectively in the gene discovery field.
Collapse
|
11
|
Zhou Y, Jie SH. Hereditary hyperbilirubinemia and its molecular diagnosis. Shijie Huaren Xiaohua Zazhi 2011; 19:2346-2352. [DOI: 10.11569/wcjd.v19.i22.2346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Hereditary hyperbilirubinemia is caused by genetic defects in the enzymes that control bilirubin metabolism. It includes Gilbert syndrome (GS), Crigler-Najjar syndrome (CNS), Lucey-Driscoll syndrome (LDS), Dubin-Johnson syndrome (DJS), Rotor syndrome (RS) and progressive familial intrahepatic cholestasis (PFIC). This literature review covers the molecular basis of and laboratory detection methods for hereditary hyperbilirubinemia.
Collapse
|
12
|
Evangelou E, Fellay J, Colombo S, Martinez-Picado J, Obel N, Goldstein DB, Telenti A, Ioannidis JPA. Impact of phenotype definition on genome-wide association signals: empirical evaluation in human immunodeficiency virus type 1 infection. Am J Epidemiol 2011; 173:1336-42. [PMID: 21490045 DOI: 10.1093/aje/kwr024] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Discussion on improving the power of genome-wide association studies to identify candidate variants and genes is generally centered on issues of maximizing sample size; less attention is given to the role of phenotype definition and ascertainment. The authors used genome-wide data from patients infected with human immunodeficiency virus type 1 (HIV-1) to assess whether differences in type of population (622 seroconverters vs. 636 seroprevalent subjects) or the number of measurements available for defining the phenotype resulted in differences in the effect sizes of associations between single nucleotide polymorphisms and the phenotype, HIV-1 viral load at set point. The effect estimate for the top 100 single nucleotide polymorphisms was 0.092 (95% confidence interval: 0.074, 0.110) log(10) viral load (log(10) copies of HIV-1 per mL of blood) greater in seroconverters than in seroprevalent subjects. The difference was even larger when the authors focused on chromosome 6 variants (0.153 log(10) viral load) or on variants that achieved genome-wide significance (0.232 log(10) viral load). The estimates of the genetic effects tended to be slightly larger when more viral load measurements were available, particularly among seroconverters and for variants that achieved genome-wide significance. Differences in phenotype definition and ascertainment may affect the estimated magnitude of genetic effects and should be considered in optimizing power for discovering new associations.
Collapse
Affiliation(s)
- Evangelos Evangelou
- Institute of Microbiology, University Hospital Center, University of Lausanne, Lausanne, Switzerland
| | | | | | | | | | | | | | | |
Collapse
|