1
|
A phenome-wide association study of 26 mendelian genes reveals phenotypic expressivity of common and rare variants within the general population. PLoS Genet 2020; 16:e1008802. [PMID: 33226994 PMCID: PMC7735621 DOI: 10.1371/journal.pgen.1008802] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 12/14/2020] [Accepted: 04/27/2020] [Indexed: 02/06/2023] Open
Abstract
The clinical evaluation of a genetic syndrome relies upon recognition of a characteristic pattern of signs or symptoms to guide targeted genetic testing for confirmation of the diagnosis. However, individuals displaying a single phenotype of a complex syndrome may not meet criteria for clinical diagnosis or genetic testing. Here, we present a phenome-wide association study (PheWAS) approach to systematically explore the phenotypic expressivity of common and rare alleles in genes associated with four well-described syndromic diseases (Alagille (AS), Marfan (MS), DiGeorge (DS), and Noonan (NS) syndromes) in the general population. Using human phenotype ontology (HPO) terms, we systematically mapped 60 phenotypes related to AS, MS, DS and NS in 337,198 unrelated white British from the UK Biobank (UKBB) based on their hospital admission records, self-administrated questionnaires, and physiological measurements. We performed logistic regression adjusting for age, sex, and the first 5 genetic principal components, for each phenotype and each variant in the target genes (JAG1, NOTCH2 FBN1, PTPN1 and RAS-opathy genes, and genes in the 22q11.2 locus) and performed a gene burden test. Overall, we observed multiple phenotype-genotype correlations, such as the association between variation in JAG1, FBN1, PTPN11 and SOS2 with diastolic and systolic blood pressure; and pleiotropy among multiple variants in syndromic genes. For example, rs11066309 in PTPN11 was significantly associated with a lower body mass index, an increased risk of hypothyroidism and a smaller size for gestational age, all in concordance with NS-related phenotypes. Similarly, rs589668 in FBN1 was associated with an increase in body height and blood pressure, and a reduced body fat percentage as observed in Marfan syndrome. Our findings suggest that the spectrum of associations of common and rare variants in genes involved in syndromic diseases can be extended to individual phenotypes within the general population. Standard medical evaluation of genetic syndromes relies upon recognizing a characteristic pattern of signs or symptoms to guide targeted genetic testing for confirmation of the diagnosis. This may lead to missing diagnoses in patients with silent or a low expressed form of the syndrome. Here we take advantage of a rich electronic health record, various phenotypic measurements, and genetic information in 337,198 unrelated white British from the UKBB, to study the relation between single syndromic disease phenotypes and genes related to syndromic disease. We show multiple phenotype-genotype associations in concordance with phenotypes variations found in syndromic diseases. For example, we show that a commonly found variant in FBN1 was associated with high standing/sitting height ratio and reduced body fat percentage as observed in individuals with Marfan syndrome. Our findings suggest that common and rare alleles in syndromic disease genes are causative of individual component phenotypes present in a general population; further research is needed to characterize the pleiotropic effect of alleles in syndromic genes in persons without the syndromic disease.
Collapse
|
2
|
Wang ZT, Tan CC, Tan L, Yu JT. Systems biology and gene networks in Alzheimer’s disease. Neurosci Biobehav Rev 2019; 96:31-44. [DOI: 10.1016/j.neubiorev.2018.11.007] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Revised: 11/18/2018] [Accepted: 11/18/2018] [Indexed: 12/25/2022]
|
3
|
Han J, Li J, Achour I, Pesce L, Foster I, Li H, Lussier YA. Convergent downstream candidate mechanisms of independent intergenic polymorphisms between co-classified diseases implicate epistasis among noncoding elements. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018; 23:524-535. [PMID: 29218911 PMCID: PMC5730078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Eighty percent of DNA outside protein coding regions was shown biochemically functional by the ENCODE project, enabling studies of their interactions. Studies have since explored how convergent downstream mechanisms arise from independent genetic risks of one complex disease. However, the cross-talk and epistasis between intergenic risks associated with distinct complex diseases have not been comprehensively characterized. Our recent integrative genomic analysis unveiled downstream biological effectors of disease-specific polymorphisms buried in intergenic regions, and we then validated their genetic synergy and antagonism in distinct GWAS. We extend this approach to characterize convergent downstream candidate mechanisms of distinct intergenic SNPs across distinct diseases within the same clinical classification. We construct a multipartite network consisting of 467 diseases organized in 15 classes, 2,358 disease-associated SNPs, 6,301 SNPassociated mRNAs by eQTL, and mRNA annotations to 4,538 Gene Ontology mechanisms. Functional similarity between two SNPs (similar SNP pairs) is imputed using a nested information theoretic distance model for which p-values are assigned by conservative scale-free permutation of network edges without replacement (node degrees constant). At FDR≤5%, we prioritized 3,870 intergenic SNP pairs associated, among which 755 are associated with distinct diseases sharing the same disease class, implicating 167 intergenic SNPs, 14 classes, 230 mRNAs, and 134 GO terms. Co-classified SNP pairs were more likely to be prioritized as compared to those of distinct classes confirming a noncoding genetic underpinning to clinical classification (odds ratio ∼3.8; p≤10-25). The prioritized pairs were also enriched in regions bound to the same/interacting transcription factors and/or interacting in long-range chromatin interactions suggestive of epistasis (odds ratio ∼ 2,500; p≤10-25). This prioritized network implicates complex epistasis between intergenic polymorphisms of co-classified diseases and offers a roadmap for a novel therapeutic paradigm: repositioning medications that target proteins within downstream mechanisms of intergenic disease-associated SNPs. Supplementary information and software: http://lussiergroup.org/publications/disease_class.
Collapse
Affiliation(s)
- Jiali Han
- Center for Biomedical Informatics and Biostatistics (CB2) and Departments of Medicine and of Systems and Industrial Engineering, The University of Arizona, Tucson, AZ 85721, USA,
| | | | | | | | | | | | | |
Collapse
|
4
|
Li Q, Schissler AG, Gardeux V, Berghout J, Achour I, Kenost C, Li H, Zhang HH, Lussier YA. kMEn: Analyzing noisy and bidirectional transcriptional pathway responses in single subjects. J Biomed Inform 2017; 66:32-41. [PMID: 28007582 PMCID: PMC5316373 DOI: 10.1016/j.jbi.2016.12.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 10/28/2016] [Accepted: 12/16/2016] [Indexed: 10/20/2022]
Abstract
MOTIVATION Understanding dynamic, patient-level transcriptomic response to therapy is an important step forward for precision medicine. However, conventional transcriptome analysis aims to discover cohort-level change, lacking the capacity to unveil patient-specific response to therapy. To address this gap, we previously developed two N-of-1-pathways methods, Wilcoxon and Mahalanobis distance, to detect unidirectionally responsive transcripts within a pathway using a pair of samples from a single subject. Yet, these methods cannot recognize bidirectionally (up and down) responsive pathways. Further, our previous approaches have not been assessed in presence of background noise and are not designed to identify differentially expressed mRNAs between two samples of a patient taken in different contexts (e.g. cancer vs non cancer), which we termed responsive transcripts (RTs). METHODS We propose a new N-of-1-pathways method, k-Means Enrichment (kMEn), that detects bidirectionally responsive pathways, despite background noise, using a pair of transcriptomes from a single patient. kMEn identifies transcripts responsive to the stimulus through k-means clustering and then tests for an over-representation of the responsive genes within each pathway. The pathways identified by kMEn are mechanistically interpretable pathways significantly responding to a stimulus. RESULTS In ∼9000 simulations varying six parameters, superior performance of kMEn over previous single-subject methods is evident by: (i) improved precision-recall at various levels of bidirectional response and (ii) lower rates of false positives (1-specificity) when more than 10% of genes in the genome are differentially expressed (background noise). In a clinical proof-of-concept, personal treatment-specific pathways identified by kMEn correlate with therapeutic response (p-value<0.01). CONCLUSION Through improved single-subject transcriptome dynamics of bidirectionally-regulated signals, kMEn provides a novel approach to identify mechanism-level biomarkers.
Collapse
Affiliation(s)
- Qike Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA
| | - A Grant Schissler
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA
| | - Vincent Gardeux
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Joanne Berghout
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Ikbel Achour
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Colleen Kenost
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Haiquan Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA.
| | - Hao Helen Zhang
- Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA; Department of Mathematics, The University of Arizona, Tucson, AZ 85721, USA.
| | - Yves A Lussier
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA; University of Arizona Cancer Center, The University of Arizona, Tucson, AZ 85721, USA; Institute for Genomics and Systems Biology, The University of Chicago, IL 60637, USA.
| |
Collapse
|
5
|
Integrative genomics analyses unveil downstream biological effectors of disease-specific polymorphisms buried in intergenic regions. NPJ Genom Med 2016; 1. [PMID: 27482468 PMCID: PMC4966659 DOI: 10.1038/npjgenmed.2016.6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Functionally altered biological mechanisms arising from disease-associated polymorphisms, remain difficult to characterise when those variants are intergenic, or, fall between genes. We sought to identify shared downstream mechanisms by which inter- and intragenic single-nucleotide polymorphisms (SNPs) contribute to a specific physiopathology. Using computational modelling of 2 million pairs of disease-associated SNPs drawn from genome-wide association studies (GWAS), integrated with expression Quantitative Trait Loci (eQTL) and Gene Ontology functional annotations, we predicted 3,870 inter–intra and inter–intra SNP pairs with convergent biological mechanisms (FDR<0.05). These prioritised SNP pairs with overlapping messenger RNA targets or similar functional annotations were more likely to be associated with the same disease than unrelated pathologies (OR>12). We additionally confirmed synergistic and antagonistic genetic interactions for a subset of prioritised SNP pairs in independent studies of Alzheimer’s disease (entropy P=0.046), bladder cancer (entropy P=0.039), and rheumatoid arthritis (PheWAS case–control P<10−4). Using ENCODE data sets, we further statistically validated that the biological mechanisms shared within prioritised SNP pairs are frequently governed by matching transcription factor binding sites and long-range chromatin interactions. These results provide a ‘roadmap’ of disease mechanisms emerging from GWAS and further identify candidate therapeutic targets among downstream effectors of intergenic SNPs.
Collapse
|
6
|
Schissler AG, Gardeux V, Li Q, Achour I, Li H, Piegorsch WW, Lussier YA. Dynamic changes of RNA-sequencing expression for precision medicine: N-of-1-pathways Mahalanobis distance within pathways of single subjects predicts breast cancer survival. Bioinformatics 2015; 31:i293-302. [PMID: 26072495 PMCID: PMC4765863 DOI: 10.1093/bioinformatics/btv253] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Motivation: The conventional approach to personalized medicine relies on molecular data analytics across multiple patients. The path to precision medicine lies with molecular data analytics that can discover interpretable single-subject signals (N-of-1). We developed a global framework, N-of-1-pathways, for a mechanistic-anchored approach to single-subject gene expression data analysis. We previously employed a metric that could prioritize the statistical significance of a deregulated pathway in single subjects, however, it lacked in quantitative interpretability (e.g. the equivalent to a gene expression fold-change). Results: In this study, we extend our previous approach with the application of statistical Mahalanobis distance (MD) to quantify personal pathway-level deregulation. We demonstrate that this approach, N-of-1-pathways Paired Samples MD (N-OF-1-PATHWAYS-MD), detects deregulated pathways (empirical simulations), while not inflating false-positive rate using a study with biological replicates. Finally, we establish that N-OF-1-PATHWAYS-MD scores are, biologically significant, clinically relevant and are predictive of breast cancer survival (P < 0.05, n = 80 invasive carcinoma; TCGA RNA-sequences). Conclusion: N-of-1-pathways MD provides a practical approach towards precision medicine. The method generates the magnitude and the biological significance of personal deregulated pathways results derived solely from the patient’s transcriptome. These pathways offer the opportunities for deriving clinically actionable decisions that have the potential to complement the clinical interpretability of personal polymorphisms obtained from DNA acquired or inherited polymorphisms and mutations. In addition, it offers an opportunity for applicability to diseases in which DNA changes may not be relevant, and thus expand the ‘interpretable ‘omics’ of single subjects (e.g. personalome). Availability and implementation:http://www.lussierlab.net/publications/N-of-1-pathways. Contact:yves@email.arizona.edu or piegorsch@math.arizona.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A Grant Schissler
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Vincent Gardeux
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Qike Li
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Ikbel Achour
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Haiquan Li
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Walter W Piegorsch
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | - Yves A Lussier
- University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA University of Arizona Center for Biomedical Informatics and Biostatistics (CB2), Tucson, AZ, USA, Graduate Interdisciplinary Program in Statistics, Department of Medicine and BIO5 Institute, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
7
|
Rollo JL, Banihashemi N, Vafaee F, Crawford JW, Kuncic Z, Holsinger RMD. Unraveling the mechanistic complexity of Alzheimer's disease through systems biology. Alzheimers Dement 2015; 12:708-18. [PMID: 26703952 DOI: 10.1016/j.jalz.2015.10.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Revised: 08/18/2015] [Accepted: 10/21/2015] [Indexed: 11/16/2022]
Abstract
Alzheimer's disease (AD) is a complex, multifactorial disease that has reached global epidemic proportions. The challenge remains to fully identify its underlying molecular mechanisms that will enable development of accurate diagnostic tools and therapeutics. Conventional experimental approaches that target individual or small sets of genes or proteins may overlook important parts of the regulatory network, which limits the opportunity of identifying multitarget interventions. Our perspective is that a more complete insight into potential treatment options for AD will only be made possible through studying the disease as a system. We propose an integrative systems biology approach that we argue has been largely untapped in AD research. We present key publications to demonstrate the value of this approach and discuss the potential to intensify research efforts in AD through transdisciplinary collaboration. We highlight challenges and opportunities for significant breakthroughs that could be made if a systems biology approach is fully exploited.
Collapse
Affiliation(s)
- Jennifer L Rollo
- Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia; Laboratory of Molecular Neuroscience, Brain and Mind Centre, The University of Sydney, Sydney, NSW, Australia; Department of Molecular Neuroscience, Institute of Neurology, University College of London, London, UK.
| | - Nahid Banihashemi
- Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| | - Fatemeh Vafaee
- Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia; School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
| | | | - Zdenka Kuncic
- Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia; School of Physics, The University of Sydney, Sydney, NSW, Australia
| | - R M Damian Holsinger
- Laboratory of Molecular Neuroscience, Brain and Mind Centre, The University of Sydney, Sydney, NSW, Australia; Discipline of Biomedical Science, School of Medical Sciences, Sydney Medical School, The University of Sydney, Lidcombe, NSW, Australia
| |
Collapse
|
8
|
Li H, Pouladi N, Achour I, Gardeux V, Li J, Li Q, Zhang HH, Martinez FD, 'Skip' Garcia JGN, Lussier YA. eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs. J Biomed Inform 2015; 58:226-234. [PMID: 26524128 DOI: 10.1016/j.jbi.2015.10.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Revised: 10/15/2015] [Accepted: 10/20/2015] [Indexed: 01/19/2023]
Abstract
The causal and interplay mechanisms of Single Nucleotide Polymorphisms (SNPs) associated with complex diseases (complex disease SNPs) investigated in genome-wide association studies (GWAS) at the transcriptional level (mRNA) are poorly understood despite recent advancements such as discoveries reported in the Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTex). Protein interaction network analyses have successfully improved our understanding of both single gene diseases (Mendelian diseases) and complex diseases. Whether the mRNAs downstream of complex disease genes are central or peripheral in the genetic information flow relating DNA to mRNA remains unclear and may be disease-specific. Using expression Quantitative Trait Loci (eQTL) that provide DNA to mRNA associations and network centrality metrics, we hypothesize that we can unveil the systems properties of information flow between SNPs and the transcriptomes of complex diseases. We compare different conditions such as naïve SNP assignments and stringent linkage disequilibrium (LD) free assignments for transcripts to remove confounders from LD. Additionally, we compare the results from eQTL networks between lymphoblastoid cell lines and liver tissue. Empirical permutation resampling (p<0.001) and theoretic Mann-Whitney U test (p<10(-30)) statistics indicate that mRNAs corresponding to complex disease SNPs via eQTL associations are likely to be regulated by a larger number of SNPs than expected. We name this novel property mRNA hubness in eQTL networks, and further term mRNAs with high hubness as master integrators. mRNA master integrators receive and coordinate the perturbation signals from large numbers of polymorphisms and respond to the personal genetic architecture integratively. This genetic signal integration contrasts with the mechanism underlying some Mendelian diseases, where a genetic polymorphism affecting a single protein hub produces a divergent signal that affects a large number of downstream proteins. Indeed, we verify that this property is independent of the hubness in protein networks for which these mRNAs are transcribed. Our findings provide novel insights into the pleiotropy of mRNAs targeted by complex disease polymorphisms and the architecture of the information flow between the genetic polymorphisms and transcriptomes of complex diseases.
Collapse
Affiliation(s)
- Haiquan Li
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Bio5 Institute, University of Arizona, Tucson, AZ, USA
| | - Nima Pouladi
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Bio5 Institute, University of Arizona, Tucson, AZ, USA
| | - Ikbel Achour
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Bio5 Institute, University of Arizona, Tucson, AZ, USA
| | - Vincent Gardeux
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Bio5 Institute, University of Arizona, Tucson, AZ, USA
| | - Jianrong Li
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Cancer Center, University of Arizona, Tucson, AZ, USA
| | - Qike Li
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Interdisciplinary Program in Statistics, University of Arizona, Tucson, AZ, USA
| | - Hao Helen Zhang
- Interdisciplinary Program in Statistics, University of Arizona, Tucson, AZ, USA; Department of Mathematics, University of Arizona, Tucson, AZ, USA
| | - Fernando D Martinez
- Bio5 Institute, University of Arizona, Tucson, AZ, USA; Department of Pediatrics, University of Arizona, Tucson, AZ, USA
| | - Joe G N 'Skip' Garcia
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Cancer Center, University of Arizona, Tucson, AZ, USA
| | - Yves A Lussier
- Department of Medicine, University of Arizona, Tucson, AZ, USA; Bio5 Institute, University of Arizona, Tucson, AZ, USA; Cancer Center, University of Arizona, Tucson, AZ, USA; Interdisciplinary Program in Statistics, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
9
|
Talwar P, Sinha J, Grover S, Rawat C, Kushwaha S, Agarwal R, Taneja V, Kukreti R. Dissecting Complex and Multifactorial Nature of Alzheimer's Disease Pathogenesis: a Clinical, Genomic, and Systems Biology Perspective. Mol Neurobiol 2015; 53:4833-64. [PMID: 26351077 DOI: 10.1007/s12035-015-9390-0] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2015] [Accepted: 08/11/2015] [Indexed: 01/14/2023]
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disorder characterized by loss of memory and other cognitive functions. AD can be classified into familial AD (FAD) and sporadic AD (SAD) based on heritability and into early onset AD (EOAD) and late onset AD (LOAD) based on age of onset. LOAD cases are more prevalent with genetically complex architecture. In spite of significant research focused on understanding the etiological mechanisms, search for diagnostic biomarker(s) and disease-modifying therapy is still on. In this article, we aim to comprehensively review AD literature on established etiological mechanisms including role of beta-amyloid and apolipoprotein E (APOE) along with promising newer etiological factors such as epigenetic modifications that have been associated with AD suggesting its multifactorial nature. As genomic studies have recently played a significant role in elucidating AD pathophysiology, a systematic review of findings from genome-wide linkage (GWL), genome-wide association (GWA), genome-wide expression (GWE), and epigenome-wide association studies (EWAS) was conducted. The availability of multi-dimensional genomic data has further coincided with the advent of computational and network biology approaches in recent years. Our review highlights the importance of integrative approaches involving genomics and systems biology perspective in elucidating AD pathophysiology. The promising newer approaches may provide reliable means of early and more specific diagnosis and help identify therapeutic interventions for LOAD.
Collapse
Affiliation(s)
- Puneet Talwar
- Academy of Scientific and Innovative Research (AcSIR), CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB) Campus, New Delhi, India.,Genomics and Molecular Medicine Unit, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Mall Road, Delhi, 110 007, India
| | - Juhi Sinha
- Genomics and Molecular Medicine Unit, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Mall Road, Delhi, 110 007, India
| | - Sandeep Grover
- Genomics and Molecular Medicine Unit, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Mall Road, Delhi, 110 007, India.,Department of Paediatrics, Division of Pneumonology-Immunology, Charité University Medical Centre, Berlin, Germany
| | - Chitra Rawat
- Academy of Scientific and Innovative Research (AcSIR), CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB) Campus, New Delhi, India.,Genomics and Molecular Medicine Unit, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Mall Road, Delhi, 110 007, India
| | - Suman Kushwaha
- Institute of Human Behaviour and Allied Sciences (IHBAS), Delhi, India
| | - Rachna Agarwal
- Institute of Human Behaviour and Allied Sciences (IHBAS), Delhi, India
| | - Vibha Taneja
- Department of Research, Sir Ganga Ram Hospital, New Delhi, India
| | - Ritushree Kukreti
- Academy of Scientific and Innovative Research (AcSIR), CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB) Campus, New Delhi, India. .,Genomics and Molecular Medicine Unit, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Mall Road, Delhi, 110 007, India.
| |
Collapse
|
10
|
Gardeux V, Bosco A, Li J, Halonen MJ, Jackson D, Martinez FD, Lussier YA. Towards a PBMC "virogram assay" for precision medicine: Concordance between ex vivo and in vivo viral infection transcriptomes. J Biomed Inform 2015; 55:94-103. [PMID: 25797143 DOI: 10.1016/j.jbi.2015.03.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 02/25/2015] [Accepted: 03/13/2015] [Indexed: 12/31/2022]
Abstract
BACKGROUND Understanding individual patient host-response to viruses is key to designing optimal personalized therapy. Unsurprisingly, in vivo human experimentation to understand individualized dynamic response of the transcriptome to viruses are rarely studied because of the obvious limitations stemming from ethical considerations of the clinical risk. OBJECTIVE In this rhinovirus study, we first hypothesized that ex vivo human cells response to virus can serve as a proxy for otherwise controversial in vivo human experimentation. We further hypothesized that the N-of-1-pathways framework, previously validated in cancer, can be effective in understanding the more subtle individual transcriptomic response to viral infection. METHOD N-of-1-pathways computes a significance score for a given list of gene sets at the patient level, using merely the 'omics profiles of two paired samples as input. We extracted the peripheral blood mononuclear cells (PBMC) of four human subjects, aliquoted in two paired samples, one subjected to ex vivo rhinovirus infection. Their dysregulated genes and pathways were then compared to those of 9 human subjects prior and after intranasal inoculation in vivo with rhinovirus. Additionally, we developed the Similarity Venn Diagram, a novel visualization method that goes beyond conventional overlap to show the similarity between two sets of qualitative measures. RESULTS We evaluated the individual N-of-1-pathways results using two established cohort-based methods: GSEA and enrichment of differentially expressed genes. Similarity Venn Diagrams and individual patient ROC curves illustrate and quantify that the in vivo dysregulation is recapitulated ex vivo both at the gene and pathway level (p-values⩽0.004). CONCLUSION We established the first evidence that an interpretable dynamic transcriptome metric, conducted as an ex vivo assays for a single subject, has the potential to predict individualized response to infectious disease without the clinical risks otherwise associated to in vivo challenges. These results serve as a foundational work for personalized "virograms".
Collapse
Affiliation(s)
- Vincent Gardeux
- Department of Medicine, University of Arizona, Tucson, AZ, USA
| | - Anthony Bosco
- Telethon Institute for Child Health Research, Perth, Australia
| | - Jianrong Li
- Department of Medicine, University of Arizona, Tucson, AZ, USA
| | | | - Daniel Jackson
- Department of Pediatrics, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA; The Childhood Asthma Research and Education Network (CARE)
| | - Fernando D Martinez
- The Childhood Asthma Research and Education Network (CARE); Department of Pediatrics, University of Arizona, Tucson, AZ, USA; BIO5 Institute, University of Arizona, Tucson, AZ, USA.
| | - Yves A Lussier
- Department of Medicine, University of Arizona, Tucson, AZ, USA; BIO5 Institute, University of Arizona, Tucson, AZ, USA; UA Cancer Center, University of Arizona, Tucson, AZ, USA.
| |
Collapse
|
11
|
Romano JD, Tharp WG, Sarkar IN. Adapting simultaneous analysis phylogenomic techniques to study complex disease gene relationships. J Biomed Inform 2015; 54:10-38. [PMID: 25592479 DOI: 10.1016/j.jbi.2015.01.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Revised: 01/02/2015] [Accepted: 01/05/2015] [Indexed: 12/11/2022]
Abstract
The characterization of complex diseases remains a great challenge for biomedical researchers due to the myriad interactions of genetic and environmental factors. Network medicine approaches strive to accommodate these factors holistically. Phylogenomic techniques that can leverage available genomic data may provide an evolutionary perspective that may elucidate knowledge for gene networks of complex diseases and provide another source of information for network medicine approaches. Here, an automated method is presented that leverages publicly available genomic data and phylogenomic techniques, resulting in a gene network. The potential of approach is demonstrated based on a case study of nine genes associated with Alzheimer Disease, a complex neurodegenerative syndrome. The developed technique, which is incorporated into an update to a previously described Perl script called "ASAP," was implemented through a suite of Ruby scripts entitled "ASAP2," first compiles a list of sequence-similarity based orthologues using PSI-BLAST and a recursive NCBI BLAST+ search strategy, then constructs maximum parsimony phylogenetic trees for each set of nucleotide and protein sequences, and calculates phylogenetic metrics (Incongruence Length Difference between orthologue sets, partitioned Bremer support values, combined branch scores, and Robinson-Foulds distance) to provide an empirical assessment of evolutionary conservation within a given genetic network. In addition to the individual phylogenetic metrics, ASAP2 provides results in a way that can be used to generate a gene network that represents evolutionary similarity based on topological similarity (the Robinson-Foulds distance). The results of this study demonstrate the potential for using phylogenomic approaches that enable the study of multiple genes simultaneously to provide insights about potential gene relationships that can be studied within a network medicine framework that may not have been apparent using traditional, single-gene methods. Furthermore, the results provide an initial integrated evolutionary history of an Alzheimer Disease gene network and identify potentially important co-evolutionary clustering that may warrant further investigation.
Collapse
Affiliation(s)
- Joseph D Romano
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT 05405, USA
| | - William G Tharp
- Department of Medicine, Endocrinology Unit, University of Vermont, Burlington, VT 05405, USA
| | - Indra Neil Sarkar
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT 05405, USA; Center for Clinical and Translational Science, University of Vermont, Burlington, VT 05405, USA; Department of Computer Science, University of Vermont, Burlington, VT 05405, USA.
| |
Collapse
|
12
|
Lee YJ, Boyd AD, Li JJ, Gardeux V, Kenost C, Saner D, Li H, Abraham I, Krishnan JA, Lussier YA. COPD Hospitalization Risk Increased with Distinct Patterns of Multiple Systems Comorbidities Unveiled by Network Modeling. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2014; 2014:855-64. [PMID: 25954392 PMCID: PMC4419951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Earlier studies on hospitalization risk are largely based on regression models. To our knowledge, network modeling of multiple comorbidities is novel and inherently enables multidimensional scoring and unbiased feature reduction. Network modeling was conducted using an independent validation design starting from 38,695 patients, 1,446,581 visits, and 430 distinct clinical facilities/hospitals. Odds ratios (OR) were calculated for every pair of comorbidity using patient counts and compared their tendency with hospitalization rates and ED visits. Network topology analyses were performed, defining significant comorbidity associations as having OR≥5 & False-Discovery-Rate≤10(-7). Four COPD-associated comorbidity sub-networks emerged, incorporating multiple clinical systems: (i) metabolic syndrome, (ii) substance abuse and mental disorder, (iii) pregnancy-associated conditions, and (iv) fall-related injury. The latter two have not been reported yet. Features prioritized from the network are predictive of hospitalizations in an independent set (p<0.004). Therefore, we suggest that network topology is a scalable and generalizable method predictive of hospitalization.
Collapse
Affiliation(s)
- Young Ji Lee
- Department of Medicine, University of Illinois at Chicago, Chicago, IL
| | - Andrew D Boyd
- Institute for Translational Health Informatics, University of Illinois at Chicago, Chicago, IL ; Departments of Biomedical and Health Information Sciences, University of Illinois at Chicago, Chicago, IL ; University of Illinois Hospital and Health Science System, University of Illinois at Chicago, Chicago, IL
| | - Jianrong John Li
- Department of Medicine, The University of Arizona, Tucson, AZ, USA
| | - Vincent Gardeux
- Department of Medicine, The University of Arizona, Tucson, AZ, USA
| | - Colleen Kenost
- Department of Medicine, The University of Arizona, Tucson, AZ, USA ; Biomedical Informatics Service Group, Arizona Health Science Center, The University of Arizona, Tucson, AZ, USA
| | - Don Saner
- Cancer Center, The University of Arizona, Tucson, AZ, USA ; Biomedical Informatics Service Group, Arizona Health Science Center, The University of Arizona, Tucson, AZ, USA
| | - Haiquan Li
- Department of Medicine, The University of Arizona, Tucson, AZ, USA
| | - Ivo Abraham
- Department of Pharmacy Practice and Science, The University of Arizona, Tucson, AZ, USA
| | - Jerry A Krishnan
- Department of Medicine, University of Illinois at Chicago, Chicago, IL ; University of Illinois Hospital and Health Science System, University of Illinois at Chicago, Chicago, IL
| | - Yves A Lussier
- Department of Medicine, The University of Arizona, Tucson, AZ, USA ; Cancer Center, The University of Arizona, Tucson, AZ, USA ; Biomedical Informatics Service Group, Arizona Health Science Center, The University of Arizona, Tucson, AZ, USA ; Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, USA
| |
Collapse
|
13
|
Gardeux V, Arslan AD, Achour I, Ho TT, Beck WT, Lussier YA. Concordance of deregulated mechanisms unveiled in underpowered experiments: PTBP1 knockdown case study. BMC Med Genomics 2014; 7 Suppl 1:S1. [PMID: 25079003 PMCID: PMC4101571 DOI: 10.1186/1755-8794-7-s1-s1] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Background Genome-wide transcriptome profiling generated by microarray and RNA-Seq often provides deregulated genes or pathways applicable only to larger cohort. On the other hand, individualized interpretation of transcriptomes is increasely pursued to improve diagnosis, prognosis, and patient treatment processes. Yet, robust and accurate methods based on a single paired-sample remain an unmet challenge. Methods "N-of-1-pathways" translates gene expression data profiles into mechanism-level profiles on single pairs of samples (one p-value per geneset). It relies on three principles: i) statistical universe is a single paired sample, which serves as its own control; ii) statistics can be derived from multiple gene expression measures that share common biological mechanisms assimilated to genesets; iii) semantic similarity metric takes into account inter-mechanisms' relationships to better assess commonality and differences, within and cross study-samples (e.g. patients, cell-lines, tissues, etc.), which helps the interpretation of the underpinning biology. Results In the context of underpowered experiments, N-of-1-pathways predictions perform better or comparable to those of GSEA and Differentially Expressed Genes enrichment (DEG enrichment), within-and cross-datasets. N-of-1-pathways uncovered concordant PTBP1-dependent mechanisms across datasets (Odds-Ratios≥13, p-values≤1 × 10−5), such as RNA splicing and cell cycle. In addition, it unveils tissue-specific mechanisms of alternatively transcribed PTBP1-dependent genesets. Furthermore, we demonstrate that GSEA and DEG Enrichment preclude accurate analysis on single paired samples. Conclusions N-of-1-pathways enables robust and biologically relevant mechanism-level classifiers with small cohorts and one single paired samples that surpasses conventional methods. Further, it identifies unique sample/ patient mechanisms, a requirement for precision medicine.
Collapse
|
14
|
Talwar P, Silla Y, Grover S, Gupta M, Agarwal R, Kushwaha S, Kukreti R. Genomic convergence and network analysis approach to identify candidate genes in Alzheimer's disease. BMC Genomics 2014; 15:199. [PMID: 24628925 PMCID: PMC4028079 DOI: 10.1186/1471-2164-15-199] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2013] [Accepted: 02/21/2014] [Indexed: 01/28/2023] Open
Abstract
Background Alzheimer’s disease (AD) is one of the leading genetically complex and heterogeneous disorder that is influenced by both genetic and environmental factors. The underlying risk factors remain largely unclear for this heterogeneous disorder. In recent years, high throughput methodologies, such as genome-wide linkage analysis (GWL), genome-wide association (GWA) studies, and genome-wide expression profiling (GWE), have led to the identification of several candidate genes associated with AD. However, due to lack of consistency within their findings, an integrative approach is warranted. Here, we have designed a rank based gene prioritization approach involving convergent analysis of multi-dimensional data and protein-protein interaction (PPI) network modelling. Results Our approach employs integration of three different AD datasets- GWL,GWA and GWE to identify overlapping candidate genes ranked using a novel cumulative rank score (SR) based method followed by prioritization using clusters derived from PPI network. SR for each gene is calculated by addition of rank assigned to individual gene based on either p value or score in three datasets. This analysis yielded 108 plausible AD genes. Network modelling by creating PPI using proteins encoded by these genes and their direct interactors resulted in a layered network of 640 proteins. Clustering of these proteins further helped us in identifying 6 significant clusters with 7 proteins (EGFR, ACTB, CDC2, IRAK1, APOE, ABCA1 and AMPH) forming the central hub nodes. Functional annotation of 108 genes revealed their role in several biological activities such as neurogenesis, regulation of MAP kinase activity, response to calcium ion, endocytosis paralleling the AD specific attributes. Finally, 3 potential biochemical biomarkers were found from the overlap of 108 AD proteins with proteins from CSF and plasma proteome. EGFR and ACTB were found to be the two most significant AD risk genes. Conclusions With the assumption that common genetic signals obtained from different methodological platforms might serve as robust AD risk markers than candidates identified using single dimension approach, here we demonstrated an integrated genomic convergence approach for disease candidate gene prioritization from heterogeneous data sources linked to AD. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-199) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Ritushree Kukreti
- Genomics and Molecular Medicine Unit, Institute of Genomics and Integrative Biology (IGIB), Council of Scientific and Industrial Research (CSIR), Mall Road, Delhi 110 007, India.
| |
Collapse
|
15
|
Jiang X, Tse K, Wang S, Doan S, Kim H, Ohno-Machado L. Recent trends in biomedical informatics: a study based on JAMIA articles. J Am Med Inform Assoc 2013; 20:e198-205. [PMID: 24214018 PMCID: PMC3861936 DOI: 10.1136/amiajnl-2013-002429] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
In a growing interdisciplinary field like biomedical informatics, information dissemination and citation trends are changing rapidly due to many factors. To understand these factors better, we analyzed the evolution of the number of articles per major biomedical informatics topic, download/online view frequencies, and citation patterns (using Web of Science) for articles published from 2009 to 2012 in JAMIA. The number of articles published in JAMIA increased significantly from 2009 to 2012, and there were some topic differences in the last 4 years. Medical Record Systems, Algorithms, and Methods are topic categories that are growing fast in several publications. We observed a significant correlation between download frequencies and the number of citations per month since publication for a given article. Earlier free availability of articles to non-subscribers was associated with a higher number of downloads and showed a trend towards a higher number of citations. This trend will need to be verified as more data accumulate in coming years.
Collapse
Affiliation(s)
- Xiaoqian Jiang
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California, USA
| | | | | | | | | | | |
Collapse
|