1
|
Thami PK, Choga WT, Dandara C, O’Brien SJ, Essex M, Gaseitsiwe S, Chimusa ER. Whole genome sequencing reveals population diversity and variation in HIV-1 specific host genes. Front Genet 2023; 14:1290624. [PMID: 38179408 PMCID: PMC10765519 DOI: 10.3389/fgene.2023.1290624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/20/2023] [Indexed: 01/06/2024] Open
Abstract
HIV infection continues to be a major global public health issue. The population heterogeneity in susceptibility or resistance to HIV-1 and progression upon infection is attributable to, among other factors, host genetic variation. Therefore, identifying population-specific variation and genetic modifiers of HIV infectivity can catapult the invention of effective strategies against HIV-1 in African populations. Here, we investigated whole genome sequences of 390 unrelated HIV-positive and -negative individuals from Botswana. We report 27.7 million single nucleotide variations (SNVs) in the complete genomes of Botswana nationals, of which 2.8 million were missing in public databases. Our population structure analysis revealed a largely homogenous structure in the Botswana population. Admixture analysis showed elevated components shared between the Botswana population and the Niger-Congo (65.9%), Khoe-San (32.9%), and Europeans (1.1%) ancestries in the population of Botswana. Statistical significance of the mutational burden of deleterious and loss-of-function variants per gene against a null model was estimated. The most deleterious variants were enriched in five genes: ACTRT2 (the Actin Related Protein T2), HOXD12 (homeobox D12), ABCB5 (ATP binding cassette subfamily B member 5), ATP8B4 (ATPase phospholipid transporting 8B4) and ABCC12 (ATP Binding Cassette Subfamily C Member 12). These genes are enriched in the glycolysis and gluconeogenesis (p < 2.84e-6) pathways and therefore, may contribute to the emerging field of immunometabolism in which therapy against HIV-1 infection is being evaluated. Published transcriptomic evidence supports the role of the glycolysis/gluconeogenesis pathways in the regulation of susceptibility to HIV, and that cumulative effects of genetic modifiers in glycolysis/gluconeogenesis pathways may potentially have effects on the expression and clinical variability of HIV-1. Identified genes and pathways provide novel avenues for other interventions, with the potential for informing the design of new therapeutics.
Collapse
Affiliation(s)
- Prisca K. Thami
- Division of Human Genetics, Department of Pathology, University of Cape Town, Cape Town, South Africa
| | - Wonderful T. Choga
- Division of Human Genetics, Department of Pathology, University of Cape Town, Cape Town, South Africa
- Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
| | - Collet Dandara
- Division of Human Genetics, Department of Pathology, University of Cape Town, Cape Town, South Africa
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
- UCT/SAMRC Platform for Pharmacogenomics Research and Translation (PREMED) Unit, South African Medical Research Council, Cape Town, South Africa
| | - Stephen J. O’Brien
- Laboratory of Genomics Diversity, Center for Computer Technologies, ITMO University, St. Petersburg, Russia
- Guy Harvey Oceanographic Center Halmos College of Arts and Sciences, Nova Southeastern University, Fort Lauderdale, FL, United States
| | - Myron Essex
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health AIDS Initiative, Harvard T. H. Chan School of Public Health, Boston, MA, United States
| | - Simani Gaseitsiwe
- Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health AIDS Initiative, Harvard T. H. Chan School of Public Health, Boston, MA, United States
| | - Emile R. Chimusa
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
- Department of Applied Sciences, Faculty of Health and Life Sciences, Northumbria University, Newcastle, United Kingdom
| |
Collapse
|
2
|
Chan AP, Choi Y, Rangan A, Zhang G, Podder A, Berens M, Sharma S, Pirrotte P, Byron S, Duggan D, Schork NJ. Interrogating the Human Diplome: Computational Methods, Emerging Applications, and Challenges. Methods Mol Biol 2023; 2590:1-30. [PMID: 36335489 DOI: 10.1007/978-1-0716-2819-5_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Human DNA sequencing protocols have revolutionized human biology, biomedical science, and clinical practice, but still have very important limitations. One limitation is that most protocols do not separate or assemble (i.e., "phase") the nucleotide content of each of the maternally and paternally derived chromosomal homologs making up the 22 autosomal pairs and the chromosomal pair making up the pseudo-autosomal region of the sex chromosomes. This has led to a dearth of studies and a consequent underappreciation of many phenomena of fundamental importance to basic and clinical genomic science. We discuss a few protocols for obtaining phase information as well as their limitations, including those that could be used in tumor phasing settings. We then describe a number of biological and clinical phenomena that require phase information. These include phenomena that require precise knowledge of the nucleotide sequence in a chromosomal segment from germline or somatic cells, such as DNA binding events, and insight into unique cis vs. trans-acting functionally impactful variant combinations-for example, variants implicated in a phenotype governed by compound heterozygosity. In addition, we also comment on the need for reliable and consensus-based diploid-context computational workflows for variant identification as well as the need for laboratory-based functional verification strategies for validating cis vs. trans effects of variant combinations. We also briefly describe available resources, example studies, as well as areas of further research, and ultimately argue that the science behind the study of human diploidy, referred to as "diplomics," which will be enabled by nucleotide-level resolution of phased genomes, is a logical next step in the analysis of human genome biology.
Collapse
Affiliation(s)
- Agnes P Chan
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Yongwook Choi
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Aditya Rangan
- Courant Institute of Mathematical Sciences at New York University, New York, NY, USA
| | - Guangfa Zhang
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Avijit Podder
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
| | - Michael Berens
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
- The City of Hope National Medical Center, Duarte, CA, USA
| | - Sunil Sharma
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
- The City of Hope National Medical Center, Duarte, CA, USA
| | - Patrick Pirrotte
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
- The City of Hope National Medical Center, Duarte, CA, USA
| | - Sara Byron
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
- The City of Hope National Medical Center, Duarte, CA, USA
| | - Dave Duggan
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA
- The City of Hope National Medical Center, Duarte, CA, USA
| | - Nicholas J Schork
- The Translational Genomics Research Institute (TGen), part of the City of Hope National Medical Center, Phoenix, AZ, USA.
- The City of Hope National Medical Center, Duarte, CA, USA.
| |
Collapse
|
3
|
Ji Y, Si Y, McMillin GA, Lyon E. Clinical pharmacogenomics testing in the era of next generation sequencing: challenges and opportunities for precision medicine. Expert Rev Mol Diagn 2018; 18:411-421. [PMID: 29634383 DOI: 10.1080/14737159.2018.1461561] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
INTRODUCTION The rapid development and dramatic decrease in cost of sequencing techniques have ushered the implementation of genomic testing in patient care. Next generation DNA sequencing (NGS) techniques have been used increasingly in clinical laboratories to scan the whole or part of the human genome in order to facilitate diagnosis and/or prognostics of genetic disease. Despite many hurdles and debates, pharmacogenomics (PGx) is believed to be an area of genomic medicine where precision medicine could have immediate impact in the near future. Areas covered: This review focuses on lessons learned through early attempts of clinically implementing PGx testing; the challenges and opportunities that PGx testing brings to precision medicine in the era of NGS. Expert commentary: Replacing targeted analysis approach with NGS for PGx testing is neither technically feasible nor necessary currently due to several technical limitations and uncertainty involved in interpreting variants of uncertain significance for PGx variants. However, reporting PGx variants out of clinical whole exome or whole genome sequencing (WES/WGS) might represent additional benefits for patients who are tested by WES/WGS.
Collapse
Affiliation(s)
- Yuan Ji
- a ARUP Laboratories and Department of Pathology , University of Utah School of Medicine , Salt Lake City , UT , USA
| | - Yue Si
- a ARUP Laboratories and Department of Pathology , University of Utah School of Medicine , Salt Lake City , UT , USA
| | - Gwendolyn A McMillin
- a ARUP Laboratories and Department of Pathology , University of Utah School of Medicine , Salt Lake City , UT , USA
| | - Elaine Lyon
- a ARUP Laboratories and Department of Pathology , University of Utah School of Medicine , Salt Lake City , UT , USA
| |
Collapse
|
4
|
Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure. Sci Rep 2017; 7:15736. [PMID: 29147026 PMCID: PMC5691156 DOI: 10.1038/s41598-017-15705-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 10/31/2017] [Indexed: 12/21/2022] Open
Abstract
Discovering genetic variants associated with human brain structures is an on-going effort. The ENIGMA consortium conducted genome-wide association studies (GWAS) with standard multi-study analytical methodology and identified several significant single nucleotide polymorphisms (SNPs). Here we employ a novel analytical approach that incorporates functional genome annotations (e.g., exon or 5′UTR), total linkage disequilibrium (LD) scores and heterozygosity to construct enrichment scores for improved identification of relevant SNPs. The method provides increased power to detect associated SNPs by estimating stratum-specific false discovery rate (FDR), where strata are classified according to enrichment scores. Applying this approach to the GWAS summary statistics of putamen volume in the ENIGMA cohort, a total of 15 independent significant SNPs were identified (conditional FDR < 0.05). In contrast, 4 SNPs were found based on standard GWAS analysis (P < 5 × 10−8). These 11 novel loci include GATAD2B, ASCC3, DSCAML1, and HELZ, which are previously implicated in various neural related phenotypes. The current findings demonstrate the boost in power with the annotation-informed FDR method, and provide insight into the genetic architecture of the putamen.
Collapse
|
5
|
Schork NJ, Nazor K. Integrated Genomic Medicine: A Paradigm for Rare Diseases and Beyond. ADVANCES IN GENETICS 2017; 97:81-113. [PMID: 28838357 PMCID: PMC6383766 DOI: 10.1016/bs.adgen.2017.06.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Individualized medicine, or the tailoring of therapeutic interventions to a patient's unique genetic, biochemical, physiological, exposure and behavioral profile, has been enhanced, if not enabled, by modern biomedical technologies such as high-throughput DNA sequencing platforms, induced pluripotent stem cell assays, biomarker discovery protocols, imaging modalities, and wireless monitoring devices. Despite successes in the isolated use of these technologies, however, it is arguable that their combined and integrated use in focused studies of individual patients is the best way to not only tailor interventions for those patients, but also shed light on treatment strategies for patients with similar conditions. This is particularly true for individuals with rare diseases since, by definition, they will require study without recourse to other individuals, or at least without recourse to many other individuals. Such integration and focus will require new biomedical scientific paradigms and infrastructure, including the creation of databases harboring study results, the formation of dedicated multidisciplinary research teams and new training programs. We consider the motivation and potential for such integration, point out areas in need of improvement, and argue for greater emphasis on improving patient health via technological innovations, not merely improving the technologies themselves. We also argue that the paradigm described can, in theory, be extended to the study of individuals with more common diseases.
Collapse
Affiliation(s)
- Nicholas J. Schork
- The Translational Genomics Research Institute, 445 North Fifth Street, Phoenix, AZ 85004, , 858-794-4054
| | - Kristopher Nazor
- MYi Diagnostics and Discovery, 5310 Eastgate Mall, San Diego, CA 92121, , 858-458-9305
| |
Collapse
|
6
|
Wu L, Schaid DJ, Sicotte H, Wieben ED, Li H, Petersen GM. Case-only exome sequencing and complex disease susceptibility gene discovery: study design considerations. J Med Genet 2014; 52:10-6. [PMID: 25371537 DOI: 10.1136/jmedgenet-2014-102697] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Whole exome sequencing (WES) provides an unprecedented opportunity to identify the potential aetiological role of rare functional variants in human complex diseases. Large-scale collaborations have generated germline WES data on patients with a number of diseases, especially cancer, but less often on healthy controls under the same sequencing procedures. These data can be a valuable resource for identifying new disease susceptibility loci if study designs are appropriately applied. This review describes suggested strategies and technical considerations when focusing on case-only study designs that use WES data in complex disease scenarios. These include variant filtering based on frequency and functionality, gene prioritisation, interrogation of different data types and targeted sequencing validation. We propose that if case-only WES designs were applied in an appropriate manner, new susceptibility genes containing rare variants for human complex diseases can be detected.
Collapse
Affiliation(s)
- Lang Wu
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA Center for Clinical and Translational Science, Mayo Clinic, Rochester, Minnesota, USA
| | - Daniel J Schaid
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
| | - Hugues Sicotte
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
| | - Eric D Wieben
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, Minnesota, USA
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, Minnesota, USA
| | - Gloria M Petersen
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
| |
Collapse
|
7
|
Zablocki RW, Schork AJ, Levine RA, Andreassen OA, Dale AM, Thompson WK. Covariate-modulated local false discovery rate for genome-wide association studies. Bioinformatics 2014; 30:2098-104. [PMID: 24711653 PMCID: PMC4103587 DOI: 10.1093/bioinformatics/btu145] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Revised: 03/03/2014] [Accepted: 03/05/2014] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Genome-wide association studies (GWAS) have largely failed to identify most of the genetic basis of highly heritable diseases and complex traits. Recent work has suggested this could be because many genetic variants, each with individually small effects, compose their genetic architecture, limiting the power of GWAS, given currently obtainable sample sizes. In this scenario, Bonferroni-derived thresholds are severely underpowered to detect the vast majority of associations. Local false discovery rate (fdr) methods provide more power to detect non-null associations, but implicit assumptions about the exchangeability of single nucleotide polymorphisms (SNPs) limit their ability to discover non-null loci. METHODS We propose a novel covariate-modulated local false discovery rate (cmfdr) that incorporates prior information about gene element-based functional annotations of SNPs, so that SNPs from categories enriched for non-null associations have a lower fdr for a given value of a test statistic than SNPs in unenriched categories. This readjustment of fdr based on functional annotations is achieved empirically by fitting a covariate-modulated parametric two-group mixture model. The proposed cmfdr methodology is applied to a large Crohn's disease GWAS. RESULTS Use of cmfdr dramatically improves power, e.g. increasing the number of loci declared significant at the 0.05 fdr level by a factor of 5.4. We also demonstrate that SNPs were declared significant using cmfdr compared with usual fdr replicate in much higher numbers, while maintaining similar replication rates for a given fdr cutoff in de novo samples, using the eight Crohn's disease substudies as independent training and test datasets. Availability an implementation: https://sites.google.com/site/covmodfdr/ CONTACT : wes.stat@gmail.com SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rong W Zablocki
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| | - Andrew J Schork
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| | - Richard A Levine
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| | - Ole A Andreassen
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| | - Anders M Dale
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USAComputational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USAComputational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| | - Wesley K Thompson
- Computational Science Research Center, San Diego State University, San Diego, CA 92182, USA, Cognitive Sciences Graduate Program, University of California at San Diego, La Jolla, CA 92093, USA, Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92182, USA, Institute of Clinical Medicine, University of Oslo, Oslo, 0424, Norway, Multimodal Imaging Laboratory and Department of Psychiatry, University of California at San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
8
|
Torri F, Dinov ID, Zamanyan A, Hobel S, Genco A, Petrosyan P, Clark AP, Liu Z, Eggert P, Pierce J, Knowles JA, Ames J, Kesselman C, Toga AW, Potkin SG, Vawter MP, Macciardi F. Next generation sequence analysis and computational genomics using graphical pipeline workflows. Genes (Basel) 2014; 3:545-75. [PMID: 23139896 PMCID: PMC3490498 DOI: 10.3390/genes3030545] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders.
Collapse
Affiliation(s)
- Federica Torri
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA 92617, USA; E-Mails: (F.T.); (S.G.P.)
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
| | - Ivo D. Dinov
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Alen Zamanyan
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Sam Hobel
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Alex Genco
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Petros Petrosyan
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Andrew P. Clark
- Zilkha Neurogenetic Institute, USC Keck School of Medicine, Los Angeles, CA 90033, USA; E-Mails: (A.P.C.); (J.A.K.)
| | - Zhizhong Liu
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Paul Eggert
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
- Department of Computer Science, University of California, Los Angeles, CA 90095, USA
| | - Jonathan Pierce
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - James A. Knowles
- Zilkha Neurogenetic Institute, USC Keck School of Medicine, Los Angeles, CA 90033, USA; E-Mails: (A.P.C.); (J.A.K.)
| | - Joseph Ames
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
| | - Carl Kesselman
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
| | - Arthur W. Toga
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, CA 90095, USA; E-Mails: (A.Z.); (S.H.); (A.G.); (P.P.); (Z.L.); (P.E.); (J.P.)
| | - Steven G. Potkin
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA 92617, USA; E-Mails: (F.T.); (S.G.P.)
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
| | - Marquis P. Vawter
- Functional Genomics Laboratory, Department of Psychiatry And Human Behavior, School of Medicine, University of California, Irvine, CA 92697, USA; E-Mail:
| | - Fabio Macciardi
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA 92617, USA; E-Mails: (F.T.); (S.G.P.)
- Biomedical Informatics Research Network (BIRN), Information Sciences Institute, University of Southern California, Los Angeles, CA 90292, USA; E-Mails: (I.D.D.); (J.A.); (C.K.); (A.W.T.)
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +1-949-824-4559; Fax: +1-949-824-2072
| |
Collapse
|
9
|
Riera C, Lois S, de la Cruz X. Prediction of pathological mutations in proteins: the challenge of integrating sequence conservation and structure stability principles. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2013. [DOI: 10.1002/wcms.1170] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Casandra Riera
- Laboratory of Translational Bioinformatics in Neuroscience; VHIR; Barcelona Spain
| | - Sergio Lois
- Laboratory of Translational Bioinformatics in Neuroscience; VHIR; Barcelona Spain
| | - Xavier de la Cruz
- Laboratory of Translational Bioinformatics in Neuroscience; VHIR; Barcelona Spain
- Institució Catalana per la Recerca i Estudis Avançats (ICREA); Barcelona Spain
| |
Collapse
|
10
|
Farhan SM, Hegele RA. Genetics 101 for Cardiologists: Rare Genetic Variants and Monogenic Cardiovascular Disease. Can J Cardiol 2013. [DOI: 10.1016/j.cjca.2012.10.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
|
11
|
Vrieze SI, Iacono WG, McGue M. Confluence of genes, environment, development, and behavior in a post Genome-Wide Association Study world. Dev Psychopathol 2012; 24:1195-214. [PMID: 23062291 PMCID: PMC3476066 DOI: 10.1017/s0954579412000648] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
This article serves to outline a research paradigm to investigate main effects and interactions of genes, environment, and development on behavior and psychiatric illness. We provide a historical context for candidate gene studies and genome-wide association studies, including benefits, limitations, and expected payoffs. Using substance use and abuse as our driving example, we then turn to the importance of etiological psychological theory in guiding genetic, environmental, and developmental research, as well as the utility of refined phenotypic measures, such as endophenotypes, in the pursuit of etiological understanding and focused tests of genetic and environmental associations. Phenotypic measurement has received considerable attention in the history of psychology and is informed by psychometrics, whereas the environment remains relatively poorly measured and is often confounded with genetic effects (i.e., gene-environment correlation). Genetically informed designs, which are no longer limited to twin and adoption studies thanks to ever-cheaper genotyping, are required to understand environmental influences. Finally, we outline the vast amount of individual difference in structural genomic variation, most of which remains to be leveraged in genetic association tests. Although the genetic data can be massive and burdensome (tens of millions of variants per person), we argue that improved understanding of genomic structure and function will provide investigators with new tools to test specific a priori hypotheses derived from etiological psychological theory, much like current candidate gene research but with less confusion and more payoff than candidate gene research has to date.
Collapse
Affiliation(s)
- Scott I Vrieze
- Department of Psychology, University of Minnesota, Minneapolis, MN 55455, USA.
| | | | | |
Collapse
|
12
|
Torkamani A, Pham P, Libiger O, Bansal V, Zhang G, Scott-Van Zeeland AA, Tewhey R, Topol EJ, Schork NJ. Clinical implications of human population differences in genome-wide rates of functional genotypes. Front Genet 2012; 3:211. [PMID: 23125845 PMCID: PMC3485509 DOI: 10.3389/fgene.2012.00211] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2012] [Accepted: 09/26/2012] [Indexed: 12/21/2022] Open
Abstract
There have been a number of recent successes in the use of whole genome sequencing and sophisticated bioinformatics techniques to identify pathogenic DNA sequence variants responsible for individual idiopathic congenital conditions. However, the success of this identification process is heavily influenced by the ancestry or genetic background of a patient with an idiopathic condition. This is so because potential pathogenic variants in a patient’s genome must be contrasted with variants in a reference set of genomes made up of other individuals’ genomes of the same ancestry as the patient. We explored the effect of ignoring the ancestries of both an individual patient and the individuals used to construct reference genomes. We pursued this exploration in two major steps. We first considered variation in the per-genome number and rates of likely functional derived (i.e., non-ancestral, based on the chimp genome) single nucleotide variants and small indels in 52 individual whole human genomes sampled from 10 different global populations. We took advantage of a suite of computational and bioinformatics techniques to predict the functional effect of over 24 million genomic variants, both coding and non-coding, across these genomes. We found that the typical human genome harbors ∼5.5–6.1 million total derived variants, of which ∼12,000 are likely to have a functional effect (∼5000 coding and ∼7000 non-coding). We also found that the rates of functional genotypes per the total number of genotypes in individual whole genomes differ dramatically between human populations. We then created tables showing how the use of comparator or reference genome panels comprised of genomes from individuals that do not have the same ancestral background as a patient can negatively impact pathogenic variant identification. Our results have important implications for clinical sequencing initiatives.
Collapse
Affiliation(s)
- Ali Torkamani
- The Scripps Translational Science La Jolla, CA, USA ; Scripps Health La Jolla, CA, USA ; Department of Molecular and Experimental Medicine, The Scripps Research Institute La Jolla, CA, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Affiliation(s)
- Monique Ohanian
- Molecular Cardiology Division, Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia
| | | | | |
Collapse
|
14
|
Intentions to receive individual results from whole-genome sequencing among participants in the ClinSeq study. Eur J Hum Genet 2012; 21:261-5. [PMID: 22892536 DOI: 10.1038/ejhg.2012.179] [Citation(s) in RCA: 144] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Genome sequencing has been rapidly integrated into clinical research and is currently marketed to health-care practitioners and consumers alike. The volume of sequencing data generated for a single individual and the wide range of findings from whole-genome sequencing raise critical questions about the return of results and their potential value for end-users. We conducted a mixed-methods study of 311 sequential participants in the NIH ClinSeq study to assess general preferences and specific attitudes toward learning results. We tested how these variables predicted intentions to receive results within four categories of findings ranging from medically actionable to variants of unknown significance. Two hundred and ninety-four participants indicated a preference to learn their genome sequencing results. Most often, participants cited disease prevention as their reason, including intention to change their lifestyle behaviors. Participants held positive attitudes, strongly perceived social norms and strong intentions to learn results, although there were significant mean differences among four categories of findings (P<0.01). Attitudes and social norms for medically actionable and carrier results were most similar and rated the highest. Participants distinguished among the types and quality of information they may receive, despite strong intentions to learn all results presented. These intentions were motivated by confidence in their ability to use the information to prevent future disease and a belief in the value of even uninterpretable information. It behooves investigators to facilitate participants' desire to learn a range of information from genomic sequencing while promoting realistic expectations for its clinical and personal utility.
Collapse
|
15
|
Xu F, Wang Q, Zhang F, Zhu Y, Gu Q, Wu L, Yang L, Yang X. Impact of Next-Generation Sequencing (NGS) technology on cardiovascular disease research. Cardiovasc Diagn Ther 2012; 2:138-46. [PMID: 24282707 DOI: 10.3978/j.issn.2223-3652.2012.06.01] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2012] [Accepted: 06/08/2012] [Indexed: 11/14/2022]
Abstract
In recent years, hundreds of gene loci associated with multiple cardiovascular pathologies and traits have been identified through high-throughput Next-Generation Sequencing (NGS) technology. Due to the increasing efficiency and decreasing cost of NGS, rapid progresses anticipated in the field of CVD research. This review summarizes the main strategies of CV research with NGS at the level of genomics, transcriptomics, epigenetics, and proteomics.
Collapse
|
16
|
Yashin AI, Wu D, Arbeev KG, Ukraintseva SV. Polygenic effects of common single-nucleotide polymorphisms on life span: when association meets causality. Rejuvenation Res 2012; 15:381-94. [PMID: 22533364 DOI: 10.1089/rej.2011.1257] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022] Open
Abstract
Recently we have shown that the human life span is influenced jointly by many common single-nucleotide polymorphisms (SNPs), each with a small individual effect. Here we investigate further the polygenic influence on life span and discuss its possible biological mechanisms. First we identified six sets of prolongevity SNP alleles in the Framingham Heart Study 550K SNPs data, using six different statistical procedures (normal linear, Cox, and logistic regressions; generalized estimation equation; mixed model; gene frequency method). We then estimated joint effects of these SNPs on human survival. We found that alleles in each set show significant additive influence on life span. Twenty-seven SNPs comprised the overlapping set of SNPs that influenced life span, regardless of the statistical procedure. The majority of these SNPs (74%) were within genes, compared to 40% of SNPs in the original 550K set. We then performed a review of current literature on functions of genes closest to these 27 SNPs. The review showed that the respective genes are largely involved in aging, cancer, and brain disorders. We concluded that polygenic effects can explain a substantial portion of genetic influence on life span. Composition of the set of prolongevity alleles depends on the statistical procedure used for the allele selection. At the same time, there is a core set of longevity alleles that are selected with all statistical procedures. Functional relevance of respective genes to aging and major diseases supports causal relationships between the identified SNPs and life span. The fact that genes found in our and other genetic association studies of aging/longevity have similar functions indicates high chances of true positive associations for corresponding genetic variants.
Collapse
Affiliation(s)
- Anatoliy I Yashin
- Center for Population Health and Aging, Duke University, Durham, NC 27708-0408, USA.
| | | | | | | |
Collapse
|
17
|
Sebastiani P, Riva A, Montano M, Pham P, Torkamani A, Scherba E, Benson G, Milton JN, Baldwin CT, Andersen S, Schork NJ, Steinberg MH, Perls TT. Whole genome sequences of a male and female supercentenarian, ages greater than 114 years. Front Genet 2012; 2:90. [PMID: 22303384 PMCID: PMC3262222 DOI: 10.3389/fgene.2011.00090] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Accepted: 12/04/2011] [Indexed: 12/18/2022] Open
Abstract
Supercentenarians (age 110+ years old) generally delay or escape age-related diseases and disability well beyond the age of 100 and this exceptional survival is likely to be influenced by a genetic predisposition that includes both common and rare genetic variants. In this report, we describe the complete genomic sequences of male and female supercentenarians, both age >114 years old. We show that: (1) the sequence variant spectrum of these two individuals' DNA sequences is largely comparable to existing non-supercentenarian genomes; (2) the two individuals do not appear to carry most of the well-established human longevity enabling variants already reported in the literature; (3) they have a comparable number of known disease-associated variants relative to most human genomes sequenced to-date; (4) approximately 1% of the variants these individuals possess are novel and may point to new genes involved in exceptional longevity; and (5) both individuals are enriched for coding variants near longevity-associated variants that we discovered through a large genome-wide association study. These analyses suggest that there are both common and rare longevity-associated variants that may counter the effects of disease-predisposing variants and extend lifespan. The continued analysis of the genomes of these and other rare individuals who have survived to extremely old ages should provide insight into the processes that contribute to the maintenance of health during extreme aging.
Collapse
Affiliation(s)
- Paola Sebastiani
- Department of Biostatistics, Boston University School of Public Health Boston, MA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|