1
|
Allen J, Abdiwahab E, Morris MD, Le Saux CJ, Betancur P, Ansel KM, Hernandez RD, Nystul TG. PROPEL: a scalable model for postbaccalaureate training to promote diversity in the biomedical workforce. JOURNAL OF MICROBIOLOGY & BIOLOGY EDUCATION 2024:e0012224. [PMID: 39254307 DOI: 10.1128/jmbe.00122-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 08/18/2024] [Indexed: 09/11/2024]
Abstract
Promoting diversity in the scientific workforce is crucial for harnessing the potential of available talent and ensuring equitable access to Science, Technology, Engineering, Mathematics, and Medicine (STEM-M) careers. We have developed an innovative program called Postbaccalaureate Research Opportunity to Promote Equity in Learning (PROPEL) that provides scientific and career development training for postbaccalaureate scholars from historically excluded backgrounds in STEM-M fields with an interest in pursuing a PhD or MD/PhD degree. Our program is distinct from other postbaccalaureate programs in that scholars are hired by individual labs rather than funded centrally by the program. This funding mechanism removes the idea that central funding is necessary to encourage faculty to train diverse scholars and allows the program to scale dynamically according to the needs of the scientific community. The PROPEL program started in 2020 with six scholars and has since grown to an enrollment of over 100, making it the largest postbaccalaureate program for biomedical research in the country. Here, we describe the program structure and curriculum, our strategy for recruitment, the enrollment trends, the program demographics, metrics of scholar engagement, and outcomes for scholars who completed the program in 2023. Our experience demonstrates the strong demand from both scholars and faculty for programming of this type and describes the feasibility of implementation.
Collapse
|
2
|
Shore N, Gazi M, Pieczonka C, Heron S, Modh R, Cahn D, Belkoff LH, Berger A, Mazzarella B, Veys J, Idom C, Morris D, Jayram G, Engelman A, Bukkapatnam R, Dato P, Bevan-Thomas R, Cornell R, Wise DR, Hardwick MK, Hernandez RD, Rojahn S, Layman P, Hatchell KE, Heald B, Nussbaum RL, Nielsen SM, Esplin ED. Efficacy of National Comprehensive Cancer Network Guidelines in Identifying Pathogenic Germline Variants Among Unselected Patients with Prostate Cancer: The PROCLAIM Trial. Eur Urol Oncol 2023; 6:477-483. [PMID: 37574391 DOI: 10.1016/j.euo.2023.07.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/07/2023] [Accepted: 07/12/2023] [Indexed: 08/15/2023]
Abstract
BACKGROUND Prostate cancer (PCa) patients with pathogenic/likely pathogenic germline variants (PGVs) in cancer predisposition genes may be eligible for U.S. Food and Drug Administration-approved targeted therapies, clinical trials, or enhanced screening. Studies suggest that eligible patients are missing genetics-informed care due to restrictive testing criteria. OBJECTIVE To establish the prevalence of actionable PGVs among prospectively accrued, unselected PCa patients, stratified by their guideline eligibility. DESIGN, SETTING, AND PARTICIPANTS Consecutive, unselected PCa patients were enrolled at 15 sites in the USA from October 2019 to August 2021, and had multigene cancer panel testing. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS Correlates between the prevalence of PGVs and clinician-reported demographic and clinical characteristics were examined. RESULTS AND LIMITATIONS Among 958 patients (median [quartiles] age at diagnosis 65 [60, 71] yr), 627 (65%) had low- or intermediate-risk disease (grade group 1, 2, or 3). A total of 77 PGVs in 17 genes were identified in 74 patients (7.7%, 95% confidence interval [CI] 6.2-9.6%). No significant difference was found in the prevalence of PGVs among patients who met the 2019 National Comprehensive Cancer Network Prostate criteria (8.8%, 43/486, 95% CI 6.6-12%) versus those who did not (6.6%, 31/472, 95% CI 4.6-9.2%; odds ratio 1.38, 95% CI 0.85-2.23), indicating that these criteria would miss 42% of patients (31/74, 95% CI 31-53%) with PGVs. The criteria were less effective at predicting PGVs in patients from under-represented populations. Most PGVs (81%, 60/74) were potentially clinically actionable. Limitations include the inability to stratify analyses based on individual ethnicity due to low numbers of non-White patients with PGVs. CONCLUSIONS Our results indicate that almost half of PCa patients with PGVs are missed by current testing guidelines. Comprehensive germline genetic testing should be offered to all patients with PCa. PATIENT SUMMARY One in 13 patients with prostate cancer carries an inherited variant that may be actionable for the patient's current care or prevention of future cancer, and could benefit from expanded testing criteria.
Collapse
|
3
|
Torgerson D, Guardado M, Steurer M, Chapin C, Hernandez RD, Ballard PL. The hydrocortisone-responsive urinary metabolome of premature infants. Pediatr Res 2023; 94:1317-1326. [PMID: 37138028 PMCID: PMC10589081 DOI: 10.1038/s41390-023-02610-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 03/21/2023] [Accepted: 04/01/2023] [Indexed: 05/05/2023]
Abstract
BACKGROUND Extremely premature infants are at risk for circulatory collapse or respiratory failure that are often treated with hydrocortisone (HC); however, there is no information on the metabolic consequences of this therapy. METHODS Longitudinal urine samples from infants <28 weeks gestation in the Trial of Late Surfactant were analyzed by untargeted UHPLC:MS/MS. Fourteen infants who received a tapering course of HC beginning at 3 mg/kg/day for ≥9 days were compared to 14 matched control infants. A secondary cross-sectional analysis by logistic regression used urines from 314 infants. RESULTS Of 1145 urinary metabolites detected, abundance of 219, representing all the major biochemical pathways, changed at p < 0.05 in the HC-treated group with 90% decreasing; 3 cortisol derivatives increased ~2-fold with HC therapy. Only 11% of regulated metabolites remained responsive at the lowest HC dose. Regulated metabolites included two steroids and thiamin that are associated with lung inflammation in infants. HC responsiveness was confirmed in 57% of metabolites by cross-sectional analysis. CONCLUSIONS HC treatment of premature infants influenced in a dose-dependent manner abundance of 19% of identified urinary metabolites of diverse biochemical systems, primarily reducing concentrations. These findings indicate that exposure to HC reversibly impacts the nutritional status of premature infants. IMPACT Hydrocortisone treatment of premature infants with respiratory failure or circulatory collapse alters levels of a subset of urinary metabolites representing all major biochemical pathways. This is the first description of the scope, magnitude, timing and reversibility of metabolomic changes in infants in response to hydrocortisone, and it confirms corticosteroid regulation of three biochemicals that are associated with lung inflammatory status. The findings indicate a dose-dependency of hydrocortisone for metabolomic and anti-inflammatory effects, that prolonged therapy may lower the supply of many nutrients, and that monitoring concentrations of cortisol and inflammation markers may be a useful clinical approach during corticosteroid therapy.
Collapse
|
4
|
Cunnane BT, Sinha U, Malis V, Hernandez RD, Smitaman E, Sinha S. Effect of different ankle joint positions on medial gastrocnemius muscle fiber strains during isometric plantarflexion. Sci Rep 2023; 13:14986. [PMID: 37696877 PMCID: PMC10495375 DOI: 10.1038/s41598-023-41127-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 08/22/2023] [Indexed: 09/13/2023] Open
Abstract
Muscle force production is influenced by muscle fiber and aponeurosis architecture. This prospective cohort study utilizes special MR imaging sequences to examine the structure-function in-vivo in the Medial Gastrocnemius (MG) at three-ankle angles (dorsiflexion, plantar flexion-low and high) and two sub-maximal levels of maximum voluntary contraction (25% and 50%MVC). The study was performed on 6 young male participants. Muscle fiber and aponeurosis strain, fiber strain normalized to force, fiber length and pennation angle (at rest and peak contraction) were analyzed for statistical differences between ankle positions and %MVC. A two-way repeated measures ANOVA and post hoc Bonferroni-adjusted tests were conducted for normal data. A related samples test with Friedman's 2-way ANOVA by ranks with corrections for multiple comparisons was conducted for non-normal data. The dorsiflexed ankle position generated significantly higher force with lower fiber strain than the plantarflexed positions. Sarcomere length extracted from muscle fiber length at each ankle angle was used to track the location on the Force-Length curve and showed the MG operates on the curve's ascending limb. Muscle force changes predicted from the F-L curve going from dorsi- to plantarflexion was less than that experimentally observed suggesting other determinants of force changes with ankle position.
Collapse
|
5
|
Guardado M, Steurer M, Chapin C, Hernandez RD, Ballard PL, Torgerson D. The Urinary Metabolomic Fingerprint in Extremely Preterm Infants on Total Parenteral Nutrition vs. Enteral Feeds. Metabolites 2023; 13:971. [PMID: 37755251 PMCID: PMC10537655 DOI: 10.3390/metabo13090971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 08/18/2023] [Accepted: 08/21/2023] [Indexed: 09/28/2023] Open
Abstract
Total Parenteral Nutrition (TPN), which uses intravenous administration of nutrients, minerals and vitamins, is essential for sustaining premature infants until they transition to enteral feeds, but there is limited information on metabolomic differences between infants on TPN and enteral feeds. We performed untargeted global metabolomics on urine samples collected between 23-30 days of life from 314 infants born <29 weeks gestational age from the TOLSURF and PROP cohorts. Principal component analysis across all metabolites showed a separation of infants solely on TPN compared to infants who had transitioned to enteral feeds, indicating global metabolomic differences between infants based on feeding status. Among 913 metabolites that passed quality control filters, 609 varied in abundance between infants on TPN vs. enteral feeds at p < 0.05. Of these, 88% were in the direction of higher abundance in the urine of infants on enteral feeds. In a subset of infants in a longitudinal analysis, both concurrent and delayed changes in metabolite levels were observed with the initiation of enteral feeds. These infants had higher concentrations of essential amino acids, lipids, and vitamins, which are necessary for growth and development, suggesting the nutritional benefit of an enteral feeding regimen.
Collapse
|
6
|
Laurent SA, Strauli NB, Eggers EL, Wu H, Michel B, Demuth S, Palanichamy A, Wilson MR, Sirota M, Hernandez RD, Cree BAC, Herman AE, von Büdingen HC. Effect of Ocrelizumab on B- and T-Cell Receptor Repertoire Diversity in Patients With Relapsing Multiple Sclerosis From the Randomized Phase III OPERA Trial. NEUROLOGY(R) NEUROIMMUNOLOGY & NEUROINFLAMMATION 2023; 10:e200118. [PMID: 37094998 PMCID: PMC10136682 DOI: 10.1212/nxi.0000000000200118] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 02/22/2023] [Indexed: 04/26/2023]
Abstract
BACKGROUND AND OBJECTIVES The B cell-depleting anti-CD20 antibody ocrelizumab (OCR) effectively reduces MS disease activity and slows disability progression. Given the role of B cells as antigen-presenting cells, the primary goal of this study was to evaluate the effect of OCR on the T-cell receptor repertoire diversity. METHODS To examine whether OCR substantially alters the molecular diversity of the T-cell receptor repertoire, deep immune repertoire sequencing (RepSeq) of CD4+ and CD8+ T-cell receptor β-chain variable regions was performed on longitudinal blood samples. The IgM and IgG heavy chain variable region repertoire was also analyzed to characterize the residual B-cell repertoire under OCR treatment. RESULTS Peripheral blood samples for RepSeq were obtained from 8 patients with relapsing MS enrolled in the OPERA I trial over a period of up to 39 months. Four patients each were treated with OCR or interferon β1-a during the double-blind period of OPERA I. All patients received OCR during the open-label extension. The diversity of the CD4+/CD8+ T-cell repertoires remained unaffected in OCR-treated patients. The expected OCR-associated B-cell depletion was mirrored by reduced B-cell receptor diversity in peripheral blood and a shift in immunoglobulin gene usage. Despite deep B-cell depletion, longitudinal persistence of clonally related B-cells was observed. DISCUSSION Our data illustrate that the diversity of CD4+/CD8+ T-cell receptor repertoires remained unaltered in OCR-treated patients with relapsing MS. Persistence of a highly diverse T-cell repertoire suggests that aspects of adaptive immunity remain intact despite extended anti-CD20 therapy. TRIAL REGISTRATION INFORMATION This is a substudy (BE29353) of the OPERA I (WA21092; NCT01247324) trial. Date of registration, November 23, 2010; first patient enrollment, August 31, 2011.
Collapse
|
7
|
Wainschtein P, Jain D, Zheng Z, Cupples LA, Shadyab AH, McKnight B, Shoemaker BM, Mitchell BD, Psaty BM, Kooperberg C, Liu CT, Albert CM, Roden D, Chasman DI, Darbar D, Lloyd-Jones DM, Arnett DK, Regan EA, Boerwinkle E, Rotter JI, O'Connell JR, Yanek LR, de Andrade M, Allison MA, McDonald MLN, Chung MK, Fornage M, Chami N, Smith NL, Ellinor PT, Vasan RS, Mathias RA, Loos RJF, Rich SS, Lubitz SA, Heckbert SR, Redline S, Guo X, Chen YDI, Laurie CA, Hernandez RD, McGarvey ST, Goddard ME, Laurie CC, North KE, Lange LA, Weir BS, Yengo L, Yang J, Visscher PM. Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data. Nat Genet 2022; 54:263-273. [PMID: 35256806 PMCID: PMC9119698 DOI: 10.1038/s41588-021-00997-7] [Citation(s) in RCA: 141] [Impact Index Per Article: 70.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 12/01/2021] [Indexed: 12/20/2022]
Abstract
Analyses of data from genome-wide association studies on unrelated individuals have shown that, for human traits and diseases, approximately one-third to two-thirds of heritability is captured by common SNPs. However, it is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular whether the causal variants are rare, or whether it is overestimated due to bias in inference from pedigree data. Here we estimated heritability for height and body mass index (BMI) from whole-genome sequence data on 25,465 unrelated individuals of European ancestry. The estimated heritability was 0.68 (standard error 0.10) for height and 0.30 (standard error 0.10) for body mass index. Low minor allele frequency variants in low linkage disequilibrium (LD) with neighboring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection. Our results imply that rare variants, in particular those in regions of low linkage disequilibrium, are a major source of the still missing heritability of complex traits and disease.
Collapse
|
8
|
Seplyarskiy VB, Soldatov RA, Koch E, McGinty RJ, Goldmann JM, Hernandez RD, Barnes K, Correa A, Burchard EG, Ellinor PT, McGarvey ST, Mitchell BD, Vasan RS, Redline S, Silverman E, Weiss ST, Arnett DK, Blangero J, Boerwinkle E, He J, Montgomery C, Rao DC, Rotter JI, Taylor KD, Brody JA, Chen YDI, de las Fuentes L, Hwu CM, Rich SS, Manichaikul AW, Mychaleckyj JC, Palmer ND, Smith JA, Kardia SLR, Peyser PA, Bielak LF, O'Connor TD, Emery LS, Gilissen C, Wong WSW, Kharchenko PV, Sunyaev S. Population sequencing data reveal a compendium of mutational processes in the human germ line. Science 2021; 373:1030-1035. [PMID: 34385354 PMCID: PMC9217108 DOI: 10.1126/science.aba7408] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Accepted: 07/14/2021] [Indexed: 12/16/2022]
Abstract
Biological mechanisms underlying human germline mutations remain largely unknown. We statistically decompose variation in the rate and spectra of mutations along the genome using volume-regularized nonnegative matrix factorization. The analysis of a sequencing dataset (TOPMed) reveals nine processes that explain the variation in mutation properties between loci. We provide a biological interpretation for seven of these processes. We associate one process with bulky DNA lesions that are resolved asymmetrically with respect to transcription and replication. Two processes track direction of replication fork and replication timing, respectively. We identify a mutagenic effect of active demethylation primarily acting in regulatory regions and a mutagenic effect of long interspersed nuclear elements. We localize a mutagenic process specific to oocytes from population sequencing data. This process appears transcriptionally asymmetric.
Collapse
|
9
|
Claesen J, Spagnolo JB, Ramos SF, Kurita KL, Byrd AL, Aksenov AA, Melnik AV, Wong WR, Wang S, Hernandez RD, Donia MS, Dorrestein PC, Kong HH, Segre JA, Linington RG, Fischbach MA, Lemon KP. A Cutibacterium acnes antibiotic modulates human skin microbiota composition in hair follicles. Sci Transl Med 2021; 12:12/570/eaay5445. [PMID: 33208503 DOI: 10.1126/scitranslmed.aay5445] [Citation(s) in RCA: 88] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 07/17/2019] [Accepted: 10/30/2020] [Indexed: 12/11/2022]
Abstract
The composition of the skin microbiota varies widely among individuals when sampled at the same body site. A key question is which molecular factors determine strain-level variability within sub-ecosystems of the skin microbiota. Here, we used a genomics-guided approach to identify an antibacterial biosynthetic gene cluster in Cutibacterium acnes (formerly Propionibacterium acnes), a human skin commensal bacterium that is widely distributed across individuals and skin sites. Experimental characterization of this biosynthetic gene cluster resulted in identification of a new thiopeptide antibiotic, cutimycin. Analysis of individual human skin hair follicles revealed that cutimycin contributed to the ecology of the skin hair follicle microbiota and helped to reduce colonization of skin hair follicles by Staphylococcus species.
Collapse
|
10
|
Taliun D, Harris DN, Kessler MD, Carlson J, Szpiech ZA, Torres R, Taliun SAG, Corvelo A, Gogarten SM, Kang HM, Pitsillides AN, LeFaive J, Lee SB, Tian X, Browning BL, Das S, Emde AK, Clarke WE, Loesch DP, Shetty AC, Blackwell TW, Smith AV, Wong Q, Liu X, Conomos MP, Bobo DM, Aguet F, Albert C, Alonso A, Ardlie KG, Arking DE, Aslibekyan S, Auer PL, Barnard J, Barr RG, Barwick L, Becker LC, Beer RL, Benjamin EJ, Bielak LF, Blangero J, Boehnke M, Bowden DW, Brody JA, Burchard EG, Cade BE, Casella JF, Chalazan B, Chasman DI, Chen YDI, Cho MH, Choi SH, Chung MK, Clish CB, Correa A, Curran JE, Custer B, Darbar D, Daya M, de Andrade M, DeMeo DL, Dutcher SK, Ellinor PT, Emery LS, Eng C, Fatkin D, Fingerlin T, Forer L, Fornage M, Franceschini N, Fuchsberger C, Fullerton SM, Germer S, Gladwin MT, Gottlieb DJ, Guo X, Hall ME, He J, Heard-Costa NL, Heckbert SR, Irvin MR, Johnsen JM, Johnson AD, Kaplan R, Kardia SLR, Kelly T, Kelly S, Kenny EE, Kiel DP, Klemmer R, Konkle BA, Kooperberg C, Köttgen A, Lange LA, Lasky-Su J, Levy D, Lin X, Lin KH, Liu C, Loos RJF, Garman L, Gerszten R, Lubitz SA, Lunetta KL, Mak ACY, Manichaikul A, Manning AK, Mathias RA, McManus DD, McGarvey ST, Meigs JB, Meyers DA, Mikulla JL, Minear MA, Mitchell BD, Mohanty S, Montasser ME, Montgomery C, Morrison AC, Murabito JM, Natale A, Natarajan P, Nelson SC, North KE, O'Connell JR, Palmer ND, Pankratz N, Peloso GM, Peyser PA, Pleiness J, Post WS, Psaty BM, Rao DC, Redline S, Reiner AP, Roden D, Rotter JI, Ruczinski I, Sarnowski C, Schoenherr S, Schwartz DA, Seo JS, Seshadri S, Sheehan VA, Sheu WH, Shoemaker MB, Smith NL, Smith JA, Sotoodehnia N, Stilp AM, Tang W, Taylor KD, Telen M, Thornton TA, Tracy RP, Van Den Berg DJ, Vasan RS, Viaud-Martinez KA, Vrieze S, Weeks DE, Weir BS, Weiss ST, Weng LC, Willer CJ, Zhang Y, Zhao X, Arnett DK, Ashley-Koch AE, Barnes KC, Boerwinkle E, Gabriel S, Gibbs R, Rice KM, Rich SS, Silverman EK, Qasba P, Gan W, Papanicolaou GJ, Nickerson DA, Browning SR, Zody MC, Zöllner S, Wilson JG, Cupples LA, Laurie CC, Jaquish CE, Hernandez RD, O'Connor TD, Abecasis GR. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 2021; 590:290-299. [PMID: 33568819 PMCID: PMC7875770 DOI: 10.1038/s41586-021-03205-y] [Citation(s) in RCA: 978] [Impact Index Per Article: 326.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 01/07/2021] [Indexed: 02/08/2023]
Abstract
The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
Collapse
|
11
|
Spear ML, Diaz-Papkovich A, Ziv E, Yracheta JM, Gravel S, Torgerson DG, Hernandez RD. Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits. eLife 2020; 9:e56029. [PMID: 33372659 PMCID: PMC7771964 DOI: 10.7554/elife.56029] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 12/13/2020] [Indexed: 11/13/2022] Open
Abstract
People in the Americas represent a diverse continuum of populations with varying degrees of admixture among African, European, and Amerindigenous ancestries. In the United States, populations with non-European ancestry remain understudied, and thus little is known about the genetic architecture of phenotypic variation in these populations. Using genotype data from the Hispanic Community Health Study/Study of Latinos, we find that Amerindigenous ancestry increased by an average of ~20% spanning 1940s-1990s in Mexican Americans. These patterns result from complex interactions between several population and cultural factors which shaped patterns of genetic variation and influenced the genetic architecture of complex traits in Mexican Americans. We show for height how polygenic risk scores based on summary statistics from a European-based genome-wide association study perform poorly in Mexican Americans. Our findings reveal temporal changes in population structure within Hispanics/Latinos that may influence biomedical traits, demonstrating a need to improve our understanding of admixed populations.
Collapse
|
12
|
Tong DMH, Hernandez RD. Population genetic simulation study of power in association testing across genetic architectures and study designs. Genet Epidemiol 2020; 44:90-103. [PMID: 31587362 PMCID: PMC6980249 DOI: 10.1002/gepi.22264] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 08/26/2019] [Accepted: 09/16/2019] [Indexed: 12/22/2022]
Abstract
While it is well established that genetics can be a major contributor to population variation of complex traits, the relative contributions of rare and common variants to phenotypic variation remains a matter of considerable debate. Here, we simulate genetic and phenotypic data across different case/control panel sampling strategies, sequencing methods, and genetic architecture models based on evolutionary forces to determine the statistical performance of rare variant association tests (RVATs) widely in use. We find that the highest statistical power of RVATs is achieved by sampling case/control individuals from the extremes of an underlying quantitative trait distribution. We also demonstrate that the use of genotyping arrays, in conjunction with imputation from a whole-genome sequenced (WGS) reference panel, recovers the vast majority (90%) of the power that could be achieved by sequencing the case/control panel using current tools. Finally, we show that for dichotomous traits, the statistical performance of RVATs decreases as rare variants become more important in the trait architecture. Our results extend previous work to show that RVATs are insufficiently powered to make generalizable conclusions about the role of rare variants in dichotomous complex traits.
Collapse
|
13
|
Szpiech ZA, Mak ACY, White MJ, Hu D, Eng C, Burchard EG, Hernandez RD. Ancestry-Dependent Enrichment of Deleterious Homozygotes in Runs of Homozygosity. Am J Hum Genet 2019; 105:747-762. [PMID: 31543216 PMCID: PMC6817522 DOI: 10.1016/j.ajhg.2019.08.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 08/27/2019] [Indexed: 12/20/2022] Open
Abstract
Runs of homozygosity (ROH) are important genomic features that manifest when an individual inherits two haplotypes that are identical by descent. Their length distributions are informative about population history, and their genomic locations are useful for mapping recessive loci contributing to both Mendelian and complex disease risk. We have previously shown that ROH, and especially long ROH that are likely the result of recent parental relatedness, are enriched for homozygous deleterious coding variation in a worldwide sample of outbred individuals. However, the distribution of ROH in admixed populations and their relationship to deleterious homozygous genotypes is understudied. Here we analyze whole-genome sequencing data from 1,441 unrelated individuals from self-identified African American, Puerto Rican, and Mexican American populations. These populations are three-way admixed between European, African, and Native American ancestries and provide an opportunity to study the distribution of deleterious alleles partitioned by local ancestry and ROH. We re-capitulate previous findings that long ROH are enriched for deleterious variation genome-wide. We then partition by local ancestry and show that deleterious homozygotes arise at a higher rate when ROH overlap African ancestry segments than when they overlap European or Native American ancestry segments of the genome. These results suggest that, while ROH on any haplotype background are associated with an inflation of deleterious homozygous variation, African haplotype backgrounds may play a particularly important role in the genetic architecture of complex diseases for admixed individuals, highlighting the need for further study of these populations.
Collapse
|
14
|
Wainschtein P, Jain DP, Yengo L, Zheng Z, Cupples LA, Shadyab AH, McKnight B, Shoemaker BM, Mitchell BD, Psaty BM, Kooperberg C, Roden D, Darbar D, Arnett DK, Regan EA, Boerwinkle E, Rotter JI, Allison MA, McDonald MLN, Chung MK, Smith NL, Ellinor PT, Vasan RS, Mathias RA, Rich SS, Heckbert SR, Redline S, Guo X, Chen YDI, Liu CT, Andrade MD, Yanek LR, Albert CM, Hernandez RD, McGarvey ST, North KE, Lange LA, Weir BS, Laurie CC, Yang J, Visscher PM. Recovery of trait heritability from whole genome sequence data. ACTA ACUST UNITED AC 2019. [DOI: 10.1530/ey.16.14.15] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
15
|
Daya M, Rafaels N, Brunetti TM, Chavan S, Levin AM, Shetty A, Gignoux CR, Boorgula MP, Wojcik G, Campbell M, Vergara C, Torgerson DG, Ortega VE, Doumatey A, Johnston HR, Acevedo N, Araujo MI, Avila PC, Belbin G, Bleecker E, Bustamante C, Caraballo L, Cruz A, Dunston GM, Eng C, Faruque MU, Ferguson TS, Figueiredo C, Ford JG, Gan W, Gourraud PA, Hansel NN, Hernandez RD, Herrera-Paz EF, Jiménez S, Kenny EE, Knight-Madden J, Kumar R, Lange LA, Lange EM, Lizee A, Maul P, Maul T, Mayorga A, Meyers D, Nicolae DL, O'Connor TD, Oliveira RR, Olopade CO, Olopade O, Qin ZS, Rotimi C, Vince N, Watson H, Wilks RJ, Wilson JG, Salzberg S, Ober C, Burchard EG, Williams LK, Beaty TH, Taub MA, Ruczinski I, Mathias RA, Barnes KC. Author Correction: Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations. Nat Commun 2019; 10:4082. [PMID: 31484942 PMCID: PMC6726619 DOI: 10.1038/s41467-019-12158-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
Collapse
|
16
|
Hernandez RD, Uricchio LH, Hartman K, Ye C, Dahl A, Zaitlen N. Ultrarare variants drive substantial cis heritability of human gene expression. Nat Genet 2019; 51:1349-1355. [PMID: 31477931 PMCID: PMC6730564 DOI: 10.1038/s41588-019-0487-7] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Accepted: 07/08/2019] [Indexed: 11/09/2022]
Abstract
The vast majority of human mutations have minor allele frequencies under 1%, with the plurality observed only once (that is, 'singletons'). While Mendelian diseases are predominantly caused by rare alleles, their cumulative contribution to complex phenotypes is largely unknown. We develop and rigorously validate an approach to jointly estimate the contribution of all alleles, including singletons, to phenotypic variation. We apply our approach to transcriptional regulation, an intermediate between genetic variation and complex disease. Using whole-genome DNA and lymphoblastoid cell line RNA sequencing data from 360 European individuals, we conservatively estimate that singletons contribute approximately 25% of cis heritability across genes (dwarfing the contributions of other frequencies). The majority (approximately 76%) of singleton heritability derives from ultrarare variants absent from thousands of additional samples. We develop an inference procedure to demonstrate that our results are consistent with pervasive purifying selection shaping the regulatory architecture of most human genes.
Collapse
|
17
|
Gignoux CR, Torgerson DG, Pino-Yanes M, Uricchio LH, Galanter J, Roth LA, Eng C, Hu D, Nguyen EA, Huntsman S, Mathias RA, Kumar R, Rodriguez-Santana J, Thakur N, Oh SS, McGarry M, Moreno-Estrada A, Sandoval K, Winkler CA, Seibold MA, Padhukasahasram B, Conti DV, Farber HJ, Avila P, Brigino-Buenaventura E, Lenoir M, Meade K, Serebrisky D, Borrell LN, Rodriguez-Cintron W, Thyne S, Joubert BR, Romieu I, Levin AM, Sienra-Monge JJ, Del Rio-Navarro BE, Gan W, Raby BA, Weiss ST, Bleecker E, Meyers DA, Martinez FJ, Gauderman WJ, Gilliland F, London SJ, Bustamante CD, Nicolae DL, Ober C, Sen S, Barnes K, Williams LK, Hernandez RD, Burchard EG. An admixture mapping meta-analysis implicates genetic variation at 18q21 with asthma susceptibility in Latinos. J Allergy Clin Immunol 2019; 143:957-969. [PMID: 30201514 PMCID: PMC6927816 DOI: 10.1016/j.jaci.2016.08.057] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Revised: 08/20/2016] [Accepted: 08/29/2016] [Indexed: 12/13/2022]
Abstract
BACKGROUND Asthma is a common but complex disease with racial/ethnic differences in prevalence, morbidity, and response to therapies. OBJECTIVE We sought to perform an analysis of genetic ancestry to identify new loci that contribute to asthma susceptibility. METHODS We leveraged the mixed ancestry of 3902 Latinos and performed an admixture mapping meta-analysis for asthma susceptibility. We replicated associations in an independent study of 3774 Latinos, performed targeted sequencing for fine mapping, and tested for disease correlations with gene expression in the whole blood of more than 500 subjects from 3 racial/ethnic groups. RESULTS We identified a genome-wide significant admixture mapping peak at 18q21 in Latinos (P = 6.8 × 10-6), where Native American ancestry was associated with increased risk of asthma (odds ratio [OR], 1.20; 95% CI, 1.07-1.34; P = .002) and European ancestry was associated with protection (OR, 0.86; 95% CI, 0.77-0.96; P = .008). Our findings were replicated in an independent childhood asthma study in Latinos (P = 5.3 × 10-3, combined P = 2.6 × 10-7). Fine mapping of 18q21 in 1978 Latinos identified a significant association with multiple variants 5' of SMAD family member 2 (SMAD2) in Mexicans, whereas a single rare variant in the same window was the top association in Puerto Ricans. Low versus high SMAD2 blood expression was correlated with case status (13.4% lower expression; OR, 3.93; 95% CI, 2.12-7.28; P < .001). In addition, lower expression of SMAD2 was associated with more frequent exacerbations among Puerto Ricans with asthma. CONCLUSION Ancestry at 18q21 was significantly associated with asthma in Latinos and implicated multiple ancestry-informative noncoding variants upstream of SMAD2 with asthma susceptibility. Furthermore, decreased SMAD2 expression in blood was strongly associated with increased asthma risk and increased exacerbations.
Collapse
|
18
|
Torres R, Szpiech ZA, Hernandez RD. Correction: Human demographic history has amplified the effects of background selection across the genome. PLoS Genet 2019; 15:e1007898. [PMID: 30601801 PMCID: PMC6314599 DOI: 10.1371/journal.pgen.1007898] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
19
|
Spear ML, Hu D, Pino-Yanes M, Huntsman S, Eng C, Levin AM, Ortega VE, White MJ, McGarry ME, Thakur N, Galanter J, Mak ACY, Oh SS, Ampleford E, Peters SP, Davis A, Kumar R, Farber HJ, Meade K, Avila PC, Serebrisky D, Lenoir MA, Brigino-Buenaventura E, Cintron WR, Thyne SM, Rodriguez-Santana JR, Ford JG, Chapela R, Estrada AM, Sandoval K, Seibold MA, Winkler CA, Bleecker ER, Myers DA, Williams LK, Hernandez RD, Torgerson DG, Burchard EG. A genome-wide association and admixture mapping study of bronchodilator drug response in African Americans with asthma. THE PHARMACOGENOMICS JOURNAL 2018; 19:249-259. [PMID: 30206298 PMCID: PMC6414286 DOI: 10.1038/s41397-018-0042-4] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 06/08/2018] [Accepted: 06/19/2018] [Indexed: 01/15/2023]
Abstract
Short-acting β2-adrenergic receptor agonists (SABAs) are the most commonly prescribed asthma medications worldwide. Response to SABAs is measured as bronchodilator drug response (BDR), which varies among racial/ethnic groups in the U.S1, 2. However, the genetic variation that contributes to BDR is largely undefined in African Americans with asthma3. To identify genetic variants that may contribute to differences in BDR in African Americans with asthma, we performed a genome-wide association study (GWAS) of BDR in 949 African American children with asthma, genotyped with the Axiom World Array 4 (Affymetrix, Santa Clara, CA) followed by imputation using 1000 Genomes phase III genotypes. We used linear regression models adjusting for age, sex, body mass index (BMI) and genetic ancestry to test for an association between BDR and genotype at single nucleotide polymorphisms (SNPs). To increase power and distinguish between shared vs. population-specific associations with BDR in children with asthma, we performed a meta-analysis across 949 African Americans and 1,830 Latinos (Total=2,779). Lastly, we performed genome-wide admixture mapping to identify regions whereby local African or European ancestry is associated with BDR in African Americans. We identified a population-specific association with an intergenic SNP on chromosome 9q21 that was significantly associated with BDR (rs73650726, p=7.69×10−9). A trans-ethnic meta-analysis across African Americans and Latinos identified three additional SNPs within the intron of PRKG1 that were significantly associated with BDR (rs7903366, rs7070958, and rs7081864, p≤5×10−8). Our results failed to replicate in three additional populations of 416 Latinos and 1,615 African Americans. Our findings indicate that both population specific and shared genetic variation contributes to differences in BDR in minority children with asthma, and that the genetic underpinnings of BDR may differ between racial/ethnic groups.
Collapse
|
20
|
Fedewa G, Radoshitzky SR, Chī X, Dǒng L, Zeng X, Spear M, Strauli N, Ng M, Chandran K, Stenglein MD, Hernandez RD, Jahrling PB, Kuhn JH, DeRisi JL. Ebola virus, but not Marburg virus, replicates efficiently and without required adaptation in snake cells. Virus Evol 2018; 4:vey034. [PMID: 30524754 PMCID: PMC6277580 DOI: 10.1093/ve/vey034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Ebola virus (EBOV) disease is a viral hemorrhagic fever with a high case-fatality rate in humans. This disease is caused by four members of the filoviral genus Ebolavirus, including EBOV. The natural hosts reservoirs of ebolaviruses remain to be identified. Glycoprotein 2 of reptarenaviruses, known to infect only boa constrictors and pythons, is similar in sequence and structure to ebolaviral glycoprotein 2, suggesting that EBOV may be able to infect reptilian cells. Therefore, we serially passaged EBOV and a distantly related filovirus, Marburg virus (MARV), in boa constrictor JK cells and characterized viral infection/replication and mutational frequency by confocal imaging and sequencing. We observed that EBOV efficiently infected and replicated in JK cells, but MARV did not. In contrast to most cell lines, EBOV-infected JK cells did not result in an obvious cytopathic effect. Surprisingly, genomic characterization of serial-passaged EBOV in JK cells revealed that genomic adaptation was not required for infection. Deep sequencing coverage (>10,000×) demonstrated the existence of only a single nonsynonymous variant (EBOV glycoprotein precursor pre-GP T544I) of unknown significance within the viral population that exhibited a shift in frequency of at least 10 per cent over six serial passages. In summary, we present the first reptilian cell line that replicates a filovirus at high titers, and for the first time demonstrate a filovirus genus-specific restriction to MARV in a cell line. Our data suggest the possibility that there may be differences between the natural host spectra of ebolaviruses and marburgviruses.
Collapse
|
21
|
Torres R, Szpiech ZA, Hernandez RD. Human demographic history has amplified the effects of background selection across the genome. PLoS Genet 2018; 14:e1007387. [PMID: 29912945 PMCID: PMC6056204 DOI: 10.1371/journal.pgen.1007387] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 07/23/2018] [Accepted: 04/30/2018] [Indexed: 01/22/2023] Open
Abstract
Natural populations often grow, shrink, and migrate over time. Such demographic processes can affect genome-wide levels of genetic diversity. Additionally, genetic variation in functional regions of the genome can be altered by natural selection, which drives adaptive mutations to higher frequencies or purges deleterious ones. Such selective processes affect not only the sites directly under selection but also nearby neutral variation through genetic linkage via processes referred to as genetic hitchhiking in the context of positive selection and background selection (BGS) in the context of purifying selection. While there is extensive literature examining the consequences of selection at linked sites at demographic equilibrium, less is known about how non-equilibrium demographic processes influence the effects of hitchhiking and BGS. Utilizing a global sample of human whole-genome sequences from the Thousand Genomes Project and extensive simulations, we investigate how non-equilibrium demographic processes magnify and dampen the consequences of selection at linked sites across the human genome. When binning the genome by inferred strength of BGS, we observe that, compared to Africans, non-African populations have experienced larger proportional decreases in neutral genetic diversity in strong BGS regions. We replicate these findings in admixed populations by showing that non-African ancestral components of the genome have also been affected more severely in these regions. We attribute these differences to the strong, sustained/recurrent population bottlenecks that non-Africans experienced as they migrated out of Africa and throughout the globe. Furthermore, we observe a strong correlation between FST and the inferred strength of BGS, suggesting a stronger rate of genetic drift. Forward simulations of human demographic history with a model of BGS support these observations. Our results show that non-equilibrium demography significantly alters the consequences of selection at linked sites and support the need for more work investigating the dynamic process of multiple evolutionary forces operating in concert.
Collapse
|
22
|
Mak ACY, White MJ, Eckalbar WL, Szpiech ZA, Oh SS, Pino-Yanes M, Hu D, Goddard P, Huntsman S, Galanter J, Wu AC, Himes BE, Germer S, Vogel JM, Bunting KL, Eng C, Salazar S, Keys KL, Liberto J, Nuckton TJ, Nguyen TA, Torgerson DG, Kwok PY, Levin AM, Celedón JC, Forno E, Hakonarson H, Sleiman PM, Dahlin A, Tantisira KG, Weiss ST, Serebrisky D, Brigino-Buenaventura E, Farber HJ, Meade K, Lenoir MA, Avila PC, Sen S, Thyne SM, Rodriguez-Cintron W, Winkler CA, Moreno-Estrada A, Sandoval K, Rodriguez-Santana JR, Kumar R, Williams LK, Ahituv N, Ziv E, Seibold MA, Darnell RB, Zaitlen N, Hernandez RD. Whole-Genome Sequencing of Pharmacogenetic Drug Response in Racially Diverse Children with Asthma. Am J Respir Crit Care Med 2018; 197:1552-1564. [PMID: 29509491 PMCID: PMC6006403 DOI: 10.1164/rccm.201712-2529oc] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 03/05/2018] [Indexed: 12/25/2022] Open
Abstract
RATIONALE Albuterol, a bronchodilator medication, is the first-line therapy for asthma worldwide. There are significant racial/ethnic differences in albuterol drug response. OBJECTIVES To identify genetic variants important for bronchodilator drug response (BDR) in racially diverse children. METHODS We performed the first whole-genome sequencing pharmacogenetics study from 1,441 children with asthma from the tails of the BDR distribution to identify genetic association with BDR. MEASUREMENTS AND MAIN RESULTS We identified population-specific and shared genetic variants associated with BDR, including genome-wide significant (P < 3.53 × 10-7) and suggestive (P < 7.06 × 10-6) loci near genes previously associated with lung capacity (DNAH5), immunity (NFKB1 and PLCB1), and β-adrenergic signaling (ADAMTS3 and COX18). Functional analyses of the BDR-associated SNP in NFKB1 revealed potential regulatory function in bronchial smooth muscle cells. The SNP is also an expression quantitative trait locus for a neighboring gene, SLC39A8. The lack of other asthma study populations with BDR and whole-genome sequencing data on minority children makes it impossible to perform replication of our rare variant associations. Minority underrepresentation also poses significant challenges to identify age-matched and population-matched cohorts of sufficient sample size for replication of our common variant findings. CONCLUSIONS The lack of minority data, despite a collaboration of eight universities and 13 individual laboratories, highlights the urgent need for a dedicated national effort to prioritize diversity in research. Our study expands the understanding of pharmacogenetic analyses in racially/ethnically diverse populations and advances the foundation for precision medicine in at-risk and understudied minority populations.
Collapse
|
23
|
Shringarpure SS, Mathias RA, Hernandez RD, O'Connor TD, Szpiech ZA, Torres R, De La Vega FM, Bustamante CD, Barnes KC, Taub MA. Using genotype array data to compare multi- and single-sample variant calls and improve variant call sets from deep coverage whole-genome sequencing data. Bioinformatics 2018; 33:1147-1153. [PMID: 28035032 PMCID: PMC5408850 DOI: 10.1093/bioinformatics/btw786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Accepted: 12/07/2016] [Indexed: 12/30/2022] Open
Abstract
Motivation Variant calling from next-generation sequencing (NGS) data is susceptible to false positive calls due to sequencing, mapping and other errors. To better distinguish true from false positive calls, we present a method that uses genotype array data from the sequenced samples, rather than public data such as HapMap or dbSNP, to train an accurate classifier using Random Forests. We demonstrate our method on a set of variant calls obtained from 642 African-ancestry genomes from the Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA), sequenced to high depth (30X). Results We have applied our classifier to compare call sets generated with different calling methods, including both single-sample and multi-sample callers. At a False Positive Rate of 5%, our method determines true positive rates of 97.5%, 95% and 99% on variant calls obtained using Illuminas single-sample caller CASAVA, Real Time Genomics multisample variant caller, and the GATK UnifiedGenotyper, respectively. Since NGS sequencing data may be accompanied by genotype data for the same samples, either collected concurrent to sequencing or from a previous study, our method can be trained on each dataset to provide a more accurate computational validation of site calls compared to generic methods. Moreover, our method allows for adjustment based on allele frequency (e.g. a different set of criteria to determine quality for rare versus common variants) and thereby provides insight into sequencing characteristics that indicate call quality for variants of different frequencies. Availability and Implementation Code is available on Github at: https://github.com/suyashss/variant_validation. Contacts suyashs@stanford.edu or mtaub@jhsph.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
24
|
Mangul S, Yang HT, Strauli N, Gruhl F, Porath HT, Hsieh K, Chen L, Daley T, Christenson S, Wesolowska-Andersen A, Spreafico R, Rios C, Eng C, Smith AD, Hernandez RD, Ophoff RA, Santana JR, Levanon EY, Woodruff PG, Burchard E, Seibold MA, Shifman S, Eskin E, Zaitlen N. ROP: dumpster diving in RNA-sequencing to find the source of 1 trillion reads across diverse adult human tissues. Genome Biol 2018; 19:36. [PMID: 29548336 PMCID: PMC5857127 DOI: 10.1186/s13059-018-1403-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 02/02/2018] [Indexed: 11/22/2022] Open
Abstract
High-throughput RNA-sequencing (RNA-seq) technologies provide an unprecedented opportunity to explore the individual transcriptome. Unmapped reads are a large and often overlooked output of standard RNA-seq analyses. Here, we present Read Origin Protocol (ROP), a tool for discovering the source of all reads originating from complex RNA molecules. We apply ROP to samples across 2630 individuals from 54 diverse human tissues. Our approach can account for 99.9% of 1 trillion reads of various read length. Additionally, we use ROP to investigate the functional mechanisms underlying connections between the immune system, microbiome, and disease. ROP is freely available at https://github.com/smangul1/rop/wiki.
Collapse
|
25
|
White KA, Ruiz DG, Szpiech ZA, Strauli NB, Hernandez RD, Jacobson MP, Barber DL. Cancer-associated arginine-to-histidine mutations confer a gain in pH sensing to mutant proteins. Sci Signal 2017; 10:10/495/eaam9931. [PMID: 28874603 DOI: 10.1126/scisignal.aam9931] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The intracellular pH (pHi) of most cancers is constitutively higher than that of normal cells and enhances proliferation and cell survival. We found that increased pHi enabled the tumorigenic behaviors caused by somatic arginine-to-histidine mutations, which are frequent in cancer and confer pH sensing not seen with wild-type proteins. Experimentally raising the pHi increased the activity of R776H mutant epidermal growth factor receptor (EGFR-R776H), thereby increasing proliferation and causing transformation in fibroblasts. An Arg-to-Gly mutation did not confer these effects. Molecular dynamics simulations of EGFR suggested that decreased protonation of His776 at high pH causes conformational changes in the αC helix that may stabilize the active form of the kinase. An Arg-to-His, but not Arg-to-Lys, mutation in the transcription factor p53 (p53-R273H) decreased its transcriptional activity and attenuated the DNA damage response in fibroblasts and breast cancer cells with high pHi. Lowering pHi attenuated the tumorigenic effects of both EGFR-R776H and p53-R273H. Our data suggest that some somatic mutations may confer a fitness advantage to the higher pHi of cancer cells.
Collapse
|