26
|
Hayes VM, Jaratlerdsiri W, Bornman MSR. Prostate cancer genomics and racial health disparity. Oncotarget 2018; 9:36650-36651. [PMID: 30613347 PMCID: PMC6291177 DOI: 10.18632/oncotarget.26399] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 11/20/2018] [Indexed: 12/31/2022] Open
|
27
|
Hayes VM, Bornman MSR. Prostate Cancer in Southern Africa: Does Africa Hold Untapped Potential to Add Value to the Current Understanding of a Common Disease? J Glob Oncol 2018; 4:1-7. [PMID: 30241160 PMCID: PMC6223485 DOI: 10.1200/jgo.2016.008862] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
|
28
|
Jaratlerdsiri W, Chan EK, Gong T, Petersen DC, Kalsbeek AM, Venter PA, Stricker PD, Bornman MR, Hayes VM. Whole-Genome Sequencing Reveals Elevated Tumor Mutational Burden and Initiating Driver Mutations in African Men with Treatment-Naïve, High-Risk Prostate Cancer. Cancer Res 2018; 78:6736-6746. [DOI: 10.1158/0008-5472.can-18-0254] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Revised: 06/05/2018] [Accepted: 09/04/2018] [Indexed: 11/16/2022]
|
29
|
Nair SS, Luu PL, Qu W, Maddugoda M, Huschtscha L, Reddel R, Chenevix-Trench G, Toso M, Kench JG, Horvath LG, Hayes VM, Stricker PD, Hughes TP, White DL, Rasko JEJ, Wong JJL, Clark SJ. Guidelines for whole genome bisulphite sequencing of intact and FFPET DNA on the Illumina HiSeq X Ten. Epigenetics Chromatin 2018; 11:24. [PMID: 29807544 PMCID: PMC5971424 DOI: 10.1186/s13072-018-0194-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 05/21/2018] [Indexed: 12/24/2022] Open
Abstract
Background Comprehensive genome-wide DNA methylation profiling is critical to gain insights into epigenetic reprogramming during development and disease processes. Among the different genome-wide DNA methylation technologies, whole genome bisulphite sequencing (WGBS) is considered the gold standard for assaying genome-wide DNA methylation at single base resolution. However, the high sequencing cost to achieve the optimal depth of coverage limits its application in both basic and clinical research. To achieve 15× coverage of the human methylome, using WGBS, requires approximately three lanes of 100-bp-paired-end Illumina HiSeq 2500 sequencing. It is important, therefore, for advances in sequencing technologies to be developed to enable cost-effective high-coverage sequencing. Results In this study, we provide an optimised WGBS methodology, from library preparation to sequencing and data processing, to enable 16–20× genome-wide coverage per single lane of HiSeq X Ten, HCS 3.3.76. To process and analyse the data, we developed a WGBS pipeline (METH10X) that is fast and can call SNPs. We performed WGBS on both high-quality intact DNA and degraded DNA from formalin-fixed paraffin-embedded tissue. First, we compared different library preparation methods on the HiSeq 2500 platform to identify the best method for sequencing on the HiSeq X Ten. Second, we optimised the PhiX and genome spike-ins to achieve higher quality and coverage of WGBS data on the HiSeq X Ten. Third, we performed integrated whole genome sequencing (WGS) and WGBS of the same DNA sample in a single lane of HiSeq X Ten to improve data output. Finally, we compared methylation data from the HiSeq 2500 and HiSeq X Ten and found high concordance (Pearson r > 0.9×). Conclusions Together we provide a systematic, efficient and complete approach to perform and analyse WGBS on the HiSeq X Ten. Our protocol allows for large-scale WGBS studies at reasonable processing time and cost on the HiSeq X Ten platform. Electronic supplementary material The online version of this article (10.1186/s13072-018-0194-0) contains supplementary material, which is available to authorized users.
Collapse
|
30
|
Chan EKF, Cameron DL, Petersen DC, Lyons RJ, Baldi BF, Papenfuss AT, Thomas DM, Hayes VM. Optical mapping reveals a higher level of genomic architecture of chained fusions in cancer. Genome Res 2018; 28:726-738. [PMID: 29618486 PMCID: PMC5932612 DOI: 10.1101/gr.227975.117] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 03/21/2018] [Indexed: 01/21/2023]
Abstract
Genomic rearrangements are common in cancer, with demonstrated links to disease progression and treatment response. These rearrangements can be complex, resulting in fusions of multiple chromosomal fragments and generation of derivative chromosomes. Although methods exist for detecting individual fusions, they are generally unable to reconstruct complex chained events. To overcome these limitations, we adopted a new optical mapping approach, allowing megabase-length genome maps to be reconstructed and rearranged genomes to be visualized without loss of integrity. Whole-genome mapping (Bionano Genomics) of a well-studied highly rearranged liposarcoma cell line resulted in 3338 assembled consensus genome maps, including 72 fusion maps. These fusion maps represent 112.3 Mb of highly rearranged genomic regions, illuminating the complex architecture of chained fusions, including content, order, orientation, and size. Spanning the junction of 147 chromosomal translocations, we found a total of 28 Mb of interspersed sequences that could not be aligned to the reference genome. Traversing these interspersed sequences using short-read sequencing breakpoint calls, we were able to identify and place 399 sequencing fragments within the optical mapping gaps, thus illustrating the complementary nature of optical mapping and short-read sequencing. We demonstrate that optical mapping provides a powerful new approach for capturing a higher level of complex genomic architecture, creating a scaffold for renewed interpretation of sequencing data of particular relevance to human cancer.
Collapse
|
31
|
Kalsbeek AMF, Chan EKF, Grogan J, Petersen DC, Jaratlerdsiri W, Gupta R, Lyons RJ, Haynes AM, Horvath LG, Kench JG, Stricker PD, Hayes VM. Altered mitochondrial genome content signals worse pathology and prognosis in prostate cancer. Prostate 2018; 78:25-31. [PMID: 29134670 DOI: 10.1002/pros.23440] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 09/29/2017] [Indexed: 01/24/2023]
Abstract
BACKGROUND Mitochondrial genome (mtDNA) content is depleted in many cancers. In prostate cancer, there is intra-glandular as well as inter-patient mtDNA copy number variation. In this study, we determine if mtDNA content can be used as a predictor for prostate cancer staging and outcomes. METHODS Fresh prostate cancer biopsies from 115 patients were obtained at time of surgery. All cores underwent pathological review, followed by isolation of cancer and normal tissue. DNA was extracted and qPCR performed to quantify the total amount of mtDNA as a ratio to genomic DNA. Differences in mtDNA content were compared for prostate cancer pathology features and disease outcomes. RESULTS We showed a significantly reduced mtDNA content in prostate cancer compared with normal adjacent prostate tissue (mean difference 1.73-fold, P-value <0.001). Prostate cancer with increased mtDNA content showed unfavorable pathologic characteristics including, higher disease stage (PT2 vs PT3 P-value = 0.018), extracapsular extension (P-value = 0.02) and a trend toward an increased Gleason score (P-value = 0.064). No significant association was observed between changes in mtDNA content and biochemical recurrence (median follow up of 107 months). CONCLUSIONS Contrary to other cancer types, prostate cancer tissue shows no universally depleted mtDNA content. Rather, the change in mtDNA content is highly variable, mirroring known prostate cancer genome heterogeneity. Patients with high mtDNA content have an unfavorable pathology, while a high mtDNA content in normal adjacent prostate tissue is associated with worse prognosis.
Collapse
|
32
|
Kalsbeek AMF, Chan EFK, Grogan J, Petersen DC, Jaratlerdsiri W, Gupta R, Lyons RJ, Haynes AM, Horvath LG, Kench JG, Stricker PD, Hayes VM. Mutational load of the mitochondrial genome predicts pathological features and biochemical recurrence in prostate cancer. Aging (Albany NY) 2017; 8:2702-2712. [PMID: 27705925 PMCID: PMC5191864 DOI: 10.18632/aging.101044] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Accepted: 09/20/2016] [Indexed: 12/31/2022]
Abstract
Prostate cancer management is complicated by extreme disease heterogeneity, which is further limited by availability of prognostic biomarkers. Recognition of prostate cancer as a genetic disease has prompted a focus on the nuclear genome for biomarker discovery, with little attention given to the mitochondrial genome. While it is evident that mitochondrial DNA (mtDNA) mutations are acquired during prostate tumorigenesis, no study has evaluated the prognostic value of mtDNA variation. Here we used next-generation sequencing to interrogate the mitochondrial genomes from prostate tissue biopsies and matched blood of 115 men having undergone a radical prostatectomy for which there was a mean of 107 months clinical follow-up. We identified 74 unique prostate cancer specific somatic mtDNA variants in 50 patients, providing significant expansion to the growing catalog of prostate cancer mtDNA mutations. While no single variant or variant cluster showed recurrence across multiple patients, we observe a significant positive correlation between the total burden of acquired mtDNA variation and elevated Gleason Score at diagnosis and biochemical relapse. We add to accumulating evidence that total acquired genomic burden, rather than specific mtDNA mutations, has diagnostic value. This is the first study to demonstrate the prognostic potential of mtDNA mutational burden in prostate cancer.
Collapse
|
33
|
Skoglund P, Thompson JC, Prendergast ME, Mittnik A, Sirak K, Hajdinjak M, Salie T, Rohland N, Mallick S, Peltzer A, Heinze A, Olalde I, Ferry M, Harney E, Michel M, Stewardson K, Cerezo-Román JI, Chiumia C, Crowther A, Gomani-Chindebvu E, Gidna AO, Grillo KM, Helenius IT, Hellenthal G, Helm R, Horton M, López S, Mabulla AZP, Parkington J, Shipton C, Thomas MG, Tibesasa R, Welling M, Hayes VM, Kennett DJ, Ramesar R, Meyer M, Pääbo S, Patterson N, Morris AG, Boivin N, Pinhasi R, Krause J, Reich D. Reconstructing Prehistoric African Population Structure. Cell 2017; 171:59-71.e21. [PMID: 28938123 PMCID: PMC5679310 DOI: 10.1016/j.cell.2017.08.049] [Citation(s) in RCA: 176] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Revised: 07/01/2017] [Accepted: 08/29/2017] [Indexed: 02/06/2023]
Abstract
We assembled genome-wide data from 16 prehistoric Africans. We show that the anciently divergent lineage that comprises the primary ancestry of the southern African San had a wider distribution in the past, contributing approximately two-thirds of the ancestry of Malawi hunter-gatherers ∼8,100-2,500 years ago and approximately one-third of the ancestry of Tanzanian hunter-gatherers ∼1,400 years ago. We document how the spread of farmers from western Africa involved complete replacement of local hunter-gatherers in some regions, and we track the spread of herders by showing that the population of a ∼3,100-year-old pastoralist from Tanzania contributed ancestry to people from northeastern to southern Africa, including a ∼1,200-year-old southern African pastoralist. The deepest diversifications of African lineages were complex, involving either repeated gene flow among geographically disparate groups or a lineage more deeply diverging than that of the San contributing more to some western African populations than to others. We finally leverage ancient genomes to document episodes of natural selection in southern African populations. PAPERCLIP.
Collapse
|
34
|
Kalsbeek AM, Chan EK, Corcoran NM, Hovens CM, Hayes VM. Mitochondrial genome variation and prostate cancer: a review of the mutational landscape and application to clinical management. Oncotarget 2017; 8:71342-71357. [PMID: 29050365 PMCID: PMC5642640 DOI: 10.18632/oncotarget.19926] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Accepted: 07/26/2017] [Indexed: 12/17/2022] Open
Abstract
Prostate cancer is a genetic disease. While next generation sequencing has allowed for the emergence of molecular taxonomy, classification is restricted to the nuclear genome. Mutations within the maternally inherited mitochondrial genome are known to impact cancer pathogenesis, as a result of disturbances in energy metabolism and apoptosis. With a higher mutation rate, limited repair and increased copy number compared to the nuclear genome, the clinical relevance of mitochondrial DNA (mtDNA) variation requires deeper exploration. Here we provide a systematic review of the landscape of prostate cancer associated mtDNA variation. While the jury is still out on the association between inherited mtDNA variation and prostate cancer risk, we collate a total of 749 uniquely reported prostate cancer associated somatic mutations. Support exists for number of somatic events, extent of heteroplasmy, and rate of recurrence of mtDNA mutations, increasing with disease aggression. While, the predicted pathogenic impact for recurrent prostate cancer associated mutations appears negligible, evidence exists for carcinogenic mutations impacting the cytochrome c oxidase complex and regulating metastasis through elevated reactive oxygen species production. Due to a lack of lethal cohort analyses, we provide additional unpublished data for metastatic disease. Discussing the advantages of mtDNA as a prostate cancer biomarker, we provide a review of current progress of including elevated mtDNA levels, of a large somatic deletion, acquired tRNAs mutations, heteroplasmy and total number of somatic events (mutational load). We confirm via meta-analysis a significant association between mtDNA mutational load and pathological staging at diagnosis or surgery (p < 0.0001).
Collapse
|
35
|
Jaratlerdsiri W, Chan EKF, Petersen DC, Yang C, Croucher PI, Bornman MSR, Sheth P, Hayes VM. Next generation mapping reveals novel large genomic rearrangements in prostate cancer. Oncotarget 2017; 8:23588-23602. [PMID: 28423598 PMCID: PMC5410329 DOI: 10.18632/oncotarget.15802] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 02/15/2017] [Indexed: 12/27/2022] Open
Abstract
Complex genomic rearrangements are common molecular events driving prostate carcinogenesis. Clinical significance, however, has yet to be fully elucidated. Detecting the full range and subtypes of large structural variants (SVs), greater than one kilobase in length, is challenging using clinically feasible next generation sequencing (NGS) technologies. Next generation mapping (NGM) is a new technology that allows for the interrogation of megabase length DNA molecules outside the detection range of single-base resolution NGS. In this study, we sought to determine the feasibility of using the Irys (Bionano Genomics Inc.) nanochannel NGM technology to generate whole genome maps of a primary prostate tumor and matched blood from a Gleason score 7 (4 + 3), ETS-fusion negative prostate cancer patient. With an effective mapped coverage of 35X and sequence coverage of 60X, and an estimated 43% tumor purity, we identified 85 large somatic structural rearrangements and 6,172 smaller somatic variants, respectively. The vast majority of the large SVs (89%), of which 73% are insertions, were not detectable ab initio using high-coverage short-read NGS. However, guided manual inspection of single NGS reads and de novo assembled scaffolds of NGM-derived candidate regions allowed for confirmation of 94% of these large SVs, with over a third impacting genes with oncogenic potential. From this single-patient study, the first cancer study to integrate NGS and NGM data, we hypothesise that there exists a novel spectrum of large genomic rearrangements in prostate cancer, that these large genomic rearrangements are likely early events in tumorigenesis, and they have potential to enhance taxonomy.
Collapse
|
36
|
Kalsbeek AM, Chan EF, Grogan J, Petersen DC, Jaratlerdsiri W, Gupta R, Lyons RJ, Haynes AM, Horvath LG, Kench JG, Stricker PD, Hayes VM. Mitochondrial genome content is decreased in prostate cancer overall, but increased in high grade cancer. Pathology 2017. [DOI: 10.1016/j.pathol.2016.12.300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
37
|
Stuchbery R, Kurganovs NJ, McCoy PJ, Nelson CC, Hayes VM, Corcoran NM, Hovens CM. Target Acquired: Progress and Promise of Targeted Therapeutics in the Treatment of Prostate Cancer. Curr Cancer Drug Targets 2016; 15:394-405. [PMID: 25882061 DOI: 10.2174/1568009615666150416113453] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Revised: 11/30/2014] [Accepted: 11/30/2014] [Indexed: 11/22/2022]
Abstract
Cancer is fundamentally a genomic disease caused by mutations or rearrangements in the DNA or epigenetic machinery of a patient. An emerging field in cancer treatment targets key aberrations arising from the mutational landscape of an individual patient's disease rather than employing a cancer-wide cytotoxic therapy approach. In prostate cancer in particular, where there is an observed variation in response to standard treatments between patients with disease of a similar pathological stage and grade, mutationdirected treatment may grow to be a viable tool for clinicians to tailor more effective treatments. This review will describe a number of mutations across multiple forms of cancer that have been successfully antagonised by targeted therapeutics including their identification, the development of targeted compounds to combat them and the development of resistance to these therapies. This review will continue to examine these same mutations in the treatment and management of prostate cancer; the prevalence of targetable mutations in prostate cancer, recent clinical trials of targeted-agents and the potential or limitations for their use.
Collapse
|
38
|
McCrow JP, Petersen DC, Louw M, Chan EKF, Harmeyer K, Vecchiarelli S, Lyons RJ, Bornman MSR, Hayes VM. Spectrum of mitochondrial genomic variation and associated clinical presentation of prostate cancer in South African men. Prostate 2016; 76:349-58. [PMID: 26660354 PMCID: PMC4832339 DOI: 10.1002/pros.23126] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 11/05/2015] [Indexed: 01/09/2023]
Abstract
BACKGROUND Prostate cancer incidence and mortality rates are significantly increased in African-American men, but limited studies have been performed within Sub-Saharan African populations. As mitochondria control energy metabolism and apoptosis we speculate that somatic mutations within mitochondrial genomes are candidate drivers of aggressive prostate carcinogenesis. METHODS We used matched blood and prostate tissue samples from 87 South African men (77 with African ancestry) to perform deep sequencing of complete mitochondrial genomes. Clinical presentation was biased toward aggressive disease (Gleason score >7, 64%), and compared with men without prostate cancer either with or without benign prostatic hyperplasia. RESULTS We identified 144 somatic mtDNA single nucleotide variants (SNVs), of which 80 were observed in 39 men presenting with aggressive disease. Both the number and frequency of somatic mtDNA SNVs were associated with higher pathological stage. CONCLUSIONS Besides doubling the total number of somatic PCa-associated mitochondrial genome mutations identified to date, we associate mutational load with aggressive prostate cancer status in men of African ancestry.
Collapse
|
39
|
Hayes VM, Tindall EA, Bornman R. Abstract B22: Significance of ethnic-based disparities in prostate cancer risk and outcomes. Cancer Epidemiol Biomarkers Prev 2015. [DOI: 10.1158/1538-7755.disp14-b22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Abstract
Ethnic-based disparities have been significantly associated with prostate cancer incidence, age and pathological/biochemical presentation, and mortality. Risk for these parameters are lowest in Asian men relative to men of European ancestry, while the highest in men of African ancestry, in particular African-Americans. Limited studies have, however, addressed clinical presentation and contributing factors within the context of Africa. We established the Southern African Prostate Cancer Study (SAPCS), a unique resource to investigate clinical presentation and risk factors within non-migrant Africans. Enrolling over 1000 men to date we report skewing towards significantly higher prostate specific antigen (PSA) levels in cases (83.0% present with a PSA≥20mg/L; median PSA=98.8mg/L) relative to men with no detectable PCa (18.5% present with a PSA≥20mg/L; median PSA=9.1mg/L). Compared with African-Americans, non-migrant Africans presented with significantly more aggressive disease defined by Gleason score >7 (17% and 36%, respectively) and PSA≥20mg/L (17.2% and 83.2%, respectively), which is further exasperated in men from rural versus urban localities. To determine if the observed clinical presentation is driven by genetics and/or environmental factors, we investigated the power for the most significant genome-wide association study (GWAS) identified PCa risk alleles (n=46) as well as epidemiological measures to predict PCa status and/or disease outcomes within the SAPCS. While we show no evidence that previously defined PCa risk alleles predict disease status within Africa, we show associations with family history of cancer, diabetes, current sexual activity and erectile dysfunction, balding pattern, frequent aspirin usage and high PSA levels. We propose that the SAPCS provides a unique non-PSA tested resource to enhance our understanding that drives significant disparities in PCa associated with ethnic background.
Citation Format: Vanessa M. Hayes, Elizabeth A. Tindall, Riana Bornman. Significance of ethnic-based disparities in prostate cancer risk and outcomes. [abstract]. In: Proceedings of the Seventh AACR Conference on The Science of Health Disparities in Racial/Ethnic Minorities and the Medically Underserved; Nov 9-12, 2014; San Antonio, TX. Philadelphia (PA): AACR; Cancer Epidemiol Biomarkers Prev 2015;24(10 Suppl):Abstract nr B22.
Collapse
|
40
|
Chan EKF, Hardie RA, Petersen DC, Beeson K, Bornman RMS, Smith AB, Hayes VM. Revised timeline and distribution of the earliest diverged human maternal lineages in southern Africa. PLoS One 2015; 10:e0121223. [PMID: 25807545 PMCID: PMC4373779 DOI: 10.1371/journal.pone.0121223] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2014] [Accepted: 01/29/2015] [Indexed: 02/05/2023] Open
Abstract
The oldest extant human maternal lineages include mitochondrial haplogroups L0d and L0k found in the southern African click-speaking forager peoples broadly classified as Khoesan. Profiling these early mitochondrial lineages allows for better understanding of modern human evolution. In this study, we profile 77 new early-diverged complete mitochondrial genomes and sub-classify another 105 L0d/L0k individuals from southern Africa. We use this data to refine basal phylogenetic divergence, coalescence times and Khoesan prehistory. Our results confirm L0d as the earliest diverged lineage (∼172 kya, 95%CI: 149–199 kya), followed by L0k (∼159 kya, 95%CI: 136–183 kya) and a new lineage we name L0g (∼94 kya, 95%CI: 72–116 kya). We identify two new L0d1 subclades we name L0d1d and L0d1c4/L0d1e, and estimate L0d2 and L0d1 divergence at ∼93 kya (95%CI:76–112 kya). We concur the earliest emerging L0d1’2 sublineage L0d1b (∼49 kya, 95%CI:37–58 kya) is widely distributed across southern Africa. Concomitantly, we find the most recent sublineage L0d2a (∼17 kya, 95%CI:10–27 kya) to be equally common. While we agree that lineages L0d1c and L0k1a are restricted to contemporary inland Khoesan populations, our observed predominance of L0d2a and L0d1a in non-Khoesan populations suggests a once independent coastal Khoesan prehistory. The distribution of early-diverged human maternal lineages within contemporary southern Africans suggests a rich history of human existence prior to any archaeological evidence of migration into the region. For the first time, we provide a genetic-based evidence for significant modern human evolution in southern Africa at the time of the Last Glacial Maximum at between ∼21–17 kya, coinciding with the emergence of major lineages L0d1a, L0d2b, L0d2d and L0d2a.
Collapse
|
41
|
Morris AG, Heinze A, Chan EKF, Smith AB, Hayes VM. First ancient mitochondrial human genome from a prepastoralist southern African. Genome Biol Evol 2014; 6:2647-53. [PMID: 25212860 PMCID: PMC4224329 DOI: 10.1093/gbe/evu202] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The oldest contemporary human mitochondrial lineages arose in Africa. The earliest divergent extant maternal offshoot, namely haplogroup L0d, is represented by click-speaking forager peoples of southern Africa. Broadly defined as Khoesan, contemporary Khoesan are today largely restricted to the semidesert regions of Namibia and Botswana, whereas archeological, historical, and genetic evidence promotes a once broader southerly dispersal of click-speaking peoples including southward migrating pastoralists and indigenous marine-foragers. No genetic data have been recovered from the indigenous peoples that once sustained life along the southern coastal waters of Africa prepastoral arrival. In this study we generate a complete mitochondrial genome from a 2,330-year-old male skeleton, confirmed through osteological and archeological analysis as practicing a marine-based forager existence. The ancient mtDNA represents a new L0d2c lineage (L0d2c1c) that is today, unlike its Khoe-language based sister-clades (L0d2c1a and L0d2c1b) most closely related to contemporary indigenous San-speakers (specifically Ju). Providing the first genomic evidence that prepastoral Southern African marine foragers carried the earliest diverged maternal modern human lineages, this study emphasizes the significance of Southern African archeological remains in defining early modern human origins.
Collapse
|
42
|
Tindall EA, Monare LR, Petersen DC, van Zyl S, Hardie RA, Segone AM, Venter PA, Bornman MSR, Hayes VM. Clinical presentation of prostate cancer in black South Africans. Prostate 2014; 74:880-91. [PMID: 24723425 PMCID: PMC4135056 DOI: 10.1002/pros.22806] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/16/2013] [Accepted: 03/12/2014] [Indexed: 12/11/2022]
Abstract
BACKGROUND Compared with White Americans, Black American men are at a significant increased risk of presenting with prostate cancer (PCa) and associated mortality, suggesting a link to African-ancestry. However, PCa status within Africa is largely unknown. We address the clinical presentation of PCa within Black South African men. METHODS Over 1,000 participants with or without PCa have enrolled in the Southern African Prostate Cancer Study (SAPCS). Using genome-wide profiling we establish a unique within Africa population substructure. Adjusting for age, clinical variables were assessed, compared against Black Americans and between rural and urban localities while addressing potential socio-demographic confounders. RESULTS We report a significant difference in the distribution of prostate specific antigen (PSA) levels skewed towards higher PSA levels in the PCa cases (83.0% present with a PSA ≥ 20 µg/L; median PSA = 98.8 µg/L) relative to men with no detectable PCa (18.5% present with a PSA ≥ 20 µg/L; median PSA = 9.1 µg/L). Compared with Black Americans, Black South Africans presented with significantly more aggressive disease defined by Gleason score >7 (17% and 36%, respectively) and PSA ≥ 20 µg/L (17.2% and 83.2%, respectively). We report exasperated disease aggression defined by Gleason score >7 (P = 0.0042) and poorly differentiated tumor grade (P < 0.0001) within rural versus urban localities. CONCLUSION Black South African men present with higher PSA levels and histopathological tumor grade compared with Black Americans, which is further escalated in men from rural localities. Our data suggests that lack of PSA testing may be contributing to an aggressive PCa disease phenotype within South African men.
Collapse
|
43
|
Tindall EA, Bornman MSR, van Zyl S, Segone AM, Monare LR, Venter PA, Hayes VM. Addressing the contribution of previously described genetic and epidemiological risk factors associated with increased prostate cancer risk and aggressive disease within men from South Africa. BMC Urol 2013; 13:74. [PMID: 24373635 PMCID: PMC3882498 DOI: 10.1186/1471-2490-13-74] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Accepted: 12/24/2013] [Indexed: 11/30/2022] Open
Abstract
Background Although African ancestry represents a significant risk factor for prostate cancer, few studies have investigated the significance of prostate cancer and relevance of previously defined genetic and epidemiological prostate cancer risk factors within Africa. We recently established the Southern African Prostate Cancer Study (SAPCS), a resource for epidemiological and genetic analysis of prostate cancer risk and outcomes in Black men from South Africa. Biased towards highly aggressive prostate cancer disease, this is the first reported data analysis. Methods The SAPCS is an ongoing population-based study of Black men with or without prostate cancer. Pilot analysis was performed for the first 837 participants, 522 cases and 315 controls. We investigate 46 pre-defined prostate cancer risk alleles and up to 24 epidemiological measures including demographic, lifestyle and environmental factors, for power to predict disease status and to drive on-going SAPCS recruitment, sampling procedures and research direction. Results Preliminary results suggest that no previously defined risk alleles significantly predict prostate cancer occurrence within the SAPCS. Furthermore, genetic risk profiles did not enhance the predictive power of prostate specific antigen (PSA) testing. Our study supports several lifestyle/environmental factors contributing to prostate cancer risk including a family history of cancer, diabetes, current sexual activity and erectile dysfunction, balding pattern, frequent aspirin usage and high PSA levels. Conclusions Despite a clear increased prostate cancer risk associated with an African ancestry, experimental data is lacking within Africa. This pilot study is therefore a significant contribution to the field. While genetic risk factors (largely European-defined) show no evidence for disease prediction in the SAPCS, several epidemiological factors were associated with prostate cancer status. We call for improved study power by building on the SAPCS resource, further validation of associated factors in independent African-based resources, and genome-wide approaches to define African-specific risk alleles.
Collapse
|
44
|
Saunders DN, Tindall EA, Shearer RF, Roberson J, Decker A, Wilson JA, Hayes VM. A novel SERPINA1 mutation causing serum alpha(1)-antitrypsin deficiency. PLoS One 2012; 7:e51762. [PMID: 23251618 PMCID: PMC3520848 DOI: 10.1371/journal.pone.0051762] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 11/06/2012] [Indexed: 12/02/2022] Open
Abstract
Mutations in the SERPINA1 gene can cause deficiency in the circulating serine protease inhibitor α1-Antitrypsin (α1AT). α1AT deficiency is the major contributor to pulmonary emphysema and liver disease in persons of European ancestry, with a prevalence of 1 in 2500 in the USA. We present the discovery and characterization of a novel SERPINA1 mutant from an asymptomatic Middle Eastern male with circulating α1AT deficiency. This 49 base pair deletion mutation (T379Δ), originally mistyped by IEF, causes a frame-shift replacement of the last sixteen α1AT residues and adds an extra twenty-four residues. Functional analysis showed that the mutant protein is not secreted and prone to intracellular aggregation.
Collapse
|
45
|
Tindall EA, Severi G, Hoang HN, Southey MC, English DR, Hopper JL, Giles GG, Hayes VM. Interleukin-6 promoter variants, prostate cancer risk, and survival. Prostate 2012; 72:1701-7. [PMID: 22782910 DOI: 10.1002/pros.22557] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2012] [Accepted: 06/12/2012] [Indexed: 12/20/2022]
Abstract
BACKGROUND Inflammation has been implicated in prostate cancer (PCa) pathogenesis. Promoter DNA variants responsible for differential expression of key cytokines may therefore influence susceptibility to PCa. METHODS Two interleukin-6 (IL-6) promoter variants, -174G>C and -6331T>C, were genotyped for association with PCa risk and survival using the Risk Factors for Prostate Cancer Study (RFPCS, 825 cases and 732 controls) and the Melbourne Collaborative Cohort Study (MCCS, 818 cases and 1,745 controls). Impact of genotypes on IL-6 transcriptional activity was measured using Low Density Arrays. RESULTS A significant increase in IL-6 transcriptional activity in malignant compared to benign prostate tissue supports a role for IL-6 in PCa. The -174G>C variant showed no association with PCa risk, overall survival, or IL-6 transcriptional activity. The -6331 C-allele was significantly associated with an increased risk in the RFPCS (OR = 1.29, 95% CI = 1.08-1.54), but not in the MCCS. In the MCCS however, cases presenting with a CC genotype conferred a higher risk of mortality (HR = 2.27, 95% CI = 1.34-3.85), which was maintained although reduced overall in the pooled analysis with RFPCS (HR = 1.68, 95% CI = 1.10-2.54). Furthermore, we associate the minor C-allele with a significant decrease in IL-6 transcriptional activity. CONCLUSIONS While our study refutes a role for IL-6 -174G>C, it is the first to implicate -6331T>C with PCa risk and poor survival. Our observation that -6331T>C has a significant impact on IL-6 transcriptional activity, calls for further investigations into the role of this variant as a novel PCa biomarker.
Collapse
|
46
|
Hayes VM. Treasure Caves
Cradle of Humankind
World Heritage Site Sterkfontein 1911, South Africa www.cradleofhumankind.co.za. Science 2012. [DOI: 10.1126/science.1211058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Our reviewer visits a World Heritage Site in South Africa that preserves the source of several important hominin fossils.
Collapse
|
47
|
Ratan A, Zhang Y, Hayes VM, Schuster SC, Miller W. Calling SNPs without a reference sequence. BMC Bioinformatics 2010; 11:130. [PMID: 20230626 PMCID: PMC2851604 DOI: 10.1186/1471-2105-11-130] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2009] [Accepted: 03/15/2010] [Indexed: 12/11/2022] Open
Abstract
Background The most common application for the next-generation sequencing technologies is resequencing, where short reads from the genome of an individual are aligned to a reference genome sequence for the same species. These mappings can then be used to identify genetic differences among individuals in a population, and perhaps ultimately to explain phenotypic variation. Many algorithms capable of aligning short reads to the reference, and determining differences between them have been reported. Much less has been reported on how to use these technologies to determine genetic differences among individuals of a species for which a reference sequence is not available, which drastically limits the number of species that can easily benefit from these new technologies. Results We describe a computational pipeline, called DIAL (De novo Identification of Alleles), for identifying single-base substitutions between two closely related genomes without the help of a reference genome. The method works even when the depth of coverage is insufficient for de novo assembly, and it can be extended to determine small insertions/deletions. We evaluate the software's effectiveness using published Roche/454 sequence data from the genome of Dr. James Watson (to detect heterozygous positions) and recent Illumina data from orangutan, in each case comparing our results to those from computational analysis that uses a reference genome assembly. We also illustrate the use of DIAL to identify nucleotide differences among transcriptome sequences. Conclusions DIAL can be used for identification of nucleotide differences in species for which no reference sequence is available. Our main motivation is to use this tool to survey the genetic diversity of endangered species as the identified sequence differences can be used to design genotyping arrays to assist in the species' management. The DIAL source code is freely available at http://www.bx.psu.edu/miller_lab/.
Collapse
|
48
|
Tindall EA, Hoang HN, Southey MC, English DR, Hopper JL, Giles GG, Severi G, Hayes VM. The 4q27 locus and prostate cancer risk. BMC Cancer 2010; 10:69. [PMID: 20184734 PMCID: PMC2841665 DOI: 10.1186/1471-2407-10-69] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2009] [Accepted: 02/25/2010] [Indexed: 12/13/2022] Open
Abstract
Background Chronic inflammation is considered to be implicated in the development of prostate cancer. In this study we are the first to investigate a potential association between variants in an autoimmune related region on chromosome 4q27 and prostate cancer risk. This region harbors two cytokine genes IL-2 and the recently described IL-21. Methods We genotyped six variants previously associated with autoimmune disease (namely rs13151961, rs13119723, rs17388568, rs3136534, rs6822844 and rs6840978) and one functional IL-2 promoter variant (rs2069762) for possible association with prostate cancer risk using the Australian Risk Factors for Prostate Cancer case-control Study. Results Overall, our results do not support an association between the seven variants at position 4q27 and prostate cancer risk. Per allele odds ratios (ORs) were not significantly different from 1 (all P-values = 0.06). However, we found suggestive evidence for a significant association between the presence of the rs13119723 variant (located in a protein of unknown function) and men with a family history of prostate cancer in first-degree relatives (P-value for interaction 0.02). The per allele OR associated with this variant was significantly higher than 1 (2.37; 95% C.I. = 1.01-5.57). Conclusions We suggest that genetic variation within the chromosome 4q27 locus might be associated with prostate cancer susceptibility in men with a family history of the disease. Furthermore, our study alludes to a potential role of unknown protein KIAA1109 in conferring this risk.
Collapse
|
49
|
Tindall EA, Petersen DC, Nikolaysen S, Miller W, Schuster SC, Hayes VM. Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation. BMC Res Notes 2010; 3:39. [PMID: 20175893 PMCID: PMC2848685 DOI: 10.1186/1756-0500-3-39] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2010] [Accepted: 02/22/2010] [Indexed: 11/29/2022] Open
Abstract
Background High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom designed VeraCode and sentrix array matrix (SAM) assays for each of these applications, respectively. We highlight applications for interpretation of Illumina generated genotype cluster plots to maximise data inclusion and reduce genotyping errors. Findings We illustrate the dramatic effect of outliers in genotype calling and data interpretation, as well as suggest simple means to avoid genotyping errors. Furthermore we present this platform as a successful method for two-cluster rare or non-autosomal variant calling. The success of high-throughput technologies to accurately call rare variants will become an essential feature for future association studies. Finally, we highlight additional advantages of the Illumina GoldenGate chemistry in generating unusually segregated cluster plots that identify potential NGS generated sequencing error resulting from minimal coverage. Conclusions We demonstrate the importance of visually inspecting genotype cluster plots generated by the Illumina software and issue warnings regarding commonly accepted quality control parameters. In addition to suggesting applications to minimise data exclusion, we propose that the Illumina cluster plots may be helpful in identifying potential in-put sequence errors, particularly important for studies to validate NGS generated variation.
Collapse
|
50
|
Tindall EA, Hayes VM. Comprehensive sequence analysis of the human IL23A gene defines new variation content and high rate of evolutionary conservation. DNA Res 2010; 17:117-22. [PMID: 20154336 PMCID: PMC2853383 DOI: 10.1093/dnares/dsq003] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
A newly described heterodimeric cytokine, interleukin-23 (IL-23) is emerging as a key player in both the innate and the adaptive T helper (Th)17 driven immune response as well as an initiator of several autoimmune diseases. The rate-limiting element of IL-23 production is believed to be driven by expression of the unique p19 subunit encoded by IL23A. We set out to perform comprehensive DNA sequencing of this previously under-studied gene in 96 individuals from two evolutionary distinct human population groups, Southern African Bantu and European. We observed a total of 33 different DNA variants within these two groups, 22 (67%) of which are currently not reported in any available database. We further demonstrate both inter-population and intra-species sequence conservation within the coding and known regulatory regions of IL23A, supporting a critical physiological role for IL-23. We conclude that IL23A may have undergone positive selection pressure directed towards conservation, suggesting that functional genetic variants within IL23A will have a significant impact on the host immune response.
Collapse
|