1
|
Ayalew W, Xiaoyun W, Tarekegn GM, Naboulsi R, Sisay Tessema T, Van Damme R, Bongcam-Rudloff E, Chu M, Liang C, Edea Z, Enquahone S, Ping Y. Whole genome sequences of 70 indigenous Ethiopian cattle. Sci Data 2024; 11:584. [PMID: 38839789 PMCID: PMC11153504 DOI: 10.1038/s41597-024-03342-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 05/02/2024] [Indexed: 06/07/2024] Open
Abstract
Indigenous animal genetic resources play a crucial role in preserving global genetic diversity and supporting the livelihoods of millions of people. In Ethiopia, the majority of the cattle population consists of indigenous breeds. Understanding the genetic architecture of these cattle breeds is essential for effective management and conservation efforts. In this study, we sequenced DNA samples from 70 animals from seven indigenous cattle breeds, generating about two terabytes of pair-end reads with an average coverage of 14X. The sequencing data were pre-processed and mapped to the cattle reference genome (ARS-UCD1.2) with an alignment rate of 99.2%. Finally, the variant calling process produced approximately 35 million high-quality SNPs. These data provide a deeper understanding of the genetic landscape, facilitate the identification of causal mutations, and enable the exploration of evolutionary patterns to assist cattle improvement and sustainable utilization, particularly in the face of unpredictable climate changes.
Collapse
Affiliation(s)
- Wondossen Ayalew
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, P.R. China
- Institute of Biotechnology, Addis Ababa University, Addis Ababa P.O. Box 1176, Addis Ababa, Ethiopia
| | - Wu Xiaoyun
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, P.R. China
| | - Getinet Mekuriaw Tarekegn
- Institute of Biotechnology, Addis Ababa University, Addis Ababa P.O. Box 1176, Addis Ababa, Ethiopia.
- Scotland's Rural College (SRUC), Roslin Institute Building, University of Edinburgh, Edinburgh, EH25 9RG, UK.
| | - Rakan Naboulsi
- Childhood Cancer Research Unit, Department of Women's and Children's Health, Karolinska Institute, Tomtebodavägen 18A, 17177, Stockholm, Sweden
| | - Tesfaye Sisay Tessema
- Institute of Biotechnology, Addis Ababa University, Addis Ababa P.O. Box 1176, Addis Ababa, Ethiopia
| | - Renaud Van Damme
- Department of Animal Biosciences, Swedish University of Agricultural Sciences, 75007, Uppsala, Sweden
| | - Erik Bongcam-Rudloff
- Department of Animal Biosciences, Swedish University of Agricultural Sciences, 75007, Uppsala, Sweden
| | - Min Chu
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, P.R. China
| | - Chunnian Liang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, P.R. China
| | - Zewdu Edea
- Ethiopian Bio and Emerging Technology Institute, Addis Ababa, Ethiopia
| | - Solomon Enquahone
- Scotland's Rural College (SRUC), Roslin Institute Building, University of Edinburgh, Edinburgh, EH25 9RG, UK
| | - Yan Ping
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, P.R. China.
| |
Collapse
|
2
|
Omar M, Alexiou M, Rekhi UR, Lehmann K, Bhardwaj A, Delyea C, Elahi S, Febbraio M. DNA methylation changes underlie the long-term association between periodontitis and atherosclerotic cardiovascular disease. Front Cardiovasc Med 2023; 10:1164499. [PMID: 37153468 PMCID: PMC10160482 DOI: 10.3389/fcvm.2023.1164499] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 03/22/2023] [Indexed: 05/09/2023] Open
Abstract
Periodontitis, the leading cause of adult tooth loss, has been identified as an independent risk factor for cardiovascular disease (CVD). Studies suggest that periodontitis, like other CVD risk factors, shows the persistence of increased CVD risk even after mitigation. We hypothesized that periodontitis induces epigenetic changes in hematopoietic stem cells in the bone marrow (BM), and such changes persist after the clinical elimination of the disease and underlie the increased CVD risk. We used a BM transplant approach to simulate the clinical elimination of periodontitis and the persistence of the hypothesized epigenetic reprogramming. Using the low-density lipoprotein receptor knockout (LDLRo ) atherosclerosis mouse model, BM donor mice were fed a high-fat diet to induce atherosclerosis and orally inoculated with Porphyromonas gingivalis (Pg), a keystone periodontal pathogen; the second group was sham-inoculated. Naïve LDLR o mice were irradiated and transplanted with BM from one of the two donor groups. Recipients of BM from Pg-inoculated donors developed significantly more atherosclerosis, accompanied by cytokine/chemokines that suggested BM progenitor cell mobilization and were associated with atherosclerosis and/or PD. Using whole-genome bisulfite sequencing, 375 differentially methylated regions (DMRs) and global hypomethylation in recipients of BM from Pg-inoculated donors were observed. Some DMRs pointed to the involvement of enzymes with major roles in DNA methylation and demethylation. In validation assays, we found a significant increase in the activity of ten-eleven translocase-2 and a decrease in the activity of DNA methyltransferases. Plasma S-adenosylhomocysteine levels were significantly higher, and the S-adenosylmethionine to S-adenosylhomocysteine ratio was decreased, both of which have been associated with CVD. These changes may be related to increased oxidative stress as a result of Pg infection. These data suggest a novel and paradigm-shifting mechanism in the long-term association between periodontitis and atherosclerotic CVD.
Collapse
|
3
|
Camerlenghi F, Favaro S, Masoero L, Broderick T. Scaled process priors for Bayesian nonparametric estimation of the unseen genetic variation. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2115918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Federico Camerlenghi
- Department of Economics, Management and Statistics, University of Milano - Bicocca, Milan, Italy
| | - Stefano Favaro
- Department of Economics and Statistics, University of Torino and Collegio Carlo Alberto, Torino, Italy
| | - Lorenzo Masoero
- Department of Electrical Engineering and Computer Science, CSAIL, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Tamara Broderick
- Department of Electrical Engineering and Computer Science, CSAIL, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
4
|
Next-Generation Sequencing Advances the Genetic Diagnosis of Cerebral Cavernous Malformation (CCM). Antioxidants (Basel) 2022; 11:antiox11071294. [PMID: 35883785 PMCID: PMC9311989 DOI: 10.3390/antiox11071294] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 06/24/2022] [Accepted: 06/26/2022] [Indexed: 02/07/2023] Open
Abstract
Cerebral Cavernous Malformation (CCM) is a cerebrovascular disease of genetic origin that predisposes to seizures, focal neurological deficits and fatal intracerebral hemorrhage. It may occur sporadically or in familial forms, segregating as an autosomal dominant condition with incomplete penetrance and highly variable expressivity. Its pathogenesis has been associated with loss-of-function mutations in three genes, namely KRIT1 (CCM1), CCM2 and PDCD10 (CCM3), which are implicated in defense mechanisms against oxidative stress and inflammation. Herein, we screened 21 Italian CCM cases using clinical exome sequencing and found six cases (~29%) with pathogenic variants in CCM genes, including a large 145−256 kb genomic deletion spanning the KRIT1 gene and flanking regions, and the KRIT1 c.1664C>T variant, which we demonstrated to activate a donor splice site in exon 16. The segregation of this cryptic splicing mutation was studied in a large Italian family (five affected and seven unaffected cases), and showed a largely heterogeneous clinical presentation, suggesting the implication of genetic modifiers. Moreover, by analyzing ad hoc gene panels, including a virtual panel of 23 cerebrovascular disease-related genes (Cerebro panel), we found two variants in NOTCH3 and PTEN genes, which could contribute to the abnormal oxidative stress and inflammatory responses to date implicated in CCM disease pathogenesis.
Collapse
|
5
|
Wickland DP, Sherman ME, Radisky DC, Mansfield AS, Asmann YW. Lower Exome Sequencing Coverage of Ancestrally African Patients in The Cancer Genome Atlas. J Natl Cancer Inst 2022; 114:1192-1199. [PMID: 35299252 PMCID: PMC9360464 DOI: 10.1093/jnci/djac054] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 12/18/2021] [Accepted: 02/25/2022] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND In the United States, cancer disproportionately impacts Black and African American individuals. Identifying genetic factors underlying cancer disparities has been an important research focus and requires data that are equitable in both quantity and quality across racial groups. It is widely recognized that DNA databases quantitatively underrepresent minorities. However, the differences in data quality between racial groups have not been well studied. METHODS We compared the qualities of germline and tumor exomes between ancestrally African and European patients in The Cancer Genome Atlas of 7 cancers with at least 50 self-reported Black patients in the context of sequencing depth, tumor purity, and qualities of germline variants and somatic mutations. RESULTS Germline and tumor exomes from ancestrally African patients were sequenced at statistically significantly lower depth in 6 out of the 7 cancers. For 3 cancers, most ancestrally European exomes were sequenced in early sample batches at higher depth, whereas ancestrally African exomes were concentrated in later batches and sequenced at much lower depth. For the other 3 cancers, the reasons of lower sequencing coverage of ancestrally African exomes remain unknown. Furthermore, even when the sequencing depths were comparable, African exomes had disproportionally higher percentages of positions with insufficient coverage, likely because of the known European bias in the human reference genome that impacted exome capture kit design. CONCLUSIONS Overall and positional lower sequencing depths of ancestrally African exomes in The Cancer Genome Atlas led to underdetection and lower quality of variants, highlighting the need to consider epidemiological factors for future genomics studies.
Collapse
Affiliation(s)
- Daniel P Wickland
- Department of Quantitative Health Sciences, Mayo Clinic, Jacksonville, FL, USA
| | - Mark E Sherman
- Department of Quantitative Health Sciences, Mayo Clinic, Jacksonville, FL, USA
| | - Derek C Radisky
- Department of Cancer Biology, Mayo Clinic, Jacksonville, FL, USA
| | | | - Yan W Asmann
- Correspondence to: Yan W. Asmann, PhD, Department of Quantitative Health Sciences, Mayo Clinic, 4500 San Pablo Rd S, Jacksonville, FL 32224, USA (e-mail: )
| |
Collapse
|
6
|
Zhao C, Teng J, Zhang X, Wang D, Zhang X, Li S, Jiang X, Li H, Ning C, Zhang Q. Towards a Cost-Effective Implementation of Genomic Prediction Based on Low Coverage Whole Genome Sequencing in Dezhou Donkey. Front Genet 2021; 12:728764. [PMID: 34804115 PMCID: PMC8595392 DOI: 10.3389/fgene.2021.728764] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 09/20/2021] [Indexed: 11/25/2022] Open
Abstract
Low-coverage whole genome sequencing is a low-cost genotyping technology. Combined with genotype imputation approaches, it is likely to become a critical component of cost-effective genomic selection programs in agricultural livestock. Here, we used the low-coverage sequence data of 617 Dezhou donkeys to investigate the performance of genotype imputation for low-coverage whole genome sequence data and genomic prediction based on the imputed genotype data. The specific aims were as follows: 1) to measure the accuracy of genotype imputation under different sequencing depths, sample sizes, minor allele frequency (MAF), and imputation pipelines and 2) to assess the accuracy of genomic prediction under different marker densities derived from the imputed sequence data, different strategies for constructing the genomic relationship matrixes, and single-vs. multi-trait models. We found that a high imputation accuracy (>0.95) can be achieved for sequence data with a sequencing depth as low as 1x and the number of sequenced individuals ≥400. For genomic prediction, the best performance was obtained by using a marker density of 410K and a G matrix constructed using expected marker dosages. Multi-trait genomic best linear unbiased prediction (GBLUP) performed better than single-trait GBLUP. Our study demonstrates that low-coverage whole genome sequencing would be a cost-effective approach for genomic prediction in Dezhou donkey.
Collapse
Affiliation(s)
- Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinhao Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China.,National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Shiyin Li
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xin Jiang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Haijing Li
- National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| |
Collapse
|
7
|
Breton G, Johansson ACV, Sjödin P, Schlebusch CM, Jakobsson M. Comparison of sequencing data processing pipelines and application to underrepresented African human populations. BMC Bioinformatics 2021; 22:488. [PMID: 34627144 PMCID: PMC8502359 DOI: 10.1186/s12859-021-04407-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 09/27/2021] [Indexed: 11/10/2022] Open
Abstract
Background Population genetic studies of humans make increasing use of high-throughput sequencing in order to capture diversity in an unbiased way. There is an abundance of sequencing technologies, bioinformatic tools and the available genomes are increasing in number. Studies have evaluated and compared some of these technologies and tools, such as the Genome Analysis Toolkit (GATK) and its “Best Practices” bioinformatic pipelines. However, studies often focus on a few genomes of Eurasian origin in order to detect technical issues. We instead surveyed the use of the GATK tools and established a pipeline for processing high coverage full genomes from a diverse set of populations, including Sub-Saharan African groups, in order to reveal challenges from human diversity and stratification. Results We surveyed 29 studies using high-throughput sequencing data, and compared their strategies for data pre-processing and variant calling. We found that processing of data is very variable across studies and that the GATK “Best Practices” are seldom followed strictly. We then compared three versions of a GATK pipeline, differing in the inclusion of an indel realignment step and with a modification of the base quality score recalibration step. We applied the pipelines on a diverse set of 28 individuals. We compared the pipelines in terms of count of called variants and overlap of the callsets. We found that the pipelines resulted in similar callsets, in particular after callset filtering. We also ran one of the pipelines on a larger dataset of 179 individuals. We noted that including more individuals at the joint genotyping step resulted in different counts of variants. At the individual level, we observed that the average genome coverage was correlated to the number of variants called. Conclusions We conclude that applying the GATK “Best Practices” pipeline, including their recommended reference datasets, to underrepresented populations does not lead to a decrease in the number of called variants compared to alternative pipelines. We recommend to aim for coverage of > 30X if identifying most variants is important, and to work with large sample sizes at the variant calling stage, also for underrepresented individuals and populations. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04407-x.
Collapse
Affiliation(s)
- Gwenna Breton
- Human Evolution, Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18C, 752 36, Uppsala, Sweden.
| | - Anna C V Johansson
- Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Husargatan 3, 752 37, Uppsala, Sweden
| | - Per Sjödin
- Human Evolution, Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18C, 752 36, Uppsala, Sweden
| | - Carina M Schlebusch
- Human Evolution, Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18C, 752 36, Uppsala, Sweden.,Palaeo-Research Institute, University of Johannesburg, P.O. Box 524, Auckland Park, 2006, South Africa.,Science for Life Laboratory, Uppsala, Sweden
| | - Mattias Jakobsson
- Human Evolution, Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18C, 752 36, Uppsala, Sweden. .,Palaeo-Research Institute, University of Johannesburg, P.O. Box 524, Auckland Park, 2006, South Africa. .,Science for Life Laboratory, Uppsala, Sweden.
| |
Collapse
|
8
|
Masoero L, Camerlenghi F, Favaro S, Broderick T. More for less: predicting and maximizing genomic variant discovery via Bayesian nonparametrics. Biometrika 2021. [DOI: 10.1093/biomet/asab012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Summary
While the cost of sequencing genomes has decreased dramatically in recent years, this expense often remains nontrivial. Under a fixed budget, scientists face a natural trade-off between quantity and quality: spending resources to sequence a greater number of genomes or spending resources to sequence genomes with increased accuracy. Our goal is to find the optimal allocation of resources between quantity and quality. Optimizing resource allocation promises to reveal as many new variations in the genome as possible. We introduce a Bayesian nonparametric methodology to predict the number of new variants in a follow-up study based on a pilot study. When experimental conditions are kept constant between the pilot and follow-up, we find that our prediction is competitive with the best existing methods. Unlike current methods, though, our new method allows practitioners to change experimental conditions between the pilot and the follow-up. We demonstrate how this distinction allows our method to be used for more realistic predictions and for optimal allocation of a fixed budget between quality and quantity. We validate our method on cancer and human genomics data.
Collapse
|
9
|
Cheng S, Xu Z, Liu Y, Lin J, Jiang Y, Wang Y, Meng X, Wang A, Huang X, Wang Z, Chen G, Wu S, Jia Z, Chen Y, Qiu X, Wu J, Song B, Ji W, An Z, Xue W, Zhao L, Geng Y, Li H, Li H, Wang Y. Whole genome sequencing of 10K patients with acute ischaemic stroke or transient ischaemic attack: design, methods and baseline patient characteristics. Stroke Vasc Neurol 2020; 6:291-297. [PMID: 33443231 PMCID: PMC8258062 DOI: 10.1136/svn-2020-000664] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Revised: 10/29/2020] [Accepted: 11/17/2020] [Indexed: 12/16/2022] Open
Abstract
Background and purpose Stroke is the second leading cause of death worldwide and the leading cause of mortality and long-term disability in China, but its underlying risk genes and pathways are far from being comprehensively understood. We here describe the design and methods of whole genome sequencing (WGS) for 10 914 patients with acute ischaemic stroke or transient ischaemic attack from the Third China National Stroke Registry (CNSR-III). Methods Baseline clinical characteristics of the included patients in this study were reported. DNA was extracted from white blood cells of participants. Libraries are constructed using qualified DNA, and WGS is conducted on BGISEQ-500 platform. The average depth is intended to be greater than 30× for each subject. Afterwards, Sentieon software is applied to process the sequencing data under the Genome Analysis Toolkit best practice guidance to call genotypes of single nucleotide variants (SNVs) and insertion-deletions. For each included subject, 21 fingerprint SNVs are genotyped by MassARRAY assays to verify that DNA sample and sequencing data originate from the same individual. The copy number variations and structural variations are also called for each patient. All of the genetic variants are annotated and predicted by bioinformatics software or by reviewing public databases. Results The average age of the included 10 914 patients was 62.2±11.3 years, and 31.4% patients were women. Most of the baseline clinical characteristics of the 10 914 and the excluded patients were balanced. Conclusions The WGS data together with abundant clinical and imaging data of CNSR-III could provide opportunity to elucidate the molecular mechanisms and discover novel therapeutic targets for stroke.
Collapse
Affiliation(s)
- Si Cheng
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China.,Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China
| | - Zhe Xu
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China.,Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China
| | - Yang Liu
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China.,Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China
| | - Jinxi Lin
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Yong Jiang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Yilong Wang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Xia Meng
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Anxin Wang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Xinying Huang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Zhimin Wang
- Department of Neurology, The First people's Hospital of Taizhou, Taizhou, China
| | - Guohua Chen
- Department of Neurology, Wuhan First Hospital, Wuhan, China
| | - Songdi Wu
- Department of Neurology, The First People's Hospital of Xi'an, Xi'an, China
| | - Zhengchang Jia
- Department of Neurology, The Second People's Hospital of Jinzhong, Jinzhong, China
| | - Yongming Chen
- Department of Neurology, WuYuan County People's Hospital, Bayannur, China
| | - Xuerong Qiu
- Department of Neurology, Qiqihar City Rongjian Stroke Prevention and Treatment Institute, Qiqihar, China
| | - Jun Wu
- Department of Neurology, Peking University Shenzhen Hospital, Shenzhen, China
| | - Binbin Song
- Department of Neurology, Luoyang Central Hospital, Luoyang, China
| | - Weizhong Ji
- Department of Neurology, Qinghai Provincial People's Hospital, Xining, China
| | - Zhongping An
- Department of Neurology, Tianjin Huanhu Hospital, Tianjin, China
| | - Wenjun Xue
- Department of Neurology, Pingdingshan First People's Hospital, Pingdingshan, China
| | - Lili Zhao
- Department of Neurology, Changzhi People's Hospital, Changzhi, China
| | - Yu Geng
- Department of Neurology, Zhejiang Provincial People's Hospital, Hangzhou, China
| | - Hongyan Li
- Department of Neurology, Xinjiang Uygur Autonomous Region People's Hospital, Urumqi, China
| | - Hao Li
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.,China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Yongjun Wang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China .,China National Clinical Research Center for Neurological Diseases, Beijing, China.,Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China
| |
Collapse
|
10
|
Wang F, Huang S, Gao R, Zhou Y, Lai C, Li Z, Xian W, Qian X, Li Z, Huang Y, Tang Q, Liu P, Chen R, Liu R, Li X, Tong X, Zhou X, Bai Y, Duan G, Zhang T, Xu X, Wang J, Yang H, Liu S, He Q, Jin X, Liu L. Initial whole-genome sequencing and analysis of the host genetic contribution to COVID-19 severity and susceptibility. Cell Discov 2020; 6:83. [PMID: 33298875 PMCID: PMC7653987 DOI: 10.1038/s41421-020-00231-4] [Citation(s) in RCA: 126] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 10/03/2020] [Indexed: 02/08/2023] Open
Abstract
The COVID-19 pandemic has accounted for millions of infections and hundreds of thousand deaths worldwide in a short-time period. The patients demonstrate a great diversity in clinical and laboratory manifestations and disease severity. Nonetheless, little is known about the host genetic contribution to the observed interindividual phenotypic variability. Here, we report the first host genetic study in the Chinese population by deeply sequencing and analyzing 332 COVID-19 patients categorized by varying levels of severity from the Shenzhen Third People's Hospital. Upon a total of 22.2 million genetic variants, we conducted both single-variant and gene-based association tests among five severity groups including asymptomatic, mild, moderate, severe, and critical ill patients after the correction of potential confounding factors. Pedigree analysis suggested a potential monogenic effect of loss of function variants in GOLGA3 and DPP7 for critically ill and asymptomatic disease demonstration. Genome-wide association study suggests the most significant gene locus associated with severity were located in TMEM189-UBE2V1 that involved in the IL-1 signaling pathway. The p.Val197Met missense variant that affects the stability of the TMPRSS2 protein displays a decreasing allele frequency among the severe patients compared to the mild and the general population. We identified that the HLA-A*11:01, B*51:01, and C*14:02 alleles significantly predispose the worst outcome of the patients. This initial genomic study of Chinese patients provides genetic insights into the phenotypic difference among the COVID-19 patient groups and highlighted genes and variants that may help guide targeted efforts in containing the outbreak. Limitations and advantages of the study were also reviewed to guide future international efforts on elucidating the genetic architecture of host-pathogen interaction for COVID-19 and other infectious and complex diseases.
Collapse
Affiliation(s)
- Fang Wang
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Shujia Huang
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- School of Medicine, South China University of Technology, Guangzhou, Guangdong, 510006, China
| | - Rongsui Gao
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Yuwen Zhou
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong, 518083, China
| | - Changxiang Lai
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Zhichao Li
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong, 518083, China
| | - Wenjie Xian
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Xiaobo Qian
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong, 518083, China
| | - Zhiyu Li
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Yushan Huang
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong, 518083, China
| | - Qiyuan Tang
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Panhong Liu
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong, 518083, China
| | - Ruikun Chen
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Rong Liu
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Xuan Li
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Xin Tong
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Xuan Zhou
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Yong Bai
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Gang Duan
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China
| | - Tao Zhang
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen, Guangdong, 518120, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- James D. Watson Institute of Genome Science, Hangzhou, Zhejiang, 310008, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
- James D. Watson Institute of Genome Science, Hangzhou, Zhejiang, 310008, China
| | - Siyang Liu
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China.
| | - Qing He
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China.
| | - Xin Jin
- BGI-Shenzhen, Shenzhen, Guangdong, 518083, China.
- School of Medicine, South China University of Technology, Guangzhou, Guangdong, 510006, China.
| | - Lei Liu
- The Third People's Hospital of Shenzhen, National Clinical Research Center for Infectious Disease, The Second Affiliated Hospital of Southern University of Science and Technology, Shenzhen, Guangdong, 518112, China.
| |
Collapse
|
11
|
Cao Y, Li L, Xu M, Feng Z, Sun X, Lu J, Xu Y, Du P, Wang T, Hu R, Ye Z, Shi L, Tang X, Yan L, Gao Z, Chen G, Zhang Y, Chen L, Ning G, Bi Y, Wang W. The ChinaMAP analytics of deep whole genome sequences in 10,588 individuals. Cell Res 2020; 30:717-731. [PMID: 32355288 PMCID: PMC7609296 DOI: 10.1038/s41422-020-0322-9] [Citation(s) in RCA: 137] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 04/09/2020] [Indexed: 12/12/2022] Open
Abstract
Metabolic diseases are the most common and rapidly growing health issues worldwide. The massive population-based human genetics is crucial for the precise prevention and intervention of metabolic disorders. The China Metabolic Analytics Project (ChinaMAP) is based on cohort studies across diverse regions and ethnic groups with metabolic phenotypic data in China. Here, we describe the centralized analysis of the deep whole genome sequencing data and the genetic bases of metabolic traits in 10,588 individuals from the ChinaMAP. The frequency spectrum of variants, population structure, pathogenic variants and novel genomic characteristics were analyzed. The individual genetic evaluations of Mendelian diseases, nutrition and drug metabolism, and traits of blood glucose and BMI were integrated. Our study establishes a large-scale and deep resource for the genetics of East Asians and provides opportunities for novel genetic discoveries of metabolic characteristics and disorders.
Collapse
Affiliation(s)
- Yanan Cao
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
- National Research Center for Translational Medicine, National Key Scientific Infrastructure for Translational Medicine (Shanghai), Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Lin Li
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
- National Research Center for Translational Medicine, National Key Scientific Infrastructure for Translational Medicine (Shanghai), Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Min Xu
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Zhimin Feng
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xiaohui Sun
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Jieli Lu
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Yu Xu
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Peina Du
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Tiange Wang
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Ruying Hu
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, 310006, Zhejiang, China
| | - Zhen Ye
- Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, 310006, Zhejiang, China
| | - Lixin Shi
- Affiliated Hospital of Guiyang Medical College, Guiyang, 550004, Guizhou, China
| | - Xulei Tang
- The First Hospital of Lanzhou University, Lanzhou, 730000, Gansu, China
| | - Li Yan
- Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, 510120, Guangdong, China
| | - Zhengnan Gao
- Dalian Municipal Central Hospital, Dalian, 116033, Liaoning, China
| | - Gang Chen
- Fujian Provincial Hospital, Fujian Medical University, Fuzhou, 350001, Fujian, China
| | - Yinfei Zhang
- Central Hospital of Shanghai Jiading District, Shanghai, 201800, China
| | - Lulu Chen
- Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, Hubei, China
| | - Guang Ning
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Yufang Bi
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Weiqing Wang
- National Clinical Research Centre for Metabolic Diseases, State Key Laboratory of Medical Genomics, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute for Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| |
Collapse
|
12
|
Jiang Y, Jiang Y, Wang S, Zhang Q, Ding X. Optimal sequencing depth design for whole genome re-sequencing in pigs. BMC Bioinformatics 2019; 20:556. [PMID: 31703550 PMCID: PMC6839175 DOI: 10.1186/s12859-019-3164-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 10/16/2019] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND As whole-genome sequencing is becoming a routine technique, it is important to identify a cost-effective depth of sequencing for such studies. However, the relationship between sequencing depth and biological results from the aspects of whole-genome coverage, variant discovery power and the quality of variants is unclear, especially in pigs. We sequenced the genomes of three Yorkshire boars at an approximately 20X depth on the Illumina HiSeq X Ten platform and downloaded whole-genome sequencing data for three Duroc and three Landrace pigs with an approximately 20X depth for each individual. Then, we downsampled the deep genome data by extracting twelve different proportions of 0.05, 0.1, 0.15, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 and 0.9 paired reads from the original bam files to mimic the sequence data of the same individuals at sequencing depths of 1.09X, 2.18X, 3.26X, 4.35X, 6.53X, 8.70X, 10.88X, 13.05X, 15.22X, 17.40X, 19.57X and 21.75X to evaluate the influence of genome coverage, the variant discovery rate and genotyping accuracy as a function of sequencing depth. In addition, SNP chip data for Yorkshire pigs were used as a validation for the comparison of single-sample calling and multisample calling algorithms. RESULTS Our results indicated that 10X is an ideal practical depth for achieving plateau coverage and discovering accurate variants, which achieved greater than 99% genome coverage. The number of false-positive variants was increased dramatically at a depth of less than 4X, which covered 95% of the whole genome. In addition, the comparison of multi- and single-sample calling showed that multisample calling was more sensitive than single-sample calling, especially at lower depths. The number of variants discovered under multisample calling was 13-fold and 2-fold higher than that under single-sample calling at 1X and 22X, respectively. A large difference was observed when the depth was less than 4.38X. However, more false-positive variants were detected under multisample calling. CONCLUSIONS Our research will inform important study design decisions regarding whole-genome sequencing depth. Our results will be helpful for choosing the appropriate depth to achieve the same power for studies performed under limited budgets.
Collapse
Affiliation(s)
- Yifan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Yao Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Sheng Wang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian, 271001 China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| |
Collapse
|
13
|
Large-Scale Whole-Genome Sequencing of Three Diverse Asian Populations in Singapore. Cell 2019; 179:736-749.e15. [DOI: 10.1016/j.cell.2019.09.019] [Citation(s) in RCA: 76] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2019] [Revised: 06/24/2019] [Accepted: 09/19/2019] [Indexed: 12/19/2022]
|
14
|
|
15
|
Carlson J, Locke AE, Flickinger M, Zawistowski M, Levy S, Myers RM, Boehnke M, Kang HM, Scott LJ, Li JZ, Zöllner S. Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans. Nat Commun 2018; 9:3753. [PMID: 30218074 PMCID: PMC6138700 DOI: 10.1038/s41467-018-05936-5] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 07/30/2018] [Indexed: 12/30/2022] Open
Abstract
A detailed understanding of the genome-wide variability of single-nucleotide germline mutation rates is essential to studying human genome evolution. Here, we use ~36 million singleton variants from 3560 whole-genome sequences to infer fine-scale patterns of mutation rate heterogeneity. Mutability is jointly affected by adjacent nucleotide context and diverse genomic features of the surrounding region, including histone modifications, replication timing, and recombination rate, sometimes suggesting specific mutagenic mechanisms. Remarkably, GC content, DNase hypersensitivity, CpG islands, and H3K36 trimethylation are associated with both increased and decreased mutation rates depending on nucleotide context. We validate these estimated effects in an independent dataset of ~46,000 de novo mutations, and confirm our estimates are more accurate than previously published results based on ancestrally older variants without considering genomic features. Our results thus provide the most refined portrait to date of the factors contributing to genome-wide variability of the human germline mutation rate.
Collapse
Affiliation(s)
- Jedidiah Carlson
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Adam E Locke
- McDonnell Genome Institute & Department of Medicine, Washington University, St. Louis, MO, 63108, USA
| | - Matthew Flickinger
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Matthew Zawistowski
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Shawn Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Michael Boehnke
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Hyun Min Kang
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Laura J Scott
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jun Z Li
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Sebastian Zöllner
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
16
|
Sazonovs A, Barrett JC. Rare-Variant Studies to Complement Genome-Wide Association Studies. Annu Rev Genomics Hum Genet 2018; 19:97-112. [PMID: 29801418 DOI: 10.1146/annurev-genom-083117-021641] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genome-wide association studies (GWASs) have revolutionized human disease genetics by discovering tens of thousands of associations between common variants and complex diseases. In parallel, huge technological advances in DNA sequencing have made it possible to measure and analyze rare variation in populations. This review considers these two stories and how they have come together. We first review the history of GWASs and sequencing. We then consider how to understand the biological mechanisms that drive signals of strong association in the absence of rare-variant studies. We describe how rare-variant studies complement these approaches and highlight both data generation and statistical challenges in their interpretation. Finally, we consider how certain special study designs, such as those for families and isolated populations, fit in this paradigm.
Collapse
Affiliation(s)
- A Sazonovs
- Wellcome Sanger Institute, Cambridge CB10 1HH, United Kingdom;
| | - J C Barrett
- Wellcome Sanger Institute, Cambridge CB10 1HH, United Kingdom;
| |
Collapse
|
17
|
Pietraszkiewicz A, van Asten F, Kwong A, Ratnapriya R, Abecasis G, Swaroop A, Chew EY. Association of Rare Predicted Loss-of-Function Variants in Cellular Pathways with Sub-Phenotypes in Age-Related Macular Degeneration. Ophthalmology 2017; 125:398-406. [PMID: 29224928 DOI: 10.1016/j.ophtha.2017.10.027] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Revised: 09/29/2017] [Accepted: 10/17/2017] [Indexed: 11/17/2022] Open
Abstract
PURPOSE To investigate the association of rare predicted loss-of-function (pLoF) variants within age-related macular degeneration (AMD) risk loci and AMD sub-phenotypes. DESIGN Case-control study. PARTICIPANTS Participants of AREDS, AREDS2, and Michigan Genomics Initiative. METHODS Whole genome sequencing data were analyzed for rare pLoF variants (frequency <0.1%) in the regions of previously identified 52 independent risk variants known to be associated with AMD. Frequency of the rare pLoF variants in cases with intermediate or advanced AMD was compared with controls. Variants were assigned to the complement, extracellular matrix (ECM), lipid, cell survival, immune system, metabolism, or unknown/other pathway. Associations of rare pLoF variant pathways with AMD sub-phenotypes were analyzed using logistic and linear regression, and Cox proportional hazards models. MAIN OUTCOME MEASURES Differences in rare pLoF variant pathway burden and association of rare pLoF variant pathways with sub-phenotypes within the population with AMD were evaluated. RESULTS Rare pLoF variants were found in 298 of 1689 cases (17.6%) and 237 of 1518 controls (15.6%) (odds ratio [OR], 1.11; 95% confidence interval [CI], 0.91-1.36; P = 0.310). An enrichment of rare pLoF variants in the complement pathway in cases versus controls (OR, 2.94; 95% CI, 1.49-5.79; P = 0.002) was observed. Within cases, associations between all rare pLoF variants and choroidal neovascularization (CNV) (OR, 1.34; 95% CI, 1.04-1.73; P = 0.023), calcified drusen (OR, 1.33; 95% CI, 1.04-1.72; P = 0.025), higher scores on the AREDS Extended AMD Severity Scale (Standardized Coefficient Beta (β)=0.346 [0.086-0.605], P = 0.009), and progression to advanced disease (hazard ratio, 1.25; 95% CI, 1.01-1.55; P = 0.042) were observed. At the pathway level, there were associations between the complement pathway and geographic atrophy (GA) (OR, 2.17; 95% CI, 1.12-4.24; P = 0.023), the complement pathway and calcified drusen (OR, 3.75; 95% CI, 1.79-7.86; P < 0.001), and the ECM pathway and more severe levels in the AREDS Extended AMD Severity Scale (β = 0.62; 95% CI, 0.04-1.20; P = 0.035). CONCLUSIONS Rare pLoF variants are associated with disease progression. Variants in the complement pathway modify the clinical course of AMD and increase the risk of developing specific sub-phenotypes.
Collapse
Affiliation(s)
- Alexandra Pietraszkiewicz
- Neurobiology, Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland
| | - Freekje van Asten
- Neurobiology, Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland; Division of Epidemiology and Clinical Applications, National Eye Institute, National Institutes of Health, Bethesda, Maryland
| | - Alan Kwong
- Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan
| | - Rinki Ratnapriya
- Neurobiology, Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland
| | - Goncalo Abecasis
- Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan
| | - Anand Swaroop
- Neurobiology, Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland
| | - Emily Y Chew
- Division of Epidemiology and Clinical Applications, National Eye Institute, National Institutes of Health, Bethesda, Maryland.
| |
Collapse
|