1
|
Sequeira JJ, Vinuthalakshmi K, Das R, van Driem G, Mustak MS. The maternal U1 haplogroup in the Koraga tribe as a correlate of their North Dravidian linguistic affinity. Front Genet 2024; 14:1303628. [PMID: 38384360 PMCID: PMC10880486 DOI: 10.3389/fgene.2023.1303628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 12/31/2023] [Indexed: 02/23/2024] Open
Abstract
Introduction: The Koraga tribe are an isolated endogamous tribal group found in the southwest coastal region of India. The Koraga language shares inherited grammatical features with North Dravidian languages. To seek a possible genetic basis for this exceptionality and understand the maternal lineage pattern, we have aimed to reconstruct the inter-population and intra-population relationships of the Koraga tribal population by using mtDNA markers for the hypervariable regions along with a partial coding region sequence analysis. Methods and Results: Amongst the 96 individuals studied, we observe 11 haplogroups, of which a few are shared and others are unique to the clans Soppu, Oṇṭi and Kuṇṭu. In addition to several deep rooted Indian-specific lineages of macrohaplogroups M and U, we observe a high frequency of the U1 lineage (∼38%), unique to the Koraga. A Bayesian analysis of the U1 clade shows that the Koraga tribe share their maternal lineage with ancestral populations of the Caucasus at the cusp of the Last Glacial Maximum. Discussion: Our study suggests that the U1 lineage found in the Indian subcontinent represents a remnant of a post-glacial dispersal. The presence of West Asian U1 when viewed along with historical linguistics leads us to hypothesise that Koraga represents a mother tongue retained by a vanquished population group that fled southward at the demise of the Indus civilisation as opposed to a father tongue, associated with a particular paternal lineage.
Collapse
Affiliation(s)
| | | | - Ranajit Das
- Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
| | - George van Driem
- Institut für Sprachwissenschaft, Universität Bern, Bern, Switzerland
| | - Mohammed S. Mustak
- Department of Applied Zoology, Mangalore University, Mangalore, Karnataka, India
| |
Collapse
|
2
|
Jones SW, Ball AL, Chadwick AE, Alfirevic A. The Role of Mitochondrial DNA Variation in Drug Response: A Systematic Review. Front Genet 2021; 12:698825. [PMID: 34484295 PMCID: PMC8416105 DOI: 10.3389/fgene.2021.698825] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 07/14/2021] [Indexed: 01/11/2023] Open
Abstract
Background: The triad of drug efficacy, toxicity and resistance underpins the risk-benefit balance of all therapeutics. The application of pharmacogenomics has the potential to improve the risk-benefit balance of a given therapeutic via the stratification of patient populations based on DNA variants. A growth in the understanding of the particulars of the mitochondrial genome, alongside the availability of techniques for its interrogation has resulted in a growing body of literature examining the impact of mitochondrial DNA (mtDNA) variation upon drug response. Objective: To critically evaluate and summarize the available literature, across a defined period, in a systematic fashion in order to map out the current landscape of the subject area and identify how the field may continue to advance. Methods: A systematic review of the literature published between January 2009 and December 2020 was conducted using the PubMed database with the following key inclusion criteria: reference to specific mtDNA polymorphisms or haplogroups, a core objective to examine associations between mtDNA variants and drug response, and research performed using human subjects or human in vitro models. Results: Review of the literature identified 24 articles reporting an investigation of the association between mtDNA variant(s) and drug efficacy, toxicity or resistance that met the key inclusion criteria. This included 10 articles examining mtDNA variations associated with antiretroviral therapy response, 4 articles examining mtDNA variants associated with anticancer agent response and 4 articles examining mtDNA variants associated with antimicrobial agent response. The remaining articles covered a wide breadth of medications and were therefore grouped together and referred to as "other." Conclusions: Investigation of the impact of mtDNA variation upon drug response has been sporadic to-date. Collective assessment of the associations identified in the articles was inconclusive due to heterogeneous methods and outcomes, limited racial/ethnic groups, lack of replication and inadequate statistical power. There remains a high degree of idiosyncrasy in drug response and this area has the potential to explain variation in drug response in a clinical setting, therefore further research is likely to be of clinical benefit.
Collapse
Affiliation(s)
- Samantha W. Jones
- Department of Pharmacology and Therapeutics, MRC Centre for Drug Safety Science, University of Liverpool, Liverpool, United Kingdom
| | - Amy L. Ball
- Department of Pharmacology and Therapeutics, MRC Centre for Drug Safety Science, University of Liverpool, Liverpool, United Kingdom
| | - Amy E. Chadwick
- Department of Pharmacology and Therapeutics, MRC Centre for Drug Safety Science, University of Liverpool, Liverpool, United Kingdom
| | - Ana Alfirevic
- Department of Pharmacology and Therapeutics, Wolfson Centre for Personalised Medicine, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
3
|
Kolmykov S, Vasiliev G, Osadchuk L, Kleschev M, Osadchuk A. Whole-Exome Sequencing Analysis of Human Semen Quality in Russian Multiethnic Population. Front Genet 2021; 12:662846. [PMID: 34178030 PMCID: PMC8232892 DOI: 10.3389/fgene.2021.662846] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/19/2021] [Indexed: 01/12/2023] Open
Abstract
The global trend toward the reduction of human spermatogenic function observed in many countries, including Russia, raised the problem of extensive screening and monitoring of male fertility and elucidation of its genetic and ethnic mechanisms. Recently, whole-exome sequencing (WES) was developed as a powerful tool for genetic analysis of complex traits. We present here the first Russian WES study for identification of new genes associated with semen quality. The experimental 3 × 2 design of the WES study was based on the analysis of 157 samples including three ethnic groups—Slavs (59), Buryats (n = 49), and Yakuts (n = 49), and two different semen quality groups—pathozoospermia (n = 95) and normospermia (n = 62). Additionally, our WES study group was negative for complete AZF microdeletions of the Y-chromosome. The normospermia group included men with normal sperm parameters in accordance with the WHO-recommended reference limit. The pathozoospermia group included men with impaired semen quality, namely, with any combined parameters of sperm concentration <15 × 106/ml, and/or progressive motility <32%, and/or normal morphology <4%. The WES was performed for all 157 samples. Subsequent calling and filtering of variants were carried out according to the GATK Best Practices recommendations. On the genotyping stage, the samples were combined into four cohorts: three sets corresponded to three ethnic groups, and the fourth set contained all the 157 whole-exome samples. Association of the obtained polymorphisms with semen quality parameters was investigated using the χ2 test. To prioritize the obtained variants associated with pathozoospermia, their effects were determined using Ensembl Variant Effect Predictor. Moreover, polymorphisms located in genes expressed in the testis were revealed based on the genomic annotation. As a result, the nine potential SNP markers rs6971091, rs557806, rs610308, rs556052, rs1289658, rs278981, rs1129172, rs12268007, and rs17228441 were selected for subsequent verification on our previously collected population sample (about 1,500 males). The selected variants located in seven genes FAM71F1, PPP1R15A, TRIM45, PRAME, RBM47, WDFY4, and FSIP2 that are expressed in the testis and play an important role in cell proliferation, meiosis, and apoptosis.
Collapse
Affiliation(s)
- Semyon Kolmykov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia.,Department of Computational Biology, Sirius University of Science and Technology, Sochi, Russia
| | - Gennady Vasiliev
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Ludmila Osadchuk
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Maxim Kleschev
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Alexander Osadchuk
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
4
|
Daw Elbait G, Henschel A, Tay GK, Al Safar HS. A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population. Front Genet 2021; 12:660428. [PMID: 33968136 PMCID: PMC8102833 DOI: 10.3389/fgene.2021.660428] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/19/2021] [Indexed: 12/30/2022] Open
Abstract
The ethnic composition of the population of a country contributes to the uniqueness of each national DNA sequencing project and, ideally, individual reference genomes are required to reduce the confounding nature of ethnic bias. This work represents a representative Whole Genome Sequencing effort of an understudied population. Specifically, high coverage consensus sequences from 120 whole genomes and 33 whole exomes were used to construct the first ever population specific major allele reference genome for the United Arab Emirates (UAE). When this was applied and compared to the archetype hg19 reference, assembly of local Emirati genomes was reduced by ∼19% (i.e., some 1 million fewer calls). In compiling the United Arab Emirates Reference Genome (UAERG), sets of annotated 23,038,090 short (novel: 1,790,171) and 137,713 structural (novel: 8,462) variants; their allele frequencies (AFs) and distribution across the genome were identified. Population-specific genetic characteristics including loss-of-function variants, admixture, and ancestral haplogroup distribution were identified and reported here. We also detect a strong correlation between F ST and admixture components in the UAE. This baseline study was conceived to establish a high-quality reference genome and a genetic variations resource to enable the development of regional population specific initiatives and thus inform the application of population studies and precision medicine in the UAE.
Collapse
Affiliation(s)
- Gihan Daw Elbait
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Andreas Henschel
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Electrical Engineering and Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan K. Tay
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Division of Psychiatry, Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia
- School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Habiba S. Al Safar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| |
Collapse
|
5
|
Khan SY, Ali M, Lee MCW, Ma Z, Biswas P, Khan AA, Naeem MA, Riazuddin S, Riazuddin S, Ayyagari R, Hejtmancik JF, Riazuddin SA. Whole genome sequencing data of multiple individuals of Pakistani descent. Sci Data 2020; 7:350. [PMID: 33051442 PMCID: PMC7555865 DOI: 10.1038/s41597-020-00664-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 09/02/2020] [Indexed: 11/25/2022] Open
Abstract
Here we report whole genome sequencing of four individuals (H3, H4, H5, and H6) from a family of Pakistani descent. Whole genome sequencing yielded 1084.92, 894.73, 1068.62, and 1005.77 million mapped reads corresponding to 162.73, 134.21, 160.29, and 150.86 Gb sequence data and 52.49x, 43.29x, 51.70x, and 48.66x average coverage for H3, H4, H5, and H6, respectively. We identified 3,529,659, 3,478,495, 3,407,895, and 3,426,862 variants in the genomes of H3, H4, H5, and H6, respectively, including 1,668,024 variants common in the four genomes. Further, we identified 42,422, 39,824, 28,599, and 35,206 novel variants in the genomes of H3, H4, H5, and H6, respectively. A major fraction of the variants identified in the four genomes reside within the intergenic regions of the genome. Single nucleotide polymorphism (SNP) genotype based comparative analysis with ethnic populations of 1000 Genomes database linked the ancestry of all four genomes with the South Asian populations, which was further supported by mitochondria based haplogroup analysis. In conclusion, we report whole genome sequencing of four individuals of Pakistani descent. Measurement(s) | SNV • genome | Technology Type(s) | whole genome sequencing • DNA sequencing | Factor Type(s) | individual | Sample Characteristic - Organism | Homo sapiens | Sample Characteristic - Location | Pakistan |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12642761
Collapse
Affiliation(s)
- Shahid Y Khan
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, 21287, USA
| | - Muhammad Ali
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, 21287, USA
| | - Mei-Chong W Lee
- Department of Computer Science, San José State University, San José, CA, 95192, USA
| | - Zhiwei Ma
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Pooja Biswas
- Shiley Eye Institute, University of California San Diego, La Jolla, CA, 92093, USA
| | - Asma A Khan
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, 53700, Pakistan
| | - Muhammad Asif Naeem
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, 53700, Pakistan
| | - Saima Riazuddin
- Department of Otorhinolaryngology-Head & Neck Surgery, University of Maryland School Medicine, Baltimore, MD, 21201, USA
| | - Sheikh Riazuddin
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, 53700, Pakistan.,Allama Iqbal Medical College, University of Health Sciences, Lahore, 54550, Pakistan.,Department of Molecular Biology, Shaheed Zulfiqar Ali Bhutto Medical University, Islamabad, 44080, Pakistan
| | - Radha Ayyagari
- Shiley Eye Institute, University of California San Diego, La Jolla, CA, 92093, USA
| | - J Fielding Hejtmancik
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - S Amer Riazuddin
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, 21287, USA.
| |
Collapse
|
6
|
Sivasubbu S, Scaria V. Genomics of rare genetic diseases-experiences from India. Hum Genomics 2019; 14:52. [PMID: 31554517 PMCID: PMC6760067 DOI: 10.1186/s40246-019-0215-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 06/26/2019] [Indexed: 12/15/2022] Open
Abstract
Home to a culturally heterogeneous population, India is also a melting pot of genetic diversity. The population architecture characterized by multiple endogamous groups with specific marriage patterns, including the widely prevalent practice of consanguinity, not only makes the Indian population distinct from rest of the world but also provides a unique advantage and niche to understand genetic diseases. Centuries of genetic isolation of population groups have amplified the founder effects, contributing to high prevalence of recessive alleles, which translates into genetic diseases, including rare genetic diseases in India.Rare genetic diseases are becoming a public health concern in India because a large population size of close to a billion people would essentially translate to a huge disease burden for even the rarest of the rare diseases. Genomics-based approaches have been demonstrated to accelerate the diagnosis of rare genetic diseases and reduce the socio-economic burden. The Genomics for Understanding Rare Diseases: India Alliance Network (GUaRDIAN) stands for providing genomic solutions for rare diseases in India. The consortium aims to establish a unique collaborative framework in health care planning, implementation, and delivery in the specific area of rare genetic diseases. It is a nation-wide collaborative research initiative catering to rare diseases across multiple cohorts, with over 240 clinician/scientist collaborators across 70 major medical/research centers. Within the GUaRDIAN framework, clinicians refer rare disease patients, generate whole genome or exome datasets followed by computational analysis of the data for identifying the causal pathogenic variations. The outcomes of GUaRDIAN are being translated as community services through a suitable platform providing low-cost diagnostic assays in India. In addition to GUaRDIAN, several genomic investigations for diseased and healthy population are being undertaken in the country to solve the rare disease dilemma.In summary, rare diseases contribute to a significant disease burden in India. Genomics-based solutions can enable accelerated diagnosis and management of rare diseases. We discuss how a collaborative research initiative such as GUaRDIAN can provide a nation-wide framework to cater to the rare disease community of India.
Collapse
Affiliation(s)
| | - Sridhar Sivasubbu
- CSIR Institute of Genomics and Integrative Biology, Delhi, 110025, India.
| | - Vinod Scaria
- CSIR Institute of Genomics and Integrative Biology, Delhi, 110025, India.
| |
Collapse
|
7
|
Khan SY, Kabir F, M'Hamdi O, Jiao X, Naeem MA, Khan SN, Riazuddin S, Hejtmancik JF, Riazuddin SA. Whole genome sequencing data for two individuals of Pakistani descent. Sci Data 2018; 5:180174. [PMID: 30204152 PMCID: PMC6137601 DOI: 10.1038/sdata.2018.174] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 07/11/2018] [Indexed: 12/30/2022] Open
Abstract
Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64 and 77.81 Gb sequence data and 23× and 25× average coverage for H1 and H2, respectively. Notably, a total of 448,544 and 470,683 novel variants, not present in the single nucleotide polymorphism database (dbSNP), were identified in H1 and H2, respectively. Comparative analysis identified 2,415,852 variants common in both genomes including 240,181 variants absent in the dbSNP. Principal component analysis linked the ancestry of both genomes with South Asian populations. In conclusion, we report whole genome sequences of two individuals from a family of Pakistani descent.
Collapse
Affiliation(s)
- Shahid Y Khan
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Firoz Kabir
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Oussama M'Hamdi
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Xiaodong Jiao
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Muhammad Asif Naeem
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore 53700, Pakistan
| | - Shaheen N Khan
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore 53700, Pakistan
| | - Sheikh Riazuddin
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore 53700, Pakistan
| | - J Fielding Hejtmancik
- Ophthalmic Genetics and Visual Function Branch, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - S Amer Riazuddin
- The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| |
Collapse
|
8
|
Malhotra S, Singh S, Sarkar S. Whole genome variant analysis in three ethnically diverse Indians. Genes Genomics 2018; 40:497-510. [PMID: 29892955 DOI: 10.1007/s13258-018-0650-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 01/02/2018] [Indexed: 12/21/2022]
Abstract
India represents an amazing confluence of geographically, linguistically and socially disparate ethnic populations (Indian Genome Variation Consortium, J Genet 87:3-20, 2008). Understanding the genetic diversity of Indian population remains a daunting task. In this paper we present detailed analysis of genomic variations (high-depth coverage (~ 30×) using Illumina Hiseq 2000 platform) from three healthy Indian male individuals each belonging to three geographically delineated regions and linguistic phylum viz. high altitude region of Ladakh (Tibeto-Burman linguistic phylum), sub mountainous region of Kumaun (Indo-European linguistic phylum) and sea level region of Telangana (Dravidian linguistic phylum) for probing the extent of genetic diversity in our population. The sequencing analysis provided high quality data (~ 95% of the total reads aligned to the human reference genome for each sample) and very good alignment quality (> 80% of the filtered mapped reads had a quality score of 60). A total of 4.3, 3.7 and 4.3 million single nucleotide variations were identified in the genome of high altitude, sub mountainous and sea level respectively by comparing with human reference genome. Approximately 17.3, 18.2, 17.4% of the variants were unique in the three genomes. The study identified many novel variations in the three diverse genomes (132,970 in Ladakh, 112,317 in Kumaun and 128,881 in Telangana individual) and is an important resource for creating a baseline and a comprehensive catalogue of human genomic variation across the Indian as well as the Asian continent.
Collapse
Affiliation(s)
- Seema Malhotra
- Defence Institute of Physiology and Allied Sciences (DIPAS), Defence Research and Development Organization, Ministry of Defence, Government of India, Lucknow Road, Delhi, 110054, India
| | - Sayar Singh
- Defence Institute of Physiology and Allied Sciences (DIPAS), Defence Research and Development Organization, Ministry of Defence, Government of India, Lucknow Road, Delhi, 110054, India
| | - Soma Sarkar
- Defence Institute of Physiology and Allied Sciences (DIPAS), Defence Research and Development Organization, Ministry of Defence, Government of India, Lucknow Road, Delhi, 110054, India.
| |
Collapse
|
9
|
Hariprakash JM, Vellarikkal SK, Verma A, Ranawat AS, Jayarajan R, Ravi R, Kumar A, Dixit V, Sivadas A, Kashyap AK, Senthivel V, Sehgal P, Mahadevan V, Scaria V, Sivasubbu S. SAGE: a comprehensive resource of genetic variants integrating South Asian whole genomes and exomes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:1-10. [PMID: 30184194 PMCID: PMC6146123 DOI: 10.1093/database/bay080] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 07/03/2018] [Indexed: 11/20/2022]
Abstract
South Asia is home to \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\sim $\end{document}20% of the world population and characterized by distinct ethnic, linguistic, cultural and genetic lineages. Only limited representative samples from the region have found its place in large population-scale international genome projects. The recent availability of genome scale data from multiple populations and datasets from South Asian countries in public domain motivated us to integrate the data into a comprehensive resource. In the present study, we have integrated a total of six datasets encompassing 1213 human exomes and genomes to create a compendium of 154 814 557 genetic variants and adding a total of 69 059 255 novel variants. The variants were systematically annotated using public resources and along with the allele frequencies are available as a browsable-online resource South Asian genomes and exomes. As a proof of principle application of the data and resource for genetic epidemiology, we have analyzed the pathogenic genetic variants causing retinitis pigmentosa. Our analysis reveals the genetic landscape of the disease and suggests subset of genetic variants to be highly prevalent in South Asia.
Collapse
Affiliation(s)
- Judith Mary Hariprakash
- GN Ramachandran Knowledge Center for Genome Informatics, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Shamsudheen Karuthedath Vellarikkal
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Ankit Verma
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Anop Singh Ranawat
- GN Ramachandran Knowledge Center for Genome Informatics, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Rijith Jayarajan
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Rowmika Ravi
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Anoop Kumar
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Vishal Dixit
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Ambily Sivadas
- GN Ramachandran Knowledge Center for Genome Informatics, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Atul Kumar Kashyap
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Vigneshwar Senthivel
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Paras Sehgal
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Vijayalakshmi Mahadevan
- School of Chemical & Biotechnology, Shanmugha Arts, Science, Technology and Research Academy (SASTRA) University, Thanjavur, Tamil Nadu 613402, India
| | - Vinod Scaria
- GN Ramachandran Knowledge Center for Genome Informatics, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| | - Sridhar Sivasubbu
- Genomics & Molecular Medicine, Council of Scientific and Industrial Research (CSIR) Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110025, India
| |
Collapse
|
10
|
Gilbert JR, Taylor GM, Losee JE, Mooney MP, Cooper GM. Resequencing of the Col1A1 gene of Oryctolagus cuniculus identifies splicing errors and single nucleotide polymorphisms. Genes Genomics 2017. [DOI: 10.1007/s13258-017-0521-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
11
|
Kraker J, Viswanathan SK, Knöll R, Sadayappan S. Recent Advances in the Molecular Genetics of Familial Hypertrophic Cardiomyopathy in South Asian Descendants. Front Physiol 2016; 7:499. [PMID: 27840609 PMCID: PMC5083855 DOI: 10.3389/fphys.2016.00499] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Accepted: 10/12/2016] [Indexed: 12/14/2022] Open
Abstract
The South Asian population, numbered at 1.8 billion, is estimated to comprise around 20% of the global population and 1% of the American population, and has one of the highest rates of cardiovascular disease. While South Asians show increased classical risk factors for developing heart failure, the role of population-specific genetic risk factors has not yet been examined for this group. Hypertrophic cardiomyopathy (HCM) is one of the major cardiac genetic disorders among South Asians, leading to contractile dysfunction, heart failure, and sudden cardiac death. This disease displays autosomal dominant inheritance, and it is associated with a large number of variants in both sarcomeric and non-sarcomeric proteins. The South Asians, a population with large ethnic diversity, potentially carries region-specific polymorphisms. There is high variability in disease penetrance and phenotypic expression of variants associated with HCM. Thus, extensive studies are required to decipher pathogenicity and the physiological mechanisms of these variants, as well as the contribution of modifier genes and environmental factors to disease phenotypes. Conducting genotype-phenotype correlation studies will lead to improved understanding of HCM and, consequently, improved treatment options for this high-risk population. The objective of this review is to report the history of cardiovascular disease and HCM in South Asians, present previously published pathogenic variants, and introduce current efforts to study HCM using induced pluripotent stem cell-derived cardiomyocytes, next-generation sequencing, and gene editing technologies. The authors ultimately hope that this review will stimulate further research, drive novel discoveries, and contribute to the development of personalized medicine with the aim of expanding therapeutic strategies for HCM.
Collapse
Affiliation(s)
- Jessica Kraker
- Department of Internal Medicine, Heart, Lung and Vascular Institute, Division of Cardiovascular Health and Sciences, University of Cincinnati College of Medicine Cincinnati, OH, USA
| | - Shiv Kumar Viswanathan
- Department of Internal Medicine, Heart, Lung and Vascular Institute, Division of Cardiovascular Health and Sciences, University of Cincinnati College of Medicine Cincinnati, OH, USA
| | - Ralph Knöll
- AstraZeneca R&D Mölndal, Innovative Medicines and Early Development, Cardiovascular and Metabolic Diseases iMedMölndal, Sweden; Integrated Cardio Metabolic Centre, Karolinska Institutet, Myocardial Genetics, Karolinska University Hospital in HuddingeHuddinge, Sweden
| | - Sakthivel Sadayappan
- Department of Internal Medicine, Heart, Lung and Vascular Institute, Division of Cardiovascular Health and Sciences, University of Cincinnati College of Medicine Cincinnati, OH, USA
| |
Collapse
|
12
|
Dutta UR. The history of human cytogenetics in India-A review. Gene 2016; 589:112-7. [PMID: 26850130 DOI: 10.1016/j.gene.2016.01.052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Revised: 01/13/2016] [Accepted: 01/27/2016] [Indexed: 10/22/2022]
Abstract
It is 60years since the discovery of the correct number of chromosomes in 1956; the field of cytogenetics had evolved. The late evolution of this field with respect to other fields is primarily due to the underdevelopment of lenses and imaging techniques. With the advent of the new technologies, especially automation and evolution of advanced compound microscopes, cytogenetics drastically leaped further to greater heights. This review describes the historic events that had led to the development of human cytogenetics with a special attention about the history of cytogenetics in India, its present status, and future. Apparently, this review provides a brief account into the insights of the early laboratory establishments, funding, and the German collaborations. The details of the Indian cytogeneticists establishing their labs, promoting the field, and offering the chromosomal diagnostic services are described. The detailed study of chromosomes helps in increasing the knowledge of the chromosome structure and function. The delineation of the chromosomal rearrangements using cytogenetics and molecular cytogenetic techniques pays way in identifying the molecular mechanisms involved in the chromosomal rearrangement. Although molecular cytogenetics is greatly developing, the conventional cytogenetics still remains the gold standard in the diagnosis of various numerical chromosomal aberrations and a few structural aberrations. The history of cytogenetics and its importance even in the era of molecular cytogenetics are discussed.
Collapse
Affiliation(s)
- Usha R Dutta
- Diagnostics Division, Center for DNA Fingerprinting and Diagnostics, Tuljaguda Complex, 4-1-714, Hyderabad 500 001, Andhra-Pradesh, India.
| |
Collapse
|
13
|
Ilyas M, Kim JS, Cooper J, Shin YA, Kim HM, Cho YS, Hwang S, Kim H, Moon J, Chung O, Jun J, Rastogi A, Song S, Ko J, Manica A, Rahman Z, Husnain T, Bhak J. Whole genome sequencing of an ethnic Pathan (Pakhtun) from the north-west of Pakistan. BMC Genomics 2015; 16:172. [PMID: 25887915 PMCID: PMC4362645 DOI: 10.1186/s12864-015-1290-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Accepted: 01/29/2015] [Indexed: 11/10/2022] Open
Abstract
Background Pakistan covers a key geographic area in human history, being both part of the Indus River region that acted as one of the cradles of civilization and as a link between Western Eurasia and Eastern Asia. This region is inhabited by a number of distinct ethnic groups, the largest being the Punjabi, Pathan (Pakhtuns), Sindhi, and Baloch. Results We analyzed the first ethnic male Pathan genome by sequencing it to 29.7-fold coverage using the Illumina HiSeq2000 platform. A total of 3.8 million single nucleotide variations (SNVs) and 0.5 million small indels were identified by comparing with the human reference genome. Among the SNVs, 129,441 were novel, and 10,315 nonsynonymous SNVs were found in 5,344 genes. SNVs were annotated for health consequences and high risk diseases, as well as possible influences on drug efficacy. We confirmed that the Pathan genome presented here is representative of this ethnic group by comparing it to a panel of Central Asians from the HGDP-CEPH panels typed for ~650 k SNPs. The mtDNA (H2) and Y haplogroup (L1) of this individual were also typical of his geographic region of origin. Finally, we reconstruct the demographic history by PSMC, which highlights a recent increase in effective population size compatible with admixture between European and Asian lineages expected in this geographic region. Conclusions We present a whole-genome sequence and analyses of an ethnic Pathan from the north-west province of Pakistan. It is a useful resource to understand genetic variation and human migration across the whole Asian continent. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1290-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Muhammad Ilyas
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, Pakistan. .,Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea.
| | - Jong-Soo Kim
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Jesse Cooper
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Young-Ah Shin
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Hak-Min Kim
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea. .,The Genomics Institute, Biomedical Engineering Department, UNIST, Ulsan, Republic of Korea.
| | - Yun Sung Cho
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea. .,The Genomics Institute, Biomedical Engineering Department, UNIST, Ulsan, Republic of Korea.
| | - Seungwoo Hwang
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea.
| | - Hyunho Kim
- The Genomics Institute, Biomedical Engineering Department, UNIST, Ulsan, Republic of Korea.
| | - Jaewoo Moon
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Oksung Chung
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea.
| | - JeHoon Jun
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea.
| | - Achal Rastogi
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea.
| | - Sanghoon Song
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Junsu Ko
- Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea.
| | - Andrea Manica
- Department of Zoology, University of Cambridge, Downing Street, Cambridge, CB2 3EJ, UK.
| | - Ziaur Rahman
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, Pakistan.
| | - Tayyab Husnain
- National Centre of Excellence in Molecular Biology, University of the Punjab, Lahore, Pakistan.
| | - Jong Bhak
- Personal Genomics Institute, Genome Research Foundation, Suwon, Republic of Korea. .,Theragen Bio Institute, TheragenEtex, Suwon, Republic of Korea. .,The Genomics Institute, Biomedical Engineering Department, UNIST, Ulsan, Republic of Korea.
| |
Collapse
|
14
|
Li Z, Wu X, He B, Zhang L. Vindel: a simple pipeline for checking indel redundancy. BMC Bioinformatics 2014; 15:359. [PMID: 25407965 PMCID: PMC4245841 DOI: 10.1186/s12859-014-0359-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2014] [Accepted: 10/23/2014] [Indexed: 12/30/2022] Open
Abstract
Background With the advance of next generation sequencing (NGS) technologies, a large number of insertion and deletion (indel) variants have been identified in human populations. Despite much research into variant calling, it has been found that a non-negligible proportion of the identified indel variants might be false positives due to sequencing errors, artifacts caused by ambiguous alignments, and annotation errors. Results In this paper, we examine indel redundancy in dbSNP, one of the central databases for indel variants, and develop a standalone computational pipeline, dubbed Vindel, to detect redundant indels. The pipeline first applies indel position information to form candidate redundant groups, then performs indel mutations to the reference genome to generate corresponding indel variant substrings. Finally the indel variant substrings in the same candidate redundant groups are compared in a pairwise fashion to identify redundant indels. We applied our pipeline to check for redundancy in the human indels in dbSNP. Our pipeline identified approximately 8% redundancy in insertion type indels, 12% in deletion type indels, and overall 10% for insertions and deletions combined. These numbers are largely consistent across all human autosomes. We also investigated indel size distribution and adjacent indel distance distribution for a better understanding of the mechanisms generating indel variants. Conclusions Vindel, a simple yet effective computational pipeline, can be used to check whether a set of indels are redundant with respect to those already in the database of interest such as NCBI’s dbSNP. Of the approximately 5.9 million indels we examined, nearly 0.6 million are redundant, revealing a serious limitation in the current indel annotation. Statistics results prove the consistency of the pipeline on indel redundancy detection for all 22 chromosomes. Apart from the standalone Vindel pipeline, the indel redundancy check algorithm is also implemented in the web server http://bioinformatics.cs.vt.edu/zhanglab/indelRedundant.php. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0359-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zhiyi Li
- Department of Computer Science, Virginia Tech, Blacksburg, VA, 24061, USA.
| | - Xiaowei Wu
- Department of Statistics, Virginia Tech, Blacksburg, VA, 24061, USA.
| | - Bin He
- Department of Computer Science, Virginia Tech, Blacksburg, VA, 24061, USA.
| | - Liqing Zhang
- Department of Computer Science, Virginia Tech, Blacksburg, VA, 24061, USA.
| |
Collapse
|
15
|
Esplin ED, Oei L, Snyder MP. Personalized sequencing and the future of medicine: discovery, diagnosis and defeat of disease. Pharmacogenomics 2014; 15:1771-1790. [PMID: 25493570 DOI: 10.2217/pgs.14.117] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The potential for personalized sequencing to individually optimize medical treatment in diseases such as cancer and for pharmacogenomic application is just beginning to be realized, and the utility of sequencing healthy individuals for managing health is also being explored. The data produced requires additional advancements in interpretation of variants of unknown significance to maximize clinical benefit. Nevertheless, personalized sequencing, only recently applied to clinical medicine, has already been broadly applied to the discovery and study of disease. It is poised to enable the earlier and more accurate diagnosis of disease risk and occurrence, guide prevention and individualized intervention as well as facilitate monitoring of healthy and treated patients, and play a role in the prevention and recurrence of future disease. This article documents the advancing capacity of personalized sequencing, reviews its impact on disease-oriented scientific discovery and anticipates its role in the future of medicine.
Collapse
Affiliation(s)
- Edward D Esplin
- 300 Pasteur Drive, Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | | |
Collapse
|
16
|
Wan Juhari WK, Md Tamrin NA, Mat Daud MHR, Isa HW, Mohd Nasir N, Maran S, Abdul Rajab NS, Ahmad Amin Noordin KB, Nik Hassan NN, Tearle R, Razali R, Merican AF, Zilfalil BA. A whole genome analyses of genetic variants in two Kelantan Malay individuals. THE HUGO JOURNAL 2014; 8:4. [PMID: 27090252 PMCID: PMC4685156 DOI: 10.1186/s11568-014-0004-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 05/04/2014] [Accepted: 09/19/2014] [Indexed: 12/29/2022]
Abstract
Background The sequencing of two members of the Royal Kelantan Malay family genomes will provide insights on the Kelantan Malay whole genome sequences. The two Kelantan Malay genomes were analyzed for the SNP markers associated with thalassemia and Helicobacter pylori infection. Helicobacter pylori infection was reported to be low prevalence in the north-east as compared to the west coast of the Peninsular Malaysia and beta-thalassemia was known to be one of the most common inherited and genetic disorder in Malaysia. Result By combining SNP information from literatures, GWAS study and NCBI ClinVar, 18 unique SNPs were selected for further analysis. From these 18 SNPs, 10 SNPs came from previous study of Helicobacter pylori infection among Malay patients, 6 SNPs were from NCBI ClinVar and 2 SNPs from GWAS studies. The analysis reveals that both Royal Kelantan Malay genomes shared all the 10 SNPs identified by Maran (Single Nucleotide Polymorphims (SNPs) genotypic profiling of Malay patients with and without Helicobacter pylori infection in Kelantan, 2011) and one SNP from GWAS study. In addition, the analysis also reveals that both Royal Kelantan Malay genomes shared 3 SNP markers; HBG1 (rs1061234), HBB (rs1609812) and BCL11A (rs766432) where all three markers were associated with beta-thalassemia. Conclusions Our findings suggest that the Royal Kelantan Malays carry the SNPs which are associated with protection to Helicobacter pylori infection. In addition they also carry SNPs which are associated with beta-thalassemia. These findings are in line with the findings by other researchers who conducted studies on thalassemia and Helicobacter pylori infection in the non-royal Malay population.
Collapse
Affiliation(s)
- Wan Khairunnisa Wan Juhari
- Department of Pediatrics, School of Medical Sciences, Universiti Sains Malaysia, 16150 Kubang Kerian, Kelantan, Malaysia
| | - Nur Aida Md Tamrin
- Faculty of Resource Science and, Technology Universiti Malaysia Sarawak, Sarawak, Malaysia
| | | | - Hatin Wan Isa
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Universiti Sains Malaysia, Kelantan, Malaysia
| | - Nurfazreen Mohd Nasir
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Universiti Sains Malaysia, Kelantan, Malaysia
| | - Sathiya Maran
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Universiti Sains Malaysia, Kelantan, Malaysia
| | - Nur Shafawati Abdul Rajab
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Universiti Sains Malaysia, Kelantan, Malaysia
| | | | | | - Rick Tearle
- Complete Genomics Inc, 2071 Stierlin Court, Mountain View, 94043, CA, USA
| | | | - Amir Feisal Merican
- Centre of Research for Computational Sciences and Informatics in Biology, Bioindustry, Environment, Agriculture and Healthcare (CRYSTAL), Kuala Lumpur, Malaysia.,Institute of Biological Science, Faculty of Science, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Bin Alwi Zilfalil
- Department of Pediatrics, School of Medical Sciences, Universiti Sains Malaysia, 16150 Kubang Kerian, Kelantan, Malaysia.
| |
Collapse
|
17
|
Chambers JC, Abbott J, Zhang W, Turro E, Scott WR, Tan ST, Afzal U, Afaq S, Loh M, Lehne B, O'Reilly P, Gaulton KJ, Pearson RD, Li X, Lavery A, Vandrovcova J, Wass MN, Miller K, Sehmi J, Oozageer L, Kooner IK, Al-Hussaini A, Mills R, Grewal J, Panoulas V, Lewin AM, Northwood K, Wander GS, Geoghegan F, Li Y, Wang J, Aitman TJ, McCarthy MI, Scott J, Butcher S, Elliott P, Kooner JS. The South Asian genome. PLoS One 2014; 9:e102645. [PMID: 25115870 PMCID: PMC4130493 DOI: 10.1371/journal.pone.0102645] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Accepted: 06/21/2014] [Indexed: 12/15/2022] Open
Abstract
The genetic sequence variation of people from the Indian subcontinent who comprise one-quarter of the world's population, is not well described. We carried out whole genome sequencing of 168 South Asians, along with whole-exome sequencing of 147 South Asians to provide deeper characterisation of coding regions. We identify 12,962,155 autosomal sequence variants, including 2,946,861 new SNPs and 312,738 novel indels. This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity. Our results will accelerate the search for the genetic variants underlying susceptibility to disorders such as type-2 diabetes and cardiovascular disease which are highly prevalent amongst South Asians.
Collapse
Affiliation(s)
- John C. Chambers
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
- Imperial College Healthcare NHS Trust, London, United Kingdom
- MRC-HPA Centre for Environment and Health, Imperial College London, Norfolk Place, London, United Kingdom
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | - James Abbott
- Centre for Integrative Systems Biology and Bioinformatics, Imperial College London, London, United Kingdom
| | - Weihua Zhang
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | - Ernest Turro
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
- Computational Biology and Statistics, University of Cambridge, Cambridge, United Kingdom
| | - William R. Scott
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Sian-Tsung Tan
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
- NHLI, Imperial College London, Hammersmith Hospital, London, United Kingdom
| | - Uzma Afzal
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Saima Afaq
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Marie Loh
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Benjamin Lehne
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Paul O'Reilly
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Kyle J. Gaulton
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Richard D. Pearson
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Xinzhong Li
- Institute of Clinical Sciences, Imperial College London, London, United Kingdom
- Royal Brompton and Harefield Hospitals NHS Trust, London, United Kingdom
| | - Anita Lavery
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Jana Vandrovcova
- MRC Clinical Sciences Centre, Imperial College London, London, United Kingdom
| | - Mark N. Wass
- Centre for Integrative Systems Biology and Bioinformatics, Imperial College London, London, United Kingdom
| | - Kathryn Miller
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | - Joban Sehmi
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
- NHLI, Imperial College London, Hammersmith Hospital, London, United Kingdom
| | | | | | - Abtehale Al-Hussaini
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Rebecca Mills
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | - Jagvir Grewal
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | | | - Alexandra M. Lewin
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
| | - Korrinne Northwood
- MRC Clinical Sciences Centre, Imperial College London, London, United Kingdom
| | - Gurpreet S. Wander
- Hero DMC Heart Institute, Dayanand Medical College and Hospital, Ludhiana, India
| | - Frank Geoghegan
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
| | | | | | - Timothy J. Aitman
- MRC Clinical Sciences Centre, Imperial College London, London, United Kingdom
| | - Mark I. McCarthy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Diabetes, Endocrinology & Metabolism, University of Oxford, Churchill Hospital, Oxford, United Kingdom
- Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford, United Kingdom
| | - James Scott
- NHLI, Imperial College London, Hammersmith Hospital, London, United Kingdom
| | - Sarah Butcher
- Centre for Integrative Systems Biology and Bioinformatics, Imperial College London, London, United Kingdom
| | - Paul Elliott
- Epidemiology and Biostatistics, Imperial College London, Norfolk Place, London, United Kingdom
- Imperial College Healthcare NHS Trust, London, United Kingdom
- MRC-HPA Centre for Environment and Health, Imperial College London, Norfolk Place, London, United Kingdom
| | - Jaspal S. Kooner
- Imperial College Healthcare NHS Trust, London, United Kingdom
- Ealing Hospital NHS Trust, Southall, Middlesex, United Kingdom
- NHLI, Imperial College London, Hammersmith Hospital, London, United Kingdom
| |
Collapse
|
18
|
Sato N, Htun NC, Daimon M, Tamiya G, Kato T, Kubota I, Ueno Y, Yamashita H, Fukao A, Kayama T, Muramatsu M. Likelihood ratio-based integrated personal risk assessment of type 2 diabetes. Endocr J 2014; 61:967-88. [PMID: 25069673 DOI: 10.1507/endocrj.ej14-0271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
To facilitate personalized health care for multifactorial diseases, risks of genetic and clinical/environmental factors should be assessed together for each individual in an integrated fashion. This approach is possible with the likelihood ratio (LR)-based risk assessment system, as this system can incorporate manifold tests. We examined the usefulness of this system for assessing type 2 diabetes (T2D). Our system employed 29 genetic susceptibility variants, body mass index (BMI), and hypertension as risk factors whose LRs can be estimated from openly available T2D association data for the Japanese population. The pretest probability was set at a sex- and age-appropriate population average of diabetes prevalence. The classification performance of our LR-based risk assessment was compared to that of a non-invasive screening test for diabetes called TOPICS (with score based on age, sex, family history, smoking, BMI, and hypertension) using receiver operating characteristic analysis with a community cohort (n = 1263). The area under the receiver operating characteristic curve (AUC) for the LR-based assessment and TOPICS was 0.707 (95% CI 0.665-0.750) and 0.719 (0.675-0.762), respectively. These AUCs were much higher than that of a genetic risk score constructed using the same genetic susceptibility variants, 0.624 (0.574-0.674). The use of ethnically matched LRs is necessary for proper personal risk assessment. In conclusion, although LR-based integrated risk assessment for T2D still requires additional tests that evaluate other factors, such as risks involved in missing heritability, our results indicate the potential usability of LR-based assessment system and stress the importance of stratified epidemiological investigations in personalized medicine.
Collapse
Affiliation(s)
- Noriko Sato
- Department of Epigenetic Epidemiology/ Molecular Epidemiology, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 101-0062, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Ribeiro-dos-Santos AM, de Souza JES, Almeida R, Alencar DO, Barbosa MS, Gusmão L, Silva WA, de Souza SJ, Silva A, Ribeiro-dos-Santos Â, Darnet S, Santos S. High-throughput sequencing of a South American Amerindian. PLoS One 2013; 8:e83340. [PMID: 24386182 PMCID: PMC3875439 DOI: 10.1371/journal.pone.0083340] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 10/30/2013] [Indexed: 11/18/2022] Open
Abstract
The emergence of next-generation sequencing technologies allowed access to the vast amounts of information that are contained in the human genome. This information has contributed to the understanding of individual and population-based variability and improved the understanding of the evolutionary history of different human groups. However, the genome of a representative of the Amerindian populations had not been previously sequenced. Thus, the genome of an individual from a South American tribe was completely sequenced to further the understanding of the genetic variability of Amerindians. A total of 36.8 giga base pairs (Gbp) were sequenced and aligned with the human genome. These Gbp corresponded to 95.92% of the human genome with an estimated miscall rate of 0.0035 per sequenced bp. The data obtained from the alignment were used for SNP (single-nucleotide) and INDEL (insertion-deletion) calling, which resulted in the identification of 502,017 polymorphisms, of which 32,275 were potentially new high-confidence SNPs and 33,795 new INDELs, specific of South Native American populations. The authenticity of the sample as a member of the South Native American populations was confirmed through the analysis of the uniparental (maternal and paternal) lineages. The autosomal comparison distinguished the investigated sample from others continental populations and revealed a close relation to the Eastern Asian populations and Aboriginal Australian. Although, the findings did not discard the classical model of America settlement; it brought new insides to the understanding of the human population history. The present study indicates a remarkable genetic variability in human populations that must still be identified and contributes to the understanding of the genetic variability of South Native American populations and of the human populations history.
Collapse
Affiliation(s)
| | - Jorge Estefano Santana de Souza
- Centro Regional de Hemoterapia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo, Brazil
- Institute of Bioinformatics and Biotechnology, São Paulo, São Paulo, Brazil
| | - Renan Almeida
- Institute of Bioinformatics and Biotechnology, São Paulo, São Paulo, Brazil
| | - Dayse O. Alencar
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
| | | | - Leonor Gusmão
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
- Institute of Molecular Pathology and Immunology, University of Porto, Porto, Portugal
| | - Wilson A. Silva
- Centro Regional de Hemoterapia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo, Brazil
- Departamento de Genética, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Sandro J. de Souza
- Institute of Bioinformatics and Biotechnology, São Paulo, São Paulo, Brazil
- Brain Institute, Universidade Federal do Rio Grande do Norte, Natal, Rio Grande do Norte, Brazil
| | - Artur Silva
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
| | | | - Sylvain Darnet
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
| | - Sidney Santos
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
- * E-mail: /
| |
Collapse
|
20
|
Salleh MZ, Teh LK, Lee LS, Ismet RI, Patowary A, Joshi K, Pasha A, Ahmed AZ, Janor RM, Hamzah AS, Adam A, Yusoff K, Hoh BP, Hatta FHM, Ismail MI, Scaria V, Sivasubbu S. Systematic pharmacogenomics analysis of a Malay whole genome: proof of concept for personalized medicine. PLoS One 2013; 8:e71554. [PMID: 24009664 PMCID: PMC3751891 DOI: 10.1371/journal.pone.0071554] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2012] [Accepted: 07/01/2013] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND With a higher throughput and lower cost in sequencing, second generation sequencing technology has immense potential for translation into clinical practice and in the realization of pharmacogenomics based patient care. The systematic analysis of whole genome sequences to assess patient to patient variability in pharmacokinetics and pharmacodynamics responses towards drugs would be the next step in future medicine in line with the vision of personalizing medicine. METHODS Genomic DNA obtained from a 55 years old, self-declared healthy, anonymous male of Malay descent was sequenced. The subject's mother died of lung cancer and the father had a history of schizophrenia and deceased at the age of 65 years old. A systematic, intuitive computational workflow/pipeline integrating custom algorithm in tandem with large datasets of variant annotations and gene functions for genetic variations with pharmacogenomics impact was developed. A comprehensive pathway map of drug transport, metabolism and action was used as a template to map non-synonymous variations with potential functional consequences. PRINCIPAL FINDINGS Over 3 million known variations and 100,898 novel variations in the Malay genome were identified. Further in-depth pharmacogenetics analysis revealed a total of 607 unique variants in 563 proteins, with the eventual identification of 4 drug transport genes, 2 drug metabolizing enzyme genes and 33 target genes harboring deleterious SNVs involved in pharmacological pathways, which could have a potential role in clinical settings. CONCLUSIONS The current study successfully unravels the potential of personal genome sequencing in understanding the functionally relevant variations with potential influence on drug transport, metabolism and differential therapeutic outcomes. These will be essential for realizing personalized medicine through the use of comprehensive computational pipeline for systematic data mining and analysis.
Collapse
Affiliation(s)
- Mohd Zaki Salleh
- Integrative Pharmacogenomics Institute (iPROMISE), Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
- Faculty of Pharmacy, Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Lay Kek Teh
- Integrative Pharmacogenomics Institute (iPROMISE), Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
- Faculty of Pharmacy, Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Lian Shien Lee
- Integrative Pharmacogenomics Institute (iPROMISE), Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Rose Iszati Ismet
- Integrative Pharmacogenomics Institute (iPROMISE), Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Ashok Patowary
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Kandarp Joshi
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Ayesha Pasha
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Azni Zain Ahmed
- Institute of Science, Universiti Teknologi MARA (UiTM) Malaysia, Shah Alam, Selangor, Malaysia
| | - Roziah Mohd Janor
- Faculty of Computer and Mathematical Science, Universiti Teknologi MARA (UiTM) Malaysia, Shah Alam, Selangor, Malaysia
| | - Ahmad Sazali Hamzah
- Institute of Science, Universiti Teknologi MARA (UiTM) Malaysia, Shah Alam, Selangor, Malaysia
| | - Aishah Adam
- Faculty of Pharmacy, Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Khalid Yusoff
- Faculty of Medicine, Universiti Teknologi MARA (UiTM) Malaysia, Sg Buloh, Selangor, Malaysia
| | - Boon Peng Hoh
- Insitute of Medical Molecular Biotechnology (IMMB), Faculty of Medicine, Universiti Teknologi MARA (UiTM) Malaysia, Sg Buloh, Selangor, Malaysia
| | | | - Mohamad Izwan Ismail
- Integrative Pharmacogenomics Institute (iPROMISE), Universiti Teknologi MARA (UiTM) Malaysia, Puncak Alam, Selangor, Malaysia
| | - Vinod Scaria
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Sridhar Sivasubbu
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| |
Collapse
|
21
|
Patel CJ, Sivadas A, Tabassum R, Preeprem T, Zhao J, Arafat D, Chen R, Morgan AA, Martin GS, Brigham KL, Butte AJ, Gibson G. Whole genome sequencing in support of wellness and health maintenance. Genome Med 2013; 5:58. [PMID: 23806097 PMCID: PMC3967117 DOI: 10.1186/gm462] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Revised: 05/23/2013] [Accepted: 06/27/2013] [Indexed: 12/19/2022] Open
Abstract
Background Whole genome sequencing is poised to revolutionize personalized medicine, providing the capacity to classify individuals into risk categories for a wide range of diseases. Here we begin to explore how whole genome sequencing (WGS) might be incorporated alongside traditional clinical evaluation as a part of preventive medicine. The present study illustrates novel approaches for integrating genotypic and clinical information for assessment of generalized health risks and to assist individuals in the promotion of wellness and maintenance of good health. Methods Whole genome sequences and longitudinal clinical profiles are described for eight middle-aged Caucasian participants (four men and four women) from the Center for Health Discovery and Well Being (CHDWB) at Emory University in Atlanta. We report multivariate genotypic risk assessments derived from common variants reported by genome-wide association studies (GWAS), as well as clinical measures in the domains of immune, metabolic, cardiovascular, musculoskeletal, respiratory, and mental health. Results Polygenic risk is assessed for each participant for over 100 diseases and reported relative to baseline population prevalence. Two approaches for combining clinical and genetic profiles for the purposes of health assessment are then presented. First we propose conditioning individual disease risk assessments on observed clinical status for type 2 diabetes, coronary artery disease, hypertriglyceridemia and hypertension, and obesity. An approximate 2:1 ratio of concordance between genetic prediction and observed sub-clinical disease is observed. Subsequently, we show how more holistic combination of genetic, clinical and family history data can be achieved by visualizing risk in eight sub-classes of disease. Having identified where their profiles are broadly concordant or discordant, an individual can focus on individual clinical results or genotypes as they develop personalized health action plans in consultation with a health partner or coach. Conclusion The CHDWB will facilitate longitudinal evaluation of wellness-focused medical care based on comprehensive self-knowledge of medical risks.
Collapse
Affiliation(s)
- Chirag J Patel
- Division of Systems Medicine, Department of Pediatrics, Stanford University School of Medicine, 251 Campus Drive, Palo Alto, CA 94304, USA ; Lucille Packard Children's Hospital, 725 Welch Rd, Palo Alto, CA 94304, USA
| | - Ambily Sivadas
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA
| | - Rubina Tabassum
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA
| | - Thanawadee Preeprem
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA
| | - Jing Zhao
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA
| | - Dalia Arafat
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA
| | - Rong Chen
- Division of Systems Medicine, Department of Pediatrics, Stanford University School of Medicine, 251 Campus Drive, Palo Alto, CA 94304, USA ; Lucille Packard Children's Hospital, 725 Welch Rd, Palo Alto, CA 94304, USA ; Personalis, Inc., 1350 Willow Rd Suite 202, Menlo Park, CA 94025, USA
| | - Alexander A Morgan
- Division of Systems Medicine, Department of Pediatrics, Stanford University School of Medicine, 251 Campus Drive, Palo Alto, CA 94304, USA ; Lucille Packard Children's Hospital, 725 Welch Rd, Palo Alto, CA 94304, USA
| | - Gregory S Martin
- Center for Health Discovery and Well Being, and School of Medicine, Emory University Midtown Hospital, 550 Peachtree St, Atlanta GA 30308, USA
| | - Kenneth L Brigham
- Center for Health Discovery and Well Being, and School of Medicine, Emory University Midtown Hospital, 550 Peachtree St, Atlanta GA 30308, USA
| | - Atul J Butte
- Division of Systems Medicine, Department of Pediatrics, Stanford University School of Medicine, 251 Campus Drive, Palo Alto, CA 94304, USA ; Lucille Packard Children's Hospital, 725 Welch Rd, Palo Alto, CA 94304, USA
| | - Greg Gibson
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta GA 30332, USA ; Center for Health Discovery and Well Being, and School of Medicine, Emory University Midtown Hospital, 550 Peachtree St, Atlanta GA 30308, USA
| |
Collapse
|
22
|
Rajendran S, Muthupalani RS, Ramanathan A. Lack of RING finger domain (RFD) mutations of the c-cbl gene in oral squamous cell carcinomas in Chennai, India. Asian Pac J Cancer Prev 2013; 14:1073-5. [PMID: 23621189 DOI: 10.7314/apjcp.2013.14.2.1073] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In normal cells, activated epidermal growth factor receptor (EGFR) molecules are subjected to ubiquitination-mediated proteasome degradation pathway by c-Cbl, an ubiquitin ligase that checks uncontrolled proliferation. Hence expression of wild type c-Cbl molecule is essential to keep this degradation machinery in a functional state. Loss of expression or function of c-Cbl may consequently lead to sustained activation of EGFR and promote carcinogenesis, loss of function mutations in the c-Cbl gene already being reported in lung and hematopoietic cancers. However, the genetic status of c-Cbl in oral squamous cell carcinoma (OSCC) is not known. Hence in the present study we investigated the genomic DNA isolated from OSCC tissue biopsy samples for mutations in the RING finger domain coding region of c-Cbl gene, which has also been reported to be most frequently mutated in other cancers. MATERIALS AND METHODS Total genomic DNA isolated from thirty two post surgical OSCC tissue samples were amplified using primers flanking the exon 8 of c-Cbl gene that codes for the RING finger domain. The PCR amplicons were then resolved in a 1.2% agarose gel, purified and subjected to direct sequencing to screen for mutations. RESULTS The sequencing data of the thirty two OSCC samples did not identify mutations in the RING finger domain coding region of c-Cbl gene. CONCLUSIONS To the best of our knowledge, this is the first time that the genetic status of c-Cbl gene in OSCC samples has been investigated. The present data indicates that genetic alteration of RING finger domain coding region of c-Cbl gene is relatively infrequent in OSCC samples.
Collapse
Affiliation(s)
- Senthilnathan Rajendran
- Department of Oral and Maxillofacial Surgery, Meenakshiammal Dental College and Hospital, Maduravoyal, India.
| | | | | |
Collapse
|