1
|
Capo-chichi LJA, Elakhdar A, Kubo T, Nyachiro J, Juskiw P, Capettini F, Slaski JJ, Ramirez GH, Beattie AD. Genetic diversity and population structure assessment of Western Canadian barley cooperative trials. FRONTIERS IN PLANT SCIENCE 2023; 13:1006719. [PMID: 36699829 PMCID: PMC9868428 DOI: 10.3389/fpls.2022.1006719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 11/25/2022] [Indexed: 06/17/2023]
Abstract
Studying the population structure and genetic diversity of historical datasets is a proposed use for association analysis. This is particularly important when the dataset contains traits that are time-consuming or costly to measure. A set of 96 elite barley genotypes, developed from eight breeding programs of the Western Canadian Cooperative Trials were used in the current study. Genetic diversity, allelic variation, and linkage disequilibrium (LD) were investigated using 5063 high-quality SNP markers via the Illumina 9K Barley Infinium iSelect SNP assay. The distribution of SNPs markers across the barley genome ranged from 449 markers on chromosome 1H to 1111 markers on chromosome 5H. The average polymorphism information content (PIC) per locus was 0.275 and ranged from 0.094 to 0.375. Bayesian clustering in STRUCTURE and principal coordinate analysis revealed that the populations are differentiated primarily due to the different breeding program origins and ear-row type into five subpopulations. Analysis of molecular variance based on PhiPT values suggested that high values of genetic diversity were observed within populations and accounted for 90% of the total variance. Subpopulation 5 exhibited the most diversity with the highest values of the diversity indices, which represent the breeding program gene pool of AFC, AAFRD, AU, and BARI. With increasing genetic distance, the LD values, expressed as r2, declined to below the critical r2 = 0.18 after 3.91 cM, and the same pattern was observed on each chromosome. Our results identified an important pattern of genetic diversity among the Canadian barley panel that was proposed to be representative of target breeding programs and may have important implications for association mapping in the future. This highlight, that efforts to identify novel variability underlying this diversity may present practical breeding opportunities to develop new barley genotypes.
Collapse
Affiliation(s)
- Ludovic J. A. Capo-chichi
- Department of Renewable Resources, Faculty of Agriculture, Life and Environmental Sciences, University of Alberta, Edmonton, AB, Canada
| | - Ammar Elakhdar
- Institute of Genetic Resources, Faculty of Agriculture, Kyushu University, Fukuoka, Japan
- Field Crops Research Institute, Agricultural Research Center, Giza, Egypt
| | - Takahiko Kubo
- Institute of Genetic Resources, Faculty of Agriculture, Kyushu University, Fukuoka, Japan
| | - Joseph Nyachiro
- Field Crop Development Centre, Alberta Agriculture and Forestry, Lacombe, AB, Canada
| | - Patricia Juskiw
- Field Crop Development Centre, Alberta Agriculture and Forestry, Lacombe, AB, Canada
| | - Flavio Capettini
- Field Crop Development Centre, Alberta Agriculture and Forestry, Lacombe, AB, Canada
| | - Jan J. Slaski
- Ecosystems and Plant Sciences, InnoTech Alberta Inc., Vegreville, AB, Canada
| | - Guillermo Hernandez Ramirez
- Department of Renewable Resources, Faculty of Agriculture, Life and Environmental Sciences, University of Alberta, Edmonton, AB, Canada
| | - Aaron D. Beattie
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
2
|
Behl T, Kaur G, Fratila O, Buhas C, Judea-Pusta CT, Negrut N, Bustea C, Bungau S. Cross-talks among GBA mutations, glucocerebrosidase, and α-synuclein in GBA-associated Parkinson's disease and their targeted therapeutic approaches: a comprehensive review. Transl Neurodegener 2021; 10:4. [PMID: 33446243 PMCID: PMC7809876 DOI: 10.1186/s40035-020-00226-x] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 12/01/2020] [Indexed: 02/08/2023] Open
Abstract
Current therapies for Parkinson's disease (PD) are palliative, of which the levodopa/carbidopa therapy remains the primary choice but is unable to modulate the progression of neurodegeneration. Due to the complication of such a multifactorial disorder and significant limitations of the therapy, numerous genetic approaches have been proved effective in finding out genes and mechanisms implicated in this disease. Following the observation of a higher frequency of PD in Gaucher's disease (GD), a lysosomal storage condition, mutations of glycosylceramidase beta (GBA) encoding glucocerebrosidase (GCase) have been shown to be involved and have been explored in the context of PD. GBA mutations are the most common genetic risk factor of PD. Various studies have revealed the relationships between PD and GBA gene mutations, facilitating a better understanding of this disorder. Various hypotheses delineate that the pathological mutations of GBA minimize the enzymatic activity of GCase, which affects the proliferation and clearance of α-synuclein; this affects the lysosomal homeostasis, exacerbating the endoplasmic reticulum stress or encouraging the mitochondrial dysfunction. Identification of the pathological mechanisms underlying the GBA-associated parkinsonism (GBA + PD) advances our understanding of PD. This review based on current literature aims to elucidate various genetic and clinical characteristics correlated with GBA mutations and to identify the numerous pathological processes underlying GBA + PD. We also delineate the therapeutic strategies to interfere with the mutant GCase function for further improvement of the related α-synuclein-GCase crosstalks. Moreover, the various therapeutic approaches such as gene therapy, chaperone proteins, and histone deacetylase inhibitors for the treatment of GBA + PD are discussed.
Collapse
Affiliation(s)
- Tapan Behl
- Chitkara College of Pharmacy, Chitkara University, Rajpura, Punjab, India.
| | - Gagandeep Kaur
- Chitkara College of Pharmacy, Chitkara University, Rajpura, Punjab, India
| | - Ovidiu Fratila
- Department of Medical Disciplines, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Romania
| | - Camelia Buhas
- Department of Morphological Disciplines, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Bihor County, Romania
| | - Claudia Teodora Judea-Pusta
- Department of Morphological Disciplines, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Bihor County, Romania
| | - Nicoleta Negrut
- Department of Psycho-Neuroscience and Recovery, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Romania
| | - Cristiana Bustea
- Department of Preclinical Disciplines, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Romania
| | - Simona Bungau
- Department of Pharmacy, Faculty of Medicine and Pharmacy, University of Oradea, Oradea, Romania
| |
Collapse
|
3
|
Wei X, Huang M, Yue Q, Ma S, Li B, Mu Z, Peng C, Gao W, Liu W, Zheng J, Weng X, Sun X, Zuo Q, Bo S, Yuan X, Zhang W, Yang G, Ding Y, Wang X, Wang T, Hua P, Wang Z. Long-term urbanization impacts the eastern golden frog ( Pelophylax plancyi) in Shanghai City: Demographic history, genetic structure, and implications for amphibian conservation in intensively urbanizing environments. Evol Appl 2021; 14:117-135. [PMID: 33519960 PMCID: PMC7819575 DOI: 10.1111/eva.13156] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 10/11/2020] [Accepted: 10/20/2020] [Indexed: 11/29/2022] Open
Abstract
Understanding the mechanisms of how urbanization influences the evolution of native species is vital for urban wildlife ecology and conservation in the Anthropocene. With thousands of years of agriculture-dominated historical urbanization followed by 40 years of intensive and rapid urbanization, Shanghai provides an ideal environment to study how the two-stage urbanization process influences the evolution of indigenous wildlife, especially of anuran species. Therefore, in this study, we used mitochondrial Cyt-b gene, microsatellite (SSR), and single nucleotide polymorphism (SNP) data to evaluate the demographic history and genetic structure of the eastern golden frog (Pelophylax plancyi), by sampling 407 individuals from 15 local populations across Shanghai, China. All local populations experienced bottlenecks during historical urbanization, while the local populations in urban areas maintained comparable contemporary effective population sizes (N e) and genetic diversity with suburban and rural populations. Nevertheless, the rapid modern urbanization has already imposed significant negative effects to the integrity of populations. The 15 local populations were differentiated into eight genetic clusters, showing a spatial distribution pattern consistent with the current urbanization gradient and island-mainland geography. Although moderate gene flow still occurred from the rural peripheral cluster to urban and suburban clusters, population fragmentation was more serious in the urban and suburban populations, where higher urbanization levels within 2-km radius areas showed significant negative relationships to the N e and genetic diversity of local populations. Therefore, to protect urban wildlife with limited dispersal ability, improving conditions in fragmented habitat remnants might be most essential for local populations living in more urbanized areas. Meanwhile, we highlight the need to preserve large unfragmented rural habitats and to construct corridor networks to connect discrete urban habitat remnants for the long-term wildlife conservation in intensively urbanizing environments.
Collapse
Affiliation(s)
- Xu Wei
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Meiling Huang
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Qu Yue
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Shuo Ma
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Ben Li
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Zhiqiang Mu
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Chuan Peng
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Wenxuan Gao
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Wenli Liu
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Jiaxin Zheng
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Xiaodong Weng
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Xiaohui Sun
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Qingqiu Zuo
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Shunqi Bo
- Shanghai Landscaping & City Appearance Administrative BureauShanghai Forestry BureauShanghaiChina
| | - Xiao Yuan
- Shanghai Landscaping & City Appearance Administrative BureauShanghai Forestry BureauShanghaiChina
| | - Wei Zhang
- Natural History Research Centre of Shanghai Natural History MuseumShanghai Science and Technology MuseumShanghaiChina
| | - Gang Yang
- Natural History Research Centre of Shanghai Natural History MuseumShanghai Science and Technology MuseumShanghaiChina
| | - Youzhong Ding
- School of Life SciencesEast China Normal UniversityShanghaiChina
| | - Xiaoming Wang
- School of Life SciencesEast China Normal UniversityShanghaiChina
- Shanghai Science and Technology MuseumShanghaiChina
| | - Tianhou Wang
- School of Life SciencesEast China Normal UniversityShanghaiChina
- Institute of Eco‐ChongmingShanghaiChina
| | - Panyu Hua
- School of Ecological and Environmental SciencesEast China Normal UniversityShanghaiChina
| | - Zhenghuan Wang
- School of Life SciencesEast China Normal UniversityShanghaiChina
- Joint Translational Science and Technology Research InstituteEast China Normal UniversityShanghaiChina
- Yangtze Delta Estuarine Wetland Ecosystem Observation and Research StationMinistry of Education & Shanghai Science and Technology CommitteeShanghaiChina
| |
Collapse
|
4
|
Kader F, Ghai M, Olaniran AO. Characterization of DNA methylation-based markers for human body fluid identification in forensics: a critical review. Int J Legal Med 2019; 134:1-20. [PMID: 31713682 DOI: 10.1007/s00414-019-02181-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 10/15/2019] [Indexed: 02/07/2023]
Abstract
Body fluid identification in crime scene investigations aids in reconstruction of crime scenes. Several studies have identified and reported differentially methylated sites (DMSs) and regions (DMRs) which differ between forensically relevant tissues (tDMRs) and body fluids. Diverse factors affect methylation patterns such as the environment, diets, lifestyle, disease, ethnicity, genetic variation, amongst others. Thus, it is important to analyse the stability of markers employed for forensic identification. Furthermore, even though epigenetic modifications are described as stable and heritable, epigenetic inheritance of potential markers for body fluid identification needs to be assessed in the long term. Here, we discuss the current status of reported DNA methylation-based markers and their verification studies. Such thorough investigation is crucial to develop a stable panel of DNA methylation-based markers for accurate body fluid identification.
Collapse
Affiliation(s)
- Farzeen Kader
- Discipline of Genetics, School of Life Sciences, College of Agriculture, Engineering and Science, University of KwaZulu-Natal (Westville Campus), Private Bag X54001, Durban, Republic of South Africa
| | - Meenu Ghai
- Discipline of Genetics, School of Life Sciences, College of Agriculture, Engineering and Science, University of KwaZulu-Natal (Westville Campus), Private Bag X54001, Durban, Republic of South Africa.
| | - Ademola O Olaniran
- Discipline of Microbiology, School of Life Sciences, College of Agriculture, Engineering and Science, University of KwaZulu-Natal (Westville Campus), Private Bag X54001, Durban, Republic of South Africa
| |
Collapse
|
5
|
Day GQ, Ng J, Oldt RF, Houghton PW, Smith DG, Kanthaswamy S. DNA-based Determination of Ancestry in Cynomolgus Macaques ( Macaca fascicularis). JOURNAL OF THE AMERICAN ASSOCIATION FOR LABORATORY ANIMAL SCIENCE : JAALAS 2018; 57:432-442. [PMID: 30165920 PMCID: PMC6159685 DOI: 10.30802/aalas-jaalas-17-000147] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 12/29/2017] [Accepted: 02/26/2018] [Indexed: 11/05/2022]
Abstract
Interest in the genetic composition of cynomolgus macaques (Macaca fascicularis) has increased due to the rising demand for NHP models in human biomedical research. Significant genetic differences among regional populations of cynomolgus macaques can confound interpretations of research results because they do not solely reflect differences in experimental treatment effects. Therefore, the common origin of cynomolgus macaques used as research subjects should be verified by using region-specific genetic markers to minimize the influence of underlying genetic variation among animals selected as research subjects on phenotypes under study. We compared the effectiveness of 18 short tandem repeat (STR) markers with that of 83 single-nucleotide polymorphism (SNP) markers to differentiate the ancestry of cynomolgus macaques from 6 different populations (Cambodia, Sumatra, Mauritius, Singapore, and the islands of Luzon and Zamboanga in the Philippines). Genetic diversity indices such as allele numbers and expected heterozygosity based on SNP were lower and exhibited lower standard errors than those provided by STR, probably because, unlike STR, most SNP are biallelic and consequently exhibit maximal expected heterozygosity values of 0.50. However, the standard error of estimates of observed heterozygosity based on SNP was higher than that for STR, perhaps reflecting sampling errors. Only 27 SNP were required to match the resolving power of 17 STR to detect population structure, that is, 1.6 SNP:1 STR. Whereas STR only differentiated the Mauritian population from all other populations, SNP detected 4 genetically distinct groups (Cambodia, Singapore-Sumatra, Mauritius, and Zamboanga). SNP are poised to become as valuable as STR for understanding and detecting genetic structure among cynomolgus macaques. Although STR will remain an important tool for cynomolgus macaque population studies, SNP have the potential to become the mainstream marker type.
Collapse
Affiliation(s)
- George Q Day
- Molecular Anthropology Laboratory, University of California, Davis, California
| | - Jillian Ng
- Molecular Anthropology Laboratory, University of California, Davis, California
| | - Robert F Oldt
- Molecular Anthropology Laboratory, University of California, Davis, California, School of Mathematics and Natural Sciences, Arizona State University at the West Campus, Glendale, Arizona
| | | | - David Glenn Smith
- Molecular Anthropology Laboratory, California National Primate Research Center, University of California, Davis, California
| | - Sree Kanthaswamy
- California National Primate Research Center, University of California, Davis, California, School of Mathematics and Natural Sciences, Arizona State University at the West Campus, Glendale, Arizona;,
| |
Collapse
|
6
|
Abstract
PURPOSE OF REVIEW GBA mutations are the most common known genetic cause of Parkinson's disease (PD). Its biological pathway may be important in idiopathic PD, since activity of the enzyme encoded by GBA, glucocerebrosidase, is reduced even among PD patients without GBA mutations. This article describes the structure and function of GBA, reviews recent literature on the clinical phenotype of GBA PD, and suggests future directions for research, counseling, and treatment. RECENT FINDINGS Several longitudinal studies have shown that GBA PD has faster motor and cognitive progression than idiopathic PD and that this effect is dose dependent. New evidence suggests that GBA mutations may be important in multiple system atrophy. Further, new interventional studies focusing on GBA PD are described. These studies may increase the interest of PD patients and caregivers in genetic counseling. GBA mutation status may help clinicians estimate PD progression, though mechanisms underlying GBA and synucleinopathy require further understanding.
Collapse
|
7
|
The GBAP1 pseudogene acts as a ceRNA for the glucocerebrosidase gene GBA by sponging miR-22-3p. Sci Rep 2017; 7:12702. [PMID: 28983119 PMCID: PMC5629250 DOI: 10.1038/s41598-017-12973-5] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 09/13/2017] [Indexed: 12/11/2022] Open
Abstract
Mutations in the GBA gene, encoding lysosomal glucocerebrosidase, represent the major predisposing factor for Parkinson's disease (PD), and modulation of the glucocerebrosidase activity is an emerging PD therapy. However, little is known about mechanisms regulating GBA expression. We explored the existence of a regulatory network involving GBA, its expressed pseudogene GBAP1, and microRNAs. The high level of sequence identity between GBA and GBAP1 makes the pseudogene a promising competing-endogenous RNA (ceRNA), functioning as a microRNA sponge. After selecting microRNAs potentially targeting both transcripts, we demonstrated that miR-22-3p binds to and down-regulates GBA and GBAP1, and decreases their endogenous mRNA levels up to 70%. Moreover, over-expression of GBAP1 3'-untranslated region was able to sequester miR-22-3p, thus increasing GBA mRNA and glucocerebrosidase levels. The characterization of GBAP1 splicing identified multiple out-of-frame isoforms down-regulated by the nonsense-mediated mRNA decay, suggesting that GBAP1 levels and, accordingly, its ceRNA effect, are significantly modulated by this degradation process. Using skin-derived induced pluripotent stem cells of PD patients with GBA mutations and controls, we observed a significant GBA up-regulation during dopaminergic differentiation, paralleled by down-regulation of miR-22-3p. Our results describe the first microRNA controlling GBA and suggest that the GBAP1 non-coding RNA functions as a GBA ceRNA.
Collapse
|
8
|
Christensen CL, Choy FYM. A Prospective Treatment Option for Lysosomal Storage Diseases: CRISPR/Cas9 Gene Editing Technology for Mutation Correction in Induced Pluripotent Stem Cells. Diseases 2017; 5:E6. [PMID: 28933359 PMCID: PMC5456334 DOI: 10.3390/diseases5010006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Revised: 02/15/2017] [Accepted: 02/20/2017] [Indexed: 02/06/2023] Open
Abstract
Ease of design, relatively low cost and a multitude of gene-altering capabilities have all led to the adoption of the sophisticated and yet simple gene editing system: clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9). The CRISPR/Cas9 system holds promise for the correction of deleterious mutations by taking advantage of the homology directed repair pathway and by supplying a correction template to the affected patient's cells. Currently, this technique is being applied in vitro in human-induced pluripotent stem cells (iPSCs) to correct a variety of severe genetic diseases, but has not as of yet been used in iPSCs derived from patients affected with a lysosomal storage disease (LSD). If adopted into clinical practice, corrected iPSCs derived from cells that originate from the patient themselves could be used for therapeutic amelioration of LSD symptoms without the risks associated with allogeneic stem cell transplantation. CRISPR/Cas9 editing in a patient's cells would overcome the costly, lifelong process associated with currently available treatment methods, including enzyme replacement and substrate reduction therapies. In this review, the overall utility of the CRISPR/Cas9 gene editing technique for treatment of genetic diseases, the potential for the treatment of LSDs and methods currently employed to increase the efficiency of this re-engineered biological system will be discussed.
Collapse
Affiliation(s)
- Chloe L Christensen
- Department of Biology, Centre for Biomedical Research, University of Victoria, 3800 Finnerty Rd., Victoria, BC V8P 5C2, Canada.
| | - Francis Y M Choy
- Department of Biology, Centre for Biomedical Research, University of Victoria, 3800 Finnerty Rd., Victoria, BC V8P 5C2, Canada.
| |
Collapse
|
9
|
Chin ELH, da Silva C, Hegde M. Assessment of clinical analytical sensitivity and specificity of next-generation sequencing for detection of simple and complex mutations. BMC Genet 2013; 14:6. [PMID: 23418865 PMCID: PMC3599218 DOI: 10.1186/1471-2156-14-6] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 02/08/2013] [Indexed: 09/03/2023] Open
Abstract
Background Detecting mutations in disease genes by full gene sequence analysis is common in clinical diagnostic laboratories. Sanger dideoxy terminator sequencing allows for rapid development and implementation of sequencing assays in the clinical laboratory, but it has limited throughput, and due to cost constraints, only allows analysis of one or at most a few genes in a patient. Next-generation sequencing (NGS), on the other hand, has evolved rapidly, although to date it has mainly been used for large-scale genome sequencing projects and is beginning to be used in the clinical diagnostic testing. One advantage of NGS is that many genes can be analyzed easily at the same time, allowing for mutation detection when there are many possible causative genes for a specific phenotype. In addition, regions of a gene typically not tested for mutations, like deep intronic and promoter mutations, can also be detected. Results Here we use 20 previously characterized Sanger-sequenced positive controls in disease-causing genes to demonstrate the utility of NGS in a clinical setting using standard PCR based amplification to assess the analytical sensitivity and specificity of the technology for detecting all previously characterized changes (mutations and benign SNPs). The positive controls chosen for validation range from simple substitution mutations to complex deletion and insertion mutations occurring in autosomal dominant and recessive disorders. The NGS data was 100% concordant with the Sanger sequencing data identifying all 119 previously identified changes in the 20 samples. Conclusions We have demonstrated that NGS technology is ready to be deployed in clinical laboratories. However, NGS and associated technologies are evolving, and clinical laboratories will need to invest significantly in staff and infrastructure to build the necessary foundation for success.
Collapse
Affiliation(s)
- Ephrem L H Chin
- Department of Human Genetics, Emory University, Michael Street, Atlanta, GA, USA
| | | | | |
Collapse
|
10
|
Moleirinho A, Seixas S, Lopes AM, Bento C, Prata MJ, Amorim A. Evolutionary constraints in the β-globin cluster: the signature of purifying selection at the δ-globin (HBD) locus and its role in developmental gene regulation. Genome Biol Evol 2013; 5:559-71. [PMID: 23431002 PMCID: PMC3622298 DOI: 10.1093/gbe/evt029] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/16/2013] [Indexed: 12/15/2022] Open
Abstract
Human hemoglobins, the oxygen carriers in the blood, are composed by two α-like and two β-like globin monomers. The β-globin gene cluster located at 11p15.5 comprises one pseudogene and five genes whose expression undergoes two critical switches: the embryonic-to-fetal and fetal-to-adult transition. HBD encodes the δ-globin chain of the minor adult hemoglobin (HbA2), which is assumed to be physiologically irrelevant. Paradoxically, reduced diversity levels have been reported for this gene. In this study, we sought a detailed portrait of the genetic variation within the β-globin cluster in a large human population panel from different geographic backgrounds. We resequenced the coding and noncoding regions of the two adult β-globin genes (HBD and HBB) in European and African populations, and analyzed the data from the β-globin cluster (HBE, HBG2, HBG1, HBBP1, HBD, and HBB) in 1,092 individuals representing 14 populations sequenced as part of the 1000 Genomes Project. Additionally, we assessed the diversity levels in nonhuman primates using chimpanzee sequence data provided by the PanMap Project. Comprehensive analyses, based on classic neutrality tests, empirical and haplotype-based studies, revealed that HBD and its neighbor pseudogene HBBP1 have mainly evolved under purifying selection, suggesting that their roles are essential and nonredundant. Moreover, in the light of recent studies on the chromatin conformation of the β-globin cluster, we present evidence sustaining that the strong functional constraints underlying the decreased contemporary diversity at these two regions were not driven by protein function but instead are likely due to a regulatory role in ontogenic switches of gene expression.
Collapse
Affiliation(s)
- Ana Moleirinho
- Institute of Molecular Pathology and Immunology of University of Porto (IPATIMUP), Porto, Portugal.
| | | | | | | | | | | |
Collapse
|
11
|
Fairbanks DJ, Fairbanks AD, Ogden TH, Parker GJ, Maughan PJ. NANOGP8: evolution of a human-specific retro-oncogene. G3 (BETHESDA, MD.) 2012; 2:1447-57. [PMID: 23173096 PMCID: PMC3484675 DOI: 10.1534/g3.112.004366] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 09/18/2012] [Indexed: 12/22/2022]
Abstract
NANOGP8 is a human (Homo sapiens) retrogene, expressed predominantly in cancer cells where its protein product is tumorigenic. It arose through retrotransposition from its parent gene, NANOG, which is expressed predominantly in embryonic stem cells. Based on identification of fixed and polymorphic variants in a genetically diverse set of human NANOG and NANOGP8 sequences, we estimated the evolutionary origin of NANOGP8 at approximately 0.9 to 2.5 million years ago, more recent than previously estimated. We also discovered that NANOGP8 arose from a derived variant allele of NANOG containing a 22-nucleotide pair deletion in the 3' UTR, which has remained polymorphic in modern humans. Evidence from our experiments indicates that NANOGP8 is fixed in modern humans even though its parent allele is polymorphic. The presence of NANOGP8-specific sequences in Neanderthal reads provided definitive evidence that NANOGP8 is also present in the Neanderthal genome. Some variants between the reference sequences of NANOG and NANOGP8 utilized in cancer research to distinguish RT-PCR products are polymorphic within NANOG or NANOGP8 and thus are not universally reliable as distinguishing features. NANOGP8 was inserted in reverse orientation into the LTR region of an SVA retroelement that arose in a human-chimpanzee-gorilla common ancestor after divergence of the orangutan ancestral lineage. Transcription factor binding sites within and beyond this LTR may promote expression of NANOGP8 in cancer cells, although current evidence is inferential. The fact that NANOGP8 is a human-specific retro-oncogene may partially explain the higher genetic predisposition for cancer in humans compared with other primates.
Collapse
Affiliation(s)
- Daniel J. Fairbanks
- Department of Biology, Utah Valley University, Orem, Utah 84058
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602
| | | | - T. Heath Ogden
- Department of Biology, Utah Valley University, Orem, Utah 84058
| | | | - Peter J. Maughan
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602
| |
Collapse
|
12
|
Yang X, Xu Y, Shah T, Li H, Han Z, Li J, Yan J. Comparison of SSRs and SNPs in assessment of genetic relatedness in maize. Genetica 2011; 139:1045-54. [PMID: 21904888 DOI: 10.1007/s10709-011-9606-9] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 08/25/2011] [Indexed: 10/17/2022]
Abstract
Advances in high-throughput SNP genotyping and genome sequencing technologies have enabled genome-wide association mapping in dissecting the genetic basis of complex quantitative traits. In this study, 82 SSRs and 884 SNPs with minor allele frequencies (MAF) over 0.20 were used to compare their ability to assess population structure, principal component analysis (PCA) and relative kinship in a maize association panel consisting of 154 inbred lines. Compared to SNPs, SSRs provided more information on genetic diversity. The expected heterozygosity (He) of SSRs and SNPs averaged 0.65 and 0.44, and the polymorphic information content of these two markers was 0.61 and 0.34 in this panel, respectively. Additionally, SSRs performed better at clustering all lines into groups using STRUCTURE and PCA approaches, and estimating relative kinship. For both marker systems, the same clusters were observed based on PCA and the first two eigenvectors accounted for similar percentage of genetic variations in this panel. The correlation coefficients of each eigenvector from SSRs and SNPs decreased sharply when the eigenvector varied from 1 to 3, but kept around 0 when the eigenvector were over 3. The kinship estimates based on SSRs and SNPs were moderately correlated (r (2) = 0.69). All these results suggest that SSR markers with moderate density are more informative than SNPs for assessing genetic relatedness in maize association mapping panels.
Collapse
Affiliation(s)
- Xiaohong Yang
- National Maize Improvement Center of China, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, No 2 Yuanmingyuan West Road, Haidian, Beijing 100193, China
| | | | | | | | | | | | | |
Collapse
|
13
|
Yang X, Xu Y, Shah T, Li H, Han Z, Li J, Yan J. Comparison of SSRs and SNPs in assessment of genetic relatedness in maize. Genetica 2011; 139:1045-1054. [PMID: 21904888 DOI: 10.1007/s10709-011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 08/25/2011] [Indexed: 05/29/2023]
Abstract
Advances in high-throughput SNP genotyping and genome sequencing technologies have enabled genome-wide association mapping in dissecting the genetic basis of complex quantitative traits. In this study, 82 SSRs and 884 SNPs with minor allele frequencies (MAF) over 0.20 were used to compare their ability to assess population structure, principal component analysis (PCA) and relative kinship in a maize association panel consisting of 154 inbred lines. Compared to SNPs, SSRs provided more information on genetic diversity. The expected heterozygosity (He) of SSRs and SNPs averaged 0.65 and 0.44, and the polymorphic information content of these two markers was 0.61 and 0.34 in this panel, respectively. Additionally, SSRs performed better at clustering all lines into groups using STRUCTURE and PCA approaches, and estimating relative kinship. For both marker systems, the same clusters were observed based on PCA and the first two eigenvectors accounted for similar percentage of genetic variations in this panel. The correlation coefficients of each eigenvector from SSRs and SNPs decreased sharply when the eigenvector varied from 1 to 3, but kept around 0 when the eigenvector were over 3. The kinship estimates based on SSRs and SNPs were moderately correlated (r (2) = 0.69). All these results suggest that SSR markers with moderate density are more informative than SNPs for assessing genetic relatedness in maize association mapping panels.
Collapse
Affiliation(s)
- Xiaohong Yang
- National Maize Improvement Center of China, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, No 2 Yuanmingyuan West Road, Haidian, Beijing 100193, China
| | | | | | | | | | | | | |
Collapse
|
14
|
Andrés AM, Clark AG, Shimmin L, Boerwinkle E, Sing CF, Hixson JE. Understanding the accuracy of statistical haplotype inference with sequence data of known phase. Genet Epidemiol 2008; 31:659-71. [PMID: 17922479 DOI: 10.1002/gepi.20185] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Statistical methods for haplotype inference from multi-site genotypes of unrelated individuals have important application in association studies and population genetics. Understanding the factors that affect the accuracy of this inference is important, but their assessment has been restricted by the limited availability of biological data with known phase. We created hybrid cell lines monosomic for human chromosome 19 and produced single-chromosome complete sequences of a 48 kb genomic region in 39 individuals of African American (AA) and European American (EA) origin. We employ these phase-known genotypes and coalescent simulations to assess the accuracy of statistical haplotype reconstruction by several algorithms. Accuracy of phase inference was considerably low in our biological data even for regions as short as 25-50 kb, suggesting that caution is needed when analyzing reconstructed haplotypes. Moreover, the reliability of estimated confidence in phase inference is not high enough to allow for a reliable incorporation of site-specific uncertainty information in subsequent analyses. We show that, in samples of certain mixed ancestry (AA and EA populations), the most accurate haplotypes are probably obtained when increasing sample size by considering the largest, pooled sample, despite the hypothetical problems associated with pooling across those heterogeneous samples. Strategies to improve confidence in reconstructed haplotypes, and realistic alternatives to the analysis of inferred haplotypes, are discussed.
Collapse
Affiliation(s)
- Aida M Andrés
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| | | | | | | | | | | |
Collapse
|
15
|
Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata. BMC Evol Biol 2007; 7:66. [PMID: 17451608 PMCID: PMC1865379 DOI: 10.1186/1471-2148-7-66] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2006] [Accepted: 04/23/2007] [Indexed: 11/22/2022] Open
Abstract
Background There has been remarkably little study of nucleotide substitution rate variation among plant nuclear genes, in part because orthology is difficult to establish. Orthology is even more problematic for intergenic regions of plant nuclear genomes, because plant genomes generally harbor a wealth of repetitive DNA. In theory orthologous intergenic data is valuable for studying rate variation because nucleotide substitutions in these regions should be under little selective constraint compared to coding regions. As a result, evolutionary rates in intergenic regions may more accurately reflect genomic features, like recombination and GC content, that contribute to nucleotide substitution. Results We generated a set of 66 intergenic sequences in Arabidopsis lyrata, a close relative of Arabidopsis thaliana. The intergenic regions included transposable element (TE) remnants and regions flanking the TEs. We verified orthology of these amplified regions both by comparison of existing A. lyrata – A. thaliana genetic maps and by using molecular features. We compared substitution rates among the 66 intergenic loci, which exhibit ~5-fold rate variation, and compared intergenic rates to a set of 64 orthologous coding sequences. Our chief observations were that the average rate of nucleotide substitution is slower in intergenic regions than in synonymous sites, that rate variation in both intergenic and coding regions correlate with GC content, that GC content alone is not sufficient to explain differences in rates between intergenic and coding regions, and that rates of evolution in intergenic regions correlate negatively with gene density. Conclusion Our observations indicated that mutation rates vary among genomics regions as a function of base composition, suggesting that previous observations of "selective constraint" on non-coding regions could more accurately be attributed to a GC effect instead of selection. The negative correlation between nucleotide substitution rate and gene density provides a potential neutral explanation for a previously documented correlation between gene density and polymorphism levels within A. thaliana. Finally, we discuss potential forces that could contribute to rapid synonymous rates, and provide evidence to suggest that transcription-related mutation contributes to rate differences between intergenic and synonymous sites.
Collapse
|
16
|
Fairbanks DJ, Maughan PJ. Evolution of the NANOG pseudogene family in the human and chimpanzee genomes. BMC Evol Biol 2006; 6:12. [PMID: 16469101 PMCID: PMC1457002 DOI: 10.1186/1471-2148-6-12] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Accepted: 02/09/2006] [Indexed: 11/25/2022] Open
Abstract
Background The NANOG gene is expressed in mammalian embryonic stem cells where it maintains cellular pluripotency. An unusually large family of pseudogenes arose from it with one unprocessed and ten processed pseudogenes in the human genome. This article compares the NANOG gene and its pseudogenes in the human and chimpanzee genomes and derives an evolutionary history of this pseudogene family. Results The NANOG gene and all pseudogenes except NANOGP8 are present at their expected orthologous chromosomal positions in the chimpanzee genome when compared to the human genome, indicating that their origins predate the human-chimpanzee divergence. Analysis of flanking DNA sequences demonstrates that NANOGP8 is absent from the chimpanzee genome. Conclusion Based on the most parsimonious ordering of inferred source-gene mutations, the deduced evolutionary origins for the NANOG pseudogene family in the human and chimpanzee genomes, in order of most ancient to most recent, are NANOGP6, NANOGP5, NANOGP3, NANOGP10, NANOGP2, NANOGP9, NANOGP7, NANOGP1, and NANOGP4. All of these pseudogenes were fixed in the genome of the human-chimpanzee common ancestor. NANOGP8 is the most recent pseudogene and it originated exclusively in the human lineage after the human-chimpanzee divergence. NANOGP1 is apparently an unprocessed pseudogene. Comparison of its sequence to the functional NANOG gene's reading frame suggests that this apparent pseudogene remained functional after duplication and, therefore, was subject to selection-driven conservation of its reading frame, and that it may retain some functionality or that its loss of function may be evolutionarily recent.
Collapse
Affiliation(s)
- Daniel J Fairbanks
- Department of Plant and Animal Sciences, Brigham Young University, Provo, UT 84602, USA
| | - Peter J Maughan
- Department of Plant and Animal Sciences, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
17
|
Soldevila M, Andrés AM, Ramírez-Soriano A, Marquès-Bonet T, Calafell F, Navarro A, Bertranpetit J. The prion protein gene in humans revisited: lessons from a worldwide resequencing study. Genome Res 2005; 16:231-9. [PMID: 16369046 PMCID: PMC1361719 DOI: 10.1101/gr.4345506] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Ample evidence has accumulated showing that different coding variants of the PRNP gene confer differential susceptibility for prion diseases. Here we evaluate the patterns of nucleotide variation in PRNP exon 2, which includes all the protein-coding sequence, by resequencing a worldwide sample of 174 humans for 2378 bp. In line with previous studies, we found two main haplotypes differentiated by nonsynonymous substitution in codon 129. Our analyses reveal the worldwide pattern of variation at the PRNP gene to be inconsistent with neutral expectations, indicating instead an excess of low-frequency variants, a footprint of the action of either positive or purifying selection. A comparison of neutrality test statistics for PRNP with other human genes indicates that the signal of positive selection on PRNP is stronger than expected from a possible confounding genome-wide background signal of population expansion. Two main conclusions arise from our analysis. First, the existence of an ancient, stable, balanced polymorphism that has been claimed in a previous study and related to cannibalism can be rejected and is shown to be due to ascertainment bias. Second, our results are consistent with a complex history of selection including mainly positive selection, even if short local periods of balancing selection (Kuru-like episodes), or even a weak purifying selection model, are consistent with our data.
Collapse
Affiliation(s)
- Marta Soldevila
- Unitat de Biologia Evolutiva, Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
18
|
Sabater-Lleal M, Soria JM, Bertranpetit J, Almasy L, Blangero J, Fontcuberta J, Calafell F. Human F7 sequence is split into three deep clades that are related to FVII plasma levels. Hum Genet 2005; 118:741-51. [PMID: 16292673 DOI: 10.1007/s00439-005-0045-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Accepted: 08/01/2005] [Indexed: 01/04/2023]
Abstract
It is widely accepted that FVII levels are strongly, consistently, and independently related to cardiovascular risk. These levels are influenced by genetic and environmental factors. Among the genetic factors, only a limited number of polymorphisms in the F7 gene have been reported, and they explain only a small proportion of the genetic variability. Recently, we have accomplished the complete dissection of the F7 quantitative trait locus responsible for all of the genetic variability observed in FVII levels. Now, we present the thorough study of the haplotype organization of F7 DNA sequence variation among individuals and the evolutionary processes that produced this variation, by sequencing 15 kb of genomic DNA sequence from the F7 locus in 40 unrelated individual (80 chromosomes) from the genetic analysis of idiopathic thrombophilia (GAIT) project as well as four non-human primate species. Our study revealed 49 polymorphisms, of which 39 SNPs were further considered. Genotyping of these DNA variations in the whole family-based GAIT sample helped resolve linkage phases, and a total of 37 distinct haplotypes were identified.Tajima's D was significantly positive in this sample, suggesting balancing selection. This parameter was a reflection of the phylogenetic structure of F7 haplotype, which was deeply split into three well-supported clades or haplogroups, suggesting that functional differences among F7 variants do not depend on a few single-site variations. Moreover, haplogroup 2 was associated with high FVII levels and haplogroup 3 with low levels. In this study, we have for the first time established a clear relation between genotypic variability structure and phenotypic variability of a particular quantitative trait involved in a complex disease.
Collapse
Affiliation(s)
- Maria Sabater-Lleal
- Unitat d'Hemostàsia i Trombosi, Hospital de la Santa Creu i Sant Pau, 08005 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
19
|
Aguilar A, Smith TB, Wayne RK. A comparison of variation between a MHC pseudogene and microsatellite loci of the little greenbul (Andropadus virens). BMC Evol Biol 2005; 5:47. [PMID: 16159389 PMCID: PMC1249561 DOI: 10.1186/1471-2148-5-47] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2005] [Accepted: 09/13/2005] [Indexed: 11/25/2022] Open
Abstract
Background We investigated genetic variation of a major histcompatibility complex (MHC) pseudogene (Anvi-DAB1) in the little greenbul (Andropadus virens) from four localities in Cameroon and one in Ivory Coast, West Africa. Previous microsatellite and mitochondrial DNA analyses had revealed little or no genetic differentiation among Cameroon localities but significant differentiation between localities in Cameroon and Ivory Coast. Results Levels of genetic variation, heterozygosity, and allelic diversity were high for the MHC pseudogene in Cameroon. Nucleotide diversity of the MHC pseudogene in Cameroon and Ivory Coast was comparable to levels observed in other avian species that have been studied for variation in nuclear genes. An excess of rare variants for the MHC pseudogene was found in the Cameroon population, but this excess was not statistically significant. Pairwise measures of population differentiation revealed high divergence between Cameroon and Ivory Coast for microsatellites and the MHC locus, although for the latter distance measures were much higher than the comparable microsatellite distances. Conclusion We provide the first ever comparison of variation in a putative MHC pseudogene to variation in neutral loci in a passerine bird. Our results are consistence with the action of neutral processes on the pseudogene and suggest they can provide an independent perspective on demographic history and population substructure.
Collapse
Affiliation(s)
- Andres Aguilar
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095 USA
- Center for Tropical Research, Institute of the Environment, 1609 Hershey Hall, University of California, Los Angeles, CA 90095 USA
- Southwest Fisheries Science Center & Department of Ocean Sciences, 110 Shaffer Road, University of California, Santa Cruz, California 95060 USA
| | - Thomas B Smith
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095 USA
- Center for Tropical Research, Institute of the Environment, 1609 Hershey Hall, University of California, Los Angeles, CA 90095 USA
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095 USA
- Center for Tropical Research, Institute of the Environment, 1609 Hershey Hall, University of California, Los Angeles, CA 90095 USA
| |
Collapse
|
20
|
Clarimón J, Andrés AM, Bertranpetit J, Comas D. Comparative analysis of Alu insertion sequences in the APP 5' flanking region in humans and other primates. J Mol Evol 2004; 58:722-31. [PMID: 15461429 DOI: 10.1007/s00239-004-2594-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Overexpression of the amyloid precursor protein gene (APP) may play a role in the neuropathology of Alzheimer's disease. Therefore, elucidating the mechanisms involved in APP gene regulation is of primary importance, and various cis-acting regulatory elements located in 5' distal regions are known to play a main role. Some of them lie within Alu elements, one of which (Alu1) is only found in humans and apes while the other (Alu2) has a much older history and is also found in rhesus. These Alu insertions harbor sequence motifs that may act as cis-regulatory elements, which may cause differences in APP regulation among primate species and whose functionality may be ascertained through their conservation in a comparative analysis. We have performed a comparative analysis of the region comprising the two Alu elements of the APP promoter in several primates, including humans. We have found a significant decrease in nucleotide diversity in the Alu2 element (inserted in all the species analyzed) compared to the Alu1 (inserted only in apes). This finding can be interpreted as a constriction in the Alu2 sequence variation as a consequence of a functional role of this element in the APP expression. The present results suggest a wider extension of the regulatory elements than the known short consensus regulatory sequences. Moreover, the different conservation of two highly similar and neighboring sequences suggests that, besides the importance of the sequence motifs, their position in relation to the gene suggests that they have played a role in being recruited as regulatory elements.
Collapse
Affiliation(s)
- Jordi Clarimón
- Unitat de Biologia Evolutiva, Facultat de Ciències de la Salut i de la Vida, Universitat Pompeu Fabra, C/ Doctor Aiguader 80, Barcelona, 08003, Catalonia, Spain
| | | | | | | |
Collapse
|
21
|
Hammer MF, Blackmer F, Garrigan D, Nachman MW, Wilder JA. Human population structure and its effects on sampling Y chromosome sequence variation. Genetics 2003; 164:1495-509. [PMID: 12930755 PMCID: PMC1462677 DOI: 10.1093/genetics/164.4.1495] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The excess of rare variants in global sequencing studies of the nonrecombining portion of the Y chromosome (NRY) has been interpreted as evidence for the effects of human demographic expansion. However, many NRY polymorphisms are geographically localized and the effect of different geographical sampling on patterns of NRY variation is unknown. We use two sampling designs to detect population structure and its effects on patterns of human NRY polymorphism. First, we sequence 26.5 kb of noncoding Y chromosome DNA from 92 globally distributed males representing 35 populations. We find that the number of polymorphisms with singleton variants is positively correlated with the number of populations sampled and that there is a significant negative correlation of Tajima's D (TD) and Fu and Li's D (FD) statistics with the number of pooled populations. We then sequence the same region in a total of 73 males sampled from 3 distinct populations and find that TD and FD values for the 3 pooled and individual population samples were much less negative than those in the aforementioned global sample. Coalescent simulations show that a simple splitting model of population structure, with no changes in population size, is sufficient to produce the negative values of TD seen in our pooled samples. These empirical and simulation results suggest that observed levels of NRY population structure may lead to an upward bias in the number of singleton variants in global surveys and call into question inferences of population expansion based on global sampling strategies.
Collapse
Affiliation(s)
- Michael F Hammer
- Genomic Analysis and Technology Core, Division of Biotechnology, Biosciences West Building, University of Arizona, Tucson, AZ 85721, USA.
| | | | | | | | | |
Collapse
|
22
|
Subramanian S, Kumar S. Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. Genome Res 2003; 13:838-44. [PMID: 12727904 PMCID: PMC430942 DOI: 10.1101/gr.1152803] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Point mutation rates in exons (synonymous sites) and noncoding (introns and intergenic) regions are generally assumed to be the same. However, comparative sequence analyses of synonymous substitutions in exons (81 genes) and that of long intergenic fragments (141.3 kbp) of human and chimpanzee genomes reveal a 30%-60% higher mutation rate in exons than in noncoding DNA. We propose a differential CpG content hypothesis to explain this fundamental, and seemingly unintuitive, pattern. We find that the increased exonic rate is the result of the relative overabundance of synonymous sites involved in CpG dinucleotides, as the evolutionary divergence in non-CpG sites is similar in noncoding DNA and synonymous sites of exons. Expectations and predictions of our hypothesis are confirmed in comparisons involving more distantly related species, including human-orangutan, human-baboon, and human-macaque. Our results suggest an underlying mechanism for higher mutation rate in GC-rich genomic regions, predict nonlinear accumulation of mutations in pseudogenes over time, and provide a possible explanation for the observed higher diversity of single nucleotide polymorphisms (SNPs) in the synonymous sites of exons compared to the noncoding regions.
Collapse
Affiliation(s)
- Sankar Subramanian
- Center for Evolutionary Functional Genomics, Arizona Biodesign Institute, Department of Biology, Arizona State University, Tempe, Arizona 85287-1501, USA
| | | |
Collapse
|
23
|
Abstract
In Drosophila and humans, there are accumulating examples of loci with a significant excess of high-frequency-derived alleles or high levels of linkage disequilibrium, relative to a neutral model of a random-mating population of constant size. These are features expected after a recent selective sweep. Their prevalence suggests that positive directional selection may be widespread in both species. However, as I show here, these features do not persist long after the sweep ends: The high-frequency alleles drift to fixation and no longer contribute to polymorphism, while linkage disequilibrium is broken down by recombination. As a result, loci chosen without independent evidence of recent selection are not expected to exhibit either of these features, even if they have been affected by numerous sweeps in their genealogical history. How then can we explain the patterns in the data? One possibility is population structure, with unequal sampling from different subpopulations. Alternatively, positive selection may not operate as is commonly modeled. In particular, the rate of fixation of advantageous mutations may have increased in the recent past.
Collapse
Affiliation(s)
- Molly Przeworski
- Department of Statistics, University of Oxford, Oxford OX1 3TG, United Kingdom.
| |
Collapse
|