1
|
Du H, Diao C, Zhuo Y, Zheng X, Hu Z, Lu S, Jin W, Zhou L, Liu JF. Assembly of novel sequences for Chinese domestic pigs reveals new genes and regulatory variants providing new insights into their diversity. Genomics 2024; 116:110782. [PMID: 38176574 DOI: 10.1016/j.ygeno.2024.110782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/27/2023] [Accepted: 01/01/2024] [Indexed: 01/06/2024]
Abstract
There is an increasing understanding that a reference genome representing an individual cannot capture all the gene repertoire of a species. Here, we conduct a population-scale missing sequences detection of Chinese domestic pigs using whole-genome sequencing data from 534 individuals. We identify 132.41 Mb of sequences absent in the reference assembly, including eight novel genes. In particular, the breeds spread in Chinese high-altitude regions perform significantly different frequencies of new sequences in promoters than other breeds. Furthermore, we dissect the role of non-coding variants and identify a novel sequence inserted in the 3'UTR of the FMO3 gene, which may be associated with the intramuscular fat phenotype. This novel sequence could be a candidate marker for meat quality. Our study provides a comprehensive overview of the missing sequences in Chinese domestic pigs and indicates that this dataset is a valuable resource for understanding the diversity and biology of pigs.
Collapse
Affiliation(s)
- Heng Du
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| | - Chenguang Diao
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| | - Yue Zhuo
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| | - Xianrui Zheng
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Zhengzheng Hu
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Shiyu Lu
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Wenjiao Jin
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Lei Zhou
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| | - Jian-Feng Liu
- State Key Laboratory of Animal Biotech Breeding; College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
2
|
Zhao P, Gu L, Gao Y, Pan Z, Liu L, Li X, Zhou H, Yu D, Han X, Qian L, Liu GE, Fang L, Wang Z. Young SINEs in pig genomes impact gene regulation, genetic diversity, and complex traits. Commun Biol 2023; 6:894. [PMID: 37652983 PMCID: PMC10471783 DOI: 10.1038/s42003-023-05234-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are a major source of genetic polymorphisms and play a role in chromatin architecture, gene regulatory networks, and genomic evolution. However, their functional role in pigs and contributions to complex traits are largely unknown. We created a catalog of TEs (n = 3,087,929) in pigs and found that young SINEs were predominantly silenced by histone modifications, DNA methylation, and decreased accessibility. However, some transcripts from active young SINEs showed high tissue-specificity, as confirmed by analyzing 3570 RNA-seq samples. We also detected 211,067 dimorphic SINEs in 374 individuals, including 340 population-specific ones associated with local adaptation. Mapping these dimorphic SINEs to genome-wide associations of 97 complex traits in pigs, we found 54 candidate genes (e.g., ANK2 and VRTN) that might be mediated by TEs. Our findings highlight the important roles of young SINEs and provide a supplement for genotype-to-phenotype associations and modern breeding in pigs.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lihong Gu
- Institute of Animal Science & Veterinary Medicine, Hainan Academy of Agricultural Sciences, No. 14 Xingdan Road, Haikou, 571100, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Lei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Xingzheng Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Dongyou Yu
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Xinyan Han
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lichun Qian
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark.
| | - Zhengguang Wang
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China.
| |
Collapse
|
3
|
Blechter B, Wong JYY, Hu W, Cawthon R, Downward GS, Portengen L, Zhang Y, Ning B, Rahman ML, Ji BT, Li J, Yang K, Dean Hosgood H, Silverman DT, Huang Y, Rothman N, Vermeulen R, Lan Q. Exposure to smoky coal combustion emissions and leukocyte Alu retroelement copy number. Carcinogenesis 2023; 44:404-410. [PMID: 37119119 PMCID: PMC10414142 DOI: 10.1093/carcin/bgad027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/09/2022] [Accepted: 04/28/2023] [Indexed: 04/30/2023] Open
Abstract
Household air pollution (HAP) from indoor combustion of solid fuel is a global health burden that has been linked to multiple diseases including lung cancer. In Xuanwei, China, lung cancer rate for non-smoking women is among the highest in the world and largely attributed to high levels of polycyclic aromatic hydrocarbons (PAHs) that are produced from combustion of smoky (bituminous) coal. Alu retroelements, repetitive mobile DNA sequences that can somatically multiply and promote genomic instability have been associated with risk of lung cancer and diesel engine exhaust exposure. We conducted analyses for 160 non-smoking women in an exposure assessment study in Xuanwei, China with a repeat sample from 49 subjects. Quantitative PCR was used to measure Alu repeat copy number relative to albumin gene copy number (Alu/ALB ratio). Associations between clusters derived from predicted levels of 43 HAP constituents, 5-methylchrysene (5-MC), a PAH previously associated with lung cancer in Xuanwei and was selected a priori for analysis, and Alu repeats were analyzed using generalized estimating equations. A cluster of 31 PAHs reflecting current exposure was associated with increased Alu copy number (β:0.03 per standard deviation change; 95% confidence interval (CI):0.01,0.04; P-value = 2E-04). One compound within this cluster, 5-MC, was also associated with increased Alu copy number (P-value = 0.02). Our findings suggest that exposure to PAHs due to indoor smoky coal combustion may contribute to genomic instability. Additionally, our study provides further support for 5-MC as a prominent carcinogenic component of smoky coal emissions. Further studies are needed to replicate our findings.
Collapse
Affiliation(s)
- Batel Blechter
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Jason Y Y Wong
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Wei Hu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Richard Cawthon
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - George S Downward
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands
| | - Lützen Portengen
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Yongliang Zhang
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Bofu Ning
- Xuanwei Center of Diseases Control, Xuanwei, Yunnan, China
| | - Mohammad L Rahman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Bu-Tian Ji
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Jihua Li
- Quijing Center for Diseases Control and Prevention, Quijing, Yunnan, China
| | - Kaiyun Yang
- Department of Cardiothoracic Surgery, Third Affiliated Hospital of Kunming Medical University, Yunnan Cancer Hospital, Kunming, Yunnan, China
| | - H Dean Hosgood
- Division of Epidemiology, Albert Einstein College of Medicine, New York, NY, USA
| | - Debra T Silverman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Yunchao Huang
- Department of Cardiothoracic Surgery, Third Affiliated Hospital of Kunming Medical University, Yunnan Cancer Hospital, Kunming, Yunnan, China
| | - Nathaniel Rothman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Roel Vermeulen
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Qing Lan
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| |
Collapse
|
4
|
Zhao P, Peng C, Fang L, Wang Z, Liu GE. Taming transposable elements in livestock and poultry: a review of their roles and applications. Genet Sel Evol 2023; 55:50. [PMID: 37479995 PMCID: PMC10362595 DOI: 10.1186/s12711-023-00821-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/30/2023] [Indexed: 07/23/2023] Open
Abstract
Livestock and poultry play a significant role in human nutrition by converting agricultural by-products into high-quality proteins. To meet the growing demand for safe animal protein, genetic improvement of livestock must be done sustainably while minimizing negative environmental impacts. Transposable elements (TE) are important components of livestock and poultry genomes, contributing to their genetic diversity, chromatin states, gene regulatory networks, and complex traits of economic value. However, compared to other species, research on TE in livestock and poultry is still in its early stages. In this review, we analyze 72 studies published in the past 20 years, summarize the TE composition in livestock and poultry genomes, and focus on their potential roles in functional genomics. We also discuss bioinformatic tools and strategies for integrating multi-omics data with TE, and explore future directions, feasibility, and challenges of TE research in livestock and poultry. In addition, we suggest strategies to apply TE in basic biological research and animal breeding. Our goal is to provide a new perspective on the importance of TE in livestock and poultry genomes.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China
| | - Chen Peng
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus, Denmark.
| | - Zhengguang Wang
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| |
Collapse
|
5
|
Shao Y, Zhou L, Li F, Zhao L, Zhang BL, Shao F, Chen JW, Chen CY, Bi X, Zhuang XL, Zhu HL, Hu J, Sun Z, Li X, Wang D, Rivas-González I, Wang S, Wang YM, Chen W, Li G, Lu HM, Liu Y, Kuderna LFK, Farh KKH, Fan PF, Yu L, Li M, Liu ZJ, Tiley GP, Yoder AD, Roos C, Hayakawa T, Marques-Bonet T, Rogers J, Stenson PD, Cooper DN, Schierup MH, Yao YG, Zhang YP, Wang W, Qi XG, Zhang G, Wu DD. Phylogenomic analyses provide insights into primate evolution. Science 2023; 380:913-924. [PMID: 37262173 DOI: 10.1126/science.abn6919] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 01/26/2023] [Indexed: 06/03/2023]
Abstract
Comparative analysis of primate genomes within a phylogenetic context is essential for understanding the evolution of human genetic architecture and primate diversity. We present such a study of 50 primate species spanning 38 genera and 14 families, including 27 genomes first reported here, with many from previously less well represented groups, the New World monkeys and the Strepsirrhini. Our analyses reveal heterogeneous rates of genomic rearrangement and gene evolution across primate lineages. Thousands of genes under positive selection in different lineages play roles in the nervous, skeletal, and digestive systems and may have contributed to primate innovations and adaptations. Our study reveals that many key genomic innovations occurred in the Simiiformes ancestral node and may have had an impact on the adaptive radiation of the Simiiformes and human evolution.
Collapse
Affiliation(s)
- Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Long Zhou
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Fang Li
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- Institute of Animal Sex and Development, ZhejiangWanli University, Ningbo 315100, China
| | - Lan Zhao
- Shaanxi Key Laboratory for Animal Conservation, College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Bao-Lin Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Feng Shao
- Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing 400715, China
| | | | - Chun-Yan Chen
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | - Xupeng Bi
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Xiao-Lin Zhuang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming 650204, China
| | | | - Jiang Hu
- Grandomics Biosciences, Beijing 102206, China
| | - Zongyi Sun
- Grandomics Biosciences, Beijing 102206, China
| | - Xin Li
- Grandomics Biosciences, Beijing 102206, China
| | - Depeng Wang
- Grandomics Biosciences, Beijing 102206, China
| | | | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Yun-Mei Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Wu Chen
- Guangzhou Zoo & Guangzhou Wildlife Research Center, Guangzhou 510070, China
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Hui-Meng Lu
- School of Life Sciences, Northwestern Polytechnical University, Xi'an 710072, China
| | - Yang Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an 710119, China
| | - Lukas F K Kuderna
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Illumina Artificial Intelligence Laboratory, Illumina Inc, San Diego, CA 92122, USA
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc, San Diego, CA 92122, USA
| | - Peng-Fei Fan
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Ming Li
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhi-Jin Liu
- College of Life Sciences, Capital Normal University, Beijing 100048, China
| | - George P Tiley
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany
| | - Takashi Hayakawa
- Faculty of Environmental Earth Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Japan Monkey Centre, Inuyama, Aichi 484-0081, Japan
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Peter D Stenson
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK
| | | | - Yong-Gang Yao
- Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming 650204, China
- Key Laboratory of Animal Models and Human Disease Mechanisms of Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
| | - Xiao-Guang Qi
- Shaanxi Key Laboratory for Animal Conservation, College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Guojie Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center of Evolutionary & Organismal Biology, and Women's Hospital at Zhejiang University School of Medicine, Hangzhou 310058, China
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou 311121, China
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650201, China
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650204, China
| |
Collapse
|
6
|
Liu X, Zhang Y, Pu Y, Ma Y, Jiang L. Whole-genome identification of transposable elements reveals the equine repetitive element insertion polymorphism in Chinese horses. Anim Genet 2023; 54:144-154. [PMID: 36464985 DOI: 10.1111/age.13277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 10/29/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022]
Abstract
Transposable elements (TEs) are diverse, abundant, and complicated in genomes. They not only can drive the genome evolution process but can also act as special resources for adaptation. However, little is known about the evolutionary processes that shaped horses. In this work, 126 horse assemblages involved in most horse breeds in China were used to investigate the patterns of TE variation for the first time. By using RepeatMasker and melt software, we found that the horse-specific short interspersed repetitive elements family, equine repetitive elements (ERE1), exhibited polymorphisms in horse genomes. Phylogenetic analysis based on these ERE1 loci (minor allele frequency ≥0.05) revealed three major horse groups, namely, those in northern China, southern China, and Qinghai-Tibetan, which mirrors the result determined by SNPs to some extent. The present ERE1 family emerged ~0.26 to 1.77 Mya ago, with an activity peak at ~0.49 Mya, which matches the early stage of the horse lineage and decreases after the divergence of Equus caballus and Equus ferus przewalskii. To detect the functional ERE1(s) associated with adaptation, locus-specific branch length, genome-wide association study, and absolute allele frequency difference analyses were conducted and resulted in two common protein-coding genes annotated by candidate ERE1s. They were clustered into the vascular smooth muscle contraction (p = 0.01, EDNRA) and apelin signalling pathways (p = 0.02, NRF1). Notably, ERE1 insertion into the EDNRA gene showed a higher association with adaptation among southern China horses and other horses in 15 populations and 451 individuals (p = 4.55 e-8). Our results provide a comprehensive understanding of TE variations to analyse the phylogenetic relationships and traits relevant to adaptive evolution in horses.
Collapse
Affiliation(s)
- Xuexue Liu
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,Centre d'Anthropobiologie et de Génomique de Toulouse, Université Paul Sabatier, Toulouse, France
| | - Yanli Zhang
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Yabin Pu
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Yuehui Ma
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Lin Jiang
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| |
Collapse
|
7
|
Storer JM, Walker JA, Baker JN, Hossain S, Roos C, Wheeler TJ, Batzer MA. Framework of the Alu Subfamily Evolution in the Platyrrhine Three-Family Clade of Cebidae, Callithrichidae, and Aotidae. Genes (Basel) 2023; 14:249. [PMID: 36833175 PMCID: PMC9956951 DOI: 10.3390/genes14020249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/10/2023] [Accepted: 01/14/2023] [Indexed: 01/20/2023] Open
Abstract
The history of Alu retroposons has been choreographed by the systematic accumulation of inherited diagnostic nucleotide substitutions to form discrete subfamilies, each having a distinct nucleotide consensus sequence. The oldest subfamily, AluJ, gave rise to AluS after the split between Strepsirrhini and what would become Catarrhini and Platyrrhini. The AluS lineage gave rise to AluY in catarrhines and to AluTa in platyrrhines. Platyrrhine Alu subfamilies Ta7, Ta10, and Ta15 were assigned names based on a standardized nomenclature. However, with the subsequent intensification of whole genome sequencing (WGS), large scale analyses to characterize Alu subfamilies using the program COSEG identified entire lineages of subfamilies simultaneously. The first platyrrhine genome with WGS, the common marmoset (Callithrix jacchus; [caljac3]), resulted in Alu subfamily names sf0 to sf94 in an arbitrary order. Although easily resolved by alignment of the consensus sequences, this naming convention can become increasingly confusing as more genomes are independently analyzed. In this study, we reported Alu subfamily characterization for the platyrrhine three-family clade of Cebidae, Callithrichidae, and Aotidae. We investigated one species/genome from each recognized family of Callithrichidae and Aotidae and of both subfamilies (Cebinae and Saimiriinae) of the family Cebidae. Furthermore, we constructed a comprehensive network of Alu subfamily evolution within the three-family clade of platyrrhines to provide a working framework for future research. Alu expansion in the three-family clade has been dominated by AluTa15 and its derivatives.
Collapse
Affiliation(s)
- Jessica M. Storer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA; (J.M.S.); (J.A.W.)
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Jerilyn A. Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA; (J.M.S.); (J.A.W.)
| | - Jasmine N. Baker
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA;
| | - Shifat Hossain
- Department of Pharmacy Practice & Science, University of Arizona, Tucson, AZ 85721, USA; (S.H.); (T.J.W.)
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany;
| | - Travis J. Wheeler
- Department of Pharmacy Practice & Science, University of Arizona, Tucson, AZ 85721, USA; (S.H.); (T.J.W.)
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA; (J.M.S.); (J.A.W.)
| |
Collapse
|
8
|
Liu P, Cuerda-Gil D, Shahid S, Slotkin RK. The Epigenetic Control of the Transposable Element Life Cycle in Plant Genomes and Beyond. Annu Rev Genet 2022; 56:63-87. [DOI: 10.1146/annurev-genet-072920-015534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Within the life cycle of a living organism, another life cycle exists for the selfish genome inhabitants, which are called transposable elements (TEs). These mobile sequences invade, duplicate, amplify, and diversify within a genome, increasing the genome's size and generating new mutations. Cells act to defend their genome, but rather than permanently destroying TEs, they use chromatin-level repression and epigenetic inheritance to silence TE activity. This level of silencing is ephemeral and reversible, leading to a dynamic equilibrium between TE suppression and reactivation within a host genome. The coexistence of the TE and host genome can also lead to the domestication of the TE to serve in host genome evolution and function. In this review, we describe the life cycle of a TE, with emphasis on how epigenetic regulation is harnessed to control TEs for host genome stability and innovation.
Collapse
Affiliation(s)
- Peng Liu
- Donald Danforth Plant Science Center, St. Louis, Missouri, USA
| | - Diego Cuerda-Gil
- Donald Danforth Plant Science Center, St. Louis, Missouri, USA
- Graduate Program in the Department of Molecular Genetics, The Ohio State University, Columbus, Ohio, USA
| | - Saima Shahid
- Donald Danforth Plant Science Center, St. Louis, Missouri, USA
| | - R. Keith Slotkin
- Donald Danforth Plant Science Center, St. Louis, Missouri, USA
- Division of Biological Sciences, University of Missouri, Columbia, Missouri, USA
| |
Collapse
|
9
|
Wang J, Weatheritt R, Voineagu I. Alu-minating the Mechanisms Underlying Primate Cortex Evolution. Biol Psychiatry 2022; 92:760-771. [PMID: 35981906 DOI: 10.1016/j.biopsych.2022.04.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 04/04/2022] [Accepted: 04/28/2022] [Indexed: 11/02/2022]
Abstract
The higher-order cognitive functions observed in primates correlate with the evolutionary enhancement of cortical volume and folding, which in turn are driven by the primate-specific expansion of cellular diversity in the developing cortex. Underlying these changes is the diversification of molecular features including the creation of human and/or primate-specific genes, the activation of specific molecular pathways, and the interplay of diverse layers of gene regulation. We review and discuss evidence for connections between Alu elements and primate brain evolution, the evolutionary milestones of which are known to coincide along primate lineages. Alus are repetitive elements that contribute extensively to the acquisition of novel genes and the expansion of diverse gene regulatory layers, including enhancers, alternative splicing, RNA editing, and microRNA pathways. By reviewing the impact of Alus on molecular features linked to cortical expansions or gyrification or implications in cognitive deficits, we suggest that future research focusing on the role of Alu-derived molecular events in the context of brain development may greatly advance our understanding of higher-order cognitive functions and neurologic disorders.
Collapse
Affiliation(s)
- Juli Wang
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia.
| | - Robert Weatheritt
- St Vincent Clinical School, University of New South Wales, Sydney, Australia; Garvan Institute of Medical Research, EMBL Australia, Sydney, New South Wales, Australia
| | - Irina Voineagu
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia; Cellular Genomics Futures Institute, University of New South Wales, Sydney, Australia.
| |
Collapse
|
10
|
Billon V, Sanchez-Luque FJ, Rasmussen J, Bodea GO, Gerhardt DJ, Gerdes P, Cheetham SW, Schauer SN, Ajjikuttira P, Meyer TJ, Layman CE, Nevonen KA, Jansz N, Garcia-Perez JL, Richardson SR, Ewing AD, Carbone L, Faulkner GJ. Somatic retrotransposition in the developing rhesus macaque brain. Genome Res 2022; 32:1298-1314. [PMID: 35728967 PMCID: PMC9341517 DOI: 10.1101/gr.276451.121] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 06/14/2022] [Indexed: 12/03/2022]
Abstract
The retrotransposon LINE-1 (L1) is central to the recent evolutionary history of the human genome and continues to drive genetic diversity and germline pathogenesis. However, the spatiotemporal extent and biological significance of somatic L1 activity are poorly defined and are virtually unexplored in other primates. From a single L1 lineage active at the divergence of apes and Old World monkeys, successive L1 subfamilies have emerged in each descendant primate germline. As revealed by case studies, the presently active human L1 subfamily can also mobilize during embryonic and brain development in vivo. It is unknown whether nonhuman primate L1s can similarly generate somatic insertions in the brain. Here we applied approximately 40× single-cell whole-genome sequencing (scWGS), as well as retrotransposon capture sequencing (RC-seq), to 20 hippocampal neurons from two rhesus macaques (Macaca mulatta). In one animal, we detected and PCR-validated a somatic L1 insertion that generated target site duplications, carried a short 5′ transduction, and was present in ∼7% of hippocampal neurons but absent from cerebellum and nonbrain tissues. The corresponding donor L1 allele was exceptionally mobile in vitro and was embedded in PRDM4, a gene expressed throughout development and in neural stem cells. Nanopore long-read methylome and RNA-seq transcriptome analyses indicated young retrotransposon subfamily activation in the early embryo, followed by repression in adult tissues. These data highlight endogenous macaque L1 retrotransposition potential, provide prototypical evidence of L1-mediated somatic mosaicism in a nonhuman primate, and allude to L1 mobility in the brain over the past 30 million years of human evolution.
Collapse
|
11
|
Palmada-Flores M, Orkin JD, Haase B, Mountcastle J, Bertelsen MF, Fedrigo O, Kuderna LFK, Jarvis ED, Marques-Bonet T. A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta). Gigascience 2022; 11:6562532. [PMID: 35365833 PMCID: PMC8975718 DOI: 10.1093/gigascience/giac026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 01/14/2022] [Accepted: 02/19/2022] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND The ring-tailed lemur (Lemur catta) is a charismatic strepsirrhine primate endemic to Madagascar. These lemurs are of particular interest, given their status as a flagship species and widespread publicity in the popular media. Unfortunately, a recent population decline has resulted in the census population decreasing to <2,500 individuals in the wild, and the species's classification as an endangered species by the IUCN. As is the case for most strepsirrhine primates, only a limited amount of genomic research has been conducted on L. catta, in part owing to the lack of genomic resources. RESULTS We generated a new high-quality reference genome assembly for L. catta (mLemCat1) that conforms to the standards of the Vertebrate Genomes Project. This new long-read assembly is composed of Pacific Biosciences continuous long reads (CLR data), Optical Mapping Bionano reads, Arima HiC data, and 10X linked reads. The contiguity and completeness of the assembly are extremely high, with scaffold and contig N50 values of 90.982 and 10.570 Mb, respectively. Additionally, when compared to other high-quality primate assemblies, L. catta has the lowest reported number of Alu elements, which results predominantly from a lack of AluS and AluY elements. CONCLUSIONS mLemCat1 is an excellent genomic resource not only for the ring-tailed lemur community, but also for other members of the Lemuridae family, and is the first very long read assembly for a strepsirrhine.
Collapse
Affiliation(s)
- Marc Palmada-Flores
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
| | - Joseph D Orkin
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain.,Département d'anthropologie, Université de Montréal, Montréal, QC H3T 1N8, Canada
| | - Bettina Haase
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA
| | | | - Mads F Bertelsen
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg C 1870, Denmark.,Center for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksber 1870, Denmark
| | - Olivier Fedrigo
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA
| | - Lukas F K Kuderna
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
| | - Erich D Jarvis
- The Vertebrate Genomes Lab, The Rockefeller University, New York, NY 10065, USA.,Center for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksber 1870, Denmark.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.,Laboratory of Neurogenetics of Language, The Rockefeller University, NY 10065, USA
| | - Tomas Marques-Bonet
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Barcelona 08010, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelon 08028a, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| |
Collapse
|
12
|
Recently Integrated Alu Elements in Capuchin Monkeys: A Resource for Cebus/ Sapajus Genomics. Genes (Basel) 2022; 13:genes13040572. [PMID: 35456378 PMCID: PMC9030454 DOI: 10.3390/genes13040572] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/21/2022] [Accepted: 03/22/2022] [Indexed: 11/16/2022] Open
Abstract
Capuchins are platyrrhines (monkeys found in the Americas) within the Cebidae family. For most of their taxonomic history, the two main morphological types of capuchins, gracile (untufted) and robust (tufted), were assigned to a single genus, Cebus. Further, all tufted capuchins were assigned to a single species, Cebus apella, despite broad geographic ranges spanning Central and northern South America. In 2012, tufted capuchins were assigned to their genus, Sapajus, with eight currently recognized species and five Cebus species, although these numbers are still under debate. Alu retrotransposons are a class of mobile element insertion (MEI) widely used to study primate phylogenetics. However, Alu elements have rarely been used to study capuchins. Recent genome-level assemblies for capuchins (Cebus imitator; [Cebus_imitator_1.0] and Sapajus apella [GSC_monkey_1.0]) facilitated large scale ascertainment of young lineage-specific Alu insertions. Reported here are 1607 capuchin specific and 678 Sapajus specific Alu insertions along with candidate oligonucleotides for locus-specific PCR assays for many elements. PCR analyses identified 104 genus level and 51 species level Alu insertion polymorphisms. The Alu datasets reported in this study provide a valuable resource that will assist in the classification of archival samples lacking phenotypic data and for the study of capuchin phylogenetic relationships.
Collapse
|
13
|
Li Y, Jiang N, Sun Y. AnnoSINE: a short interspersed nuclear elements annotation tool for plant genomes. PLANT PHYSIOLOGY 2022; 188:955-970. [PMID: 34792587 PMCID: PMC8825457 DOI: 10.1093/plphys/kiab524] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
Short interspersed nuclear elements (SINEs) are a widespread type of small transposable element (TE). With increasing evidence for their impact on gene function and genome evolution in plants, accurate genome-scale SINE annotation becomes a fundamental step for studying the regulatory roles of SINEs and their relationship with other components in the genomes. Despite the overall promising progress made in TE annotation, SINE annotation remains a major challenge. Unlike some other TEs, SINEs are short and heterogeneous, and they usually lack well-conserved sequence or structural features. Thus, current SINE annotation tools have either low sensitivity or high false discovery rates. Given the demand and challenges, we aimed to provide a more accurate and efficient SINE annotation tool for plant genomes. The pipeline starts with maximizing the pool of SINE candidates via profile hidden Markov model-based homology search and de novo SINE search using structural features. Then, it excludes the false positives by integrating all known features of SINEs and the features of other types of TEs that can often be misannotated as SINEs. As a result, the pipeline substantially improves the tradeoff between sensitivity and accuracy, with both values close to or over 90%. We tested our tool in Arabidopsis thaliana and rice (Oryza sativa), and the results show that our tool competes favorably against existing SINE annotation tools. The simplicity and effectiveness of this tool would potentially be useful for generating more accurate SINE annotations for other plant species. The pipeline is freely available at https://github.com/yangli557/AnnoSINE.
Collapse
Affiliation(s)
- Yang Li
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| |
Collapse
|
14
|
Zhao P, Du H, Jiang L, Zheng X, Feng W, Diao C, Zhou L, Liu GE, Zhang H, Chamba Y, Zhang Q, Li B, Liu JF. PRE-1 Revealed Previous Unknown Introgression Events in Eurasian Boars during the Middle Pleistocene. Genome Biol Evol 2021; 12:1751-1764. [PMID: 33151306 PMCID: PMC7643367 DOI: 10.1093/gbe/evaa142] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2020] [Indexed: 12/22/2022] Open
Abstract
Introgression events and population admixture occurred among Sus species across the Eurasian mainland in the Middle Pleistocene, which reflects the local adaption of different populations and contributes to evolutionary novelty. Previous findings on these population introgressions were largely based on extensive genome-wide single-nucleotide polymorphism information, ignoring structural variants (SVs) as an important alternative resource of genetic variations. Here, we profiled the genome-wide SVs and explored the formation of pattern-related SVs, indicating that PRE1-SS is a recently active subfamily that was strongly associated with introgression events in multiple Asian and European pig populations. As reflected by the three different combination haplotypes from two specific patterns and known phylogenetic relationships in Eurasian boars, we identified the Asian Northern wild pigs as having experienced introgression from European wild boars around 0.5–0.2 Ma and having received latitude-related selection. During further exploration of the influence of pattern-related SVs on gene functions, we found substantial sequence changes in 199 intron regions of 54 genes and 3 exon regions of 3 genes (HDX, TRO, and SMIM1), implying that the pattern-related SVs were highly related to positive selection and adaption of pigs. Our findings revealed novel introgression events in Eurasian wild boars, providing a timeline of population admixture and divergence across the Eurasian mainland in the Middle Pleistocene.
Collapse
Affiliation(s)
- Pengju Zhao
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Heng Du
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lin Jiang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Xianrui Zheng
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Wen Feng
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Chenguang Diao
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lei Zhou
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Maryland
| | - Hao Zhang
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yangzom Chamba
- College of Animal Science and Technology, Tibet Agriculture and Animal Husbandry College, Linzhi, Tibet, China
| | - Qin Zhang
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China.,College of Animal Science and Technology, Shandong Agricultural University, Taian, Shandong, PR China
| | - Bugao Li
- Department of Animal Sciences and Veterinary Medicine, Shanxi Agricultural University, Taigu, China
| | - Jian-Feng Liu
- National Engineering Laboratory for Animal Breeding; Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture; College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
15
|
Wong JYY, Cawthon R, Dai Y, Vermeulen R, Bassig BA, Hu W, Duan H, Niu Y, Downward GS, Leng S, Ji BT, Fu W, Xu J, Meliefste K, Zhou B, Yang J, Ren D, Ye M, Jia X, Meng T, Bin P, Hosgood Iii HD, Silverman DT, Rothman N, Zheng Y, Lan Q. Elevated Alu retroelement copy number among workers exposed to diesel engine exhaust. Occup Environ Med 2021; 78:823-828. [PMID: 34039759 DOI: 10.1136/oemed-2021-107462] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 05/03/2021] [Accepted: 05/07/2021] [Indexed: 11/04/2022]
Abstract
BACKGROUND Millions of workers worldwide are exposed to diesel engine exhaust (DEE), a known genotoxic carcinogen. Alu retroelements are repetitive DNA sequences that can multiply and compromise genomic stability. There is some evidence linking altered Alu repeats to cancer and elevated mortality risks. However, whether Alu repeats are influenced by environmental pollutants is unexplored. In an occupational setting with high DEE exposure levels, we investigated associations with Alu repeat copy number. METHODS A cross-sectional study of 54 male DEE-exposed workers from an engine testing facility and a comparison group of 55 male unexposed controls was conducted in China. Personal air samples were assessed for elemental carbon, a DEE surrogate, using NIOSH Method 5040. Quantitative PCR (qPCR) was used to measure Alu repeat copy number relative to albumin (Alb) single-gene copy number in leucocyte DNA. The unitless Alu/Alb ratio reflects the average quantity of Alu repeats per cell. Linear regression models adjusted for age and smoking status were used to estimate relations between DEE-exposed workers versus unexposed controls, DEE tertiles (6.1-39.0, 39.1-54.5 and 54.6-107.7 µg/m3) and Alu/Alb ratio. RESULTS DEE-exposed workers had a higher average Alu/Alb ratio than the unexposed controls (p=0.03). Further, we found a positive exposure-response relationship (p=0.02). The Alu/Alb ratio was highest among workers exposed to the top tertile of DEE versus the unexposed controls (1.12±0.08 SD vs 1.06±0.07 SD, p=0.01). CONCLUSION Our findings suggest that DEE exposure may contribute to genomic instability. Further investigations of environmental pollutants, Alu copy number and carcinogenesis are warranted.
Collapse
Affiliation(s)
- Jason Y Y Wong
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Richard Cawthon
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, USA
| | - Yufei Dai
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Roel Vermeulen
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands
| | - Bryan A Bassig
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Wei Hu
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Huawei Duan
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Yong Niu
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - George S Downward
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands
| | - Shuguang Leng
- Department of Internal Medicine, School of Medicine, University of New Mexico, Albuquerque, New Mexico, USA
| | - Bu-Tian Ji
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Wei Fu
- Chaoyang Center for Disease Control and Prevention, Chaoyang, Liaoning, China
| | - Jun Xu
- Hong Kong University, Hong Kong, China
| | - Kees Meliefste
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands
| | - Baosen Zhou
- China Medical University, Shenyang, Liaoning, China
| | - Jufang Yang
- Chaoyang Center for Disease Control and Prevention, Chaoyang, Liaoning, China
| | - Dianzhi Ren
- Chaoyang Center for Disease Control and Prevention, Chaoyang, Liaoning, China
| | - Meng Ye
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Xiaowei Jia
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Tao Meng
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Ping Bin
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - H Dean Hosgood Iii
- Division of Epidemiology, Albert Einstein College of Medicine, Yeshiva University, New York, New York, USA
| | - Debra T Silverman
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Nathaniel Rothman
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Yuxin Zheng
- National Institute for Occupational Health and Poison Control, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Qing Lan
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
16
|
Long X, Xue H. Genetic-variant hotspots and hotspot clusters in the human genome facilitating adaptation while increasing instability. Hum Genomics 2021; 15:19. [PMID: 33741065 PMCID: PMC7976700 DOI: 10.1186/s40246-021-00318-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 03/04/2021] [Indexed: 12/25/2022] Open
Abstract
Background Genetic variants, underlining phenotypic diversity, are known to distribute unevenly in the human genome. A comprehensive understanding of the distributions of different genetic variants is important for insights into genetic functions and disorders. Methods Herein, a sliding-window scan of regional densities of eight kinds of germline genetic variants, including single-nucleotide-polymorphisms (SNPs) and four size-classes of copy-number-variations (CNVs) in the human genome has been performed. Results The study has identified 44,379 hotspots with high genetic-variant densities, and 1135 hotspot clusters comprising more than one type of hotspots, accounting for 3.1% and 0.2% of the genome respectively. The hotspots and clusters are found to co-localize with different functional genomic features, as exemplified by the associations of hotspots of middle-size CNVs with histone-modification sites, work with balancing and positive selections to meet the need for diversity in immune proteins, and facilitate the development of sensory-perception and neuroactive ligand-receptor interaction pathways in the function-sparse late-replicating genomic sequences. Genetic variants of different lengths co-localize with retrotransposons of different ages on a “long-with-young” and “short-with-all” basis. Hotspots and clusters are highly associated with tumor suppressor genes and oncogenes (p < 10−10), and enriched with somatic tumor CNVs and the trait- and disease-associated SNPs identified by genome-wise association studies, exceeding tenfold enrichment in clusters comprising SNPs and extra-long CNVs. Conclusions In conclusion, the genetic-variant hotspots and clusters represent two-edged swords that spearhead both positive and negative genomic changes. Their strong associations with complex traits and diseases also open up a potential “Common Disease-Hotspot Variant” approach to the missing heritability problem. Supplementary Information The online version contains supplementary material available at 10.1186/s40246-021-00318-3.
Collapse
Affiliation(s)
- Xi Long
- Division of Life Science and Applied Genomics Centre, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China.,HKUST Shenzhen Research Institute, 9 Yuexing First Road, Nanshan, Shenzhen, China
| | - Hong Xue
- Division of Life Science and Applied Genomics Centre, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China. .,HKUST Shenzhen Research Institute, 9 Yuexing First Road, Nanshan, Shenzhen, China. .,Centre for Cancer Genomics, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, Jiangsu, China.
| |
Collapse
|
17
|
Casanova EL, Konkel MK. The Developmental Gene Hypothesis for Punctuated Equilibrium: Combined Roles of Developmental Regulatory Genes and Transposable Elements. Bioessays 2020; 42:e1900173. [PMID: 31943266 PMCID: PMC7029956 DOI: 10.1002/bies.201900173] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 11/30/2019] [Indexed: 12/13/2022]
Abstract
Theories of the genetics underlying punctuated equilibrium (PE) have been vague to date. Here the developmental gene hypothesis is proposed, which states that: 1) developmental regulatory (DevReg) genes are responsible for the orchestration of metazoan morphogenesis and their extreme conservation and mutation intolerance generates the equilibrium or stasis present throughout much of the fossil record and 2) the accumulation of regulatory elements and recombination within these same genes-often derived from transposable elements-drives punctuated bursts of morphological divergence and speciation across metazoa. This two-part hypothesis helps to explain the features that characterize PE, providing a theoretical genetic basis for the once-controversial theory. Also see the video abstract here https://youtu.be/C-fu-ks5yDs.
Collapse
Affiliation(s)
- Emily L. Casanova
- Department of Biomedical Sciences, University of South Carolina School of Medicine at Greenville, Greenville, South Carolina, USA
| | - Miriam K. Konkel
- Department of Genetics and Biochemistry, Clemson Center for Human Genetics, Biomedical Data Science and Informatics Program, Clemson University, Clemson, South Carolina, USA
| |
Collapse
|
18
|
Lee W, Choi M, Kim S, Tang W, Kim DH, Kim HS, Liang P, Han K. A comprehensive analysis of the Baboon-specific full-length LINE-1 retrotransposons. Genes Genomics 2019; 41:831-837. [PMID: 30887304 DOI: 10.1007/s13258-019-00794-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Accepted: 02/08/2019] [Indexed: 11/26/2022]
Abstract
BACKGROUND Long interspersed elements-1 (LINE-1s or L1s) and Alu elements are most successful retrotransposons that have generated genetic diversity and genomic fluidity in the primate genome. They account for ~ 27.7% of the primate genome. Interestingly, a previous study has shown that the retrotransposition rate of Alu elements is nine times higher in baboons than in humans. OBJECTIVE The expansion of Alu copies could be dependent on the activity of L1-encoded proteins. Thus, we aimed to investigate full-length baboon-specific L1s and characterize structurally and functionally intact baboon-specific L1s (ORF1p/ORF2p and ORF2p only) that could induce trans-mobilization of Alu elements in the baboon genome. RESULTS A total of 673 baboon-specific L1 candidates (> 4 kb) were identified through the comparative genomic analysis. Applying the baboon-specific correction value obtained from the experimental validation, it demonstrated that approximately 446 baboon-specific L1s (> 4 kb) were present in the baboon reference genome (papAnu2). In addition, we observed phylogenetic relationship of the baboon-specific L1s through the neighbor-joining method and they diverged from the L1PA6 consensus sequence. Finally, we identified 36 full-length baboon-specific L1s that were intact both ORF1p and ORF2p. CONCLUSION The number of baboon-specific full-length L1s is fewer than the number of human-specific full-length L1s. Therefore, there is possibility that the "L1 master gene" or "L1 source gene" is more abundant in the baboon genome, or that in trans retrotransposition activity of baboon-specific L1s is relatively stronger than in the other genomes.
Collapse
Affiliation(s)
- Wooseok Lee
- Department of Nanobiomedical Science, BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan, 31116, Republic of Korea
- DKU-Theragen institute for NGS analysis (DTiNa), Cheonan, 31116, Republic of Korea
| | - Minhoon Choi
- Department of Nanobiomedical Science, BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan, 31116, Republic of Korea
- DKU-Theragen institute for NGS analysis (DTiNa), Cheonan, 31116, Republic of Korea
| | - Songmi Kim
- Department of Nanobiomedical Science, BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan, 31116, Republic of Korea
- DKU-Theragen institute for NGS analysis (DTiNa), Cheonan, 31116, Republic of Korea
| | - Wanxiangfu Tang
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada
| | - Dong Hee Kim
- Department of Anesthesiology and Pain Management, College of Medicine, Dankook University, Cheonan, 31116, Republic of Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, 46241, Republic of Korea
| | - Ping Liang
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada
| | - Kyudong Han
- Department of Nanobiomedical Science, BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan, 31116, Republic of Korea.
- DKU-Theragen institute for NGS analysis (DTiNa), Cheonan, 31116, Republic of Korea.
| |
Collapse
|
19
|
Xu S, Feng Y, Zhao S. Proteins with Evolutionarily Hypervariable Domains are Associated with Immune Response and Better Survival of Basal-like Breast Cancer Patients. Comput Struct Biotechnol J 2019; 17:430-440. [PMID: 30996822 PMCID: PMC6451114 DOI: 10.1016/j.csbj.2019.03.008] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 02/21/2019] [Accepted: 03/16/2019] [Indexed: 10/27/2022] Open
Abstract
Maltase-glucoamylase (MGAM) and MGAM2 both belong to the glycoside hydrolase family 31. MGAM, a therapeutic target for type 2 diabetes, is α-1,4-glucosidase and expressed in the intestine to catalyze starch digestion. MGAM2, however, is largely uncharacterized. By investigating The Cancer Genome Atlas data, we found that among breast cancer subtypes, MGAM2 expression is nearly exclusive to basal-like breast cancers (BLBCs), whereas MGAM tends to express in luminal A breast cancers. Moreover, MGAM2 expression is associated with better patient survival and correlated with immune genes/signatures, unlike MGAM. Both genes have emerged in mammals, but diverged after the placental-marsupial split. In placentals, MGAM2 has likely lost its α-1,4-glucosidase activity due to mutations in key catalytic sites, and has acquired a large domain that is extracellular, threonine-rich and evolutionarily hypervariable (EHV). Guided by MGAM2 findings, our genome-wide search identified >1000 human proteins with EHV regions. These proteins are enriched in immune functions and molecules, including major histocompatibility complex proteins. Their genes are expressed higher in BLBCs and are associated with better patient survival, like MGAM2. Their EHV-coding sequences are rich in simple repeats and harbor more cancer passenger mutations. In conclusion, MGAM2 diverges from MGAM structurally and likely functionally in placentals. MGAM2 is among >1000 human proteins with EHV regions and associated with immune response. We propose that these EHV molecules may have significant implication in cancer immunotherapy and BLBC treatment.
Collapse
Affiliation(s)
- Shutan Xu
- Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA 30602-7229, USA
| | - Yuan Feng
- Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA 30602-7229, USA
| | - Shaying Zhao
- Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA 30602-7229, USA
| |
Collapse
|
20
|
Moshiri N, Mirarab S. A Two-State Model of Tree Evolution and Its Applications to Alu Retrotransposition. Syst Biol 2018; 67:475-489. [PMID: 29165679 DOI: 10.1093/sysbio/syx088] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 11/15/2017] [Indexed: 11/14/2022] Open
Abstract
Models of tree evolution have mostly focused on capturing the cladogenesis processes behind speciation. Processes that derive the evolution of genomic elements, such as repeats, are not necessarily captured by these existing models. In this article, we design a model of tree evolution that we call the dual-birth model, and we show how it can be useful in studying the evolution of short Alu repeats found in the human genome in abundance. The dual-birth model extends the traditional birth-only model to have two rates of propagation, one for active nodes that propagate often, and another for inactive nodes, that with a lower rate, activate and start propagating. Adjusting the ratio of the rates controls the expected tree balance. We present several theoretical results under the dual-birth model, introduce parameter estimation techniques, and study the properties of the model in simulations. We then use the dual-birth model to estimate the number of active Alu elements and their rates of propagation and activation in the human genome based on a large phylogenetic tree that we build from close to one million Alu sequences.
Collapse
Affiliation(s)
- Niema Moshiri
- Bioinformatics and Systems Biology Graduate Program, UC San Diego, 9500 Gilman Dr., La Jolla, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC San Diego, 9500 Gilman Dr., La Jolla, CA 92093, USA
| |
Collapse
|
21
|
Buena-Atienza E, Nasser F, Kohl S, Wissinger B. A 73,128 bp de novo deletion encompassing the OPN1LW/OPN1MW gene cluster in sporadic Blue Cone Monochromacy: a case report. BMC MEDICAL GENETICS 2018; 19:107. [PMID: 29940872 PMCID: PMC6019650 DOI: 10.1186/s12881-018-0623-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Accepted: 06/12/2018] [Indexed: 01/04/2023]
Abstract
Background Blue Cone Monochromacy (BCM) is a rare congenital cone dysfunction disorder with X-linked recessive mode of inheritance. BCM is caused by mutations at the OPN1LW/MW cone opsin gene cluster including deletions of the locus control region (LCR) and/or parts of the gene cluster. We aimed at investigating the clinical presentation, genetic cause and inheritance underlying a sporadic case of BCM. Case presentation We report a 24-year-old male presenting with congenital photophobia, nystagmus and colour vision abnormalities. There was no history of retinal dystrophy in the family. Clinical diagnosis of BCM was supported by genetic investigations of the patient and his family members. Molecular genetic analysis of the OPN1LW/OPN1MW gene cluster revealed a novel deletion of about 73 kb in the patient encompassing the LCR. The deletion was absent in the X-chromosomes of both the mother and transmitting grandfather. Conclusions The present report provides the clinical findings and the genetic basis underlying a sporadic BCM case which is caused by a de novo deletion within the OPN1LW/MW gene cluster originating from the mother’s germline due to Alu-repeat mediated recombination. This is the first report of a de novo deletion resulting in BCM, highlighting the importance to consider BCM and perform genetic testing for this condition in male patients with cone dysfunction also in the absence of a positive family history.
Collapse
Affiliation(s)
- Elena Buena-Atienza
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tuebingen, Elfriede-Aulhorn 7, D-72076, Tuebingen, Germany
| | - Fadi Nasser
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tuebingen, Elfriede-Aulhorn 7, D-72076, Tuebingen, Germany
| | - Susanne Kohl
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tuebingen, Elfriede-Aulhorn 7, D-72076, Tuebingen, Germany
| | - Bernd Wissinger
- Institute for Ophthalmic Research, Centre for Ophthalmology, University of Tuebingen, Elfriede-Aulhorn 7, D-72076, Tuebingen, Germany.
| |
Collapse
|
22
|
Mao H, Wang H. Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs). Genome Biol Evol 2018; 9:2048-2056. [PMID: 28903462 PMCID: PMC5585668 DOI: 10.1093/gbe/evx145] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2017] [Indexed: 02/06/2023] Open
Abstract
Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family.
Collapse
Affiliation(s)
- Hongliang Mao
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China
| | - Hao Wang
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China.,Department of Genetics, University of Georgia
| |
Collapse
|
23
|
Thybert D, Roller M, Navarro FCP, Fiddes I, Streeter I, Feig C, Martin-Galvez D, Kolmogorov M, Janoušek V, Akanni W, Aken B, Aldridge S, Chakrapani V, Chow W, Clarke L, Cummins C, Doran A, Dunn M, Goodstadt L, Howe K, Howell M, Josselin AA, Karn RC, Laukaitis CM, Jingtao L, Martin F, Muffato M, Nachtweide S, Quail MA, Sisu C, Stanke M, Stefflova K, Van Oosterhout C, Veyrunes F, Ward B, Yang F, Yazdanifar G, Zadissa A, Adams DJ, Brazma A, Gerstein M, Paten B, Pham S, Keane TM, Odom DT, Flicek P. Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes. Genome Res 2018; 28:448-459. [PMID: 29563166 PMCID: PMC5880236 DOI: 10.1101/gr.234096.117] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Accepted: 03/05/2018] [Indexed: 12/31/2022]
Abstract
Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.
Collapse
Affiliation(s)
- David Thybert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
- Earlham Institute, Norwich Research Park, Norwich NR4 7UH, United Kingdom
| | - Maša Roller
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Fábio C P Navarro
- Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, Connecticut 06520, USA
| | - Ian Fiddes
- Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA
| | - Ian Streeter
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Christine Feig
- University of Cambridge, Cancer Research UK Cambridge Institute, Robinson Way, Cambridge CB2 0RE, United Kingdom
| | - David Martin-Galvez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California 92092, USA
| | - Václav Janoušek
- Department of Zoology, Faculty of Science, Charles University in Prague, 128 44 Prague, Czech Republic
| | - Wasiu Akanni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Bronwen Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Sarah Aldridge
- University of Cambridge, Cancer Research UK Cambridge Institute, Robinson Way, Cambridge CB2 0RE, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Varshith Chakrapani
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - William Chow
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Carla Cummins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Anthony Doran
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Matthew Dunn
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Leo Goodstadt
- Wellcome Trust Centre for Human Genetics, Oxford OX3 7BN, United Kingdom
| | - Kerstin Howe
- Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, Connecticut 06520, USA
| | - Matthew Howell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Ambre-Aurore Josselin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Robert C Karn
- Department of Medicine, College of Medicine, University of Arizona, Tuscon, Arizona 85724, USA
| | - Christina M Laukaitis
- Department of Medicine, College of Medicine, University of Arizona, Tuscon, Arizona 85724, USA
| | - Lilue Jingtao
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Fergal Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Stefanie Nachtweide
- Institute of Mathematics and Computer Science, University of Greifswald, Greifswald 17487, Germany
| | - Michael A Quail
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Cristina Sisu
- Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, Connecticut 06520, USA
| | - Mario Stanke
- Institute of Mathematics and Computer Science, University of Greifswald, Greifswald 17487, Germany
| | - Klara Stefflova
- University of Cambridge, Cancer Research UK Cambridge Institute, Robinson Way, Cambridge CB2 0RE, United Kingdom
| | - Cock Van Oosterhout
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom
| | - Frederic Veyrunes
- Institut des Sciences de l'Evolution de Montpellier, Université Montpellier/CNRS, 34095 Montpellier, France
| | - Ben Ward
- Earlham Institute, Norwich Research Park, Norwich NR4 7UH, United Kingdom
| | - Fengtang Yang
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Golbahar Yazdanifar
- Department of Medicine, College of Medicine, University of Arizona, Tuscon, Arizona 85724, USA
| | - Amonida Zadissa
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David J Adams
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Alvis Brazma
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Mark Gerstein
- Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, Connecticut 06520, USA
| | - Benedict Paten
- Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA
| | - Son Pham
- Bioturing Inc, San Diego, California 92121, USA
| | - Thomas M Keane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Duncan T Odom
- University of Cambridge, Cancer Research UK Cambridge Institute, Robinson Way, Cambridge CB2 0RE, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom
| |
Collapse
|
24
|
Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1. Proc Natl Acad Sci U S A 2017; 114:E11020-E11028. [PMID: 29229810 PMCID: PMC5754783 DOI: 10.1073/pnas.1712526114] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The genetic basis of X-Linked dystonia-parkinsonism (XDP) has been difficult to unravel, in part because all patients inherit the same haplotype of seven sequence variants, none of which has ever been identified in control individuals. This study revealed that one of the haplotype markers, a retrotransposon insertion within an intron of TAF1, has a variable number of hexameric repeats among affected individuals with an increase in repeat number strongly correlated with earlier age at disease onset. These data support a contributing role for this sequence in disease pathogenesis while further suggesting that XDP may be part of a growing list of neurodegenerative disorders associated with unstable repeat expansions. X-linked dystonia-parkinsonism (XDP) is a neurodegenerative disease associated with an antisense insertion of a SINE-VNTR-Alu (SVA)-type retrotransposon within an intron of TAF1. This unique insertion coincides with six additional noncoding sequence changes in TAF1, the gene that encodes TATA-binding protein–associated factor-1, which appear to be inherited together as an identical haplotype in all reported cases. Here we examined the sequence of this SVA in XDP patients (n = 140) and detected polymorphic variation in the length of a hexanucleotide repeat domain, (CCCTCT)n. The number of repeats in these cases ranged from 35 to 52 and showed a highly significant inverse correlation with age at disease onset. Because other SVAs exhibit intrinsic promoter activity that depends in part on the hexameric domain, we assayed the transcriptional regulatory effects of varying hexameric lengths found in the unique XDP SVA retrotransposon using luciferase reporter constructs. When inserted sense or antisense to the luciferase reading frame, the XDP variants repressed or enhanced transcription, respectively, to an extent that appeared to vary with length of the hexamer. Further in silico analysis of this SVA sequence revealed multiple motifs predicted to form G-quadruplexes, with the greatest potential detected for the hexameric repeat domain. These data directly link sequence variation within the XDP-specific SVA sequence to phenotypic variability in clinical disease manifestation and provide insight into potential mechanisms by which this intronic retroelement may induce transcriptional interference in TAF1 expression.
Collapse
|
25
|
Hoad G, Harrison K. The Design and Optimization of DNA Methylation Pyrosequencing Assays Targeting Region-Specific Repeat Elements. Methods Mol Biol 2017; 1589:17-27. [PMID: 26246355 DOI: 10.1007/7651_2015_285] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Epigenetic modifications, such as DNA methylation, can contribute to gene regulation and chromosomal stability. There are several methods and techniques available for methylation analysis, ranging from global methylation to gene-specific targeted regions. Bisulfite conversion enables numerous methodologies to be used for downstream applications, including pyrosequencing which measures DNA methylation at an individual CpG site level. This allows specific regions of interest to be targeted for DNA methylation analysis. Designing and optimizing pyrosequencing assays correctly is vital for the interpretation of results.Dysregulation of DNA methylation has been implicated in human diseases, with regions such as repeat elements commonly altered. Human population studies investigating these tend to use consensus sequences to target repeat elements. However, these elements have high mutational rates, particularly Alu sequences, which could lead to assay bias and masking of changes at a regional level. Therefore, it may be more beneficial to target specific repeat elements depending upon their chromosomal location, rather than analyzing overall methylation levels.
Collapse
Affiliation(s)
- Gwen Hoad
- Natural Products Group, Rowett Institute of Nutrition and Health, University of Aberdeen, Greenburn Road, Bucksburn, Aberdeen, AB21 9SB, Scotland, UK
| | - Kristina Harrison
- Natural Products Group, Rowett Institute of Nutrition and Health, University of Aberdeen, Greenburn Road, Bucksburn, Aberdeen, AB21 9SB, Scotland, UK.
| |
Collapse
|
26
|
Turner TN, Coe BP, Dickel DE, Hoekzema K, Nelson BJ, Zody MC, Kronenberg ZN, Hormozdiari F, Raja A, Pennacchio LA, Darnell RB, Eichler EE. Genomic Patterns of De Novo Mutation in Simplex Autism. Cell 2017; 171:710-722.e12. [PMID: 28965761 PMCID: PMC5679715 DOI: 10.1016/j.cell.2017.08.047] [Citation(s) in RCA: 225] [Impact Index Per Article: 32.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 08/03/2017] [Accepted: 08/25/2017] [Indexed: 12/22/2022]
Abstract
To further our understanding of the genetic etiology of autism, we generated and analyzed genome sequence data from 516 idiopathic autism families (2,064 individuals). This resource includes >59 million single-nucleotide variants (SNVs) and 9,212 private copy number variants (CNVs), of which 133,992 and 88 are de novo mutations (DNMs), respectively. We estimate a mutation rate of ∼1.5 × 10-8 SNVs per site per generation with a significantly higher mutation rate in repetitive DNA. Comparing probands and unaffected siblings, we observe several DNM trends. Probands carry more gene-disruptive CNVs and SNVs, resulting in severe missense mutations and mapping to predicted fetal brain promoters and embryonic stem cell enhancers. These differences become more pronounced for autism genes (p = 1.8 × 10-3, OR = 2.2). Patients are more likely to carry multiple coding and noncoding DNMs in different genes, which are enriched for expression in striatal neurons (p = 3 × 10-3), suggesting a path forward for genetically characterizing more complex cases of autism.
Collapse
Affiliation(s)
- Tychele N Turner
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Bradley P Coe
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Diane E Dickel
- Functional Genomics Department, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | | | - Zev N Kronenberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Fereydoun Hormozdiari
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, CA 95817, USA
| | - Archana Raja
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Len A Pennacchio
- Functional Genomics Department, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; U.S. Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Robert B Darnell
- New York Genome Center, New York, NY 10013, USA; Laboratory of Molecular Neuro-Oncology, The Rockefeller University, New York, NY 10065, USA; Howard Hughes Medical Institute, The Rockefeller University, New York, NY 10065, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
27
|
Genome sequence of the basal haplorrhine primate Tarsius syrichta reveals unusual insertions. Nat Commun 2016; 7:12997. [PMID: 27708261 PMCID: PMC5059674 DOI: 10.1038/ncomms12997] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 08/17/2016] [Indexed: 12/28/2022] Open
Abstract
Tarsiers are phylogenetically located between the most basal strepsirrhines and the most derived anthropoid primates. While they share morphological features with both groups, they also possess uncommon primate characteristics, rendering their evolutionary history somewhat obscure. To investigate the molecular basis of such attributes, we present here a new genome assembly of the Philippine tarsier (Tarsius syrichta), and provide extended analyses of the genome and detailed history of transposable element insertion events. We describe the silencing of Alu monomers on the lineage leading to anthropoids, and recognize an unexpected abundance of long terminal repeat-derived and LINE1-mobilized transposed elements (Tarsius interspersed elements; TINEs). For the first time in mammals, we identify a complete mitochondrial genome insertion within the nuclear genome, then reveal tarsier-specific, positive gene selection and posit population size changes over time. The genomic resources and analyses presented here will aid efforts to more fully understand the ancient characteristics of primate genomes. Tarsiers occupy a key node between strepsirrhines and anthropoids in the primate phylogeny. Here, Warren and colleagues present the genome of Tarsius syrichta, including a survey of transposable elements, an unusual mitochondrial insertion, and evidence for positive gene selection.
Collapse
|
28
|
Sheinman M, Ramisch A, Massip F, Arndt PF. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences. Sci Rep 2016; 6:30851. [PMID: 27488939 PMCID: PMC4973250 DOI: 10.1038/srep30851] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 07/08/2016] [Indexed: 12/26/2022] Open
Abstract
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf's law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.
Collapse
Affiliation(s)
- Michael Sheinman
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Biology, Faculty of Science, Utrecht University, Utrecht, the Netherlands
| | - Anna Ramisch
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Florian Massip
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- INRA, UR1404 Mathématique Informatique Appliquées du Génome á l’Environnement-F-78350 Jouy-en Josas, France
| | - Peter F. Arndt
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
29
|
Bailey J. Monkey-based research on human disease: the implications of genetic differences. Altern Lab Anim 2016; 42:287-317. [PMID: 25413291 DOI: 10.1177/026119291404200504] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Assertions that the use of monkeys to investigate human diseases is valid scientifically are frequently based on a reported 90-93% genetic similarity between the species. Critical analyses of the relevance of monkey studies to human biology, however, indicate that this genetic similarity does not result in sufficient physiological similarity for monkeys to constitute good models for research, and that monkey data do not translate well to progress in clinical practice for humans. Salient examples include the failure of new drugs in clinical trials, the highly different infectivity and pathology of SIV/HIV, and poor extrapolation of research on Alzheimer's disease, Parkinson's disease and stroke. The major molecular differences underlying these inter-species phenotypic disparities have been revealed by comparative genomics and molecular biology - there are key differences in all aspects of gene expression and protein function, from chromosome and chromatin structure to post-translational modification. The collective effects of these differences are striking, extensive and widespread, and they show that the superficial similarity between human and monkey genetic sequences is of little benefit for biomedical research. The extrapolation of biomedical data from monkeys to humans is therefore highly unreliable, and the use of monkeys must be considered of questionable value, particularly given the breadth and potential of alternative methods of enquiry that are currently available to scientists.
Collapse
Affiliation(s)
- Jarrod Bailey
- New England Anti-Vivisection Society (NEAVS), Boston, MA, USA
| |
Collapse
|
30
|
Seibt KM, Wenke T, Muders K, Truberg B, Schmidt T. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 86:268-285. [PMID: 26996788 DOI: 10.1111/tpj.13170] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 03/11/2016] [Accepted: 03/14/2016] [Indexed: 06/05/2023]
Abstract
Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | | | | | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
31
|
Zhang Y, Peng X, Tang Y, Gan X, Wang C, Xie L, Xie X, Gan R, Wu Y. Identification of IgH gene rearrangement and immunophenotype in an animal model of Epstein-Barr virus-associated lymphomas. J Med Virol 2016; 88:1804-13. [PMID: 26991077 DOI: 10.1002/jmv.24526] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2016] [Indexed: 11/10/2022]
Abstract
Epstein-Barr virus (EBV) is a human oncogenic herpesvirus associated with lymphoma and nasopharyngeal carcinoma. Because the susceptible hosts of EB virus are limited to human and cotton-top tamarins (Saguinus oedipus), there have been no appropriate animal models until the lymphoma model induced by EBV in human peripheral blood lymphocyte (hu-PBL)/SCID chimeric mice was reported. However, it is still controversial whether the EBV-associated lymphoma induced in hu-PBL/SCID mice is a monoclonal tumor. In this study, we transplanted normal human peripheral blood lymphocytes (hu-PBL) from six donors infected with EBV into SCID mice to construct hu-PBL/SCID chimeric mice. The induced tumors were found in the mediastinum or abdominal cavity of SCID mice. Microscopic observation exhibited tumor cells that were large and had a plasmablastic, centroblastic or immunoblastic-like appearance. Immunophenotyping assays showed the induced tumors were LCA-positive, CD20/CD79a-positive (markers of B cells), and CD3/CD45RO-negative (markers of T cells). A human-specific Alu sequence could be amplified by Alu-PCR. This confirmed that induced tumors were B-cell lymphomas originating from the transplanted human lymphocytes rather than mouse cells. EBER in situ hybridization detected positive signals in the nuclei of the tumor cells. Expression of EBV-encoded LMP1, EBNA-1, and EBNA-2 in the tumors was significantly positive. PCR-based capillary electrophoresis analysis of IgH gene rearrangement revealed a monoclonal peak and single amplification product in all six cases of induced tumors. This indicated that EBV can induce monoclonal proliferation of human B lymphocytes and promotes the development of lymphoma. J. Med. Virol. 88:1804-1813, 2016. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yang Zhang
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Xueqin Peng
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Yunlian Tang
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Xiaoning Gan
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Chengkun Wang
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Lu Xie
- Shanghai Center for Bioinformation Technology (SCBIT), Shanghai Academy of Science and Technology, Shanghai 201203, P.R. China
| | - Xiaoli Xie
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Runliang Gan
- Cancer Research Institute, College of Medicine, University of South China, Chang Sheng Xi Avenue 28, Hengyang, Hunan 421001, P.R. China
| | - Yimou Wu
- Hunan Provincial Key Laboratory for Special Pathogen Prevention and Control, University of South China, Hunan 421001, P.R. China
| |
Collapse
|
32
|
Caudron-Herger M, Pankert T, Seiler J, Németh A, Voit R, Grummt I, Rippe K. Alu element-containing RNAs maintain nucleolar structure and function. EMBO J 2015; 34:2758-74. [PMID: 26464461 DOI: 10.15252/embj.201591458] [Citation(s) in RCA: 104] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2015] [Accepted: 08/31/2015] [Indexed: 01/05/2023] Open
Abstract
Non-coding RNAs play a key role in organizing the nucleus into functional subcompartments. By combining fluorescence microscopy and RNA deep-sequencing-based analysis, we found that RNA polymerase II transcripts originating from intronic Alu elements (aluRNAs) were enriched in the nucleolus. Antisense-oligo-mediated depletion of aluRNAs or drug-induced inhibition of RNA polymerase II activity disrupted nucleolar structure and impaired RNA polymerase I-dependent transcription of rRNA genes. In contrast, overexpression of a prototypic aluRNA sequence increased both nucleolus size and levels of pre-rRNA, suggesting a functional link between aluRNA, nucleolus integrity and pre-rRNA synthesis. Furthermore, we show that aluRNAs interact with nucleolin and target ectopic genomic loci to the nucleolus. Our study suggests an aluRNA-based mechanism that links RNA polymerase I and II activities and modulates nucleolar structure and rRNA production.
Collapse
Affiliation(s)
- Maïwen Caudron-Herger
- Genome Organization & Function, German Cancer Research Center (DKFZ) Bioquant Center, Heidelberg, Germany
| | - Teresa Pankert
- Genome Organization & Function, German Cancer Research Center (DKFZ) Bioquant Center, Heidelberg, Germany
| | - Jeanette Seiler
- Molecular Biology of the Cell II, German Cancer Research Center, Heidelberg, Germany
| | - Attila Németh
- Department of Biochemistry III, Biochemistry Center Regensburg University of Regensburg, Regensburg, Germany
| | - Renate Voit
- Molecular Biology of the Cell II, German Cancer Research Center, Heidelberg, Germany
| | - Ingrid Grummt
- Molecular Biology of the Cell II, German Cancer Research Center, Heidelberg, Germany
| | - Karsten Rippe
- Genome Organization & Function, German Cancer Research Center (DKFZ) Bioquant Center, Heidelberg, Germany
| |
Collapse
|
33
|
Pizzato M, McCauley SM, Neagu MR, Pertel T, Firrito C, Ziglio S, Dauphin A, Zufferey M, Berthoux L, Luban J. Lv4 Is a Capsid-Specific Antiviral Activity in Human Blood Cells That Restricts Viruses of the SIVMAC/SIVSM/HIV-2 Lineage Prior to Integration. PLoS Pathog 2015; 11:e1005050. [PMID: 26181333 PMCID: PMC4504712 DOI: 10.1371/journal.ppat.1005050] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 06/25/2015] [Indexed: 12/24/2022] Open
Abstract
HIV-2 and SIVMAC are AIDS-causing, zoonotic lentiviruses that jumped to humans and rhesus macaques, respectively, from SIVSM-bearing sooty mangabey monkeys. Cross-species transmission events such as these sometimes necessitate virus adaptation to species-specific, host restriction factors such as TRIM5. Here, a new human restriction activity is described that blocks viruses of the SIVSM/SIVMAC/HIV-2 lineage. Human T, B, and myeloid cell lines, peripheral blood mononuclear cells and dendritic cells were 4 to >100-fold less transducible by VSV G-pseudotyped SIVMAC, HIV-2, or SIVSM than by HIV-1. In contrast, transduction of six epithelial cell lines was equivalent to that by HIV-1. Substitution of HIV-1 CA with the SIVMAC or HIV-2 CA was sufficient to reduce HIV-1 transduction to the level of the respective vectors. Among such CA chimeras there was a general trend such that CAs from epidemic HIV-2 Group A and B isolates were the most infectious on human T cells, CA from a 1° sooty mangabey isolate was the least infectious, and non-epidemic HIV-2 Group D, E, F, and G CAs were in the middle. The CA-specific decrease in infectivity was observed with either HIV-1, HIV-2, ecotropic MLV, or ALV Env pseudotypes, indicating that it was independent of the virus entry pathway. As2O3, a drug that suppresses TRIM5-mediated restriction, increased human blood cell transduction by SIVMAC but not by HIV-1. Nonetheless, elimination of TRIM5 restriction activity did not rescue SIVMAC transduction. Also, in contrast to TRIM5-mediated restriction, the SIVMAC CA-specific block occurred after completion of reverse transcription and the formation of 2-LTR circles, but before establishment of the provirus. Transduction efficiency in heterokaryons generated by fusing epithelial cells with T cells resembled that in the T cells, indicative of a dominant-acting SIVMAC restriction activity in the latter. These results suggest that the nucleus of human blood cells possesses a restriction factor specific for the CA of HIV-2/SIVMAC/SIVSM and that cross-species transmission of SIVSM to human T cells necessitated adaptation of HIV-2 to this putative restriction factor. HIV-1 and HIV-2, the two lentiviruses that cause AIDS in humans, are members of a family of such viruses that infect African primates. HIV-1 is a zoonosis that was transmitted to humans from chimpanzees. HIV-2 was transmitted to humans from sooty mangabey monkeys. In several documented cases of cross-species transmission of lentiviruses it has been shown that replication of the virus in the new host species necessitated that the virus adapt to species-specific antiviral factors in the host. Here we report that human blood cells possess an antiviral activity that exhibits specificity for viruses of the HIV-2/SIVMAC/SIVSM lineage, with restriction being greatest for SIVSM and the least for epidemic HIV-2. Here we show that this dominant-acting, antiviral activity is specific for the capsid and blocks the virus after it enters the nucleus. The evidence suggests that, in order to jump from sooty mangabey monkeys to humans, the capsid of these viruses changed in order to adapt to this antiviral activity. In keeping with the practice concerning anti-lentiviral activities we propose to call this new antiviral activity Lv4.
Collapse
Affiliation(s)
- Massimo Pizzato
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
- Center for Integrative Biology, University of Trento, Trento, Italy
| | - Sean Matthew McCauley
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Martha R. Neagu
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
| | - Thomas Pertel
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
| | - Claudia Firrito
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
| | - Serena Ziglio
- Center for Integrative Biology, University of Trento, Trento, Italy
| | - Ann Dauphin
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| | - Madeleine Zufferey
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
| | - Lionel Berthoux
- Laboratory of Retrovirology, University of Québec, Trois-Rivières, Quebec, Canada
| | - Jeremy Luban
- Department of Microbiology and Molecular Medicine, University of Geneva, Geneva, Switzerland
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
34
|
Lee HE, Ayarpadikannan S, Kim HS. Role of transposable elements in genomic rearrangement, evolution, gene regulation and epigenetics in primates. Genes Genet Syst 2015; 90:245-57. [DOI: 10.1266/ggs.15-00016] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Hee-Eun Lee
- Department of Biological Sciences, College of Natural Sciences, Pusan National University
- Genetic Engineering Institute, Pusan National University
| | - Selvam Ayarpadikannan
- Department of Biological Sciences, College of Natural Sciences, Pusan National University
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University
- Genetic Engineering Institute, Pusan National University
| |
Collapse
|
35
|
Lee HE, Eo J, Kim HS. Composition and evolutionary importance of transposable elements in humans and primates. Genes Genomics 2014. [DOI: 10.1007/s13258-014-0249-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
36
|
Ottolini B, Hornsby MJ, Abujaber R, MacArthur JAL, Badge RM, Schwarzacher T, Albertson DG, Bevins CL, Solnick JV, Hollox EJ. Evidence of convergent evolution in humans and macaques supports an adaptive role for copy number variation of the β-defensin-2 gene. Genome Biol Evol 2014; 6:3025-38. [PMID: 25349268 PMCID: PMC4255768 DOI: 10.1093/gbe/evu236] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
β-defensins are a family of important peptides of innate immunity, involved in host defense, immunomodulation, reproduction, and pigmentation. Genes encoding β-defensins show evidence of birth-and-death evolution, adaptation by amino acid sequence changes, and extensive copy number variation (CNV) within humans and other species. The role of CNV in the adaptation of β-defensins to new functions remains unclear, as does the adaptive role of CNV in general. Here, we fine-map CNV of a cluster of β-defensins in humans and rhesus macaques. Remarkably, we found that the structure of the CNV is different between primates, with distinct mutational origins and CNV boundaries defined by retroviral long terminal repeat elements. Although the human β-defensin CNV region is 322 kb and encompasses several genes, including β-defensins, a long noncoding RNA gene, and testes-specific zinc-finger transcription factors, the orthologous region in the rhesus macaque shows CNV of a 20-kb region, containing only a single gene, the ortholog of the human β-defensin-2 gene. Despite its independent origins, the range of gene copy numbers in the rhesus macaque is similar to humans. In addition, the rhesus macaque gene has been subject to divergent positive selection at the amino acid level following its initial duplication event between 3 and 9.5 Ma, suggesting adaptation of this gene as the macaque successfully colonized novel environments outside Africa. Therefore, the molecular phenotype of β-defensin-2 CNV has undergone convergent evolution, and this gene shows evidence of adaptation at the amino acid level in rhesus macaques.
Collapse
Affiliation(s)
| | - Michael J Hornsby
- Department of Microbiology and Immunology, University of California Davis School of Medicine
| | - Razan Abujaber
- Department of Genetics, University of Leicester, United Kingdom
| | - Jacqueline A L MacArthur
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco Present address: European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Richard M Badge
- Department of Genetics, University of Leicester, United Kingdom
| | | | - Donna G Albertson
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco Present address: Bluestone Center for Clinical Research, New York University College of Dentistry, New York, New York
| | - Charles L Bevins
- Department of Microbiology and Immunology, University of California Davis School of Medicine
| | - Jay V Solnick
- Department of Microbiology and Immunology, University of California Davis School of Medicine Department of Medicine, Center for Comparative Medicine, and the California National Primate Research Center, University of California
| | - Edward J Hollox
- Department of Genetics, University of Leicester, United Kingdom
| |
Collapse
|
37
|
Dynamic Alu methylation during normal development, aging, and tumorigenesis. BIOMED RESEARCH INTERNATIONAL 2014; 2014:784706. [PMID: 25243180 PMCID: PMC4163490 DOI: 10.1155/2014/784706] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Accepted: 08/16/2014] [Indexed: 12/15/2022]
Abstract
DNA methylation primarily occurs on CpG dinucleotides and plays an important role in transcriptional regulations during tissue development and cell differentiation. Over 25% of CpG dinucleotides in the human genome reside within Alu elements, the most abundant human repeats. The methylation of Alu elements is an important mechanism to suppress Alu transcription and subsequent retrotransposition. Decades of studies revealed that Alu methylation is highly dynamic during early development and aging. Recently, many environmental factors were shown to have a great impact on Alu methylation. In addition, aberrant Alu methylation has been documented to be an early event in many tumors and Alu methylation levels have been associated with tumor aggressiveness. The assessment of the Alu methylation has become an important approach for early diagnosis and/or prognosis of cancer. This review focuses on the dynamic Alu methylation during development, aging, and tumor genesis. The cause and consequence of Alu methylation changes will be discussed.
Collapse
|
38
|
Inference of transposable element ancestry. PLoS Genet 2014; 10:e1004482. [PMID: 25121584 PMCID: PMC4133154 DOI: 10.1371/journal.pgen.1004482] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2013] [Accepted: 05/16/2014] [Indexed: 01/11/2023] Open
Abstract
Most common methods for inferring transposable element (TE) evolutionary relationships are based on dividing TEs into subfamilies using shared diagnostic nucleotides. Although originally justified based on the “master gene” model of TE evolution, computational and experimental work indicates that many of the subfamilies generated by these methods contain multiple source elements. This implies that subfamily-based methods give an incomplete picture of TE relationships. Studies on selection, functional exaptation, and predictions of horizontal transfer may all be affected. Here, we develop a Bayesian method for inferring TE ancestry that gives the probability that each sequence was replicative, its frequency of replication, and the probability that each extant TE sequence came from each possible ancestral sequence. Applying our method to 986 members of the newly-discovered LAVA family of TEs, we show that there were far more source elements in the history of LAVA expansion than subfamilies identified using the CoSeg subfamily-classification program. We also identify multiple replicative elements in the AluSc subfamily in humans. Our results strongly indicate that a reassessment of subfamily structures is necessary to obtain accurate estimates of mutation processes, phylogenetic relationships and historical times of activity. The most common entities in vertebrate genomes are transposable elements (TEs), DNA sequences that have been repeatedly copied and inserted into new locations throughout the genome. Some TEs have been replicated hundreds of thousands of times, and their ecology and evolutionary history within a genome is thus critical to understanding how genome structure evolves. It was once thought that only a few “master gene” copies could replicate, while the rest were inactive (dead on arrival), but recent computational and laboratory studies have indicated that this is not the case. However, previous methods for reconstructing TE evolutionary history were not designed to solve the problem of determining the ancestral source sequence for large numbers of elements. Here, we present a new method that is. Our method surveys all likely TE ancestors and determines the probability that each modern element arose from each of its plausible ancestors. We applied our method to the gibbon-derived LAVA TE family and to the human AluSc subfamily and inferred many more source elements than indicated by previous methods. This new method will help us better understand TE evolution, including both the impact of sequence on replication and the substitution process after replication.
Collapse
|
39
|
Criscione SW, Zhang Y, Thompson W, Sedivy JM, Neretti N. Transcriptional landscape of repetitive elements in normal and cancer human cells. BMC Genomics 2014; 15:583. [PMID: 25012247 PMCID: PMC4122776 DOI: 10.1186/1471-2164-15-583] [Citation(s) in RCA: 173] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 07/03/2014] [Indexed: 12/11/2022] Open
Abstract
Background Repetitive elements comprise at least 55% of the human genome with more recent estimates as high as two-thirds. Most of these elements are retrotransposons, DNA sequences that can insert copies of themselves into new genomic locations by a “copy and paste” mechanism. These mobile genetic elements play important roles in shaping genomes during evolution, and have been implicated in the etiology of many human diseases. Despite their abundance and diversity, few studies investigated the regulation of endogenous retrotransposons at the genome-wide scale, primarily because of the technical difficulties of uniquely mapping high-throughput sequencing reads to repetitive DNA. Results Here we develop a new computational method called RepEnrich to study genome-wide transcriptional regulation of repetitive elements. We show that many of the Long Terminal Repeat retrotransposons in humans are transcriptionally active in a cell line-specific manner. Cancer cell lines display increased RNA Polymerase II binding to retrotransposons than cell lines derived from normal tissue. Consistent with increased transcriptional activity of retrotransposons in cancer cells we found significantly higher levels of L1 retrotransposon RNA expression in prostate tumors compared to normal-matched controls. Conclusions Our results support increased transcription of retrotransposons in transformed cells, which may explain the somatic retrotransposition events recently reported in several types of cancers. Electronic Supplementary Material Supplementary material is available for this article at 10.1186/1471-2164-15-583 and is accessible for authorized users.
Collapse
Affiliation(s)
| | | | | | | | - Nicola Neretti
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI 02912, USA.
| |
Collapse
|
40
|
Maumus F, Quesneville H. Ancestral repeats have shaped epigenome and genome composition for millions of years in Arabidopsis thaliana. Nat Commun 2014; 5:4104. [PMID: 24954583 PMCID: PMC4090718 DOI: 10.1038/ncomms5104] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2013] [Accepted: 05/13/2014] [Indexed: 12/19/2022] Open
Abstract
Little is known about the evolution of repeated sequences over long periods of time. Using two independent approaches, we show that the majority of the repeats found in the Arabidopsis thaliana genome are ancient and likely to derive from the retention of fragments deposited during ancestral bursts that occurred early in the Brassicaceae evolution. We determine that the majority of young repeats are found in pericentromeric domains, while older copies are frequent in the gene-rich regions. Our results further suggest that the DNA methylation of repeats through small RNA-mediated pathways can last over prolonged periods of time. We also illustrate the way repeated sequences are composted by mutations towards genomic dark matter over time, probably driven by the deamination of methylcytosines, which also have an impact on epigenomic landscapes. Overall, we show that the ancient proliferation of repeat families has long-term consequences on A. thaliana biology and genome composition. Repeated sequences are common in genomes, yet little is known about the long-term evolution of repeats in plants. Here, Maumus and Quesneville show that most of the repeated sequences in the model plant, Arabidopsis thaliana, are old and that many small RNAs correspond to old repeats.
Collapse
Affiliation(s)
- Florian Maumus
- INRA, UR1164 URGI-Research Unit in Genomics-Info, INRA de Versailles-Grignon, Route de Saint-Cyr, Versailles 78026, France
| | - Hadi Quesneville
- INRA, UR1164 URGI-Research Unit in Genomics-Info, INRA de Versailles-Grignon, Route de Saint-Cyr, Versailles 78026, France
| |
Collapse
|
41
|
Walters-Conte KB, Johnson DLE, Johnson WE, O’Brien SJ, Pecon-Slattery J. The dynamic proliferation of CanSINEs mirrors the complex evolution of Feliforms. BMC Evol Biol 2014; 14:137. [PMID: 24947429 PMCID: PMC4084570 DOI: 10.1186/1471-2148-14-137] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2014] [Accepted: 06/11/2014] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Repetitive short interspersed elements (SINEs) are retrotransposons ubiquitous in mammalian genomes and are highly informative markers to identify species and phylogenetic associations. Of these, SINEs unique to the order Carnivora (CanSINEs) yield novel insights on genome evolution in domestic dogs and cats, but less is known about their role in related carnivores. In particular, genome-wide assessment of CanSINE evolution has yet to be completed across the Feliformia (cat-like) suborder of Carnivora. Within Feliformia, the cat family Felidae is composed of 37 species and numerous subspecies organized into eight monophyletic lineages that likely arose 10 million years ago. Using the Felidae family as a reference phylogeny, along with representative taxa from other families of Feliformia, the origin, proliferation and evolution of CanSINEs within the suborder were assessed. RESULTS We identified 93 novel intergenic CanSINE loci in Feliformia. Sequence analyses separated Feliform CanSINEs into two subfamilies, each characterized by distinct RNA polymerase binding motifs and phylogenetic associations. Subfamily I CanSINEs arose early within Feliformia but are no longer under active proliferation. Subfamily II loci are more recent, exclusive to Felidae and show evidence for adaptation to extant RNA polymerase activity. Further, presence/absence distributions of CanSINE loci are largely congruent with taxonomic expectations within Feliformia and the less resolved nodes in the Felidae reference phylogeny present equally ambiguous CanSINE data. SINEs are thought to be nearly impervious to excision from the genome. However, we observed a nearly complete excision of a CanSINEs locus in puma (Puma concolor). In addition, we found that CanSINE proliferation in Felidae frequently targeted existing CanSINE loci for insertion sites, resulting in tandem arrays. CONCLUSIONS We demonstrate the existence of at least two SINE families within the Feliformia suborder, one of which is actively involved in insertional mutagenesis. We find SINEs are powerful markers of speciation and conclude that the few inconsistencies with expected patterns of speciation likely represent incomplete lineage sorting, species hybridization and SINE-mediated genome rearrangement.
Collapse
Affiliation(s)
- Kathryn B Walters-Conte
- Department of Biology, American University, 101 Hurst Hall 4440 Massachusetts Ave, Washington, DC 20016, USA
| | - Diana LE Johnson
- Department of Biological Sciences, The George Washington University, 2036 G St, Washington, DC 20009, USA
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA
| | - Stephen J O’Brien
- Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, 41 A, Sredniy Avenue St., Petersburg 199034, Russia
| | - Jill Pecon-Slattery
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA
| |
Collapse
|
42
|
McLain AT, Carman GW, Fullerton ML, Beckstrom TO, Gensler W, Meyer TJ, Faulk C, Batzer MA. Analysis of western lowland gorilla (Gorilla gorilla gorilla) specific Alu repeats. Mob DNA 2013; 4:26. [PMID: 24262036 PMCID: PMC4177385 DOI: 10.1186/1759-8753-4-26] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 10/23/2013] [Indexed: 02/07/2023] Open
Abstract
Background Research into great ape genomes has revealed widely divergent activity levels over time for Alu elements. However, the diversity of this mobile element family in the genome of the western lowland gorilla has previously been uncharacterized. Alu elements are primate-specific short interspersed elements that have been used as phylogenetic and population genetic markers for more than two decades. Alu elements are present at high copy number in the genomes of all primates surveyed thus far. The AluY subfamily and its derivatives have been recognized as the evolutionarily youngest Alu subfamily in the Old World primate lineage. Results Here we use a combination of computational and wet-bench laboratory methods to assess and catalog AluY subfamily activity level and composition in the western lowland gorilla genome (gorGor3.1). A total of 1,075 independent AluY insertions were identified and computationally divided into 10 subfamilies, with the largest number of gorilla-specific elements assigned to the canonical AluY subfamily. Conclusions The retrotransposition activity level appears to be significantly lower than that seen in the human and chimpanzee lineages, while higher than that seen in orangutan genomes, indicative of differential Alu amplification in the western lowland gorilla lineage as compared to other Homininae.
Collapse
Affiliation(s)
- Adam T McLain
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA.
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 2013; 10:563-9. [PMID: 23644548 DOI: 10.1038/nmeth.2474] [Citation(s) in RCA: 3121] [Impact Index Per Article: 283.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 04/04/2013] [Indexed: 02/08/2023]
Abstract
We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.
Collapse
|
44
|
Xu H, Luo X, Qian J, Pang X, Song J, Qian G, Chen J, Chen S. FastUniq: a fast de novo duplicates removal tool for paired short reads. PLoS One 2012; 7:e52249. [PMID: 23284954 PMCID: PMC3527383 DOI: 10.1371/journal.pone.0052249] [Citation(s) in RCA: 357] [Impact Index Per Article: 29.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2012] [Accepted: 11/16/2012] [Indexed: 11/19/2022] Open
Abstract
The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/.
Collapse
Affiliation(s)
- Haibin Xu
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
| | - Xiang Luo
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
| | - Jun Qian
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
| | - Xiaohui Pang
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
| | - Jingyuan Song
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
| | - Guangrui Qian
- Department of Geosciences, Stony Brook University, Stony Brook, New York, United States of America
| | - Jinhui Chen
- Key Laboratory of Forest Genetics and Biotechnology, Ministry of Education of China, Nanjing Forestry University, Nanjing, Jiangsu Province, China
- * E-mail: (JHC); (SLC)
| | - Shilin Chen
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
- * E-mail: (JHC); (SLC)
| |
Collapse
|
45
|
The lemur revolution starts now: the genomic coming of age for a non-model organism. Mol Phylogenet Evol 2012; 66:442-52. [PMID: 22982436 DOI: 10.1016/j.ympev.2012.08.024] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Revised: 08/24/2012] [Accepted: 08/27/2012] [Indexed: 12/25/2022]
Abstract
Morris Goodman was a revolutionary. Together with a mere handful of like-minded scientists, Morris established himself as a leader in the molecular phylogenetic revolution of the 1960s. The effects of this revolution are most evident in this journal, which he founded in 1992. Happily for lemur biologists, one of Morris Goodman's primary interests was in reconstructing the phylogeny of the primates, including the tooth-combed Lorisifomes of Africa and Asia, and the Lemuriformes of Madagascar (collectively referred to as the suborder Strepsirrhini). This paper traces the development of molecular phylogenetic and evolutionary genetic trends and methods over the 50-year expanse of Morris Goodman's career, particularly as they apply to our understanding of lemuriform phylogeny, biogeography, and biology. Notably, this perspective reveals that the lemuriform genome is sufficiently rich in phylogenetic signal such that the very earliest molecular phylogenetic studies - many of which were conducted by Goodman himself - have been validated by contemporary studies that have exploited advanced computational methods applied to phylogenomic scale data; studies that were beyond imagining in the earliest days of phylogeny reconstruction. Nonetheless, the frontier still beckons. New technologies for gathering and analyzing genomic data will allow investigators to build upon what can now be considered a nearly-known phylogeny of the Lemuriformes in order to ask innovative questions about the evolutionary mechanisms that generate and maintain the extraordinary breadth and depth of biological diversity within this remarkable clade of primates.
Collapse
|
46
|
McLain AT, Meyer TJ, Faulk C, Herke SW, Oldenburg JM, Bourgeois MG, Abshire CF, Roos C, Batzer MA. An alu-based phylogeny of lemurs (infraorder: Lemuriformes). PLoS One 2012; 7:e44035. [PMID: 22937148 PMCID: PMC3429421 DOI: 10.1371/journal.pone.0044035] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Accepted: 07/31/2012] [Indexed: 11/30/2022] Open
Abstract
LEMURS (INFRAORDER: Lemuriformes) are a radiation of strepsirrhine primates endemic to the island of Madagascar. As of 2012, 101 lemur species, divided among five families, have been described. Genetic and morphological evidence indicates all species are descended from a common ancestor that arrived in Madagascar ∼55-60 million years ago (mya). Phylogenetic relationships in this species-rich infraorder have been the subject of debate. Here we use Alu elements, a family of primate-specific Short INterspersed Elements (SINEs), to construct a phylogeny of infraorder Lemuriformes. Alu elements are particularly useful SINEs for the purpose of phylogeny reconstruction because they are identical by descent and confounding events between loci are easily resolved by sequencing. The genome of the grey mouse lemur (Microcebus murinus) was computationally assayed for synapomorphic Alu elements. Those that were identified as Lemuriformes-specific were analyzed against other available primate genomes for orthologous sequence in which to design primers for PCR (polymerase chain reaction) verification. A primate phylogenetic panel of 24 species, including 22 lemur species from all five families, was examined for the presence/absence of 138 Alu elements via PCR to establish relationships among species. Of these, 111 were phylogenetically informative. A phylogenetic tree was generated based on the results of this analysis. We demonstrate strong support for the monophyly of Lemuriformes to the exclusion of other primates, with Daubentoniidae, the aye-aye, as the basal lineage within the infraorder. Our results also suggest Lepilemuridae as a sister lineage to Cheirogaleidae, and Indriidae as sister to Lemuridae. Among the Cheirogaleidae, we show strong support for Microcebus and Mirza as sister genera, with Cheirogaleus the sister lineage to both. Our results also support the monophyly of the Lemuridae. Within Lemuridae we place Lemur and Hapalemur together to the exclusion of Eulemur and Varecia, with Varecia the sister lineage to the other three genera.
Collapse
Affiliation(s)
- Adam T McLain
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Rawal K, Priya A, Malik A, Bahl R, Ramaswamy R. Distribution of MGEs and their insertion sites in the Macaca mulatta genome. Mob Genet Elements 2012; 2:133-141. [PMID: 23061019 PMCID: PMC3463469 DOI: 10.4161/mge.21074] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Mobile genetic elements (MGEs) are fragments of DNA that can move around within the genome through retrotransposition. These are responsible for various important events such as gene inactivation, transduction, regulation of gene expression and genome expansion. The present work involves the identification and study of the distribution of Alu and L1 retrotransposons in the genome of Macaca mulatta, an extensively used organism in biomedical studies. We also make comparisons with MGE distributions in other primate genomes and study the physicochemical properties of the local DNA structure around the transposon insertion site using ELAN. The present work also includes computational testing of the pre-insertion loci in order to detect unique features based on DNA structure, thermodynamic considerations and protein interaction measures. Although there is significant sequence divergence between the elements of M. mulatta and H. sapiens, their genome wide distribution is very similar; comparing the distribution of L1's in all available X chromosome sequences suggests a common mechanism behind the spread of MGE's in primate genomes.
Collapse
Affiliation(s)
- Kamal Rawal
- Department of Biotechnology; Jaypee Institute of Information Technology; Noida, India
| | - Avantika Priya
- Department of Biotechnology; Jaypee Institute of Information Technology; Noida, India
| | - Aman Malik
- Department of Biotechnology; Jaypee Institute of Information Technology; Noida, India
| | - Radhika Bahl
- Department of Biotechnology; Jaypee Institute of Information Technology; Noida, India
- University of Hyderabad; Central University; Gachibowli, Hyderaba, India
| | - Ram Ramaswamy
- University of Hyderabad; Central University; Gachibowli, Hyderaba, India
| |
Collapse
|
48
|
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers. Mob DNA 2012; 3:8. [PMID: 22541534 PMCID: PMC3357318 DOI: 10.1186/1759-8753-3-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Accepted: 04/30/2012] [Indexed: 01/25/2023] Open
Abstract
Background Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Results Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. Conclusions This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion.
Collapse
|
49
|
Abstract
Alu elements are primate-specific repeats and comprise 11% of the human genome. They have wide-ranging influences on gene expression. Their contribution to genome evolution, gene regulation and disease is reviewed.
Collapse
|
50
|
de Andrade A, Wang M, Bonaldo MF, Xie H, Soares MB. Genetic and epigenetic variations contributed by Alu retrotransposition. BMC Genomics 2011; 12:617. [PMID: 22185517 PMCID: PMC3272032 DOI: 10.1186/1471-2164-12-617] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2011] [Accepted: 12/20/2011] [Indexed: 12/16/2022] Open
Abstract
Background De novo retrotransposition of Alu elements has been recognized as a major driver for insertion polymorphisms in human populations. In this study, we exploited Alu-anchored bisulfite PCR libraries to identify evolutionarily recent Alu element insertions, and to investigate their genetic and epigenetic variation. Results A total of 327 putatively recent Alu insertions were identified, altogether represented by 1,762 sequence reads. Nearly all such de novo retrotransposition events (316/327) were novel. Forty-seven out of forty-nine randomly selected events, corresponding to nineteen genomic loci, were sequence-verified. Alu element insertions remained hemizygous in one or more individuals in sixteen of the nineteen genomic loci. The Alu elements were found to be enriched for young Alu families with characteristic sequence features, such as the presence of a longer poly(A) tail. In addition, we documented the occurrence of a duplication of the AT-rich target site in their immediate flanking sequences, a hallmark of retrotransposition. Furthermore, we found the sequence motif (TT/AAAA) that is recognized by the ORF2P protein encoded by LINE-1 in their 5'-flanking regions, consistent with the fact that Alu retrotransposition is facilitated by LINE-1 elements. While most of these Alu elements were heavily methylated, we identified an Alu localized 1.5 kb downstream of TOMM5 that exhibited a completely unmethylated left arm. Interestingly, we observed differential methylation of its immediate 5' and 3' flanking CpG dinucleotides, in concordance with the unmethylated and methylated statuses of its internal 5' and 3' sequences, respectively. Importantly, TOMM5's CpG island and the 3 Alu repeats and 1 MIR element localized upstream of this newly inserted Alu were also found to be unmethylated. Methylation analyses of two additional genomic loci revealed no methylation differences in CpG dinucleotides flanking the Alu insertion sites in the two homologous chromosomes, irrespective of the presence or absence of the insertion. Conclusions We anticipate that the combination of methodologies utilized in this study, which included repeat-anchored bisulfite PCR sequencing and the computational analysis pipeline herein reported, will prove invaluable for the generation of genetic and epigenetic variation maps.
Collapse
Affiliation(s)
- Alexandre de Andrade
- Falk Brain Tumor Center, Cancer Biology and Epigenomics Program, Children's Memorial Research Center, Chicago, IL 60614-3394, USA
| | | | | | | | | |
Collapse
|