1
|
Haerter CAG, Blanco DR, Traldi JB, Feldberg E, Margarido VP, Lui RL. Are scattered microsatellites weak chromosomal markers? Guided mapping reveals new insights into Trachelyopterus (Siluriformes: Auchenipteridae) diversity. PLoS One 2023; 18:e0285388. [PMID: 37310952 DOI: 10.1371/journal.pone.0285388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 04/22/2023] [Indexed: 06/15/2023] Open
Abstract
The scattered distribution pattern of microsatellites is a challenging problem in fish cytogenetics. This type of array hinders the identification of useful patterns and the comparison between species, often resulting in over-limited interpretations that only label it as "scattered" or "widely distributed". However, several studies have shown that the distribution pattern of microsatellites is non-random. Thus, here we tested whether a scattered microsatellite could have distinct distribution patterns on homeologous chromosomes of closely related species. The clustered sites of 18S and 5S rDNA, U2 snRNA and H3/H4 histone genes were used as a guide to compare the (GATA)n microsatellite distribution pattern on the homeologous chromosomes of six Trachelyopterus species: T. coriaceus and Trachelyopterus aff. galeatus from the Araguaia River basin; T. striatulus, T. galeatus and T. porosus from the Amazonas River basin; and Trachelyopterus aff. coriaceus from the Paraguay River basin. Most species had similar patterns of the (GATA)n microsatellite in the histone genes and 5S rDNA carriers. However, we have found a chromosomal polymorphism of the (GATA)n sequence in the 18S rDNA carriers of Trachelyopterus galeatus, which is in Hard-Weinberg equilibrium and possibly originated through amplification events; and a chromosome polymorphism in Trachelyopterus aff. galeatus, which combined with an inversion polymorphism of the U2 snRNA in the same chromosome pair resulted in six possible cytotypes, which are in Hardy-Weinberg disequilibrium. Therefore, comparing the distribution pattern on homeologous chromosomes across the species, using gene clusters as a guide to identify it, seems to be an effective way to further the analysis of scattered microsatellites in fish cytogenetics.
Collapse
Affiliation(s)
| | | | - Josiane Baccarin Traldi
- Departamento de Genética, Instituto de Ciências Biológicas, Universidade Federal do Amazonas, Manaus, Brasil
| | | | - Vladimir Pavan Margarido
- Universidade Estadual do Oeste do Paraná, Centro de Ciências Biológicas e da Saúde, Cascavel, Paraná, Brasil
| | - Roberto Laridondo Lui
- Universidade Estadual do Oeste do Paraná, Centro de Ciências Biológicas e da Saúde, Cascavel, Paraná, Brasil
| |
Collapse
|
2
|
van Kruistum H, Nijland R, Reznick DN, Groenen MAM, Megens HJ, Pollux BJA. Parallel Genomic Changes Drive Repeated Evolution of Placentas in Live-Bearing Fish. Mol Biol Evol 2021; 38:2627-2638. [PMID: 33620468 PMCID: PMC8136483 DOI: 10.1093/molbev/msab057] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The evolutionary origin of complex organs challenges empirical study because most organs evolved hundreds of millions of years ago. The placenta of live-bearing fish in the family Poeciliidae represents a unique opportunity to study the evolutionary origin of complex organs, because in this family a placenta evolved at least nine times independently. It is currently unknown whether this repeated evolution is accompanied by similar, repeated, genomic changes in placental species. Here, we compare whole genomes of 26 poeciliid species representing six out of nine independent origins of placentation. Evolutionary rate analysis revealed that the evolution of the placenta coincides with convergent shifts in the evolutionary rate of 78 protein-coding genes, mainly observed in transporter- and vesicle-located genes. Furthermore, differences in sequence conservation showed that placental evolution coincided with similar changes in 76 noncoding regulatory elements, occurring primarily around genes that regulate development. The unexpected high occurrence of GATA simple repeats in the regulatory elements suggests an important function for GATA repeats in developmental gene regulation. The distinction in molecular evolution observed, with protein-coding parallel changes more often found in metabolic and structural pathways, compared with regulatory change more frequently found in developmental pathways, offers a compelling model for complex trait evolution in general: changing the regulation of otherwise highly conserved developmental genes may allow for the evolution of complex traits.
Collapse
Affiliation(s)
- Henri van Kruistum
- Animal Breeding and Genomics Group, Wageningen University, Wageningen, The Netherlands.,Experimental Zoology Group, Wageningen University, Wageningen, The Netherlands
| | - Reindert Nijland
- Marine Animal Ecology Group, Wageningen University, Wageningen, The Netherlands
| | - David N Reznick
- Department of Biology, University of California, Riverside, CA, USA
| | - Martien A M Groenen
- Animal Breeding and Genomics Group, Wageningen University, Wageningen, The Netherlands
| | - Hendrik-Jan Megens
- Animal Breeding and Genomics Group, Wageningen University, Wageningen, The Netherlands.,Aquaculture and Fisheries Group, Wageningen University, Wageningen, The Netherlands
| | - Bart J A Pollux
- Experimental Zoology Group, Wageningen University, Wageningen, The Netherlands
| |
Collapse
|
3
|
Gharesouran J, Hosseinzadeh H, Ghafouri-Fard S, Taheri M, Rezazadeh M. STRs: Ancient Architectures of the Genome beyond the Sequence. J Mol Neurosci 2021; 71:2441-2455. [PMID: 34056692 DOI: 10.1007/s12031-021-01850-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 04/22/2021] [Indexed: 01/24/2023]
Abstract
Short tandem repeats (STRs) are commonly defined as short runs of repetitive nucleotides, consisting of tandemly repeating 2-6- bp motif units, which are ubiquitously distributed throughout genomes. Functional STRs are polymorphic in the population, and their variations influence gene expression, which subsequently may result in pathogenic phenotypes. To understand STR phenotypic effects and their functional roles, we describe four different mutational mechanisms including the unequal crossing-over model, gene conversion, retrotransposition mechanism and replication slippage. Due to the multi-allelic nature, small length, abundance, high variability, codominant inheritance, nearly neutral evolution, extensive genome coverage and simple assaying of STRs, these markers are widely used in various types of biological research, including population genetics studies, genome mapping, molecular epidemiology, paternity analysis and gene flow studies. In this review, we focus on the current knowledge regarding STR genomic distribution, function, mutation and applications.
Collapse
Affiliation(s)
- Jalal Gharesouran
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Hassan Hosseinzadeh
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Soudeh Ghafouri-Fard
- Department of Medical Genetics, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Taheri
- Skull Base Research Center, Loghman Hakim Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Maryam Rezazadeh
- Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran. .,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
4
|
Bhattacharyya B, Mitra U, Bhattacharyya R. Tandem repeat interval pattern identifies animal taxa. Bioinformatics 2021; 37:2250-2258. [PMID: 33677492 DOI: 10.1093/bioinformatics/btab124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 12/11/2020] [Accepted: 02/22/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION We discover that maximality of information content among intervals of Tandem Repeats (TRs) in animal genome segregates over taxa such that taxa identification becomes swift and accurate. Successive TRs of a motif occur at intervals over the sequence, forming a trail of TRs of the motif across the genome. We present a method, Tandem Repeat Information Mining (TRIM), that mines 4k number of TR trails of all k length motifs from a whole genome sequence and extracts the information content within intervals of the trails. TRIM vector formed from the ordered set of interval entropies becomes instrumental for genome segregation. RESULTS Reconstruction of correct phylogeny for animals from whole genome sequences proves precision of TRIM. Identification of animal taxa by TRIM vector upon feature selection is the most significant achievement. These suggest Tandem Repeat Interval Pattern (TRIP) is a taxa-specific constitutional characteristic in animal genome. AVAILABILITY Source and executable code of TRIM along with usage manual are made available at https://github.com/BB-BiG/TRIM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Balaram Bhattacharyya
- Department of Computer and System Sciences, Visva-Bharati University, Santiniketan, 731235
| | - Uddalak Mitra
- Department of Computer and System Sciences, Visva-Bharati University, Santiniketan, 731235
| | | |
Collapse
|
5
|
Li D, Pan S, Zhang H, Fu Y, Peng Z, Zhang L, Peng S, Xu F, Huang H, Shi R, Zheng H, Peng Y, Tan Z. A comprehensive microsatellite landscape of human Y-DNA at kilobase resolution. BMC Genomics 2021; 22:76. [PMID: 33482734 PMCID: PMC7821415 DOI: 10.1186/s12864-021-07389-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 01/13/2021] [Indexed: 12/12/2022] Open
Abstract
Background Though interest in human simple sequence repeats (SSRs) is increasing, little is known about the exact distributional features of numerous SSRs in human Y-DNA at chromosomal level. Herein, totally 540 maps were established, which could clearly display SSR landscape in every bin of 1 k base pairs (Kbp) along the sequenced part of human reference Y-DNA (NC_000024.10), by our developed differential method for improving the existing method to reveal SSR distributional characteristics in large genomic sequences. Results The maps show that SSRs accumulate significantly with forming density peaks in at least 2040 bins of 1 Kbp, which involve different coding, noncoding and intergenic regions of the Y-DNA, and 10 especially high density peaks were reported to associate with biological significances, suggesting that the other hundreds of especially high density peaks might also be biologically significant and worth further analyzing. In contrast, the maps also show that SSRs are extremely sparse in at least 207 bins of 1 Kbp, including many noncoding and intergenic regions of the Y-DNA, which is inconsistent with the widely accepted view that SSRs are mostly rich in these regions, and these sparse distributions are possibly due to powerfully regional selection. Additionally, many regions harbor SSR clusters with same or similar motif in the Y-DNA. Conclusions These 540 maps may provide the important information of clearly position-related SSR distributional features along the human reference Y-DNA for better understanding the genome structures of the Y-DNA. This study may contribute to further exploring the biological significance and distribution law of the huge numbers of SSRs in human Y-DNA. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07389-5.
Collapse
Affiliation(s)
- Douyue Li
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Saichao Pan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Hongxi Zhang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Yongzhuo Fu
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Zhuli Peng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Liang Zhang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Shan Peng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Fei Xu
- Department of Mathematics, Wilfrid Laurier University, Waterloo, Ontario, N2L 3C5, Canada
| | - Hanrou Huang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Ruixue Shi
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Heping Zheng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Yousong Peng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Zhongyang Tan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China.
| |
Collapse
|
6
|
Avvaru AK, Sharma D, Verma A, Mishra RK, Sowpati DT. MSDB: a comprehensive, annotated database of microsatellites. Nucleic Acids Res 2020; 48:D155-D159. [PMID: 31599331 PMCID: PMC6943038 DOI: 10.1093/nar/gkz886] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 09/28/2019] [Accepted: 10/01/2019] [Indexed: 11/18/2022] Open
Abstract
Microsatellites are short tandem repeats of 1–6 nucleotide motifs, studied for their utility as genome markers and in forensics. Recent evidence points to the role of microsatellites in important regulatory functions, and their length polymorphisms at coding regions are linked to various neurodegenerative disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and their evolution remains poorly understood. Though other databases of microsatellites exist, they fall short on several fronts. MSDB (MicroSatellite DataBase) is a collection of >4 billion microsatellites from 37 680 genomes presented in a user-friendly web portal for easy, interactive analysis and visualization. This is by far the most comprehensive, annotated, updated database to access and analyze microsatellite data of multiple species. The features of MSDB enable users to explore the data as tables that can be filtered and exported, and also as interactive charts to view and compare the data of multiple species simultaneously. Its modularity and architecture permit seamless updates with new data, making it a powerful tool and useful resource to researchers working on this important class of DNA elements, particularly in context of their evolution and emerging roles in genome organization and gene regulation.
Collapse
Affiliation(s)
- Akshay Kumar Avvaru
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad - 201002, India
| | - Deepak Sharma
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Archana Verma
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Rakesh K Mishra
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Divya Tej Sowpati
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| |
Collapse
|
7
|
Mourad R, Cuvier O. TAD-free analysis of architectural proteins and insulators. Nucleic Acids Res 2019; 46:e27. [PMID: 29272504 PMCID: PMC5861416 DOI: 10.1093/nar/gkx1246] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 12/05/2017] [Indexed: 11/20/2022] Open
Abstract
The three-dimensional (3D) organization of the genome is intimately related to numerous key biological functions including gene expression and DNA replication regulations. The mechanisms by which molecular drivers functionally organize the 3D genome, such as topologically associating domains (TADs), remain to be explored. Current approaches consist in assessing the enrichments or influences of proteins at TAD borders. Here, we propose a TAD-free model to directly estimate the blocking effects of architectural proteins, insulators and DNA motifs on long-range contacts, making the model intuitive and biologically meaningful. In addition, the model allows analyzing the whole Hi-C information content (2D information) instead of only focusing on TAD borders (1D information). The model outperforms multiple logistic regression at TAD borders in terms of parameter estimation accuracy and is validated by enhancer-blocking assays. In Drosophila, the results support the insulating role of simple sequence repeats and suggest that the blocking effects depend on the number of repeats. Motif analysis uncovered the roles of the transcriptional factors pannier and tramtrack in blocking long-range contacts. In human, the results suggest that the blocking effects of the well-known architectural proteins CTCF, cohesin and ZNF143 depend on the distance between loci, where each protein may participate at different scales of the 3D chromatin organization.
Collapse
Affiliation(s)
- Raphaël Mourad
- LBME, Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, UPS, 31062 Toulouse, France
| | - Olivier Cuvier
- LBME, Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, UPS, 31062 Toulouse, France
| |
Collapse
|
8
|
Kumar RP, Ray S, Home P, Saha B, Bhattacharya B, Wilkins HM, Chavan H, Ganguly A, Milano-Foster J, Paul A, Krishnamurthy P, Swerdlow RH, Paul S. Regulation of energy metabolism during early mammalian development: TEAD4 controls mitochondrial transcription. Development 2018; 145:dev.162644. [PMID: 30201685 DOI: 10.1242/dev.162644] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2017] [Accepted: 08/31/2018] [Indexed: 12/27/2022]
Abstract
Early mammalian development is crucially dependent on the establishment of oxidative energy metabolism within the trophectoderm (TE) lineage. Unlike the inner cell mass, TE cells enhance ATP production via mitochondrial oxidative phosphorylation (OXPHOS) and this metabolic preference is essential for blastocyst maturation. However, molecular mechanisms that regulate establishment of oxidative energy metabolism in TE cells are incompletely understood. Here, we show that conserved transcription factor TEAD4, which is essential for pre-implantation mammalian development, regulates this process by promoting mitochondrial transcription. In developing mouse TE and TE-derived trophoblast stem cells (TSCs), TEAD4 localizes to mitochondria, binds to mitochondrial DNA (mtDNA) and facilitates its transcription by recruiting mitochondrial RNA polymerase (POLRMT). Loss of TEAD4 impairs recruitment of POLRMT, resulting in reduced expression of mtDNA-encoded electron transport chain components, thereby inhibiting oxidative energy metabolism. Our studies identify a novel TEAD4-dependent molecular mechanism that regulates energy metabolism in the TE lineage to ensure mammalian development.
Collapse
Affiliation(s)
- Ram P Kumar
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Soma Ray
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Pratik Home
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Biswarup Saha
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Bhaswati Bhattacharya
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Heather M Wilkins
- University of Kansas Alzheimer's Disease Center and the Departments of Neurology, Molecular and Integrative Physiology, and Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Hemantkumar Chavan
- Department of Pharmacology, Toxicology and Therapeutics, University of Kansas Medical Center, 3901 Rainbow Boulevard, Kansas City, KS 66160, USA
| | - Avishek Ganguly
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Jessica Milano-Foster
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Arindam Paul
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Partha Krishnamurthy
- Department of Pharmacology, Toxicology and Therapeutics, University of Kansas Medical Center, 3901 Rainbow Boulevard, Kansas City, KS 66160, USA
| | - Russell H Swerdlow
- University of Kansas Alzheimer's Disease Center and the Departments of Neurology, Molecular and Integrative Physiology, and Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Soumen Paul
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA .,Institute of Reproductive Health and Regenerative Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| |
Collapse
|
9
|
Avvaru AK, Saxena S, Sowpati DT, Mishra RK. MSDB: A Comprehensive Database of Simple Sequence Repeats. Genome Biol Evol 2018; 9:1797-1802. [PMID: 28854643 PMCID: PMC5533116 DOI: 10.1093/gbe/evx132] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2017] [Indexed: 11/13/2022] Open
Abstract
Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb.
Collapse
Affiliation(s)
| | - Saketh Saxena
- CSIR - Centre for Cellular and Molecular Biology, Hyderabad, India
| | | | | |
Collapse
|
10
|
Merrick BA, Chang JS, Phadke DP, Bostrom MA, Shah RR, Wang X, Gordon O, Wright GM. HAfTs are novel lncRNA transcripts from aflatoxin exposure. PLoS One 2018; 13:e0190992. [PMID: 29351317 PMCID: PMC5774710 DOI: 10.1371/journal.pone.0190992] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Accepted: 12/22/2017] [Indexed: 12/28/2022] Open
Abstract
The transcriptome can reveal insights into precancer biology. We recently conducted RNA-Seq analysis on liver RNA from male rats exposed to the carcinogen, aflatoxin B1 (AFB1), for 90 days prior to liver tumor onset. Among >1,000 differentially expressed transcripts, several novel, unannotated Cufflinks-assembled transcripts, or HAfTs (Hepatic Aflatoxin Transcripts) were found. We hypothesized PCR-cloning and RACE (rapid amplification of cDNA ends) could further HAfT identification. Sanger data was obtained for 6 transcripts by PCR and 16 transcripts by 5’- and 3’-RACE. BLAST alignments showed, with two exceptions, HAfT transcripts were lncRNAs, >200nt without apparent long open reading frames. Six rat HAfT transcripts were classified as ‘novel’ without RefSeq annotation. Sequence alignment and genomic synteny showed each rat lncRNA had a homologous locus in the mouse genome and over half had homologous loci in the human genome, including at least two loci (and possibly three others) that were previously unannotated. While HAfT functions are not yet clear, coregulatory roles may be possible from their adjacent orientation to known coding genes with altered expression that include 8 HAfT-gene pairs. For example, a unique rat HAfT, homologous to Pvt1, was adjacent to known genes controlling cell proliferation. Additionally, PCR and RACE Sanger sequencing showed many alternative splice variants and refinements of exon sequences compared to Cufflinks assembled transcripts and gene prediction algorithms. Presence of multiple splice variants and short tandem repeats found in some HAfTs may be consequential for secondary structure, transcriptional regulation, and function. In summary, we report novel, differentially expressed lncRNAs after exposure to the genotoxicant, AFB1, prior to neoplastic lesions. Complete cloning and sequencing of such transcripts could pave the way for a new set of sensitive and early prediction markers for chemical hepatocarcinogens.
Collapse
Affiliation(s)
- B. Alex Merrick
- Biomolecular Screening Branch, Division National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, United States of America
- * E-mail:
| | - Justin S. Chang
- Biomolecular Screening Branch, Division National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, United States of America
| | - Dhiral P. Phadke
- Sciome, LLC, Research Triangle Park, North Carolina, United States of America
| | - Meredith A. Bostrom
- Genomics Laboratory, David H. Murdock Research Institute, Kannapolis, North Carolina, United State of America
| | - Ruchir R. Shah
- Sciome, LLC, Research Triangle Park, North Carolina, United States of America
| | - Xinguo Wang
- Genomics Laboratory, David H. Murdock Research Institute, Kannapolis, North Carolina, United State of America
| | - Oksana Gordon
- Genomics Laboratory, David H. Murdock Research Institute, Kannapolis, North Carolina, United State of America
| | - Garron M. Wright
- Genomics Laboratory, David H. Murdock Research Institute, Kannapolis, North Carolina, United State of America
| |
Collapse
|
11
|
Bagshaw AT. Functional Mechanisms of Microsatellite DNA in Eukaryotic Genomes. Genome Biol Evol 2017; 9:2428-2443. [PMID: 28957459 PMCID: PMC5622345 DOI: 10.1093/gbe/evx164] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2017] [Indexed: 02/06/2023] Open
Abstract
Microsatellite repeat DNA is best known for its length mutability, which is implicated in several neurological diseases and cancers, and often exploited as a genetic marker. Less well-known is the body of work exploring the widespread and surprisingly diverse functional roles of microsatellites. Recently, emerging evidence includes the finding that normal microsatellite polymorphism contributes substantially to the heritability of human gene expression on a genome-wide scale, calling attention to the task of elucidating the mechanisms involved. At present, these are underexplored, but several themes have emerged. I review evidence demonstrating roles for microsatellites in modulation of transcription factor binding, spacing between promoter elements, enhancers, cytosine methylation, alternative splicing, mRNA stability, selection of transcription start and termination sites, unusual structural conformations, nucleosome positioning and modification, higher order chromatin structure, noncoding RNA, and meiotic recombination hot spots.
Collapse
|
12
|
Pita S, Panzera F, Mora P, Vela J, Cuadrado Á, Sánchez A, Palomeque T, Lorite P. Comparative repeatome analysis on Triatoma infestans Andean and Non-Andean lineages, main vector of Chagas disease. PLoS One 2017; 12:e0181635. [PMID: 28723933 PMCID: PMC5517068 DOI: 10.1371/journal.pone.0181635] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2017] [Accepted: 07/04/2017] [Indexed: 12/13/2022] Open
Abstract
Triatoma infestans is the most important Chagas disease vector in South America. Two main evolutionary lineages, named Andean and non-Andean, have been recognized by geographical distribution, phenetic and genetic characteristics. One of the main differences is the genomic size, varying over 30% in their haploid DNA content. Here we realize a genome wide analysis to compare the repetitive genome fraction (repeatome) between both lineages in order to identify the main repetitive DNA changes occurred during T. infestans differentiation process. RepeatExplorer analysis using Illumina reads showed that both lineages exhibit the same amount of non-repeat sequences, and that satellite DNA is by far the major component of repetitive DNA and the main responsible for the genome size differentiation between both lineages. We characterize 42 satellite DNA families, which are virtually all present in both lineages but with different amount in each lineage. Furthermore, chromosomal location of satellite DNA by fluorescence in situ hybridization showed that genomic variations in T. infestans are mainly due to satellite DNA families located on the heterochromatic regions. The results also show that many satDNA families are located on the euchromatic regions of the chromosomes.
Collapse
Affiliation(s)
- Sebastián Pita
- Sección Genética Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Francisco Panzera
- Sección Genética Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Pablo Mora
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Jaén, Spain
| | - Jesús Vela
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Jaén, Spain
| | - Ángeles Cuadrado
- Department of Cell Biology and Genetics, University of Alcalá de Henares, Alcalá de Henares, Madrid, Spain
| | - Antonio Sánchez
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Jaén, Spain
| | - Teresa Palomeque
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Jaén, Spain
| | - Pedro Lorite
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Jaén, Spain
| |
Collapse
|
13
|
Krishnan J, Athar F, Rani TS, Mishra RK. Simple sequence repeats showing 'length preference' have regulatory functions in humans. Gene 2017; 628:156-161. [PMID: 28712775 DOI: 10.1016/j.gene.2017.07.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2016] [Revised: 05/18/2017] [Accepted: 07/10/2017] [Indexed: 11/15/2022]
Abstract
Simple sequence repeats (SSRs), simple tandem repeats (STRs) or microsatellites are short tandem repeats of 1-6 nucleotide motifs. They are twice as abundant as the protein coding DNA in the human genome and yet little is known about their functional relevance. Analysis of genomes across various taxa show that despite the instability associated with longer stretches of repeats, few SSRs with specific longer repeat lengths are enriched in the genomes indicating a positive selection. This conserved feature of length dependent enrichment hints at not only sequence but also length dependent functionality for SSRs. In the present study, we selected 23 SSRs of the human genome that show specific repeat length dependent enrichment and analysed their cis-regulatory potential using promoter modulation, boundary and barrier assays. We find that the 23 SSR sequences, which are mostly intergenic and intronic, possess distinct cis-regulatory potential. They modulate minimal promoter activity in transient luciferase assays and are capable of functioning as enhancer-blockers and barrier elements. The results of our functional assays propose cis-gene regulatory roles for these specific length enriched SSRs and opens avenues for further investigations.
Collapse
Affiliation(s)
- Jaya Krishnan
- Stowers Institute for Medical Research, MO, United States; International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India; CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India
| | - Fathima Athar
- Stowers Institute for Medical Research, MO, United States; International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India; CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India
| | - Tirupaati Swaroopa Rani
- Stowers Institute for Medical Research, MO, United States; International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India; CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India
| | - Rakesh Kumar Mishra
- Stowers Institute for Medical Research, MO, United States; International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India; CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India.
| |
Collapse
|
14
|
Case LK, Teuscher C. Y genetic variation and phenotypic diversity in health and disease. Biol Sex Differ 2015; 6:6. [PMID: 25866616 PMCID: PMC4392626 DOI: 10.1186/s13293-015-0024-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 02/22/2015] [Indexed: 11/10/2022] Open
Abstract
Sexually dimorphic traits arise through the combined effects of sex hormones and sex chromosomes on sex-biased gene expression, and experimental mouse models have been instrumental in determining their relative contribution in modulating sex differences. A role for the Y chromosome (ChrY) in mediating sex differences outside of development and reproduction has historically been overlooked due to its unusual genetic composition and the predominant testes-specific expression of ChrY-encoded genes. However, ample evidence now exists supporting ChrY as a mediator of other physiological traits in males, and genetic variation in ChrY has been linked to several diseases, including heart disease, cancer, and autoimmune diseases in experimental animal models, as well as humans. The genetic and molecular mechanisms by which ChrY modulates phenotypic variation in males remain unknown but may be a function of copy number variation between homologous X-Y multicopy genes driving differential gene expression. Here, we review the literature identifying an association between ChrY polymorphism and phenotypic variation and present the current evidence depicting the mammalian ChrY as a member of the regulatory genome in males and as a factor influencing paternal parent-of-origin effects in female offspring.
Collapse
Affiliation(s)
- Laure K Case
- Department of Medicine, University of Vermont, 89 Beaumont Ave, Burlington, VT 05405 USA
| | - Cory Teuscher
- Department of Medicine, University of Vermont, 89 Beaumont Ave, Burlington, VT 05405 USA ; Department of Pathology, University of Vermont, 89 Beaumont Ave, Burlington, VT 05405 USA ; University of Vermont, Given Medical Building C317, Burlington, VT 05405 USA
| |
Collapse
|
15
|
Pathak RU, Srinivasan A, Mishra RK. Genome-wide mapping of matrix attachment regions in Drosophila melanogaster. BMC Genomics 2014; 15:1022. [PMID: 25424749 PMCID: PMC4301625 DOI: 10.1186/1471-2164-15-1022] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 11/12/2014] [Indexed: 12/12/2022] Open
Abstract
Background Eukaryotic genome acquires functionality upon proper packaging within the nucleus. This process is facilitated by the structural framework of Nuclear Matrix, a nucleo-proteinaceous meshwork. Matrix Attachment Regions (MARs) in the genome serve as anchoring sites to this framework. Results Here we report direct sequencing of the MAR preparation from Drosophila melanogaster embryos and identify >7350 MARs. This amounts to ~2.5% of the fly genome and often coincide with AT rich non-coding regions. We find significant association of MARs with the origins of replication, transcription start sites, paused RNA Polymerase II sites and exons, but not introns, of highly expressed genes. We also identified sequence motifs and repeats that constitute MARs. Conclusion Our data reveal the contact points of genome to the nuclear architecture and provide a link between nuclear functions and genomic packaging. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1022) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Rakesh K Mishra
- Centre for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad 500 007, India.
| |
Collapse
|
16
|
Ramamoorthy S, Garapati HS, Mishra RK. Length and sequence dependent accumulation of simple sequence repeats in vertebrates: Potential role in genome organization and regulation. Gene 2014; 551:167-75. [DOI: 10.1016/j.gene.2014.08.052] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Revised: 08/03/2014] [Accepted: 08/25/2014] [Indexed: 10/24/2022]
|
17
|
Abstract
Sex chromosomes are the most dynamic entity in any genome having unique morphology, gene content, and evolution. They have evolved multiple times and independently throughout vertebrate evolution. One of the major genomic changes that pertain to sex chromosomes involves the amplification of common repeats. It is hypothesized that such amplification of repeats facilitates the suppression of recombination, leading to the evolution of heteromorphic sex chromosomes through genetic degradation of Y or W chromosomes. Although contrasting evidence is available, it is clear that amplification of simple repetitive sequences played a major role in the evolution of Y and W chromosomes in vertebrates. In this review, we present a brief overview of the repetitive DNA classes that accumulated during sex chromosome evolution, mainly focusing on vertebrates, and discuss their possible role and potential function in this process.
Collapse
|
18
|
Qin L, Ma Y, Liang P, Tan Z, Li S. Differential distributions of mononucleotide repeat sequences in 256 viral genomes and its potential implications. Gene 2014; 544:159-64. [PMID: 24786215 DOI: 10.1016/j.gene.2014.04.063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Revised: 04/14/2014] [Accepted: 04/26/2014] [Indexed: 11/18/2022]
Abstract
Mononucleotide repeats (MNRs) have been systematically investigated in the genomes of eukaryotic and prokaryotic organisms. However, detailed information on the distribution of MNRs in viral genomes is limited. In this study, we examined the distributions of MNRs in 256 fully sequenced virus genomes which showed extensive variations across viral genomes, and is significantly influenced by both genome size and CG content. Furthermore, the ratio of the observed to the expected number of MNRs (O/E ratio) appears to be influenced by both the host range and genome type of a particular virus. Additionally, the densities and frequencies of MNRs in genic regions are lower than in non-coding regions, suggesting that selective pressure acts on viral genomes. We also discuss the potential functional roles that these MNR loci could play in virus genomes. To our knowledge, this is the first analysis focusing on MNRs in viruses, and our study could have potential implications for a deeper understanding of virus genome stability and the co-evolution that occurs between a virus and its host.
Collapse
Affiliation(s)
- Lü Qin
- State Key Laboratory of Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China; College of Biology, State Key Laboratory for Chemo/Biosensing and Chemometrics, Hunan University, Changsha 410082, China
| | - Yuxin Ma
- State Key Laboratory of Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Pengbo Liang
- College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China
| | - Zhongyang Tan
- State Key Laboratory of Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China; College of Biology, State Key Laboratory for Chemo/Biosensing and Chemometrics, Hunan University, Changsha 410082, China.
| | - Shifang Li
- State Key Laboratory of Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China.
| |
Collapse
|