1
|
Tseng MC, Lee YH, Yen TB, Li SM. Genome-wide characterization of microsatellites in cobia Rachycentron canadum (Linnaeus, 1766): Survey and analysis of their abundance and diversity. JOURNAL OF FISH BIOLOGY 2024; 104:44-55. [PMID: 37658731 DOI: 10.1111/jfb.15552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 08/23/2023] [Accepted: 08/31/2023] [Indexed: 09/05/2023]
Abstract
The cobia Rachycentron canadum, mainly distributed in the warm waters of tropical and subtropical regions around the world, remains a fish of considerable economic importance. Detailed diversity and the number of microsatellite sequences in the cobia genome are still unintelligible. The primary aim of this work was to identify and quantify the miscellaneous SSR sequences in the cobia genome. More than 280,000 sequences were sequenced and screened using next-generation sequencing technology and microsatellite identification. Perfect mononucleotide repeats, dinucleotide microsatellites, and trinucleotide microsatellites contain (A)10 /(T)10 , (AC)6 /(TG)6 , and (AAT)5-32 as the largest number of motifs in each type of microsatellite, respectively. The tetranucleotide and pentanucleotide microsatellites (TTM and PTM) consist of the largest number of motifs of both (ATCT)5-32 and (TCAT)5-31 in TTMs, and (CTCTC)5-9 in PTMs, whereas the hexanucleotide microsatellites are rarely observed in the cobia genome. All c. 38000 sequences of composite microsatellites are extremely diverse, including compound (11.71%), interrupted compound (71.77%), complex (0.45%), and interrupted complex (16.07%). In this study, we developed a convenient and useful recording system for writing down and categorizing diverse composite microsatellite types. This system will provide great support for exploring repeat origins, evolutionary mechanisms, and the application of polymorphic microsatellites.
Collapse
Affiliation(s)
- Mei-Chen Tseng
- Department of Aquaculture, National Pingtung University of Science and Technology, Pingtung 912, Taiwan, R.O.C
| | - Yen-Hung Lee
- Tungkang Aquaculture Research Center, Fisheries Research Institute, MOA, Pingtung 928, Taiwan, R.O.C
| | - Tsair-Bor Yen
- Department of Tropical Agriculture and International Cooperation, National Pingtung University of Science and Technology, Pingtung 912, Taiwan, R.O.C
| | - Shu-Ming Li
- Department of Aquaculture, National Pingtung University of Science and Technology, Pingtung 912, Taiwan, R.O.C
| |
Collapse
|
2
|
Shi Y, Niu Y, Zhang P, Luo H, Liu S, Zhang S, Wang J, Li Y, Liu X, Song T, Xu T, He S. Characterization of genome-wide STR variation in 6487 human genomes. Nat Commun 2023; 14:2092. [PMID: 37045857 PMCID: PMC10097659 DOI: 10.1038/s41467-023-37690-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/27/2023] [Indexed: 04/14/2023] Open
Abstract
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
Collapse
Affiliation(s)
- Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shuai Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Sijia Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jiajia Wang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xinyue Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tingrui Song
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, 250117, Shandong, China.
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
3
|
Apsley AT, Domico ER, Verbiest MA, Brogan CA, Buck ER, Burich AJ, Cardone KM, Stone WJ, Anisimova M, Vandenbergh DJ. A novel hypervariable variable number tandem repeat in the dopamine transporter gene ( SLC6A3). Life Sci Alliance 2023; 6:e202201677. [PMID: 36754567 PMCID: PMC9909461 DOI: 10.26508/lsa.202201677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 01/25/2023] [Accepted: 01/26/2023] [Indexed: 02/10/2023] Open
Abstract
The dopamine transporter gene, SLC6A3, has received substantial attention in genetic association studies of various phenotypes. Although some variable number tandem repeats (VNTRs) present in SLC6A3 have been tested in genetic association studies, results have not been consistent. VNTRs in SLC6A3 that have not been examined genetically were characterized. The Tandem Repeat Annotation Library was used to characterize the VNTRs of 64 unrelated long-read haplotype-phased SLC6A3 sequences. Sequence similarity of each repeat unit of the five VNTRs is reported, along with the correlations of SNP-SNP, SNP-VNTR, and VNTR-VNTR alleles across the gene. One of these VNTRs is a novel hyper-VNTR (hyVNTR) in intron 8 of SLC6A3, which contains a range of 3.4-133.4 repeat copies and has a consensus sequence length of 38 bp, with 82% G+C content. The 38-base repeat was predicted to form G-quadruplexes in silico and was confirmed by circular dichroism spectroscopy. In addition, this hyVNTR contains multiple putative binding sites for PRDM9, which, in combination with low levels of linkage disequilibrium around the hyVNTR, suggests it might be a recombination hotspot.
Collapse
Affiliation(s)
- Abner T Apsley
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
- The Molecular, Cellular and Integrative Biosciences Program, The Pennsylvania State University, State College, PA, USA
| | - Emma R Domico
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
| | - Max A Verbiest
- Institute of Computational Life Science, School of Life Sciences and Facility Management, Zürich University of Applied Sciences, Wädenswil, Switzerland
- Department of Molecular Life Sciences, Faculty of Science, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Carly A Brogan
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
| | - Evan R Buck
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
| | - Andrew J Burich
- Department of Information Science and Technologies - Applied Data Sciences, The Pennsylvania State University, State College, PA, USA
| | - Kathleen M Cardone
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
| | - Wesley J Stone
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
| | - Maria Anisimova
- Institute of Computational Life Science, School of Life Sciences and Facility Management, Zürich University of Applied Sciences, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - David J Vandenbergh
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, USA
- The Molecular, Cellular and Integrative Biosciences Program, The Pennsylvania State University, State College, PA, USA
- Institute of the Neurosciences, The Pennsylvania State University, State College, PA, USA
- The Bioinformatics and Genomics Program, The Pennsylvania State University, State College, PA, USA
| |
Collapse
|
4
|
Boiko SM. Identification of novel SSR markers for predicting the geographic origin of fungus Schizophyllum commune Fr. Fungal Biol 2022; 126:764-774. [PMID: 36517144 DOI: 10.1016/j.funbio.2022.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 09/21/2022] [Accepted: 09/29/2022] [Indexed: 01/07/2023]
Abstract
The fungus Schizophyllum commune is a cosmopolitan basidiomycete, which is popular as an edible, medical mushroom. It causes wood rot and often used as a model object in research. In this study, we analyzed thirty-two genomes of S. commune strains from the NCBI database and designed forty-seven unique SSR DNA markers. The detailed analysis revealed the enrichment of the S. commune genome for CG, GC, CTC, GAG, and TCG motifs. Principal components analysis confirmed the effectiveness of novel SSR DNA markers that preserve the initial heterogeneity of populations. The construction of a network between strains showed single one at a maximum similarity of 38%, and increasing the similarity to 55% breaks the linkage between large groups while separating two new groups containing strains of the population Ru and test cultures S. commune. The amplicons' presence was identified as a sufficient sign of relation of the culture to a specific population. Testing the novel SSR markers allowed to establish a clear delimitation of all groups by geographic location and to differentiate the H4-8 (GCA_000143185.1) strain from the USA population. This research is the basis for the further analysis of S. commune populations and improvement of approaches to determine its genetic diversity.
Collapse
Affiliation(s)
- Sergiy M Boiko
- Department of Phytoecology, Institute for Evolutionary Ecology National Academy of Sciences of Ukraine, 37 Lebedeva Str., 03143, Kyiv, Ukraine.
| |
Collapse
|
5
|
Lei Y, Zhou Y, Price M, Song Z. Genome-wide characterization of microsatellite DNA in fishes: survey and analysis of their abundance and frequency in genome-specific regions. BMC Genomics 2021; 22:421. [PMID: 34098869 PMCID: PMC8186053 DOI: 10.1186/s12864-021-07752-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/24/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Microsatellite repeats are ubiquitous in organism genomes and play an important role in the chromatin organization, regulation of gene activity, recombination and DNA replication. Although microsatellite distribution patterns have been studied in most phylogenetic lineages, they are unclear in fish species. RESULTS Here, we present the first systematic examination of microsatellite distribution in coding and non-coding regions of 14 fish genomes. Our study showed that the number and type of microsatellites displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation and DNA replication slippage theories alone were insufficient to explain the distribution patterns. Our results showed that microsatellites are dominant in non-coding regions. The total number of microsatellites ranged from 78,378 to 1,012,084, and the relative density varied from 4925.76 bp/Mb to 25,401.97 bp/Mb. Overall, (A + T)-rich repeats were dominant. The dependence of repeat abundance on the length of the repeated unit (1-6 nt) showed a great similarity decrease, whereas more tri-nucleotide repeats were found in exonic regions than tetra-nucleotide repeats of most species. Moreover, the incidence of different repeated types appeared species- and genomic-specific. These results highlight potential mechanisms for maintaining microsatellite distribution, such as selective forces and mismatch repair systems. CONCLUSIONS Our data could be beneficial for the studies of genome evolution and microsatellite DNA evolutionary dynamics, and facilitate the exploration of microsatellites structural, function, composition mode and molecular markers development in these species.
Collapse
Affiliation(s)
- Yi Lei
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Yu Zhou
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Megan Price
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Zhaobin Song
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
| |
Collapse
|
6
|
Luce L, Abelleyro MM, Carcione M, Mazzanti C, Rossetti L, Radic P, Szijan I, Menazzi S, Francipane L, Nevado J, Lapunzina P, De Brasi C, Giliberto F. Analysis of complex structural variants in the DMD gene in one family. Neuromuscul Disord 2021; 31:253-263. [PMID: 33451931 DOI: 10.1016/j.nmd.2020.11.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 11/26/2020] [Accepted: 11/30/2020] [Indexed: 11/24/2022]
Abstract
This work describes a family with Duchenne muscular dystrophy (DMD) with a rare case of a symptomatic pregnant woman. The main aim was to perform prenatal molecular diagnosis to provide genetic counseling. The secondary aim was to suggest the molecular mechanisms causing the complex structural variant (cxSV) identified. To accomplish this, we used a multi-technique algorithm including segregation analysis, Multiplex Ligation-dependent Probe Amplification, PCR, X-chromosome inactivation studies, microarrays, whole genome sequencing and bioinformatics. We identified a duplication of exons 38-43 in the DMD gene in all affected and obligate carrier members, proving that this was the DMD-causing mutation. We also observed a skewed X-chromosome inactivation in the symptomatic woman that explained her symptomatology. In addition, we identified a cxSV (duplication of exons 38-43 and deletion of exons 45-54) in the affected boy. The molecular characterization and bioinformatic analyses of the breakpoint junctions allowed us to identify Double Strand Breaks stimulator motifs and suggested the replication-dependent Fork Stalling and Template Switching as the most probable mechanisms leading to the duplication. In addition, the de novo deletion might have been the result of a germline inter-chromosome non-allelic recombination involving the Non-Homologous End Joining mechanism. In conclusion, the diagnostic strategy used allowed us to provide accurate molecular diagnosis and genetic counseling. In addition, the familial molecular diagnosis together with the in-depth characterization of the cxSV helped to determine the chronology of the molecular events, and propose and understand the molecular mechanisms involved in the generation of this complex rearrangement.
Collapse
Affiliation(s)
- Leonela Luce
- Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética, Cátedra de Genética, Laboratorio de Distrofinopatías, Universidad de Buenos Aires, Buenos Aires, Argentina; Instituto de Inmunología, Genética y Metabolismo (INIGEM), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Martín M Abelleyro
- CONICET-Academia Nacional de Medicina, Instituto de Medicina Experimental (IMEX), Buenos Aires, Argentina
| | - Micaela Carcione
- Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética, Cátedra de Genética, Laboratorio de Distrofinopatías, Universidad de Buenos Aires, Buenos Aires, Argentina; Instituto de Inmunología, Genética y Metabolismo (INIGEM), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Chiara Mazzanti
- Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética, Cátedra de Genética, Laboratorio de Distrofinopatías, Universidad de Buenos Aires, Buenos Aires, Argentina; Instituto de Inmunología, Genética y Metabolismo (INIGEM), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Liliana Rossetti
- CONICET-Academia Nacional de Medicina, Instituto de Medicina Experimental (IMEX), Buenos Aires, Argentina
| | - Pamela Radic
- CONICET-Academia Nacional de Medicina, Instituto de Medicina Experimental (IMEX), Buenos Aires, Argentina
| | - Irene Szijan
- Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética, Cátedra de Genética, Laboratorio de Distrofinopatías, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Sebastián Menazzi
- Hospital de Clínicas "José de San Martín", División de Genética, Universidad de Buenos Aires, Buenos Aires Argentina
| | - Liliana Francipane
- Hospital de Clínicas "José de San Martín", División de Genética, Universidad de Buenos Aires, Buenos Aires Argentina
| | - Julián Nevado
- Instituto de Genética Médica y Molecular (INGEMM)-IdiPAZ, Hospital Universitario La Paz, Universidad Autónoma, Madrid, Spain; Centro de Investigaciones Biomédicas en Red para Enfermedades Raras (CIBERER), Madrid, Spain; ITHACA-ERN (European Reference Network), La Paz University Hospital, Madrid, Spain
| | - Pablo Lapunzina
- Instituto de Genética Médica y Molecular (INGEMM)-IdiPAZ, Hospital Universitario La Paz, Universidad Autónoma, Madrid, Spain; Centro de Investigaciones Biomédicas en Red para Enfermedades Raras (CIBERER), Madrid, Spain; ITHACA-ERN (European Reference Network), La Paz University Hospital, Madrid, Spain
| | - Carlos De Brasi
- CONICET-Academia Nacional de Medicina, Instituto de Medicina Experimental (IMEX), Buenos Aires, Argentina
| | - Florencia Giliberto
- Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética, Cátedra de Genética, Laboratorio de Distrofinopatías, Universidad de Buenos Aires, Buenos Aires, Argentina; Instituto de Inmunología, Genética y Metabolismo (INIGEM), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina.
| |
Collapse
|
7
|
Balzano E, Pelliccia F, Giunta S. Genome (in)stability at tandem repeats. Semin Cell Dev Biol 2020; 113:97-112. [PMID: 33109442 DOI: 10.1016/j.semcdb.2020.10.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 09/26/2020] [Accepted: 10/10/2020] [Indexed: 12/12/2022]
Abstract
Repeat sequences account for over half of the human genome and represent a significant source of variation that underlies physiological and pathological states. Yet, their study has been hindered due to limitations in short-reads sequencing technology and difficulties in assembly. A important category of repetitive DNA in the human genome is comprised of tandem repeats (TRs), where repetitive units are arranged in a head-to-tail pattern. Compared to other regions of the genome, TRs carry between 10 and 10,000 fold higher mutation rate. There are several mutagenic mechanisms that can give rise to this propensity toward instability, but their precise contribution remains speculative. Given the high degree of homology between these sequences and their arrangement in tandem, once damaged, TRs have an intrinsic propensity to undergo aberrant recombination with non-allelic exchange and generate harmful rearrangements that may undermine the stability of the entire genome. The dynamic mutagenesis at TRs has been found to underlie individual polymorphism associated with neurodegenerative and neuromuscular disorders, as well as complex genetic diseases like cancer and diabetes. Here, we review our current understanding of the surveillance and repair mechanisms operating within these regions, and we describe how alterations in these protective processes can readily trigger mutational signatures found at TRs, ultimately resulting in the pathological correlation between TRs instability and human diseases. Finally, we provide a viewpoint to counter the detrimental effects that TRs pose in light of their selection and conservation, as important drivers of human evolution.
Collapse
Affiliation(s)
- Elisa Balzano
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Franca Pelliccia
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Simona Giunta
- The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA; Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy.
| |
Collapse
|
8
|
Huang Y, Huang X, Zhou X, Wang J, Zhang R, Ma F, Wang K, Zhang Z, Dai X, Cao X, Zhang C, Han K, Ren Q. Immune activation by a multigene family of lectins with variable tandem repeats in oriental river prawn ( Macrobrachium nipponense). Open Biol 2020; 10:200141. [PMID: 32931720 PMCID: PMC7536079 DOI: 10.1098/rsob.200141] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Genomic regions with repeated sequences are unstable and prone to rapid DNA diversification. However, the role of tandem repeats within the coding region is not fully characterized. Here, we have identified a new hypervariable C-type lectin gene family with different numbers of tandem repeats (Rlecs; R means repeat) in oriental river prawn (Macrobrachium nipponense). Two types of repeat units (33 or 30 bp) are identified in the second exon, and the number of repeat units vary from 1 to 9. Rlecs can be classified into 15 types through phylogenetic analysis. The amino acid sequences in the same type of Rlec are highly conservative outside the repeat regions. The main differences among the Rlec types are evident in exon 5. A variable number of tandem repeats in Rlecs may be produced by slip mispairing during gene replication. Alternative splicing contributes to the multiplicity of forms in this lectin gene family, and different types of Rlecs vary in terms of tissue distribution, expression quantity and response to bacterial challenge. These variations suggest that Rlecs have functional diversity. The results of experiments on sugar binding, microbial inhibition and clearance, regulation of antimicrobial peptide gene expression and prophenoloxidase activation indicate that the function of Rlecs with the motif of YRSKDD in innate immunity is enhanced when the number of tandem repeats increases. Our results suggest that Rlecs undergo gene expansion through gene duplication and alternative splicing, which ultimately leads to functional diversity.
Collapse
Affiliation(s)
- Ying Huang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China.,College of Oceanography, Hohai University, 1 Xikang Road, Nanjing, Jiangsu 210098, People's Republic of China
| | - Xin Huang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Xuming Zhou
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, People's Republic of China
| | - Jialin Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, People's Republic of China
| | - Ruidong Zhang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Futong Ma
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Kaiqiang Wang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Zhuoxing Zhang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Xiaoling Dai
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Xueying Cao
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Chao Zhang
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Keke Han
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China
| | - Qian Ren
- College of Marine Science and Engineering, Nanjing Normal University, 1 Wenyuan Road, Nanjing, Jiangsu 210023, People's Republic of China.,Co-Innovation Center for Marine Bio-Industry Technology of Jiangsu Province, Lianyungang, Jiangsu 222005, People's Republic of China
| |
Collapse
|
9
|
McDew-White M, Li X, Nkhoma SC, Nair S, Cheeseman I, Anderson TJC. Mode and Tempo of Microsatellite Length Change in a Malaria Parasite Mutation Accumulation Experiment. Genome Biol Evol 2020; 11:1971-1985. [PMID: 31273388 PMCID: PMC6644851 DOI: 10.1093/gbe/evz140] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/29/2019] [Indexed: 12/12/2022] Open
Abstract
Malaria parasites have small extremely AT-rich genomes: microsatellite repeats (1–9 bp) comprise 11% of the genome and genetic variation in natural populations is dominated by repeat changes in microsatellites rather than point mutations. This experiment was designed to quantify microsatellite mutation patterns in Plasmodium falciparum. We established 31 parasite cultures derived from a single parasite cell and maintained these for 114–267 days with frequent reductions to a single cell, so parasites accumulated mutations during ∼13,207 cell divisions. We Illumina sequenced the genomes of both progenitor and end-point mutation accumulation (MA) parasite lines in duplicate to validate stringent calling parameters. Microsatellite calls were 99.89% (GATK), 99.99% (freeBayes), and 99.96% (HipSTR) concordant in duplicate sequence runs from independent sequence libraries, whereas introduction of microsatellite mutations into the reference genome revealed a low false negative calling rate (0.68%). We observed 98 microsatellite mutations. We highlight several conclusions: microsatellite mutation rates (3.12 × 10−7 to 2.16 × 10−8/cell division) are associated with both repeat number and repeat motif like other organisms studied. However, 41% of changes resulted from loss or gain of more than one repeat: this was particularly true for long repeat arrays. Unlike other eukaryotes, we found no insertions or deletions that were not associated with repeats or homology regions. Overall, microsatellite mutation rates are among the lowest recorded and comparable to those in another AT-rich protozoan (Dictyostelium). However, a single infection (>1011 parasites) will still contain over 2.16 × 103 to 3.12 × 104 independent mutations at any single microsatellite locus.
Collapse
Affiliation(s)
| | - Xue Li
- Texas Biomedical Research Institute, San Antonio, Texas
| | - Standwell C Nkhoma
- Texas Biomedical Research Institute, San Antonio, Texas.,Malaria Research and Reference Reagent Resource Center (MR4), BEI Resources, American Type Culture Collection, 10801 University Boulevard, Manassas, VA
| | - Shalini Nair
- Texas Biomedical Research Institute, San Antonio, Texas
| | - Ian Cheeseman
- Texas Biomedical Research Institute, San Antonio, Texas
| | | |
Collapse
|
10
|
Heissl A, Betancourt AJ, Hermann P, Povysil G, Arbeithuber B, Futschik A, Ebner T, Tiemann-Boege I. The impact of poly-A microsatellite heterologies in meiotic recombination. Life Sci Alliance 2019; 2:2/2/e201900364. [PMID: 31023833 PMCID: PMC6485458 DOI: 10.26508/lsa.201900364] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 03/27/2019] [Accepted: 03/29/2019] [Indexed: 12/12/2022] Open
Abstract
Meiosis strongly influences the transmission and evolution of heterozygous poly-A repeats as measured experimentally in a large collection of single recombination products in a human hotspot. Meiotic recombination has strong, but poorly understood effects on short tandem repeat (STR) instability. Here, we screened thousands of single recombinant products with sperm typing to characterize the role of polymorphic poly-A repeats at a human recombination hotspot in terms of hotspot activity and STR evolution. We show that the length asymmetry between heterozygous poly-A’s strongly influences the recombination outcome: a heterology of 10 A’s (9A/19A) reduces the number of crossovers and elevates the frequency of non-crossovers, complex recombination products, and long conversion tracts. Moreover, the length of the heterology also influences the STR transmission during meiotic repair with a strong and significant insertion bias for the short heterology (6A/7A) and a deletion bias for the long heterology (9A/19A). In spite of this opposing insertion-/deletion-biased gene conversion, we find that poly-A’s are enriched at human recombination hotspots that could have important consequences in hotspot activation.
Collapse
Affiliation(s)
- Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | | | - Philipp Hermann
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Gundula Povysil
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| | | | - Andreas Futschik
- Institute of Applied Statistics, Johannes Kepler University, Linz, Austria
| | - Thomas Ebner
- Department of Gynecology, Obstetrics and Gynecological Endocrinology, Kepler University Clinic, Linz, Austria
| | | |
Collapse
|
11
|
Press MO, Hall AN, Morton EA, Queitsch C. Substitutions Are Boring: Some Arguments about Parallel Mutations and High Mutation Rates. Trends Genet 2019; 35:253-264. [PMID: 30797597 PMCID: PMC6435258 DOI: 10.1016/j.tig.2019.01.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 12/20/2018] [Accepted: 01/14/2019] [Indexed: 12/31/2022]
Abstract
Extant genomes are largely shaped by global transposition, copy-number fluctuation, and rearrangement of DNA sequences rather than by substitutions of single nucleotides. Although many of these large-scale mutations have low probabilities and are unlikely to repeat, others are recurrent or predictable in their effects, leading to stereotyped genome architectures and genetic variation in both eukaryotes and prokaryotes. Such recurrent, parallel mutation modes can profoundly shape the paths taken by evolution and undermine common models of evolutionary genetics. Similar patterns are also evident at the smaller scales of individual genes or short sequences. The scale and extent of this 'non-substitution' variation has recently come into focus through the advent of new genomic technologies; however, it is still not widely considered in genotype-phenotype association studies. In this review we identify common features of these disparate mutational phenomena and comment on the importance and interpretation of these mutational patterns.
Collapse
Affiliation(s)
| | - Ashley N Hall
- Department of Genome Sciences, University of Washington, Seattle, WA 91895, USA; Department of Molecular and Cellular Biology, University of Washington, Seattle, WA 91895, USA
| | - Elizabeth A Morton
- Department of Genome Sciences, University of Washington, Seattle, WA 91895, USA
| | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA 91895, USA.
| |
Collapse
|