1
|
Unneberg P, Larsson M, Olsson A, Wallerman O, Petri A, Bunikis I, Vinnere Pettersson O, Papetti C, Gislason A, Glenner H, Cartes JE, Blanco-Bercial L, Eriksen E, Meyer B, Wallberg A. Ecological genomics in the Northern krill uncovers loci for local adaptation across ocean basins. Nat Commun 2024; 15:6297. [PMID: 39090106 PMCID: PMC11294593 DOI: 10.1038/s41467-024-50239-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 05/15/2024] [Indexed: 08/04/2024] Open
Abstract
Krill are vital as food for many marine animals but also impacted by global warming. To learn how they and other zooplankton may adapt to a warmer world we studied local adaptation in the widespread Northern krill (Meganyctiphanes norvegica). We assemble and characterize its large genome and compare genome-scale variation among 74 specimens from the colder Atlantic Ocean and warmer Mediterranean Sea. The 19 Gb genome likely evolved through proliferation of retrotransposons, now targeted for inactivation by extensive DNA methylation, and contains many duplicated genes associated with molting and vision. Analysis of 760 million SNPs indicates extensive homogenizing gene-flow among populations. Nevertheless, we detect signatures of adaptive divergence across hundreds of genes, implicated in photoreception, circadian regulation, reproduction and thermal tolerance, indicating polygenic adaptation to light and temperature. The top gene candidate for ecological adaptation was nrf-6, a lipid transporter with a Mediterranean variant that may contribute to early spring reproduction. Such variation could become increasingly important for fitness in Atlantic stocks. Our study underscores the widespread but uneven distribution of adaptive variation, necessitating characterization of genetic variation among natural zooplankton populations to understand their adaptive potential, predict risks and support ocean conservation in the face of climate change.
Collapse
Affiliation(s)
- Per Unneberg
- Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Mårten Larsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Husargatan 3, 751 23, Uppsala, Sweden
- Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - Anna Olsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Husargatan 3, 751 23, Uppsala, Sweden
| | - Ola Wallerman
- Department of Medical Biochemistry and Microbiology, Uppsala University, Husargatan 3, 751 23, Uppsala, Sweden
| | - Anna Petri
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, National Genomics Infrastructure hosted by SciLifeLab, Uppsala, Sweden
| | - Ignas Bunikis
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, National Genomics Infrastructure hosted by SciLifeLab, Uppsala, Sweden
| | - Olga Vinnere Pettersson
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, National Genomics Infrastructure hosted by SciLifeLab, Uppsala, Sweden
| | | | - Astthor Gislason
- Marine and Freshwater Research Institute, Pelagic Division, Reykjavik, Iceland
| | - Henrik Glenner
- Department of Biological Sciences, University of Bergen, Bergen, Norway
- Center for Macroecology, Evolution and Climate Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Joan E Cartes
- Instituto de Ciencias del Mar (ICM-CSIC), Barcelona, Spain
| | | | | | - Bettina Meyer
- Section Polar Biological Oceanography, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, Bremerhaven, Germany
- Institute for Chemistry and Biology of the Marine Environment, Carlvon Ossietzky University of Oldenburg, Oldenburg, Germany
- Helmholtz Institute for Functional Marine Biodiversity (HIFMB), University of Oldenburg, Oldenburg, Germany
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Uppsala University, Husargatan 3, 751 23, Uppsala, Sweden.
| |
Collapse
|
2
|
Liu Z, Zheng J, Li H, Fang K, Wang S, He J, Zhou D, Weng S, Chi M, Gu Z, He J, Li F, Wang M. Genome assembly of redclaw crayfish (Cherax quadricarinatus) provides insights into its immune adaptation and hypoxia tolerance. BMC Genomics 2024; 25:746. [PMID: 39080519 PMCID: PMC11290268 DOI: 10.1186/s12864-024-10673-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 07/29/2024] [Indexed: 08/02/2024] Open
Abstract
BACKGROUND The introduction of non-native species is a primary driver of biodiversity loss in freshwater ecosystems. The redclaw crayfish (Cherax quadricarinatus) is a freshwater species that exhibits tolerance to hypoxic stresses, fluctuating temperatures, high ammonia concentration. These hardy physiological characteristics make C. quadricarinatus a popular aquaculture species and a potential invasive species that can negatively impact tropical and subtropical ecosystems. Investigating the genomic basis of environmental tolerances and immune adaptation in C. quadricarinatus will facilitate the development of management strategies of this potential invasive species. RESULTS We constructed a chromosome-level genome of C. quadricarinatus by integrating Nanopore and PacBio techniques. Comparative genomic analysis suggested that transposable elements and tandem repeats drove genome size evolution in decapod crustaceans. The expansion of nine immune-related gene families contributed to the disease resistance of C. quadricarinatus. Three hypoxia-related genes (KDM3A, KDM5A, HMOX2) were identified as being subjected to positive selection in C. quadricarinatus. Additionally, in vivo analysis revealed that upregulating KDM5A was crucial for hypoxic response in C. quadricarinatus. Knockdown of KDM5A impaired hypoxia tolerance in this species. CONCLUSIONS Our results provide the genomic basis for hypoxic tolerance and immune adaptation in C. quadricarinatus, facilitating the management of this potential invasive species. Additionally, in vivo analysis in C. quadricarinatus suggests that the role of KDM5A in the hypoxic response of animals is complex.
Collapse
Affiliation(s)
- Ziwei Liu
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China
| | - Jianbo Zheng
- Key Laboratory of Genetics and Breeding, Zhejiang Institute of Freshwater Fisheries, Huzhou, 313001, China
| | - Haoyang Li
- China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, Guangdong Provincial Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Ke Fang
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China
| | - Sheng Wang
- China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, Guangdong Provincial Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Jian He
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Dandan Zhou
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China
| | - Shaoping Weng
- China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, Guangdong Provincial Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Meili Chi
- Key Laboratory of Genetics and Breeding, Zhejiang Institute of Freshwater Fisheries, Huzhou, 313001, China
| | - Zhimin Gu
- Key Laboratory of Genetics and Breeding, Zhejiang Institute of Freshwater Fisheries, Huzhou, 313001, China
- Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, China
| | - Jianguo He
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China.
- China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, Guangdong Provincial Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China.
| | - Fei Li
- Key Laboratory of Genetics and Breeding, Zhejiang Institute of Freshwater Fisheries, Huzhou, 313001, China.
| | - Muhua Wang
- State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-Sen University, Zhuhai, 519000, China.
- China-ASEAN Belt and Road Joint Laboratory On Mariculture Technology, Guangdong Provincial Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China.
| |
Collapse
|
3
|
Dantas CWD, da Costa Neto SR, Alves SIA, da Costa Pinheiro K, De Los Santos EFF, Ramos RTJ. SATIN: a micro and mini satellite mining tool of total genome and coding regions with analysis of perfect repeats polymorphism in coding regions. BMC Bioinformatics 2024; 25:217. [PMID: 38890569 PMCID: PMC11186120 DOI: 10.1186/s12859-024-05842-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 06/12/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Tandem repeats are specific sequences in genomic DNA repeated in tandem that are present in all organisms. Among the subcategories of TRs we have Satellite repeats, that is divided into macrosatellites, minisatellites, and microsatellites, being the last two of specific interest because they can identify polymorphisms between organisms due to their instability. Currently, most mining tools focus on Simple Sequence Repeats (SSR) mining, and only a few can identify SSRs in the coding regions. RESULTS We developed a microsatellite mining software called SATIN (Micro and Mini SATellite IdentificatioN tool) based on a new sliding window algorithm written in C and Python. It represents a new approach to SSR mining by addressing the limitations of existing tools, particularly in coding region SSR mining. SATIN is available at https://github.com/labgm/SATIN.git . It was shown to be the second fastest for perfect and compound SSR mining. It can identify SSRs from coding regions plus SSRs with motif sizes bigger than 6. Besides the SSR mining, SATIN can also analyze SSRs polymorphism on coding-regions from pre-determined groups, and identify SSRs differentially abundant among them on a per-gene basis. To validate, we analyzed SSRs from two groups of Escherichia coli (K12 and O157) and compared the results with 5 known SSRs from coding regions. SATIN identified all 5 SSRs from 237 genes with at least one SSR on it. CONCLUSIONS The SATIN is a novel microsatellite search software that utilizes an innovative sliding window technique based on a numerical list for repeat region search to identify perfect, and composite SSRs while generating comprehensible and analyzable outputs. It is a tool capable of using files in fasta or GenBank format as input for microsatellite mining, also being able to identify SSRs present in coding regions for GenBank files. In conclusion, we expect SATIN to help identify potential SSRs to be used as genetic markers.
Collapse
Affiliation(s)
| | | | - Sandy Ingrid Aguiar Alves
- Simulation and Computational Biology Laboratory, High Performance Computing Center, Federal University of Pará, Belém, Brazil
| | - Kenny da Costa Pinheiro
- Simulation and Computational Biology Laboratory, High Performance Computing Center, Federal University of Pará, Belém, Brazil
| | | | - Rommel Thiago Jucá Ramos
- Simulation and Computational Biology Laboratory, High Performance Computing Center, Federal University of Pará, Belém, Brazil.
| |
Collapse
|
4
|
Yurchenko A, Pšenička T, Mora P, Ortega JAM, Baca AS, Rovatsos M. Cytogenetic Analysis of Satellitome of Madagascar Leaf-Tailed Geckos. Genes (Basel) 2024; 15:429. [PMID: 38674364 PMCID: PMC11049218 DOI: 10.3390/genes15040429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 03/21/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Satellite DNA (satDNA) consists of sequences of DNA that form tandem repetitions across the genome, and it is notorious for its diversity and fast evolutionary rate. Despite its importance, satDNA has been only sporadically studied in reptile lineages. Here, we sequenced genomic DNA and PCR-amplified microdissected W chromosomes on the Illumina platform in order to characterize the monomers of satDNA from the Henkel's leaf-tailed gecko U. henkeli and to compare their topology by in situ hybridization in the karyotypes of the closely related Günther's flat-tail gecko U. guentheri and gold dust day gecko P. laticauda. We identified seventeen different satDNAs; twelve of them seem to accumulate in centromeres, telomeres and/or the W chromosome. Notably, centromeric and telomeric regions seem to share similar types of satDNAs, and we found two that seem to accumulate at both edges of all chromosomes in all three species. We speculate that the long-term stability of all-acrocentric karyotypes in geckos might be explained from the presence of specific satDNAs at the centromeric regions that are strong meiotic drivers, a hypothesis that should be further tested.
Collapse
Affiliation(s)
- Alona Yurchenko
- Department of Ecology, Faculty of Science, Charles University, 128 44 Prague, Czech Republic; (A.Y.); (T.P.)
| | - Tomáš Pšenička
- Department of Ecology, Faculty of Science, Charles University, 128 44 Prague, Czech Republic; (A.Y.); (T.P.)
| | - Pablo Mora
- Department of Experimental Biology, Faculty of Experimental Sciences, University of Jaén, Campus Las Lagunillas s/n, E-23071 Jaen, Spain; (P.M.); (J.A.M.O.); (A.S.B.)
| | - Juan Alberto Marchal Ortega
- Department of Experimental Biology, Faculty of Experimental Sciences, University of Jaén, Campus Las Lagunillas s/n, E-23071 Jaen, Spain; (P.M.); (J.A.M.O.); (A.S.B.)
| | - Antonio Sánchez Baca
- Department of Experimental Biology, Faculty of Experimental Sciences, University of Jaén, Campus Las Lagunillas s/n, E-23071 Jaen, Spain; (P.M.); (J.A.M.O.); (A.S.B.)
| | - Michail Rovatsos
- Department of Ecology, Faculty of Science, Charles University, 128 44 Prague, Czech Republic; (A.Y.); (T.P.)
| |
Collapse
|
5
|
Mokhtar MM, Alsamman AM, El Allali A. MegaSSR: a web server for large scale microsatellite identification, classification, and marker development. FRONTIERS IN PLANT SCIENCE 2023; 14:1219055. [PMID: 38162302 PMCID: PMC10757629 DOI: 10.3389/fpls.2023.1219055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 08/18/2023] [Indexed: 01/03/2024]
Abstract
Next-generation sequencing technologies have opened new avenues for using genomic data to study and develop molecular markers and improve genetic resources. Simple Sequence Repeats (SSRs) as genetic markers are increasingly used in molecular diversity and molecular breeding programs that require bioinformatics pipelines to analyze the large amounts of data. Therefore, there is an ongoing need for online tools that provide computational resources with minimal effort and maximum efficiency, including automated development of SSR markers. These tools should be flexible, customizable, and able to handle the ever-increasing amount of genomic data. Here we introduce MegaSSR (https://bioinformatics.um6p.ma/MegaSSR), a web server and a standalone pipeline that enables the design of SSR markers in any target genome. MegaSSR allows users to design targeted PCR-based primers for their selected SSR repeats and includes multiple tools that initiate computational pipelines for SSR mining, classification, comparisons, PCR primer design, in silico PCR validation, and statistical visualization. MegaSSR results can be accessed, searched, downloaded, and visualized with user-friendly web-based tools. These tools provide graphs and tables showing various aspects of SSR markers and corresponding PCR primers. MegaSSR will accelerate ongoing research in plant species and assist breeding programs in their efforts to improve current genomic resources.
Collapse
Affiliation(s)
- Morad M. Mokhtar
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Benguerir, Morocco
- Agricultural Genetic Engineering Research Institute, Agricultural Research Center, Giza, Egypt
| | - Alsamman M. Alsamman
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Benguerir, Morocco
- Agricultural Genetic Engineering Research Institute, Agricultural Research Center, Giza, Egypt
- Biotechnology Department, International Center for Agricultural Research in the Dry Areas (ICARDA), Giza, Egypt
| | - Achraf El Allali
- Bioinformatics Laboratory, College of Computing, Mohammed VI Polytechnic University, Benguerir, Morocco
| |
Collapse
|
6
|
Alhawatema M. DendroSSR: SSRs and sequence alignment as tools for building phylogeny trees. BRAZ J BIOL 2023; 83:e275386. [PMID: 37851777 DOI: 10.1590/1519-6984.275386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Accepted: 08/06/2023] [Indexed: 10/20/2023] Open
Abstract
This study introduces a new method to construct phylogenetic trees by combining both of the Simple Sequence Repeats (SSRs) and sequence alignments. The purpose of this work is to present the DendroSSR program and show it via a case study involving diverse Aspergillus species. To show how the DendroSSR program works to resolve complicated species relationships in phylogenetic trees, we employed the Aspergillus species as an example of a research case. The DendroSSR employs a technique containing multiple phases beginning with, detecting SSRs, computing SSRs similarities, sequences alignment, building a distance matrix based on SSRs similarity and sequences alignments, and then hierarchical clustering, and presenting the findings in a dendrogram. Sometimes sequence alignments alone may not give adequate information to generate a phylogenetic tree to resolve complicated species relationships. Therefore, establishing a distance matrix that is formed of addition of SSRs similarity across sequences to the traditional sequence alignment helps the process substantially and resolves the connections of complex species on phylogenetic trees. Additionally, it may be hard to distinguish complex relationships across species when studying conserved sequences, which could lead to an incomplete representation of their evolutionary relationships. These limitations are addressed by DendroSSR, which offers a technique to produce phylogenetic trees by incorporating SSRs similarity across species into the approach of generating phylogenetic trees. As it is known, SSRs are extensively scattered across the genomes of species and exhibit a great variation. Therefore, SSRs may support the knowledge gathered from sequence alignments by providing more information on genetic variation and even evolutionary relationships. The use of DendroSSR analysis might be considered for creating phylogenetic trees as a complementary or secondary strategy among the species under examination in circumstances where traditional phylogenetic analysis fails to clarify the species complex phylogenetic relationships.
Collapse
Affiliation(s)
- M Alhawatema
- Tafila Technical University, Faculty of Science, Department of Applied Biological Science, Tafila, Jordan
| |
Collapse
|
7
|
Westcott PMK, Muyas F, Hauck H, Smith OC, Sacks NJ, Ely ZA, Jaeger AM, Rideout WM, Zhang D, Bhutkar A, Beytagh MC, Canner DA, Jaramillo GC, Bronson RT, Naranjo S, Jin A, Patten JJ, Cruz AM, Shanahan SL, Cortes-Ciriano I, Jacks T. Mismatch repair deficiency is not sufficient to elicit tumor immunogenicity. Nat Genet 2023; 55:1686-1695. [PMID: 37709863 PMCID: PMC10562252 DOI: 10.1038/s41588-023-01499-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 08/07/2023] [Indexed: 09/16/2023]
Abstract
DNA mismatch repair deficiency (MMRd) is associated with a high tumor mutational burden (TMB) and sensitivity to immune checkpoint blockade (ICB) therapy. Nevertheless, most MMRd tumors do not durably respond to ICB and critical questions remain about immunosurveillance and TMB in these tumors. In the present study, we developed autochthonous mouse models of MMRd lung and colon cancer. Surprisingly, these models did not display increased T cell infiltration or ICB response, which we showed to be the result of substantial intratumor heterogeneity of mutations. Furthermore, we found that immunosurveillance shapes the clonal architecture but not the overall burden of neoantigens, and T cell responses against subclonal neoantigens are blunted. Finally, we showed that clonal, but not subclonal, neoantigen burden predicts ICB response in clinical trials of MMRd gastric and colorectal cancer. These results provide important context for understanding immune evasion in cancers with a high TMB and have major implications for therapies aimed at increasing TMB.
Collapse
Affiliation(s)
- Peter M K Westcott
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | - Francesc Muyas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
| | - Haley Hauck
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Olivia C Smith
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Nathan J Sacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Zackery A Ely
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alex M Jaeger
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - William M Rideout
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Daniel Zhang
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Arjun Bhutkar
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Mary C Beytagh
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - David A Canner
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Grissel C Jaramillo
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - Santiago Naranjo
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Abbey Jin
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - J J Patten
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Amanda M Cruz
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Sean-Luc Shanahan
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Isidro Cortes-Ciriano
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK.
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Rodent Histopathology Core, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
8
|
Rasoarahona R, Wattanadilokchatkun P, Panthum T, Jaisamut K, Lisachov A, Thong T, Singchat W, Ahmad SF, Han K, Kraichak E, Muangmai N, Koga A, Duengkae P, Antunes A, Srikulnath K. MicrosatNavigator: exploring nonrandom distribution and lineage-specificity of microsatellite repeat motifs on vertebrate sex chromosomes across 186 whole genomes. Chromosome Res 2023; 31:29. [PMID: 37775555 DOI: 10.1007/s10577-023-09738-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 08/11/2023] [Accepted: 09/05/2023] [Indexed: 10/01/2023]
Abstract
Microsatellites are short tandem DNA repeats, ubiquitous in genomes. They are believed to be under selection pressure, considering their high distribution and abundance beyond chance or random accumulation. However, limited analysis of microsatellites in single taxonomic groups makes it challenging to understand their evolutionary significance across taxonomic boundaries. Despite abundant genomic information, microsatellites have been studied in limited contexts and within a few species, warranting an unbiased examination of their genome-wide distribution in distinct versus closely related-clades. Large-scale comparisons have revealed relevant trends, especially in vertebrates. Here, "MicrosatNavigator", a new tool that allows quick and reliable investigation of perfect microsatellites in DNA sequences, was developed. This tool can identify microsatellites across the entire genome sequences. Using this tool, microsatellite repeat motifs were identified in the genome sequences of 186 vertebrates. A significant positive correlation was noted between the abundance, density, length, and GC bias of microsatellites and specific lineages. The (AC)n motif is the most prevalent in vertebrate genomes, showing distinct patterns in closely related species. Longer microsatellites were observed on sex chromosomes in birds and mammals but not on autosomes. Microsatellites on sex chromosomes of non-fish vertebrates have the lowest GC content, whereas high-GC microsatellites (≥ 50 M% GC) are preferred in bony and cartilaginous fishes. Thus, similar selective forces and mutational processes may constrain GC-rich microsatellites to different clades. These findings should facilitate investigations into the roles of microsatellites in sex chromosome differentiation and provide candidate microsatellites for functional analysis across the vertebrate evolutionary spectrum.
Collapse
Affiliation(s)
- Ryan Rasoarahona
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Sciences for Industry, Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Pish Wattanadilokchatkun
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Thitipong Panthum
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Kitipong Jaisamut
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Artem Lisachov
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Thanyapat Thong
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Worapong Singchat
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Syed Farhan Ahmad
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Kyudong Han
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan, 31116, Republic of Korea
- Center for Bio-Medical Engineering Core Facility, Dankook University, Cheonan, 31116, Republic of Korea
| | - Ekaphan Kraichak
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Department of Botany, Faculty of Science, Kasetsart University, Bangkok, 10900, Thailand
| | - Narongrit Muangmai
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Department of Fishery Biology, Faculty of Fisheries, Kasetsart University, Chatuchak, Bangkok, 10900, Thailand
| | - Akihiko Koga
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Prateep Duengkae
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros Do Porto de Leixes, Av. General Norton de Matos, S/N, 4450-208, Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, S/N, 4169-007, Porto, Portugal
| | - Kornsorn Srikulnath
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand.
- Sciences for Industry, Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand.
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand.
- Center for Advanced Studies in Tropical Natural Resources, National Research University-Kasetsart University, Kasetsart University, (CASTNAR, NRU-KU, Thailand), Bangkok, 10900, Thailand.
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok, 10900, Thailand.
| |
Collapse
|
9
|
Lu J, Lu L, Yao F, Fang M, Ma X, Meng J, Shao K. Dicarboxylic Amino Acid Permease 7219 Regulates Fruiting Body Type of Auricularia heimuer. J Fungi (Basel) 2023; 9:876. [PMID: 37754984 PMCID: PMC10532715 DOI: 10.3390/jof9090876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 08/20/2023] [Accepted: 08/22/2023] [Indexed: 09/28/2023] Open
Abstract
Auricularia heimuer is a widely cultivated jelly mushroom. The fruiting bodies are categorized into cluster and chrysanthemum types. With changing consumer demands and the need to reduce bio-waste, the demand for clustered fruiting bodies is increasing. Therefore, gene mining for fruiting body types is a matter of urgency. We determined that the A. heimuer locus for fruiting body type was located at one end of the genetic linkage map. The locus was localized between the markers D23860 and D389 by increasing the density of the genetic linkage map. BlastN alignment showed that the marker SCL-18 was also located between D23860 and D389, and a total of 25 coding genes were annotated within this interval. Through parental transcriptome analysis and qRT-PCR verification, the locus g7219 was identified as the gene controlling the fruiting body type. A single-nucleotide substitution in the TATA box of g7219 was detected between the parents. By PCR amplification of the promoter region of g7219, the TATA-box sequences of the cluster- and chrysanthemum-type strains were found to be CATAAAA and TATAAAA, respectively. This study provides a foundation for the breeding of fruiting body types and strain improvement of A. heimuer.
Collapse
Affiliation(s)
- Jia Lu
- Engineering Research Center of Ministry of Education of China for Food and Medicine, Jilin Agricultural University, Changchun 130118, China; (J.L.); (X.M.); (K.S.)
- Guizhou Key Laboratory of Edible Fungi Breeding, Guizhou Academy of Agricultural Sciences, Guiyang 550006, China
| | - Lixin Lu
- College of Horticulture, Jilin Agricultural University, Changchun 130118, China; (L.L.); (M.F.); (J.M.)
| | - Fangjie Yao
- Engineering Research Center of Ministry of Education of China for Food and Medicine, Jilin Agricultural University, Changchun 130118, China; (J.L.); (X.M.); (K.S.)
- College of Horticulture, Jilin Agricultural University, Changchun 130118, China; (L.L.); (M.F.); (J.M.)
| | - Ming Fang
- College of Horticulture, Jilin Agricultural University, Changchun 130118, China; (L.L.); (M.F.); (J.M.)
| | - Xiaoxu Ma
- Engineering Research Center of Ministry of Education of China for Food and Medicine, Jilin Agricultural University, Changchun 130118, China; (J.L.); (X.M.); (K.S.)
| | - Jingjing Meng
- College of Horticulture, Jilin Agricultural University, Changchun 130118, China; (L.L.); (M.F.); (J.M.)
| | - Kaisheng Shao
- Engineering Research Center of Ministry of Education of China for Food and Medicine, Jilin Agricultural University, Changchun 130118, China; (J.L.); (X.M.); (K.S.)
| |
Collapse
|
10
|
Alves SIA, Ferreira VBC, Dantas CWD, da Silva ALDC, Ramos RTJ. EasySSR: a user-friendly web application with full command-line features for large-scale batch microsatellite mining and samples comparison. Front Genet 2023; 14:1228552. [PMID: 37693309 PMCID: PMC10483286 DOI: 10.3389/fgene.2023.1228552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 07/28/2023] [Indexed: 09/12/2023] Open
Abstract
Microsatellites, also known as SSRs or STRs, are polymorphic DNA regions with tandem repetitions of a nucleotide motif of size 1-6 base pairs with a broad range of applications in many fields, such as comparative genomics, molecular biology, and forensics. However, the majority of researchers do not have computational training and struggle while running command-line tools or very limited web tools for their SSR research, spending a considerable amount of time learning how to execute the software and conducting the post-processing data tabulation in other tools or manually-time that could be used directly in data analysis. We present EasySSR, a user-friendly web tool with command-line full functionality, designed for practical use in batch identifying and comparing SSRs in sequences, draft, or complete genomes, not requiring previous bioinformatic skills to run. EasySSR requires only a FASTA and an optional GENBANK file of one or more genomes to identify and compare STRs. The tool can automatically analyze and compare SSRs in whole genomes, convert GenBank to PTT files, identify perfect and imperfect SSRs and coding and non-coding regions, compare their frequencies, abundancy, motifs, flanking sequences, and iterations, producing many outputs ready for download such as PTT files, interactive charts, and Excel tables, giving the user the data ready for further analysis in minutes. EasySSR was implemented as a web application, which can be executed from any browser and is available for free at https://computationalbiology.ufpa.br/easyssr/. Tutorials, usage notes, and download links to the source code can be found at https://github.com/engbiopct/EasySSR.
Collapse
Affiliation(s)
- Sandy Ingrid Aguiar Alves
- Laboratory of Biological Engineering, Biological Science Institute, Park of Science and Technology, Federal University of Pará, Belém, Brazil
| | - Victor Benedito Costa Ferreira
- Laboratory of Biological Engineering, Biological Science Institute, Park of Science and Technology, Federal University of Pará, Belém, Brazil
| | | | - Artur Luiz da Costa da Silva
- Laboratory of Biological Engineering, Biological Science Institute, Park of Science and Technology, Federal University of Pará, Belém, Brazil
| | - Rommel Thiago Jucá Ramos
- Laboratory of Biological Engineering, Biological Science Institute, Park of Science and Technology, Federal University of Pará, Belém, Brazil
| |
Collapse
|
11
|
Chaudhary S, Singh RK, Kumar P. Genome-wide identification, characterization and primer designing of simple sequence repeats across Leguminosae family. 3 Biotech 2023; 13:286. [PMID: 37520343 PMCID: PMC10382446 DOI: 10.1007/s13205-023-03706-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 07/16/2023] [Indexed: 08/01/2023] Open
Abstract
Legumes are important clade of commercially important family Leguminosae that mainly include medicinal, flowering and edible plants. Although the genomic sequence of legumes is accessible, only the limited number of effective simple sequence repeat markers has been identified by prior research. Additional polymorphic simple sequence repeats marker discovery will aid in the genetics and breeding of legumes. In this study, 13 complete genome sequences were screened for the identification of chromosome-wise simple sequence repeats (SSRs) and 1,866,861 SSRs were identified. Based on the study, it was observed that the number of SSRs in non-coding region was more as compared to coding region and frequency of mononucleotides was highest followed by di-nucleotides while penta- and hexa-nucleotide repeats were least frequent one. The identified genome-wide SSRs and newly developed SSR markers, primers and their mapping will provide a powerful means for genetic researches across Leguminosae plants, including genetic diversity and evolutionary origin analysis, fingerprinting, QTL mapping and marker-assisted selection for breeding as well as comparative genomic analysis studies.
Collapse
Affiliation(s)
- Sakshi Chaudhary
- Dr. A. P. J. Abdul Kalam Technical University, Lucknow, India
- International Centre for Genetic Engineering and Biotechnology, New Delhi, 110067 India
| | - Ravi Kant Singh
- Amity Institute of Biotechnology, Amity University, Noida, UP 201313 India
| | - Pradeep Kumar
- Department of Botany, University of Lucknow, Lucknow, UP 226007 India
| |
Collapse
|
12
|
Zuo B, Nneji LM, Sun YB. Comparative genomics reveals insights into anuran genome size evolution. BMC Genomics 2023; 24:379. [PMID: 37415107 DOI: 10.1186/s12864-023-09499-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/30/2023] [Indexed: 07/08/2023] Open
Abstract
BACKGROUND Amphibians, particularly anurans, display an enormous variation in genome size. Due to the unavailability of whole genome datasets in the past, the genomic elements and evolutionary causes of anuran genome size variation are poorly understood. To address this, we analyzed whole-genome sequences of 14 anuran species ranging in size from 1.1 to 6.8 Gb. By annotating multiple genomic elements, we investigated the genomic correlates of anuran genome size variation and further examined whether the genome size relates to habitat types. RESULTS Our results showed that intron expansions or contraction and Transposable Elements (TEs) diversity do not contribute significantly to genome size variation. However, the recent accumulation of transposable elements (TEs) and the lack of deletion of ancient TEs primarily accounted for the evolution of anuran genome sizes. Our study showed that the abundance and density of simple repeat sequences positively correlate with genome size. Ancestral state reconstruction revealed that genome size exhibits a taxon-specific pattern of evolution, with families Bufonidae and Pipidae experiencing extreme genome expansion and contraction events, respectively. Our result showed no relationship between genome size and habitat types, although large genome-sized species are predominantly found in humid habitats. CONCLUSIONS Overall, our study identified the genomic element and their evolutionary dynamics accounting for anuran genome size variation, thus paving a path to a greater understanding of the size evolution of the genome in amphibians.
Collapse
Affiliation(s)
- Bin Zuo
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, 650504, China
| | - Lotanna Micah Nneji
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Yan-Bo Sun
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, 650504, China.
- Laboratory for Conservation and Utilization of Bio-resources, Yunnan University, Kunming, 650091, China.
| |
Collapse
|
13
|
Behboudi R, Nouri-Baygi M, Naghibzadeh M. RPTRF: A rapid perfect tandem repeat finder tool for DNA sequences. Biosystems 2023; 226:104869. [PMID: 36858110 DOI: 10.1016/j.biosystems.2023.104869] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 01/23/2023] [Accepted: 02/23/2023] [Indexed: 03/02/2023]
Abstract
The sequencing of eukaryotic genomes has shown that tandem repeats are abundant in their sequences. In addition to affecting some cellular processes, tandem repeats in the genome may be associated with specific diseases and have been the key to resolving criminal cases. Any tool developed for detecting tandem repeats must be accurate, fast, and useable in thousands of laboratories worldwide, including those with not very advanced computing capabilities. The proposed method, the Rapid Perfect Tandem Repeat Finder (RPTRF), minimizes the need for excess character comparison processing by indexing the input file and significantly helps to accelerate and prepare the output without artifacts by using an interval tree in the filtering section. The experiments demonstrated that the RPTRF is very fast in discovering all perfect tandem repeats of all categories of any genomic sequences. Although the detection of imperfect TRs is not the focus of the RPTRF, comparisons show that it even outperforms some other tools (in five selected gold standards) designed explicitly for this purpose. The implemented tool and how to use it are available on GitHub.
Collapse
Affiliation(s)
- Reza Behboudi
- Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Mostafa Nouri-Baygi
- Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran.
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
| |
Collapse
|
14
|
Liu K, Xie N. Pipeline for developing polymorphic microsatellites in species without reference genomes. 3 Biotech 2022; 12:248. [PMID: 36039078 PMCID: PMC9418399 DOI: 10.1007/s13205-022-03313-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 08/16/2022] [Indexed: 11/01/2022] Open
Abstract
Microsatellites, also known as simple sequence repeats (SSRs), are the preferred type of marker for many genetic applications. In conjunction with the ongoing development of next-generation sequencing, several bioinformatic tools have been developed for identifying SSRs from genomic or transcriptomic sequences. Although these tools are handy for generating polymorphic SSRs, their application almost always depends on an existing reference genome or self-assembly of the reference genome. With this in mind, we propose a pipeline for developing polymorphic SSRs that may be applied to species without reference genomes. Using a species without a reference genome (black Amur bream; Megalobrama terminalis Richardson, 1846) as a model, our pipeline was able to effectively discover polymorphic SSRs. Under different R parameters of a reference-free single nucleotide polymorphisms (SNPs) caller (ebwt2InDel), a total of 258, 208, 102, and 11 polymorphic SSRs were mined. To quantify the accuracy of the polymorphic SSRs detected using our pipeline, we analyzed 25 SSRs with PCR experiments. All primers were successfully amplified, and most SSRs (23 SSRs, 92%) were polymorphic. From the 36 individual black Amur bream, we acquired an average of 3.36 alleles per locus, ranging from one to 11. This demonstrates the effectiveness of our pipeline in identifying polymorphic SSRs and designing primers for SSR genotyping. Ultimately, our pipeline can effectively mine polymorphic SSRs for species without reference genomes, complementing SSR mining approaches based on reference genomes and helping to resolve biological issues that accompany these methods. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-022-03313-0.
Collapse
Affiliation(s)
- Kai Liu
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, Zhejiang China
| | - Nan Xie
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, Zhejiang China
| |
Collapse
|
15
|
Jia S, Chen W, Zhao G, Wang S, Xu Z, Li Q. Characterization of the complete chloroplast genome of the medicinal herb Veronica polita Fr. 1819 (Lamiales: Plantaginaceae). Mitochondrial DNA B Resour 2022; 7:1078-1080. [PMID: 35801139 PMCID: PMC9255118 DOI: 10.1080/23802359.2022.2086074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Veronica polita Fr. 1819 (synonym: Veronica didyma Ten. 1981) is a species of annual herb with high medicinal value. It is originally from Southwest Asia, but has been naturalized widely in many regions of the world. In this study, the complete chloroplast genome of V. polita was determined to be 150,191 bp long with a typical quadripartite structure, comprising two inverted repeat regions (IRa and IRb, 25,465 bp each), a large single-copy (LSC) region (81,847 bp) and a small single-copy (SSC) region (17,414 bp). It encodes a panel of 114 genes (including 79 protein-coding, 31 tRNA, and four rRNA genes) with 18 of them being completely or partially duplicated and 19 of them possessing one or two introns. Phylogenetic analysis supported the tribal-level taxonomy of the family Plantaginaceae, and revealed that V. polita was most closely related to the congener Veronica persica Poir. 1808.
Collapse
Affiliation(s)
- Shouning Jia
- Qinghai Provincial Hospital of Traditional Chinese Medicine, Xining, People's Republic of China
| | - Wenjuan Chen
- Qinghai Provincial Hospital of Traditional Chinese Medicine, Xining, People's Republic of China
| | - Guofu Zhao
- Qinghai Provincial Hospital of Traditional Chinese Medicine, Xining, People's Republic of China
| | - Shuangxi Wang
- Qinghai Provincial Hospital of Traditional Chinese Medicine, Xining, People's Republic of China
| | - Zhiwei Xu
- Qinghai Provincial Hospital of Traditional Chinese Medicine, Xining, People's Republic of China
| | - Qien Li
- Tibetan Medicine Research Center, Tibetan Medical College, Qinghai University, Xining, People's Republic of China
| |
Collapse
|
16
|
Genome-Wide Survey and Development of the First Microsatellite Markers Database ( AnCorDB) in Anemone coronaria L. Int J Mol Sci 2022; 23:ijms23063126. [PMID: 35328546 PMCID: PMC8949970 DOI: 10.3390/ijms23063126] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Revised: 03/08/2022] [Accepted: 03/10/2022] [Indexed: 12/31/2022] Open
Abstract
Anemone coronaria L. (2n = 2x = 16) is a perennial, allogamous, highly heterozygous plant marketed as a cut flower or in gardens. Due to its large genome size, limited efforts have been made in order to develop species-specific molecular markers. We obtained the first draft genome of the species by Illumina sequencing an androgenetic haploid plant of the commercial line “MISTRAL® Magenta”. The genome assembly was obtained by applying the MEGAHIT pipeline and consisted of 2 × 106 scaffolds. The SciRoKo SSR (Simple Sequence Repeats)-search module identified 401.822 perfect and 188.987 imperfect microsatellites motifs. Following, we developed a user-friendly “Anemone coronaria Microsatellite DataBase” (AnCorDB), which incorporates the Primer3 script, making it possible to design couples of primers for downstream application of the identified SSR markers. Eight genotypes belonging to eight cultivars were used to validate 62 SSRs and a subset of markers was applied for fingerprinting each cultivar, as well as to assess their intra-cultivar variability. The newly developed microsatellite markers will find application in Breeding Rights disputes, developing genetic maps, marker assisted breeding (MAS) strategies, as well as phylogenetic studies.
Collapse
|
17
|
Different Roles of Introgression on the Demographic Change in Two Snakebark Maples, Acer caudatifolium and A. morrisonense, with Contrasted Postglacial Expansion Routes. PLANTS 2022; 11:plants11050644. [PMID: 35270114 PMCID: PMC8912722 DOI: 10.3390/plants11050644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 02/22/2022] [Accepted: 02/23/2022] [Indexed: 11/16/2022]
Abstract
Hybridization frequently occurs in plant species. With repeated backcross, the introgression may influence evolutionary trajectories through the entry of foreign genes. However, the genetic admixture via hybridization events is often confused with the ancestral polymorphism, especially in closely related species that have experienced similar evolutionary events. In Taiwan, two independent-originated endemic snakebark maples have contrasted postglacial range expansion routes: northward and upward expansion in Acer caudatifolium and downward expansion in A. morrisonense. The range expansion causes the current parapatric distribution, increasing the possibility of introgression. This study elucidates how their genetic variation reflects introgression and historical demography. With 17 EST-SSR markers among the intensely sampled 657 individuals, we confirmed that the genetic admixture between species mainly was attributed to recent introgression instead of common ancestral polymorphism. The secondary contact scenario inferred by approximate Bayesian computation suggested that A. morrisonense received more genetic variations from A. caudatifolium. Introgression occurred in colonized Taiwan around the early Last Glacial Period. Furthermore, the demography of A. caudatifolium was more severely affected by introgression than A. morrisonense, especially in the wavefront populations with high altitude range expansion, implying an altitude-related adaptive introgression. In contrast, A. morrisonense exhibited ubiquitous introgression independent of postglacial expansion, suggesting that introgression in A. morrisonense was neutral. In terms of different genetic consequences, introgression had different demographic impacts on species with different altitude expansion directions even under the same climate-change conditions within an island.
Collapse
|
18
|
Tien NQD, Ma X, Man LQ, Chi DTK, Huy NX, Nhut DT, Rombauts S, Ut T, Loc NH. De novo whole-genome assembly and discovery of genes involved in triterpenoid saponin biosynthesis of Vietnamese ginseng ( Panax vietnamensis Ha et Grushv.). PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2021; 27:2215-2229. [PMID: 34744362 PMCID: PMC8526660 DOI: 10.1007/s12298-021-01076-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 09/13/2021] [Accepted: 09/17/2021] [Indexed: 05/04/2023]
Abstract
UNLABELLED Vietnamese ginseng (Panax vietnamensis Ha et Grushv.), also known as Ngoc Linh ginseng, is a high-value herb in Vietnam. Vietnamese ginseng has been proven to be effective in enhancing the immune system, human memory, anti-stress, anti-inflammatory, anti-cancer, and prevent aging. The present study reports the first draft whole-genome of Vietnamese ginseng and the identification of potential genes involved in the triterpenoid metabolic pathway. De novo whole-genome assembly was performed successfully from a data of approximately 139 Gbps of 394,802,120 high quality reads to generate 9815 scaffolds with an N50 value of 572,722 bp from the leaf of Vietnamese ginseng. The assembled genome of Vietnamese ginseng is 3,001,967,204 bp long containing 79,374 gene models. Among them, there are 55,012 genes (69.30%) were annotated by various public molecular biology databases. The potential genes involved in triterpenoid saponin biosynthesis in Vietnamese ginseng and their metabolic pathway were also predicted." Three genes encoding squalene monooxygenase isozymes in Vietnamese ginseng were cloned, sequenced and characterized. Moreover, expression levels of several key genes involved in terpenoid biosynthesis in different parts of Vietnamese ginseng were also analyzed. The SSR markers were detected by various programs from both of assembly full dataset of Vietnamese ginseng genome and predicted genes. The present work provided important data of the draft whole-genome of Vietnamese ginseng for further studies to understand the role of genes involved in ginsenoside biosynthesis and their metabolic pathway at the molecular level of this rare medicinal species. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s12298-021-01076-1.
Collapse
Affiliation(s)
- Nguyen Quang Duc Tien
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
- Department of Biology, Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | - Xiao Ma
- VIB-UGent Center for Plant Systems Biology, Ghent University, 9000 Ghent, Belgium
| | - Le Quang Man
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | - Duong Thi Kim Chi
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| | | | - Duong-Tan Nhut
- Tay Nguyen Institute of Scientific Research, Vietnam Academy of Science and Technology, Dalat, 670000 Vietnam
| | - Stephane Rombauts
- VIB-UGent Center for Plant Systems Biology, Ghent University, 9000 Ghent, Belgium
| | - Tran Ut
- Ngoc Linh Ginseng and Medicinal Materials Development Center, Quang Nam Quang Ngai, 51000 Vietnam
| | - Nguyen Hoang Loc
- Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
- Department of Biology, Bioactive Compound Institute, University of Sciences, Hue University, Hue, 530000 Vietnam
| |
Collapse
|
19
|
Barchi L, Rabanus‐Wallace MT, Prohens J, Toppino L, Padmarasu S, Portis E, Rotino GL, Stein N, Lanteri S, Giuliano G. Improved genome assembly and pan-genome provide key insights into eggplant domestication and breeding. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:579-596. [PMID: 33964091 PMCID: PMC8453987 DOI: 10.1111/tpj.15313] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/30/2021] [Accepted: 05/04/2021] [Indexed: 05/20/2023]
Abstract
Eggplant (Solanum melongena L.) is an important horticultural crop and one of the most widely grown vegetables from the Solanaceae family. It was domesticated from a wild, prickly progenitor carrying small, round, non-anthocyanic fruits. We obtained a novel, highly contiguous genome assembly of the eggplant '67/3' reference line, by Hi-C retrofitting of a previously released short read- and optical mapping-based assembly. The sizes of the 12 chromosomes and the fraction of anchored genes in the improved assembly were comparable to those of a chromosome-level assembly. We resequenced 23 accessions of S. melongena representative of the worldwide phenotypic, geographic, and genetic diversity of the species, and one each from the closely related species Solanum insanum and Solanum incanum. The eggplant pan-genome contained approximately 51.5 additional megabases and 816 additional genes compared with the reference genome, while the pan-plastome showed little genetic variation. We identified 53 selective sweeps related to fruit color, prickliness, and fruit shape in the nuclear genome, highlighting selection leading to the emergence of present-day S. melongena cultivars from its wild ancestors. Candidate genes underlying the selective sweeps included a MYBL1 repressor and CHALCONE ISOMERASE (for fruit color), homologs of Arabidopsis GLABRA1 and GLABROUS INFLORESCENCE STEMS2 (for prickliness), and orthologs of tomato FW2.2, OVATE, LOCULE NUMBER/WUSCHEL, SUPPRESSOR OF OVATE, and CELL SIZE REGULATOR (for fruit size/shape), further suggesting that selection for the latter trait relied on a common set of orthologous genes in tomato and eggplant.
Collapse
Affiliation(s)
- Lorenzo Barchi
- DISAFA – Plant geneticsUniversity of TurinGrugliasco (TO)10095Italy
| | | | - Jaime Prohens
- COMAVUniversitat Politècnica de ValènciaCamino de Vera 14Valencia46022Spain
| | - Laura Toppino
- CREA Research Centre for Genomics and BioinformaticsVia Paullese 28Montanaso LombardoLO26836Italy
| | - Sudharsan Padmarasu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)Corrensstr. 3Seeland06466Germany
| | - Ezio Portis
- DISAFA – Plant geneticsUniversity of TurinGrugliasco (TO)10095Italy
| | - Giuseppe Leonardo Rotino
- CREA Research Centre for Genomics and BioinformaticsVia Paullese 28Montanaso LombardoLO26836Italy
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)Corrensstr. 3Seeland06466Germany
- Department of Crop SciencesCenter for Integrated Breeding Research (CiBreed)Georg‐August‐UniversityVon Siebold Str. 8Göttingen37075Germany
| | - Sergio Lanteri
- DISAFA – Plant geneticsUniversity of TurinGrugliasco (TO)10095Italy
| | | |
Collapse
|
20
|
Dai A, Wang Y, Greenberg A, Liufu Z, Tang T. Rapid Evolution of Autosomal Binding Sites of the Dosage Compensation Complex in Drosophila melanogaster and Its Association With Transcription Divergence. Front Genet 2021; 12:675027. [PMID: 34194473 PMCID: PMC8238462 DOI: 10.3389/fgene.2021.675027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/19/2021] [Indexed: 11/25/2022] Open
Abstract
How pleiotropy influences evolution of protein sequence remains unclear. The male-specific lethal (MSL) complex in Drosophila mediates dosage compensation by 2-fold upregulation of the X chromosome in males. Nevertheless, several MSL proteins also bind autosomes and likely perform functions not related to dosage compensation. Here, we study the evolution of MOF, MSL1, and MSL2 biding sites in Drosophila melanogaster and its close relative Drosophila simulans. We found pervasive expansion of the MSL binding sites in D. melanogaster, particularly on autosomes. The majority of these newly-bound regions are unlikely to function in dosage compensation and associated with an increase in expression divergence between D. melanogaster and D. simulans. While dosage-compensation related sites show clear signatures of adaptive evolution, these signatures are even more marked among autosomal regions. Our study points to an intriguing avenue of investigation of pleiotropy as a mechanism promoting rapid protein sequence evolution.
Collapse
Affiliation(s)
- Aimei Dai
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Yushuai Wang
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | | | - Zhongqi Liufu
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Tian Tang
- State Key Laboratory of Biocontrol and Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
21
|
Lei Y, Zhou Y, Price M, Song Z. Genome-wide characterization of microsatellite DNA in fishes: survey and analysis of their abundance and frequency in genome-specific regions. BMC Genomics 2021; 22:421. [PMID: 34098869 PMCID: PMC8186053 DOI: 10.1186/s12864-021-07752-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/24/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Microsatellite repeats are ubiquitous in organism genomes and play an important role in the chromatin organization, regulation of gene activity, recombination and DNA replication. Although microsatellite distribution patterns have been studied in most phylogenetic lineages, they are unclear in fish species. RESULTS Here, we present the first systematic examination of microsatellite distribution in coding and non-coding regions of 14 fish genomes. Our study showed that the number and type of microsatellites displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation and DNA replication slippage theories alone were insufficient to explain the distribution patterns. Our results showed that microsatellites are dominant in non-coding regions. The total number of microsatellites ranged from 78,378 to 1,012,084, and the relative density varied from 4925.76 bp/Mb to 25,401.97 bp/Mb. Overall, (A + T)-rich repeats were dominant. The dependence of repeat abundance on the length of the repeated unit (1-6 nt) showed a great similarity decrease, whereas more tri-nucleotide repeats were found in exonic regions than tetra-nucleotide repeats of most species. Moreover, the incidence of different repeated types appeared species- and genomic-specific. These results highlight potential mechanisms for maintaining microsatellite distribution, such as selective forces and mismatch repair systems. CONCLUSIONS Our data could be beneficial for the studies of genome evolution and microsatellite DNA evolutionary dynamics, and facilitate the exploration of microsatellites structural, function, composition mode and molecular markers development in these species.
Collapse
Affiliation(s)
- Yi Lei
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Yu Zhou
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Megan Price
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China
| | - Zhaobin Song
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, People's Republic of China.
| |
Collapse
|
22
|
Microsatellites as Agents of Adaptive Change: An RNA-Seq-Based Comparative Study of Transcriptomes from Five Helianthus Species. Symmetry (Basel) 2021. [DOI: 10.3390/sym13060933] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mutations that provide environment-dependent selective advantages drive adaptive divergence among species. Many phenotypic differences among related species are more likely to result from gene expression divergence rather than from non-synonymous mutations. In this regard, cis-regulatory mutations play an important part in generating functionally significant variation. Some proposed mechanisms that explore the role of cis-regulatory mutations in gene expression divergence involve microsatellites. Microsatellites exhibit high mutation rates achieved through symmetric or asymmetric mutation processes and are abundant in both coding and non-coding regions in positions that could influence gene function and products. Here we tested the hypothesis that microsatellites contribute to gene expression divergence among species with 50 individuals from five closely related Helianthus species using an RNA-seq approach. Differential expression analyses of the transcriptomes revealed that genes containing microsatellites in non-coding regions (UTRs and introns) are more likely to be differentially expressed among species when compared to genes with microsatellites in the coding regions and transcripts lacking microsatellites. We detected a greater proportion of shared microsatellites in 5′UTRs and coding regions compared to 3′UTRs and non-coding transcripts among Helianthus spp. Furthermore, allele frequency differences measured by pairwise FST at single nucleotide polymorphisms (SNPs), indicate greater genetic divergence in transcripts containing microsatellites compared to those lacking microsatellites. A gene ontology (GO) analysis revealed that microsatellite-containing differentially expressed genes are significantly enriched for GO terms associated with regulation of transcription and transcription factor activity. Collectively, our study provides compelling evidence to support the role of microsatellites in gene expression divergence.
Collapse
|
23
|
Yuan J, Zhang X, Li F, Xiang J. Genome Sequencing and Assembly Strategies and a Comparative Analysis of the Genomic Characteristics in Penaeid Shrimp Species. Front Genet 2021; 12:658619. [PMID: 34012463 PMCID: PMC8126689 DOI: 10.3389/fgene.2021.658619] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 03/17/2021] [Indexed: 01/27/2023] Open
Abstract
Penaeid shrimp (family Penaeidae) represents one of the most economically and ecologically important groups of crustaceans. However, their genome sequencing and assembly have encountered extreme difficulties during the last 20 years. In this study, based on our previous genomic data, we investigated the genomic characteristics of four penaeid shrimp species and identified potential factors that result in their poor genome assembly, including heterozygosity, polyploidization, and repeats. Genome sequencing and comparison of somatic cells (diploid) of the four shrimp species and a single sperm cell (haploid) of Litopenaeus vannamei identified a common bimodal distribution of K-mer depths, suggesting either high heterozygosity or abundant homo-duplicated sequences present in their genomes. However, penaeids have not undergone whole-genome duplication as indicated by a series of approaches. Besides, the remarkable expansion of simple sequence repeats was another outstanding character of penaeid genomes, which also made the genome assembly highly fragmented. Due to this situation, we tried to assemble the genome of penaeid shrimp using various genome sequencing and assembly strategies and compared the quality. Therefore, this study provides new insights about the genomic characteristics of penaeid shrimps while improving their genome assemblies.
Collapse
Affiliation(s)
- Jianbo Yuan
- CAS Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Xiaojun Zhang
- CAS Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Fuhua Li
- CAS Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Jianhai Xiang
- CAS Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| |
Collapse
|
24
|
Simple sequence repeats drive genome plasticity and promote adaptive evolution in penaeid shrimp. Commun Biol 2021; 4:186. [PMID: 33574498 PMCID: PMC7878876 DOI: 10.1038/s42003-021-01716-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 01/18/2021] [Indexed: 12/13/2022] Open
Abstract
Simple sequence repeats (SSRs) are rare (approximately 1%) in most genomes and are generally considered to have no function. However, penaeid shrimp genomes have a high proportion of SSRs (>23%), raising the question of whether these SSRs play important functional and evolutionary roles in these SSR-rich species. Here, we show that SSRs drive genome plasticity and adaptive evolution in two penaeid shrimp species, Fenneropenaeus chinensis and Litopenaeus vannamei. Assembly and comparison of genomes of these two shrimp species at the chromosome-level revealed that transposable elements serve as carriers for SSR expansion, which is still occurring. The remarkable genome plasticity identified herein might have been shaped by significant SSR expansions. SSRs were also found to regulate gene expression by multi-omics analyses, and be responsible for driving adaptive evolution, such as the variable osmoregulatory capacities of these shrimp under low-salinity stress. These data provide strong evidence that SSRs are an important driver of the adaptive evolution in penaeid shrimp.
Collapse
|
25
|
Mokhtar MM, Atia MAM. SSRome: an integrated database and pipelines for exploring microsatellites in all organisms. Nucleic Acids Res 2020; 47:D244-D252. [PMID: 30365025 PMCID: PMC6323889 DOI: 10.1093/nar/gky998] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 10/14/2018] [Indexed: 11/23/2022] Open
Abstract
Over the past decade, many databases focusing on microsatellite mining on a genomic scale were released online with at least one of the following major deficiencies: (i) lacking the classification of microsatellites as genic or non-genic, (ii) not comparing microsatellite motifs at both genic and non-genic levels in order to identify unique motifs for each class or (iii) missing SSR marker development. In this study, we have developed ‘SSRome’ as a web-based, user-friendly, comprehensive and dynamic database with pipelines for exploring microsatellites in 6533 organisms. In the SSRome database, 158 million microsatellite motifs are identified across all taxa, in addition to all the mitochondrial and chloroplast genomes and expressed sequence tags available from NCBI. Moreover, 45.1 million microsatellite markers were developed and classified as genic or non-genic. All the stored motif and marker datasets can be downloaded freely. In addition, SSRome provides three user-friendly tools to identify, classify and compare motifs on either a genome- or transcriptome-wide scale. With the implementation of PHP, HTML and JavaScript, users can upload their data for analysis via a user-friendly GUI. SSRome represents a powerful database and mega-tool that will assist researchers in developing and dissecting microsatellite markers on a high-throughput scale.
Collapse
Affiliation(s)
- Morad M Mokhtar
- Molecular Genetics and Genome Mapping Laboratory, Genome Mapping Department, Agricultural Genetic Engineering Research Institute (AGERI), ARC, Giza, 12619, Egypt
| | - Mohamed A M Atia
- Molecular Genetics and Genome Mapping Laboratory, Genome Mapping Department, Agricultural Genetic Engineering Research Institute (AGERI), ARC, Giza, 12619, Egypt
| |
Collapse
|
26
|
Ranathunge C, Wheeler GL, Chimahusky ME, Perkins AD, Pramod S, Welch ME. Transcribed microsatellite allele lengths are often correlated with gene expression in natural sunflower populations. Mol Ecol 2020; 29:1704-1716. [PMID: 32285554 DOI: 10.1111/mec.15440] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 03/15/2020] [Accepted: 04/02/2020] [Indexed: 12/23/2022]
Abstract
Microsatellites are common in genomes of most eukaryotic species. Due to their high mutability, an adaptive role for microsatellites has been considered. However, little is known concerning the contribution of microsatellites towards phenotypic variation. We used populations of the common sunflower (Helianthus annuus) at two latitudes to quantify the effect of microsatellite allele length on phenotype at the level of gene expression. We conducted a common garden experiment with seed collected from sunflower populations in Kansas and Oklahoma followed by an RNA-Seq experiment on 95 individuals. The effect of microsatellite allele length on gene expression was assessed across 3,325 microsatellites that could be consistently scored. Our study revealed 479 microsatellites at which allele length significantly correlates with gene expression (eSTRs). When irregular allele sizes not conforming to the motif length were removed, the number of eSTRs rose to 2,379. The percentage of variation in gene expression explained by eSTRs ranged from 1%-86% when controlling for population and allele-by-population interaction effects at the 479 eSTRs. Of these eSTRs, 70.4% are in untranslated regions (UTRs). A gene ontology (GO) analysis revealed that eSTRs are significantly enriched for GO terms associated with cis- and trans-regulatory processes. Our findings suggest that a substantial number of transcribed microsatellites can influence gene expression.
Collapse
Affiliation(s)
- Chathurani Ranathunge
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Gregory L Wheeler
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Melody E Chimahusky
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Andy D Perkins
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, USA
| | - Sreepriya Pramod
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Mark E Welch
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| |
Collapse
|
27
|
Nagel JH, Cruywagen EM, Machua J, Wingfield MJ, Slippers B. Highly transferable microsatellite markers for the genera Lasiodiplodia and Neofusicoccum. FUNGAL ECOL 2020. [DOI: 10.1016/j.funeco.2019.100903] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
28
|
Dragon’s Blood from Dracaena cambodiana in China: Applied History and Induction Techniques toward Formation Mechanism. FORESTS 2020. [DOI: 10.3390/f11040372] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Dragon’s blood that is extracted from Dracaena plants has been widely used as traditional medicine in various ancient cultures. The application of dragon’s blood has a cherished history in China, even though the original plants were not discovered for some period. Dracaena cochinchinensis and Dracaena cambodiana were successively discovered in southern China during the 1970s–1980s. In the last half of the century, Chinese scientists have extensively investigated the production of dragon’s blood from these two Dracaena species, whereas these results have not been previously systematically summarized, as in the present paper. Herein, we present the applied history in ancient China and artificially induced technologies for dragon’s blood development based on these two Dracaena species, in particular, using tissue cultures seedlings and tender plants of D. cambodiana. Big data research, including transcriptomic and genomic studies, has suggested that dragon’s blood might be a defense substance that is secreted by Dracaena plants in response to (a)biotic stimuli. This review represents an effort to highlight the progress and achievements from applied history as well as induction techniques that are used for the formation of dragon’s blood that have taken place in China. Such knowledge might aid in the global conservation of wild Dracaena species and contribute to understanding dragon blood formation mechanisms, eventually assisting in the efficient utilization of limited Dracaena plant resources for the sustainable production of dragon’s blood.
Collapse
|
29
|
Genovese LM, Mosca MM, Pellegrini M, Geraci F. Dot2dot: accurate whole-genome tandem repeats discovery. Bioinformatics 2019; 35:914-922. [PMID: 30165507 PMCID: PMC6419916 DOI: 10.1093/bioinformatics/bty747] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Revised: 08/03/2018] [Accepted: 08/24/2018] [Indexed: 01/18/2023] Open
Abstract
MOTIVATION Large-scale sequencing projects have confirmed the hypothesis that eukaryotic DNA is rich in repetitions whose functional role needs to be elucidated. In particular, tandem repeats (TRs) (i.e. short, almost identical sequences that lie adjacent to each other) have been associated to many cellular processes and, indeed, are also involved in several genetic disorders. The need of comprehensive lists of TRs for association studies and the absence of a computational model able to capture their variability have revived research on discovery algorithms. RESULTS Building upon the idea that sequence similarities can be easily displayed using graphical methods, we formalized the structure that TRs induce in dot-plot matrices where a sequence is compared with itself. Leveraging on the observation that a compact representation of these matrices can be built and searched in linear time, we developed Dot2dot: an accurate algorithm fast enough to be suitable for whole-genome discovery of TRs. Experiments on five manually curated collections of TRs have shown that Dot2dot is more accurate than other established methods, and completes the analysis of the biggest known reference genome in about one day on a standard PC. AVAILABILITY AND IMPLEMENTATION Source code and datasets are freely available upon paper acceptance at the URL: https://github.com/Gege7177/Dot2dot. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Marco M Mosca
- Department of Computer Science, University of Liverpool, Liverpool, UK
| | - Marco Pellegrini
- Institute for Informatics and Telematics, CNR, Pisa, Italy.,Laboratory of Integrative Systems Medicine (LISM), Institute of Informatics and Telematics and Institute of Clinical Physiology, Pisa, Italy
| | - Filippo Geraci
- Institute for Informatics and Telematics, CNR, Pisa, Italy
| |
Collapse
|
30
|
Tumi L, Zhang Y, Wang Z, Suni ML, Burgess KS, Ge XJ. Microsatellite markers for the endangered Puya raimondii in Peru. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11308. [PMID: 31890354 PMCID: PMC6923705 DOI: 10.1002/aps3.11308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Accepted: 11/01/2019] [Indexed: 06/10/2023]
Abstract
PREMISE Microsatellite primers were developed for Puya raimondii (Bromeliaceae), an endangered species distributed in the Andean Mountains of Bolivia and Peru. METHODS AND RESULTS Genome skimming of P. raimondii, P. macrura, and P. hutchisonii resulted in the selection of 46 pairs of cross-species microsatellite markers. Of these, 12 microsatellite primer pairs produced clear and polymorphic bands in P. raimondii. These primer sets were then used for the detection of potential polymorphisms in 84 P. raimondii individuals collected from four populations in Peru. The number of alleles per locus ranged from one to six, and the observed and expected levels of heterozygosity ranged from 0.000 to 0.8929 and from 0.000 to 0.7662, respectively. CONCLUSIONS The microsatellite markers developed in this study will be useful for future population genetic analyses and breeding system studies in P. raimondii.
Collapse
Affiliation(s)
- Liscely Tumi
- Laboratorio de Fisiología VegetalFacultad de Ciencias BiológicasUniversidad Nacional Mayor de San MarcosLimaPeru
| | - Yu‐Qu Zhang
- South China Botanical GardenChinese Academy of SciencesGuangzhou510650People's Republic of China
| | - Zheng‐Feng Wang
- South China Botanical GardenChinese Academy of SciencesGuangzhou510650People's Republic of China
| | - Mery L. Suni
- Laboratorio de Fisiología VegetalFacultad de Ciencias BiológicasUniversidad Nacional Mayor de San MarcosLimaPeru
| | - Kevin S. Burgess
- Department of BiologyColumbus State UniversityColumbusGeorgia31907‐5645USA
| | - Xue-jun Ge
- South China Botanical GardenChinese Academy of SciencesGuangzhou510650People's Republic of China
| |
Collapse
|
31
|
Wang P, Yao FJ, Lu LX, Fang M, Zhang YM, Khan AA, Kong XH, Yu J, Jiang WZ, Kitamoto Y, Honda Y. Map-based cloning of genes encoding key enzymes for pigment synthesis in Auricularia cornea. Fungal Biol 2019; 123:843-853. [PMID: 31627860 DOI: 10.1016/j.funbio.2019.09.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 06/10/2019] [Accepted: 09/03/2019] [Indexed: 10/26/2022]
Abstract
Color is an important quality attribute of fungi, and a useful marker for classification, genetic, and molecular research. However, there is much debate over which enzymes play key regulatory roles in pigment synthesis pathways among different fungi and even within the same species. Auricularia cornea is the most widely cultivated mushroom in the genus Auricularia; 1.834 million tons of this mushroom were produced in 2016 in China. Thus, systematic studies on its color inheritance and the genes encoding key enzymes for pigment synthesis have high scientific and economic value. In this study, the white strain ACW001 and the purple strain ACP004 of A. cornea were used as dikaryotic parents. Selfing populations of ACW001 and ACP004 were constructed with their monokaryotic strains. The fruiting body color of the two populations was consistent with that of their parents, confirming that the two parents were color homozygotes. All strains in the hybrid population of the two parents produced purple fruiting bodies. A robust hybrid strain (ACW001-33×ACP004-33) was selected from the hybrid population, and 87 monokaryotic strains of ACW001-33×ACP004-33 were obtained as a mapping population. Finally, a testcross population was constructed by crossing the mapping population with the test strain ACW001-9. The color genotype of each monokaryotic strain in the mapping population was identified by a fruiting test. The genomes of the two monokaryotic strains ACW001-33 and ACP004-33 were sequenced, and then simple sequence repeat (SSR) and sequence-related amplified polymorphism (SRAP) molecular marker primers were developed. Then, 88 pairs of primers that could distinguish the genotypes of the mapping population were used to construct a genetic linkage map. The genetic linkage map consisted of 12 linkage groups (LGs) spanning 1315.2 cM. The color control locus was preliminarily located at 24.5 cM of the 11th LG. Fine-mapping primers were designed based on sequence differences between ACW001-33 and ACP004-33 in the primary location region. Four color control candidate genes were located in an 8.2-kb region of ACW001-33_contig733 and a 9.2-kb region of ACP004-33_contig802. Homologous alignment and prediction of conserved domain analyses indicated that two of the color control candidate genes encoded proteins with unknown function, and the other two, ACP004_g11815 and ACP004_g11816, encoded glutamyl aminotransferases. These two genes were consecutively arranged on ACP004-33_contig802, and were likely to encode key enzymes in the γ-glutamine-4-hydroxy-benzoate (GHB) pigment synthesis pathway. Primers were designed from the flanking sequences of the two genes and used to analyze the testcross population. Products were amplified only from the 30 testcross strains with purple fruiting bodies, confirming the accuracy of the localization results. We discuss the deficiencies and advantages of map-based cloning in fungi vs. plants, and summarize the steps and requirements of the map-based cloning method for fungi. This study has provided novel ideas and methods for locating functional genes in fungi.
Collapse
Affiliation(s)
- Peng Wang
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Fang-Jie Yao
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China; College of Horticulture, Jilin Agricultural University, Changchun, 130118, China.
| | - Li-Xin Lu
- College of Horticulture, Jilin Agricultural University, Changchun, 130118, China
| | - Ming Fang
- College of Horticulture, Jilin Agricultural University, Changchun, 130118, China
| | - You-Min Zhang
- College of Horticulture, Jilin Agricultural University, Changchun, 130118, China.
| | - Asif Ali Khan
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Xiang-Hui Kong
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Jing Yu
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Wan-Zhu Jiang
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Yutaka Kitamoto
- Engineering Research Center of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, 130118, China
| | - Yoichi Honda
- Graduate School of Agriculture, Kyoto University, Kyoto, 6068502, Japan
| |
Collapse
|
32
|
Acquadro A, Torello Marinoni D, Sartor C, Dini F, Macchio M, Botta R. Transcriptome characterization and expression profiling in chestnut cultivars resistant or susceptible to the gall wasp Dryocosmus kuriphilus. Mol Genet Genomics 2019; 295:107-120. [PMID: 31506717 DOI: 10.1007/s00438-019-01607-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 08/27/2019] [Indexed: 02/07/2023]
Abstract
The oriental gall wasp Dryocosmus kuriphilus represents a limiting pest for the European Chestnut (Castanea sativa, Fagaceae) as it creates severe yield losses. The European Chestnut is a deciduous tree, having major social, economic and environmental importance in Southern Europe, covering an area of 2.53 million hectares, including 75,000 ha devoted to fruit production. Cultivars show different susceptibility and very few are resistant to gall wasp. To deeply investigate the plant response and understand which factors can lead the plant to develop or not the gall, the study of transcriptome is basic (fundamental). To date, little transcriptomic information are available for C. sativa species. Hence, we present a de novo assembly of the chestnut transcriptome of the resistant Euro-Japanese hybrid 'Bouche de Bétizac' (BB) and the susceptible cultivar 'Madonna' (M), collecting RNA from buds at different stages of budburst. The two transcriptomes were assembled into 34,081 (BB) and 30,605 (M) unigenes, respectively. The former was used as a reference sequence for further characterization analyses, highlighting the presence of 1444 putative resistance gene analogs (RGAs) and about 1135 unigenes, as putative MiRNA targets. A global quantitative transcriptome profiling comparing the resistant and the susceptible cultivars, in the presence or not of the gall wasp, revealed some GO enrichments as "response to stimulus" (GO:0050896), and "developmental processes" (e.g., post-embryonic development, GO:0009791). Many up-regulated genes appeared to be transcription factors (e.g., RAV1, AP2/ERF, WRKY33) or protein regulators (e.g., RAPTOR1B) and storage proteins (e.g., LEA D29) involved in "post-embryonic development". Our analysis was able to provide a large amount of information, including 7k simple sequence repeat (SSR) and 335k single-nucleotide polymorphism (SNP)/INDEL markers, and generated the first reference unigene catalog for the European Chestnut. The transcriptome data for C. sativa will contribute to understand the genetic basis of the resistance to gall wasp and will provide useful information for next molecular genetic studies of this species and its relatives.
Collapse
Affiliation(s)
- Alberto Acquadro
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy
| | - Daniela Torello Marinoni
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy.
| | - Chiara Sartor
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy
| | - Francesca Dini
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy
| | - Matteo Macchio
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy
| | - Roberto Botta
- DISAFA, Dipartimento di Scienze Agrarie, Forestali e Alimentari, Università degli Studi di Torino, Largo Paolo Braccini 2, 10095, Grugliasco, Italy
| |
Collapse
|
33
|
Characterisation of polymorphic microsatellite loci in three copper butterfly species (Lycaena spp.). Mol Biol Rep 2019; 46:6585-6591. [PMID: 31482433 DOI: 10.1007/s11033-019-05056-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 08/29/2019] [Indexed: 10/26/2022]
Abstract
The genus Lycaena is widely used for studying life history evolution, local adaptation, stress biology, and behavior. Furthermore, several species are currently declining and thus of conservation concern. In order to provide the molecular basis for population genetics and conservation biology, we report the development of 36 microsatellite markers for Lycaena spp. Loci were screened in 21 individuals each per species using individuals from Greifswald, north-eastern Germany (L. tityrus) or the Italian and Austrian Alps (L. hippothoe and L. virgaureae). Ten, sixteen, and ten polymorphic microsatellite loci are characterised in L. tityrus, L. hippothoe, and L. virgaureae, respectively. Allele numbers per locus ranged from three to 20 and expected heterozygosity from 0.37 to 0.94. Nineteen out of the 36 loci were successfully cross-amplified in at least one other taxon, resulting in a total of 13 loci for L. tityrus tityrus, 14 for L. tityrus subalpinus, 20 for L. hippothoe, and 18 for L. virgaureae. These markers will be useful for addressing population genetic issues in L. tityrus, L. hippothoe, and L. virgaureae, and potentially other Copper butterflies.
Collapse
|
34
|
IDSSR: An Efficient Pipeline for Identifying Polymorphic Microsatellites from a Single Genome Sequence. Int J Mol Sci 2019; 20:ijms20143497. [PMID: 31315288 PMCID: PMC6678329 DOI: 10.3390/ijms20143497] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 06/25/2019] [Accepted: 07/15/2019] [Indexed: 12/02/2022] Open
Abstract
Simple sequence repeats (SSRs) are known as microsatellites, and consist of tandem 1–6-base motifs. They have become one of the most popular molecular markers, and are widely used in molecular ecology, conservation biology, molecular breeding, and many other fields. Previously reported methods identify monomorphic and polymorphic SSRs and determine the polymorphic SSRs via experimental validation, which is potentially time-consuming and costly. Herein, we present a new strategy named insertion/deletion (INDEL) SSR (IDSSR) to identify polymorphic SSRs by integrating SSRs with nucleotide insertions/deletions (INDEL) solely based on a single genome sequence and the sequenced pair-end reads. These INDEL indexes and polymorphic SSRs were identified, as well as the number of repeats, repeat motifs, chromosome location, annealing temperature, and primer sequences, enabling future experimental approaches to determine the correctness and polymorphism. Experimental validation with the giant panda demonstrated that our method has high reliability and stability. The efficient SSR pipeline would help researchers obtain high-quality genetic markers for plants and animals of interest, save labor, and reduce costly marker-screening experiments. IDSSR is freely available at https://github.com/Allsummerking/IDSSR.
Collapse
|
35
|
Shamanskiy VA, Timonina VN, Popadin KY, Gunbin KV. ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation. BMC Genomics 2019; 20:295. [PMID: 31284879 PMCID: PMC6614062 DOI: 10.1186/s12864-019-5536-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Mitochondria is a powerhouse of all eukaryotic cells that have its own circular DNA (mtDNA) encoding various RNAs and proteins. Somatic perturbations of mtDNA are accumulating with age thus it is of great importance to uncover the main sources of mtDNA instability. Recent analyses demonstrated that somatic mtDNA deletions depend on imperfect repeats of various nature between distant mtDNA segments. However, till now there are no comprehensive databases annotating all types of imperfect repeats in numerous species with sequenced complete mitochondrial genome as well as there are no algorithms capable to call all types of imperfect repeats in circular mtDNA. RESULTS We implemented naïve algorithm of pattern recognition by analogy to standard dot-plot construction procedures allowing us to find both perfect and imperfect repeats of four main types: direct, inverted, mirror and complementary. Our algorithm is adapted to specific characteristics of mtDNA such as circularity and an excess of short repeats - it calls imperfect repeats starting from the length of 10 b.p. We constructed interactive web available database ImtRDB depositing perfect and imperfect repeats positions in mtDNAs of more than 3500 Vertebrate species. Additional tools, such as visualization of repeats within a genome, comparison of repeat densities among different genomes and a possibility to download all results make this database useful for many biologists. Our first analyses of the database demonstrated that mtDNA imperfect repeats (i) are usually short; (ii) associated with unfolded DNA structures; (iii) four types of repeats positively correlate with each other forming two equivalent pairs: direct and mirror versus inverted and complementary, with identical nucleotide content and similar distribution between species; (iv) abundance of repeats is negatively associated with GC content; (v) dinucleotides GC versus CG are overrepresented on light chain of mtDNA covered by repeats. CONCLUSIONS ImtRDB is available at http://bioinfodbs.kantiana.ru/ImtRDB/ . It is accompanied by the software calling all types of interspersed repeats with different level of degeneracy in circular DNA. This database and software can become a very useful tool in various areas of mitochondrial and chloroplast DNA research.
Collapse
Affiliation(s)
- Viktor A Shamanskiy
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
| | - Valeria N Timonina
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
| | - Konstantin Yu Popadin
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Konstantin V Gunbin
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia. .,Center of Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia.
| |
Collapse
|
36
|
Kolesnikova AI, Putintseva YA, Simonov EP, Biriukov VV, Oreshkova NV, Pavlov IN, Sharov VV, Kuzmin DA, Anderson JB, Krutovsky KV. Mobile genetic elements explain size variation in the mitochondrial genomes of four closely-related Armillaria species. BMC Genomics 2019; 20:351. [PMID: 31068137 PMCID: PMC6506933 DOI: 10.1186/s12864-019-5732-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Accepted: 04/24/2019] [Indexed: 12/03/2022] Open
Abstract
Background Species in the genus Armillaria (fungi, basidiomycota) are well-known as saprophytes and pathogens on plants. Many of them cause white-rot root disease in diverse woody plants worldwide. Mitochondrial genomes (mitogenomes) are widely used in evolutionary and population studies, but despite the importance and wide distribution of Armillaria, the complete mitogenomes have not previously been reported for this genus. Meanwhile, the well-supported phylogeny of Armillaria species provides an excellent framework in which to study variation in mitogenomes and how they have evolved over time. Results Here we completely sequenced, assembled, and annotated the circular mitogenomes of four species: A. borealis, A. gallica, A. sinapina, and A. solidipes (116,443, 98,896, 103,563, and 122,167 bp, respectively). The variation in mitogenome size can be explained by variable numbers of mobile genetic elements, introns, and plasmid-related sequences. Most Armillaria introns contained open reading frames (ORFs) that are related to homing endonucleases of the LAGLIDADG and GIY-YIG families. Insertions of mobile elements were also evident as fragments of plasmid-related sequences in Armillaria mitogenomes. We also found several truncated gene duplications in all four mitogenomes. Conclusions Our study showed that fungal mitogenomes have a high degree of variation in size, gene content, and genomic organization even among closely related species of Armillara. We suggest that mobile genetic elements invading introns and intergenic sequences in the Armillaria mitogenomes have played a significant role in shaping their genome structure. The mitogenome changes we describe here are consistent with widely accepted phylogenetic relationships among the four species. Electronic supplementary material The online version of this article (10.1186/s12864-019-5732-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Anna I Kolesnikova
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia.,Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", Krasnoyarsk, 660036, Russia
| | - Yuliya A Putintseva
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia
| | - Evgeniy P Simonov
- Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", Krasnoyarsk, 660036, Russia.,Institute of Animal Systematics and Ecology, Siberian Branch of Russian Academy of Sciences, 630091, Novosibirsk, Russia
| | - Vladislav V Biriukov
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia.,Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", Krasnoyarsk, 660036, Russia
| | - Natalya V Oreshkova
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia.,Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", Krasnoyarsk, 660036, Russia.,Laboratory of Forest Genetics and Selection, V. N. Sukachev Institute of Forest, Siberian Branch of Russian Academy of Sciences, Krasnoyarsk, 660036, Russia
| | - Igor N Pavlov
- Laboratory of Reforestation, Mycology and Plant Pathology, V. N. Sukachev Institute of Forest, Siberian Branch of Russian Academy of Sciences, Krasnoyarsk, 660036, Russia
| | - Vadim V Sharov
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia.,Laboratory of Genomic Research and Biotechnology, Federal Research Center "Krasnoyarsk Science Center of the Siberian Branch of the Russian Academy of Sciences", Krasnoyarsk, 660036, Russia.,Department of High Performance Computing, Institute of Space and Information Technologies, Siberian Federal University, Krasnoyarsk, 660074, Russia
| | - Dmitry A Kuzmin
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia.,Department of High Performance Computing, Institute of Space and Information Technologies, Siberian Federal University, Krasnoyarsk, 660074, Russia
| | - James B Anderson
- Department of Biology, University of Toronto, Mississauga, ON, l5L 1C6, Canada
| | - Konstantin V Krutovsky
- Laboratory of Forest Genomics, Genome Research and Education Center, Institute of Fundamental Biology and Biotechnology, Siberian Federal University, Krasnoyarsk, 660036, Russia. .,Department of Forest Genetics and Forest Tree Breeding, Georg-August University of Göttingen, 37077, Göttingen, Germany. .,Laboratory of Population Genetics, N. I. Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, 119333, Russia. .,Department of Ecosystem Science and Management, Texas A&M University, College Station, TX, 77843-2138, USA.
| |
Collapse
|
37
|
Mathema VB, Dondorp AM, Imwong M. OSTRFPD: Multifunctional Tool for Genome-Wide Short Tandem Repeat Analysis for DNA, Transcripts, and Amino Acid Sequences with Integrated Primer Designer. Evol Bioinform Online 2019; 15:1176934319843130. [PMID: 31040636 PMCID: PMC6482647 DOI: 10.1177/1176934319843130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/15/2019] [Indexed: 01/18/2023] Open
Abstract
Microsatellite mining is a common outcome of the in silico approach to genomic studies. The resulting short tandemly repeated DNA could be used as molecular markers for studying polymorphism, genotyping and forensics. The omni short tandem repeat finder and primer designer (OSTRFPD) is among the few versatile, platform-independent open-source tools written in Python that enables researchers to identify and analyse genome-wide short tandem repeats in both nucleic acids and protein sequences. OSTRFPD is designed to run either in a user-friendly fully featured graphical interface or in a command line interface mode for advanced users. OSTRFPD can detect both perfect and imperfect repeats of low complexity with customisable scores. Moreover, the software has built-in architecture to simultaneously filter selection of flanking regions in DNA and generate microsatellite-targeted primers implementing the Primer3 platform. The software has built-in motif-sequence generator engines and an additional option to use the dictionary mode for custom motif searches. The software generates search results including general statistics containing motif categorisation, repeat frequencies, densities, coverage, guanine–cytosine (GC) content, and simple text-based imperfect alignment visualisation. Thus, OSTRFPD presents users with a quick single-step solution package to assist development of microsatellite markers and categorise tandemly repeated amino acids in proteome databases. Practical implementation of OSTRFPD was demonstrated using publicly available whole-genome sequences of selected Plasmodium species. OSTRFPD is freely available and open-sourced for improvement and user-specific adaptation.
Collapse
Affiliation(s)
- Vivek Bhakta Mathema
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Arjen M Dondorp
- Mahidol-Oxford Tropical Medicine Research unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Centre for Tropical Medicine, Churchill Hospital, Oxford, UK
| | - Mallika Imwong
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Mallika Imwong, Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok 10400, Thailand.
| |
Collapse
|
38
|
Ding X, Mei W, Huang S, Wang H, Zhu J, Hu W, Ding Z, Tie W, Peng S, Dai H. Genome survey sequencing for the characterization of genetic background of Dracaena cambodiana and its defense response during dragon's blood formation. PLoS One 2018; 13:e0209258. [PMID: 30550595 PMCID: PMC6294377 DOI: 10.1371/journal.pone.0209258] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 12/03/2018] [Indexed: 11/26/2022] Open
Abstract
Dragon's blood collected from the genus Dracaena is used as a renowned traditional medicine in various cultures worldwide. However, the genetics of the genus Dracaena and the formation mechanism of dragon's blood remain poorly understood. Here, we generate the first draft genome reference assembly of an elite Chinese Dracaena species, Dracaena cambodiana, from next-generation sequencing data with 89.46× coverage. The reads were assembled into 2,640,704 contigs with an N50 length of 1.87 kb, and a 1.05 Gb assembly was finally assembled with 2,379,659 scaffolds. Furthermore, 97.75% of the 267,243 simple sequence repeats identified from these scaffolds were mononucleotide, dinucleotide, and trinucleotide repeats. Among all 53,700 predicted genes, 158 genes involved in cell wall and plant hormone synthesis and reactive oxygen species scavenging showed altered regulation during the formation of dragon's blood. This study provides a genomic characterization of D. cambodiana and improves understanding of the molecular mechanism of dragon's blood formation. This report represents the first genome-wide characterization of a Dracaena species in the Asparagaceae.
Collapse
Affiliation(s)
- Xupo Ding
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
- Hainan Key Laboratory for Research and Development of Natural Products from Li folk Medicine, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Wenli Mei
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
- Hainan Key Laboratory for Research and Development of Natural Products from Li folk Medicine, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Shengzhuo Huang
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
- Hainan Key Laboratory for Research and Development of Natural Products from Li folk Medicine, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Hui Wang
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
- Hainan Key Laboratory for Research and Development of Natural Products from Li folk Medicine, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Jiahong Zhu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Wei Hu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Zehong Ding
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Weiwei Tie
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Shiqing Peng
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| | - Haofu Dai
- Key Laboratory of Biology and Genetic Resources of Tropical Crops of Ministry of Agriculture and Rural Affairs, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
- Hainan Key Laboratory for Research and Development of Natural Products from Li folk Medicine, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, People’s Republic of China
| |
Collapse
|
39
|
Gong J, Xiao M, Wang H, Kudinha T, Wang Y, Zhao F, Wu W, He L, Xu YC, Zhang J. Genetic Differentiation, Diversity, and Drug Susceptibility of Candida krusei. Front Microbiol 2018; 9:2717. [PMID: 30524386 PMCID: PMC6256198 DOI: 10.3389/fmicb.2018.02717] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 10/24/2018] [Indexed: 11/23/2022] Open
Abstract
Candida krusei is a notable pathogenic fungus that causes invasive candidiasis, mainly due to its natural resistance to fluconazole. However, to date, there is limited research on the genetic population features of C. krusei. We developed a set of microsatellite markers for this organism, with a cumulative discriminatory power of 1,000. Using these microsatellite loci, 48 independent C. krusei strains of clearly known the sources, were analyzed. Furthermore, susceptibility to 9 antifungal agents was determined for each strain, by the Clinical and Laboratory Standards Institute broth microdilution method. Population structure analyses revealed that C. krusei could be separated into two clusters. The cluster with the higher genetic diversity had wider MIC ranges for six antifungal agents. Furthermore, the highest MIC values of the six antifungal agents belonged to the cluster with higher genetic diversity. The higher genetic diversity cluster might have a better adaptive capacity when C. krusei is under selection pressure from antifungal agents, and thus is more likely to develop drug resistance.
Collapse
Affiliation(s)
- Jie Gong
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Meng Xiao
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China.,Beijing Key Laboratory for Mechanisms Research and Precision Diagnosis of Invasive Fungal Diseases, Beijing, China
| | - He Wang
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China.,Beijing Key Laboratory for Mechanisms Research and Precision Diagnosis of Invasive Fungal Diseases, Beijing, China
| | - Timothy Kudinha
- School of Biomedical Science, Charles Sturt University, Orange, NSW, Australia.,Central West Pathology Laboratory, Orange, NSW, Australia
| | - Yu Wang
- Key Laboratory of Wildlife Biotechnology, Conservation and Utilization of Zhejiang Province, Zhejiang Normal University, Jinhua, China
| | - Fei Zhao
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Weiwei Wu
- Department of Dermatology, Hainan Provincial Center for Skin Disease and STI Control, Haikou, China
| | - Lihua He
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Ying-Chun Xu
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China.,Beijing Key Laboratory for Mechanisms Research and Precision Diagnosis of Invasive Fungal Diseases, Beijing, China
| | - Jianzhong Zhang
- State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| |
Collapse
|
40
|
Du L, Zhang C, Liu Q, Zhang X, Yue B, Hancock J. Krait: an ultrafast tool for genome-wide survey of microsatellites and primer design. Bioinformatics 2018; 34:681-683. [PMID: 29048524 DOI: 10.1093/bioinformatics/btx665] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Accepted: 10/17/2017] [Indexed: 12/16/2022] Open
Abstract
Summary Microsatellites are found to be related with various diseases and widely used in population genetics as genetic markers. However, it remains a challenge to identify microsatellite from large genome and screen microsatellites for primer design from a huge result dataset. Here, we present Krait, a robust and flexible tool for fast investigation of microsatellites in DNA sequences. Krait is designed to identify all types of perfect or imperfect microsatellites on a whole genomic sequence, and is also applicable to identification of compound microsatellites. Primer3 was seamlessly integrated into Krait so that users can design primer for microsatellite amplification in an efficient way. Additionally, Krait can export microsatellite results in FASTA or GFF3 format for further analysis and generate statistical report as well as plotting. Availability and implementation Krait is freely available at https://github.com/lmdu/krait under GPL2 License, implemented in C and Python, and supported on Windows, Linux and Mac operating systems. Contact chizhang_swmu@126.com. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lianming Du
- Institute for Advanced Study, Chengdu University, Chengdu, Sichuan 610106, China
| | - Chi Zhang
- Rehabilitation Medicine Department, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan 646099, China
| | - Qin Liu
- College of Life Sciences and Food Engineering, Yibin University, Yibin, Sichuan 644000, China
| | - Xiuyue Zhang
- Key Laboratory of Bio-Resources and Eco-Environment, Ministry of Education, College of Life Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Bisong Yue
- Key Laboratory of Bio-Resources and Eco-Environment, Ministry of Education, College of Life Science, Sichuan University, Chengdu, Sichuan 610065, China
| | | |
Collapse
|
41
|
Durigan M, Cardoso-Silva CB, Ciampi-Guillardi M, Toledo-Silva G, Mori GM, Franco RMB, Souza AP. Molecular genotyping, diversity studies and high-resolution molecular markers unveiled by microsatellites in Giardia duodenalis. PLoS Negl Trop Dis 2018; 12:e0006928. [PMID: 30500829 PMCID: PMC6291164 DOI: 10.1371/journal.pntd.0006928] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 12/12/2018] [Accepted: 10/16/2018] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Giardia duodenalis (synonyms G. lamblia and G. intestinalis) is an enteric protozoan parasite of a wide range of mammalian hosts, including humans and various domestic and wild animals. There is considerable genetic variability in G. duodenalis and isolates of this parasite have been divided into eight genetic assemblages. Microsatellites markers can be used to discriminate isolates with a high level of sensitivity. This study was conducted to identify and characterize genomic microsatellites (simple sequence repeats-SSRs), sequences of one- to six-nucleotide motifs repeated in tandem, present in the available genomes of G. duodenalis and to develop new markers that can serve as a tool for detection and for characterizing the genetic diversity of this parasite. METHODOLOGY/ PRINCIPAL FINDINGS For each genetic assemblage, polymorphism levels for the microsatellite markers were evaluated. After performing the analysis using the MISA and SciRoKo software, 1,853 simple sequence repeats (SSRs) were identified. In all the genomes, trinucleotide repeats were the most common class followed by tetranucleotide. Many of the SSR loci are assemblage-specific, and 36 SSR loci shared among all the genomes were identified. Together with hypothetical proteins, variant-specific surface proteins represented nearly half of the annotated SSR loci. The results regarding the most common repeat among the SSRs led us to infer that positive selection occurred to avoid frameshift mutations. Additionally, based on inter- and intra-genetic assemblages polymorphism analyses, we unveiled previously undetected genetic variation, indicating that the microsatellite markers we developed are useful molecular tools for epidemiological inferences based on population genetics patterns and processes. CONCLUSIONS There is increasing demand for the development of new molecular markers and for the characterization of pathogens at a higher resolution level. In this study, we present 60 G. duodenalis microsatellites markers that exhibited high polymerase chain reaction (PCR) amplification efficiency among the different genetic assemblages. Twenty of these markers presented nucleotide sequence polymorphisms and may be used as a genotyping tool. The monomorphic markers can be used for the detection of the parasite at the species and genetic assemblage level. These polymorphic markers revealed a genetic diversity that was previously undetectable, thus they can be considered valuable molecular tools for high resolution markers in future studies investigating Giardia and may also be used for epidemiological inferences based on populations genetics patterns and processes.
Collapse
Affiliation(s)
- Maurício Durigan
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Claudio Benício Cardoso-Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Maísa Ciampi-Guillardi
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Departamento de Fitopatologia–ESALQ–Universidade de São Paulo, Piracicaba, São Paulo, Brazil
| | - Guilherme Toledo-Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Gustavo M. Mori
- Instituto de Biociências, Campus do Litoral Paulista, Universidade Estadual Paulista (Unesp), São Vicente, Sao Paulo, Brazil
| | - Regina M. B. Franco
- Departamento de Biologia Animal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Anete P. Souza
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| |
Collapse
|
42
|
Zhu WC, Sun JT, Dai J, Huang JR, Chen L, Hong XY. New microsatellites revealed strong gene flow among populations of a new outbreak pest, Athetis lepigone (Möschler). BULLETIN OF ENTOMOLOGICAL RESEARCH 2018; 108:636-644. [PMID: 29173200 DOI: 10.1017/s000748531700116x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Athetis lepigone (Möschler) (Lepidoptera: Noctuidae) is a new outbreak pest in China. Consequently, it is unclear whether the emergence and spread of the outbreak of this pest are triggered by rapid in situ population size increases in each outbreak area, or by immigrants from a potential source area in China. In order to explore the outbreak process of this pest through a population genetics approach, we developed ten novel polymorphic expressed sequence tags (EST)-derived microsatellites. These new microsatellites had moderately high levels of polymorphism in the tested population. The number of alleles per locus ranged from 3 to 19, with an average of 8.6, and the expected heterozygosity ranged from 0.269 to 0.783. A preliminary population genetic analysis using these new microsatellites revealed a lack of population genetic structure in natural populations of A. lepigone. The estimates of recent migration rate revealed strong gene flow among populations. In conclusion, our study developed the first set of EST-microsatellite markers and shed a new light on the population genetic structure of this pest in China.
Collapse
Affiliation(s)
- W-C Zhu
- Department of Entomology,Nanjing Agricultural University,Nanjing, Jiangsu 210095,China
| | - J-T Sun
- Department of Entomology,Nanjing Agricultural University,Nanjing, Jiangsu 210095,China
| | - J Dai
- Department of Entomology,Nanjing Agricultural University,Nanjing, Jiangsu 210095,China
| | - J-R Huang
- Institute of Plant Protection, Henan Academy of Agricultural Sciences,Zhengzhou, Henan 450002,China
| | - L Chen
- Department of Entomology,Nanjing Agricultural University,Nanjing, Jiangsu 210095,China
| | - X-Y Hong
- Department of Entomology,Nanjing Agricultural University,Nanjing, Jiangsu 210095,China
| |
Collapse
|
43
|
Qi WH, Jiang XM, Yan CC, Zhang WQ, Xiao GS, Yue BS, Zhou CQ. Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes. Sci Rep 2018; 8:14407. [PMID: 30258087 PMCID: PMC6158176 DOI: 10.1038/s41598-018-32286-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 09/04/2018] [Indexed: 01/23/2023] Open
Abstract
As the first examination of distribution, guanine-cytosine (GC) pattern, and variation analysis of microsatellites (SSRs) in different genomic regions of six bovid species, SSRs displayed nonrandomly distribution in different regions. SSR abundances are much higher in the introns, transposable elements (TEs), and intergenic regions compared to the 3′-untranslated regions (3′UTRs), 5′UTRs and coding regions. Trinucleotide perfect SSRs (P-SSRs) were the most frequent in the coding regions, whereas, mononucleotide P-SSRs were the most in the introns, 3′UTRs, TEs, and intergenic regions. Trifold P-SSRs had more GC-contents in the 5′UTRs and coding regions than that in the introns, 3′UTRs, TEs, and intergenic regions, whereas mononucleotide P-SSRs had the least GC-contents in all genomic regions. The repeat copy numbers (RCN) of the same mono- to hexanucleotide P-SSRs showed significantly different distributions in different regions (P < 0.01). Except for the coding regions, mononucleotide P-SSRs had the most RCNs, followed by the pattern: di- > tri- > tetra- > penta- > hexanucleotide P-SSRs in the same regions. The analysis of coefficient of variability (CV) of SSRs showed that the CV variations of RCN of the same mono- to hexanucleotide SSRs were relative higher in the intronic and intergenic regions, followed by the CV variation of RCN in the TEs, and the relative lower was in the 5′UTRs, 3′UTRs, and coding regions. Wide SSR analysis of different genomic regions has helped to reveal biological significances of their distributions.
Collapse
Affiliation(s)
- Wen-Hua Qi
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Xue-Mei Jiang
- College of Environmental and Chemistry Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Chao-Chao Yan
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Wan-Qing Zhang
- College of Life Sciences, Sichuan Agricultural University, Ya'an, Sichuan Province, 625014, P. R. China
| | - Guo-Sheng Xiao
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Bi-Song Yue
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Cai-Quan Zhou
- Key Laboratory of Southwest China Wildlife Resources Conservation (Ministry of Education), China West Normal University, Nanchong, 637009, P. R. China.
| |
Collapse
|
44
|
Biswas MK, Natarajan S, Biswas D, Nath UK, Park JI, Nou IS. LSAT: Liliaceae Simple Sequences Analysis Tool, a web server. Bioinformation 2018; 14:181-182. [PMID: 29983488 PMCID: PMC6016758 DOI: 10.6026/97320630014181] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Revised: 04/21/2018] [Accepted: 04/28/2018] [Indexed: 01/01/2023] Open
Abstract
LSAT is a web-based microsatellite SSR marker designer tool specific for the Liliaceae family. It is developed using HTML, CSS, PHP, Perl and Java scripts. It works without extra add-ons on standard browsers. LSAT provides SSR primer designing service using the web interface. It helps in SSR mining and primer design. LSAT is user friendly with customizable search parameters producing visual output having download options. The current version of LSAT is backed by two data sets, namely, lily EST (Expressed Sequence Tag) from NCBI and lily nr (non redundant) with 4,099 and 216,768 unigenes, respectively. LSAT will be updated regularly upon availability of additional data (either EST and/or transcriptome) on Liliaceae. AVAILABILITY LSAT is available for free at http://210.110.86.160/Lsat/Lsat.html.
Collapse
Affiliation(s)
- Manosh Kumar Biswas
- Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, 57922 South Korea
| | - Sathishkumar Natarajan
- Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, 57922 South Korea
| | - Dhiman Biswas
- Department of Computer Science and Engineering, Maulana Abul Kalam Azad University of Technology, WB, India
| | - Ujjal Kumar Nath
- Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, 57922 South Korea
| | - Jong-In Park
- Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, 57922 South Korea
| | - Ill-Sup Nou
- Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, 57922 South Korea
| |
Collapse
|
45
|
Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics 2018; 33:2583-2585. [PMID: 28398459 PMCID: PMC5870701 DOI: 10.1093/bioinformatics/btx198] [Citation(s) in RCA: 1032] [Impact Index Per Article: 172.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Accepted: 04/06/2017] [Indexed: 12/27/2022] Open
Abstract
Motivation Microsatellites are a widely-used marker system in plant genetics and forensics. The development of reliable microsatellite markers from resequencing data is challenging. Results We extended MISA, a computational tool assisting the development of microsatellite markers, and reimplemented it as a web-based application. We improved compound microsatellite detection and added the possibility to display and export MISA results in GFF3 format for downstream analysis. Availability and Implementation MISA-web can be accessed under http://misaweb.ipk-gatersleben.de/. The website provides tutorials, usage note as well as download links to the source code. Contact scholz@ipk-gatersleben.de.
Collapse
Affiliation(s)
- Sebastian Beier
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany
| | - Thomas Thiel
- KWS Saat SE, Grimsehlstr. 31, 37555 Einbeck, Germany
| | - Thomas Münch
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Germany
| |
Collapse
|
46
|
Zeng Z, Sun H, Vainio EJ, Raffaello T, Kovalchuk A, Morin E, Duplessis S, Asiegbu FO. Intraspecific comparative genomics of isolates of the Norway spruce pathogen (Heterobasidion parviporum) and identification of its potential virulence factors. BMC Genomics 2018; 19:220. [PMID: 29580224 PMCID: PMC5870257 DOI: 10.1186/s12864-018-4610-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 03/20/2018] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Heterobasidion parviporum is an economically most important fungal forest pathogen in northern Europe, causing root and butt rot disease of Norway spruce (Picea abies (L.) Karst.). The mechanisms underlying the pathogenesis and virulence of this species remain elusive. No reference genome to facilitate functional analysis is available for this species. RESULTS To better understand the virulence factor at both phenotypic and genomic level, we characterized 15 H. parviporum isolates originating from different locations across Finland for virulence, vegetative growth, sporulation and saprotrophic wood decay. Wood decay capability and latitude of fungal origins exerted interactive effects on their virulence and appeared important for H. parviporum virulence. We sequenced the most virulent isolate, the first full genome sequences of H. parviporum as a reference genome, and re-sequenced the remaining 14 H. parviporum isolates. Genome-wide alignments and intrinsic polymorphism analysis showed that these isolates exhibited overall high genomic similarity with an average of at least 96% nucleotide identity when compared to the reference, yet had remarkable intra-specific level of polymorphism with a bias for CpG to TpG mutations. Reads mapping coverage analysis enabled the classification of all predicted genes into five groups and uncovered two genomic regions exclusively present in the reference with putative contribution to its higher virulence. Genes enriched for copy number variations (deletions and duplications) and nucleotide polymorphism were involved in oxidation-reduction processes and encoding domains relevant to transcription factors. Some secreted protein coding genes based on the genome-wide selection pressure, or the presence of variants were proposed as potential virulence candidates. CONCLUSION Our study reported on the first reference genome sequence for this Norway spruce pathogen (H. parviporum). Comparative genomics analysis gave insight into the overall genomic variation among this fungal species and also facilitated the identification of several secreted protein coding genes as putative virulence factors for the further functional analysis. We also analyzed and identified phenotypic traits potentially linked to its virulence.
Collapse
Affiliation(s)
- Zhen Zeng
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| | - Hui Sun
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Collaborative Innovation Center of Sustainable Forestry in Southern China, College of Forestry, Nanjing Forestry University, Nanjing, China
| | - Eeva J. Vainio
- Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Tommaso Raffaello
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| | - Andriy Kovalchuk
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| | - Emmanuelle Morin
- INRA UMR 1136 Interactions Arbres Micro-organismes, INRA Centre Grand Est Nancy, Champenoux, France
| | - Sébastien Duplessis
- INRA UMR 1136 Interactions Arbres Micro-organismes, INRA Centre Grand Est Nancy, Champenoux, France
- UMR 1136 Interactions Arbres/Microorganismes, Faculté des Sciences et Technologies, Université de Lorraine, Vandoeuvre-lès-Nancy, France
| | - Fred O. Asiegbu
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
47
|
Fan H, Guo W. A genome-wide investigation of microsatellite mismatches and the association with body mass among bird species. PeerJ 2018; 6:e4495. [PMID: 29576965 PMCID: PMC5857172 DOI: 10.7717/peerj.4495] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 02/21/2018] [Indexed: 01/26/2023] Open
Abstract
Mutation rate is usually found to covary with many life history traits of animals such as body mass, which has been readily explained by the higher number of mutation opportunities per unit time. Although the precise reason for the pattern is not yet clear, to determine the universality of this pattern, we tested whether life history traits impact another form of genetic mutation, the motif mismatches in microsatellites. Employing published genome sequences from 65 avian species, we explored the motif mismatches patterns of microsatellites in birds on a genomic level and assessed the relationship between motif mismatches and body mass in a phylogenetic context. We found that small-bodied species have a higher average mismatches and we suggested that higher heterozygosity in imperfect microsatellites lead to the increase of motif mismatches. Our results obtained from this study imply that a negative body mass trend in mutation rate may be a general pattern of avian molecular evolution.
Collapse
Affiliation(s)
- Haiying Fan
- Department of Ecology, College of Life Sciences, Wuhan University, Wuhan, China
| | - Weibin Guo
- Department of Ecology, College of Life Sciences, Wuhan University, Wuhan, China
| |
Collapse
|
48
|
Ranathunge C, Wheeler GL, Chimahusky ME, Kennedy MM, Morrison JI, Baldwin BS, Perkins AD, Welch ME. Transcriptome profiles of sunflower reveal the potential role of microsatellites in gene expression divergence. Mol Ecol 2018; 27:1188-1199. [PMID: 29419922 DOI: 10.1111/mec.14522] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 01/18/2018] [Accepted: 01/29/2018] [Indexed: 12/17/2022]
Abstract
The mechanisms by which natural populations generate adaptive genetic variation are not well understood. Some studies propose that microsatellites can function as drivers of adaptive variation. Here, we tested a potentially adaptive role for transcribed microsatellites with natural populations of the common sunflower (Helianthus annuus L.) by assessing the enrichment of microsatellites in genes that show expression divergence across latitudes. Seeds collected from six populations at two distinct latitudes in Kansas and Oklahoma were planted and grown in a common garden. Morphological measurements from the common garden demonstrated that phenotypic variation among populations is largely explained by underlying genetic variation. An RNA-Seq experiment was conducted with 96 of the individuals grown in the common garden and differentially expressed (DE) transcripts between the two latitudes were identified. A total number of 825 DE transcripts were identified. DE transcripts and nondifferentially expressed (NDE) transcripts were then scanned for microsatellites. The abundance of different motif lengths and types in both groups were estimated. Our results indicate that DE transcripts are significantly enriched with mononucleotide repeats and significantly depauperate in trinucleotide repeats. Further, the standardized mononucleotide repeat motif A and dinucleotide repeat motif AG were significantly enriched within DE transcripts while motif types, C, AT, ACC and AAC in DE transcripts, are significantly differentiated in microsatellite tract length between the two latitudes. The tract length differentiation at specific microsatellite motif types across latitudes and their enrichment within DE transcripts indicate a potential functional role for transcribed microsatellites in gene expression divergence in sunflower.
Collapse
Affiliation(s)
- Chathurani Ranathunge
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Gregory L Wheeler
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA.,Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, OH, USA
| | - Melody E Chimahusky
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| | - Meaghan M Kennedy
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Jesse I Morrison
- Department of Plant and Soil Sciences, Mississippi State University, Starkville, MS, USA
| | - Brian S Baldwin
- Department of Plant and Soil Sciences, Mississippi State University, Starkville, MS, USA
| | - Andy D Perkins
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, USA
| | - Mark E Welch
- Department of Biological Sciences, Mississippi State University, Starkville, MS, USA
| |
Collapse
|
49
|
Shirasawa K, Isuzugawa K, Ikenaga M, Saito Y, Yamamoto T, Hirakawa H, Isobe S. The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding. DNA Res 2017; 24:499-508. [PMID: 28541388 PMCID: PMC5737369 DOI: 10.1093/dnares/dsx020] [Citation(s) in RCA: 124] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Accepted: 04/25/2017] [Indexed: 12/13/2022] Open
Abstract
We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site–associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)).
Collapse
Affiliation(s)
- Kenta Shirasawa
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Kanji Isuzugawa
- Horticultural Experiment Station, Yamagata Integrated Agricultural Research Center, Sagae, Yamagata 991-0043, Japan
| | - Mitsunobu Ikenaga
- Central Agricultural Experiment Station, Agricultural Research Department, Hokkaido Research Organization, Naganuma, Hokkaido 069-1395, Japan
| | - Yutaro Saito
- Horticultural Experiment Station, Yamagata Integrated Agricultural Research Center, Sagae, Yamagata 991-0043, Japan
| | - Toshiya Yamamoto
- Institute of Fruit Tree and Tea Science, National Agriculture and Food Research Organization, Tsukuba, Ibaraki 305-8605, Japan
| | - Hideki Hirakawa
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Sachiko Isobe
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| |
Collapse
|
50
|
Ding S, Wang S, He K, Jiang M, Li F. Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects. BMC Genomics 2017; 18:848. [PMID: 29110701 PMCID: PMC5674736 DOI: 10.1186/s12864-017-4234-0] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2017] [Accepted: 10/23/2017] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Simple sequence repeats (SSR), also called microsatellites, have been widely used as genetic markers, and have been extensively studied in some model insects. At present, the genomes of more than 100 insect species are available. However, the features of SSRs in most insect genomes remain largely unknown. RESULTS We identified 15.01 million SSRs across 136 insect genomes. The number of identified SSRs was positively associated with genome size in insects, but the frequency and density per megabase of genomes were not. Most insect SSRs (56.2-93.1%) were perfect (no mismatch). Imperfect (at least one mismatch) SSRs (average length 22-73 bp) were longer than perfect SSRs (16-30 bp). The most abundant insect SSRs were the di- and trinucleotide types, which accounted for 27.2% and 22.0% of all SSRs, respectively. On average, 59.1%, 36.8%, and 3.7% of insect SSRs were located in intergenic, intronic, and exonic regions, respectively. The percentages of various types of SSRs were similar among insects from the same family. However, they were dissimilar among insects from different families within orders. We carried out a phylogenetic analysis using the SSR frequencies. Species from the same family were generally clustered together in the evolutionary tree. However, insects from the same order but not in the same family did not cluster together. These results indicated that although SSRs undergo rapid expansions and contractions in different populations of the same species, the general genomic features of insect SSRs remain conserved at the family level. CONCLUSION Millions of insect SSRs were identified and their genome features were analyzed. Most insect SSRs were perfect and were located in intergenic regions. We presented evidence that the variance of insect SSRs accumulated after the differentiation of insect families.
Collapse
Affiliation(s)
- Simin Ding
- Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058 China
| | - Shuping Wang
- Technical Centre for Animal Plant and Food Inspection and Quarantine, Shanghai Entry-exit Inspection and Quarantine Bureau, Shanghai, 200135 China
| | - Kang He
- Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058 China
| | - Mingxing Jiang
- Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058 China
| | - Fei Li
- Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058 China
| |
Collapse
|