1
|
Nakamura-García AK, Espinal-Enríquez J. Pseudogenes in Cancer: State of the Art. Cancers (Basel) 2023; 15:4024. [PMID: 37627052 PMCID: PMC10452131 DOI: 10.3390/cancers15164024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/27/2023] Open
Abstract
Pseudogenes are duplicates of protein-coding genes that have accumulated multiple detrimental alterations, rendering them unable to produce the protein they encode. Initially disregarded as "junk DNA" due to their perceived lack of functionality, research on their biological roles has been hindered by this assumption. Nevertheless, recent focus has shifted towards these molecules due to their abnormal expression in cancer phenotypes. In this review, our objective is to provide a thorough overview of the current understanding of pseudogene formation, the mechanisms governing their expression, and the roles they may play in promoting tumorigenesis.
Collapse
|
2
|
Chechetkin VR, Lobzin VV. Ribonucleocapsid assembly/packaging signals in the genomes of the coronaviruses SARS-CoV and SARS-CoV-2: detection, comparison and implications for therapeutic targeting. J Biomol Struct Dyn 2022; 40:508-522. [PMID: 32901577 PMCID: PMC7544952 DOI: 10.1080/07391102.2020.1815581] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 08/21/2020] [Indexed: 12/24/2022]
Abstract
The genomic ssRNA of coronaviruses is packaged within a helical nucleocapsid. Due to transitional symmetry of a helix, weakly specific cooperative interaction between ssRNA and nucleocapsid proteins leads to the natural selection of specific quasi-periodic assembly/packaging signals in the related genomic sequence. Such signals coordinated with the nucleocapsid helical structure were detected and reconstructed in the genomes of the coronaviruses SARS-CoV and SARS-CoV-2. The main period of the signals for both viruses was about 54 nt, that implies 6.75 nt per N protein. The complete coverage of the ssRNA genome of length about 30,000 nt by the nucleocapsid would need 4.4 × 103 N proteins, that makes them the most abundant among the structural proteins. The repertoires of motifs for SARS-CoV and SARS-CoV-2 were divergent but nearly coincided for different isolates of SARS-CoV-2. We obtained the distributions of assembly/packaging signals over the genomes with nonoverlapping windows of width 432 nt. Finally, using the spectral entropy, we compared the load from point mutations and indels during virus age for SARS-CoV and SARS-CoV-2. We found the higher mutational load on SARS-CoV. In this sense, SARS-CoV-2 can be treated as a 'newborn' virus. These observations may be helpful in practical medical applications and are of basic interest. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Vladimir R. Chechetkin
- Engelhardt Institute of Molecular Biology of
Russian Academy of Sciences, Moscow,
Russia
| | | |
Collapse
|
3
|
Balakirev ES, Anisimova M, Pavlyuchkov VA, Ayala FJ. DNA polymorphism and selection at the bindin locus in three Strongylocentrotus sp. (Echinoidea). BMC Genet 2016; 17:66. [PMID: 27176219 PMCID: PMC4866015 DOI: 10.1186/s12863-016-0374-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 05/02/2016] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The sperm gene bindin encodes a gamete recognition protein, which plays an important role in conspecific fertilization and reproductive isolation of sea urchins. Molecular evolution of the gene has been extensively investigated with the attention focused on the protein coding regions. Intron evolution has been investigated to a much lesser extent. We have studied nucleotide variability in the complete bindin locus, including two exons and one intron, in the sea urchin Strongylocentrotus intermedius represented by two morphological forms. We have also analyzed all available bindin sequences for two other sea urchin species, S. pallidus and S. droebachiensis. RESULTS The results show that the bindin sequences from the two forms of S. intermedius are intermingled with no evidence of genetic divergence; however, the forms exhibit slightly different patterns in bindin variability. The level of the bindin nucleotide diversity is close for S. intermedius and S. droebachiensis, but noticeably higher for S. pallidus. The distribution of variability is non-uniform along the gene; however there are striking similarities among the species, indicating similar evolutionary trends in this gene engaged in reproductive function. The patterns of nucleotide variability and divergence are radically different in the bindin coding and intron regions. Positive selection is detected in the bindin coding region. The neutrality tests as well as the maximum likelihood approaches suggest the action of diversifying selection in the bindin intron. CONCLUSIONS Significant deviation from neutrality has been detected in the bindin coding region and suggested in the intron, indicating the possible functional importance of the bindin intron variability. To clarify the question concerning possible involvement of diversifying selection in the bindin intron evolution more data combining population genetic and functional approaches are necessary.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- A. V. Zhirmunsky Institute of Marine Biology, Far Eastern Branch of the Russian Academy of Science, Vladivostok, 690041, Russia.
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA.
- Far Eastern Federal University, Vladivostok, 690950, Russia.
| | - Maria Anisimova
- Institute of Applied Simulation, School of Life Sciences and Facility Management, Zürich University of Applied Sciences, Wädenswil, 8820, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland
| | | | - Francisco J Ayala
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA
| |
Collapse
|
4
|
Mahmudi O, Sennblad B, Arvestad L, Nowick K, Lagergren J. Gene-pseudogene evolution: a probabilistic approach. BMC Genomics 2015; 16 Suppl 10:S12. [PMID: 26449131 PMCID: PMC4602177 DOI: 10.1186/1471-2164-16-s10-s12] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Over the last decade, methods have been developed for the reconstruction of gene trees that take into account the species tree. Many of these methods have been based on the probabilistic duplication-loss model, which describes how a gene-tree evolves over a species-tree with respect to duplication and losses, as well as extension of this model, e.g., the DLRS (Duplication, Loss, Rate and Sequence evolution) model that also includes sequence evolution under relaxed molecular clock. A disjoint, almost as recent, and very important line of research has been focused on non protein-coding, but yet, functional DNA. For instance, DNA sequences being pseudogenes in the sense that they are not translated, may still be transcribed and the thereby produced RNA may be functional. We extend the DLRS model by including pseudogenization events and devise an MCMC framework for analyzing extended gene families consisting of genes and pseudogenes with respect to this model, i.e., reconstructing gene-trees and identifying pseudogenization events in the reconstructed gene-trees. By applying the MCMC framework to biologically realistic synthetic data, we show that gene-trees as well as pseudogenization points can be inferred well. We also apply our MCMC framework to extended gene families belonging to the Olfactory Receptor and Zinc Finger superfamilies. The analysis indicate that both these super families contains very old pseudogenes, perhaps so old that it is reasonable to suspect that some are functional. In our analysis, the sub families of the Olfactory Receptors contains only lineage specific pseudogenes, while the sub families of the Zinc Fingers contains pseudogene lineages common to several species.
Collapse
|
5
|
Chen Z, Gao K, Su X, Rao P, An X. Genome-Wide Identification of the Invertase Gene Family in Populus. PLoS One 2015; 10:e0138540. [PMID: 26393355 PMCID: PMC4579127 DOI: 10.1371/journal.pone.0138540] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2015] [Accepted: 08/31/2015] [Indexed: 11/18/2022] Open
Abstract
Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials.
Collapse
Affiliation(s)
- Zhong Chen
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants of the Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Kai Gao
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants of the Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Xiaoxing Su
- Beijing Berry Genomics Company Limited, Beijing, China
| | - Pian Rao
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants of the Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Xinmin An
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants of the Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| |
Collapse
|
6
|
Guo Z, Jiang W, Lages N, Borcherds W, Wang D. Relationship between gene duplicability and diversifiability in the topology of biochemical networks. BMC Genomics 2014; 15:577. [PMID: 25005725 PMCID: PMC4129122 DOI: 10.1186/1471-2164-15-577] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 06/26/2014] [Indexed: 01/21/2023] Open
Abstract
Background Selective gene duplicability, the extensive expansion of a small number of gene families, is universal. Quantitatively, the number of genes (P(K)) with K duplicates in a genome decreases precipitously as K increases, and often follows a power law (P(k)∝k-α). Functional diversification, either neo- or sub-functionalization, is a major evolution route for duplicate genes. Results Using three lines of genomic datasets, we studied the relationship between gene duplicability and diversifiability in the topology of biochemical networks. First, we explored scenario where two pathways in the biochemical networks antagonize each other. Synthetic knockout of respective genes for the two pathways rescues the phenotypic defects of each individual knockout. We identified duplicate gene pairs with sufficient divergences that represent this antagonism relationship in the yeast S. cerevisiae. Such pairs overwhelmingly belong to large gene families, thus tend to have high duplicability. Second, we used distances between proteins of duplicate genes in the protein interaction network as a metric of their diversification. The higher a gene’s duplicate count, the further the proteins of this gene and its duplicates drift away from one another in the networks, which is especially true for genetically antagonizing duplicate genes. Third, we computed a sequence-homology-based clustering coefficient to quantify sequence diversifiability among duplicate genes – the lower the coefficient, the more the sequences have diverged. Duplicate count (K) of a gene is negatively correlated to the clustering coefficient of its duplicates, suggesting that gene duplicability is related to the extent of sequence divergence within the duplicate gene family. Conclusion Thus, a positive correlation exists between gene diversifiability and duplicability in the context of biochemical networks – an improvement of our understanding of gene duplicability.
Collapse
Affiliation(s)
| | | | | | | | - Degeng Wang
- Greehey Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, 8403 Floyd Curl Drive, San Antonio, TX 78229-3900, USA.
| |
Collapse
|
7
|
Molecular and functional characterization of cDNAs putatively encoding carboxylesterases from the migratory locust, Locusta migratoria. PLoS One 2014; 9:e94809. [PMID: 24722667 PMCID: PMC3983256 DOI: 10.1371/journal.pone.0094809] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 03/20/2014] [Indexed: 11/20/2022] Open
Abstract
Carboxylesterases (CarEs) belong to a superfamily of metabolic enzymes encoded by a number of genes and are widely distributed in microbes, plants and animals including insects. These enzymes play important roles in detoxification of insecticides and other xenobiotics, degradation of pheromones, regulation of neurodevelopment, and control of animal development. In this study, we characterized a total of 39 full-length cDNAs putatively encoding different CarEs from the migratory locust, Locusta migratoria, one of the most severe insect pests in many regions of the world, and evaluated the role of four CarE genes in insecticide detoxification. Our phylogenetic analysis grouped the 39 CarEs into five different clades including 20 CarEs in clade A, 3 in D, 13 in E, 1 in F and 2 in I. Four CarE genes (LmCesA3, LmCesA20, LmCesD1, LmCesE1), representing three different clades (A, D and E), were selected for further analyses. The transcripts of the four genes were detectable in all the developmental stages and tissues examined. LmCesA3 and LmCesE1 were mainly expressed in the fat bodies and Malpighian tubules, whereas LmCesA20 and LmCesD1 were predominately expressed in the muscles and hemolymph, respectively. The injection of double-stranded RNA (dsRNA) synthesized from each of the four CarE genes followed by the bioassay with each of four insecticides (chlorpyrifos, malathion, carbaryl and deltamethrin) increased the nymphal mortalities by 37.2 and 28.4% in response to malathion after LmCesA20 and LmCesE1 were silenced, respectively. Thus, we proposed that both LmCesA20 and LmCesE1 played an important role in detoxification of malathion in the locust. These results are expected to help researchers reveal the characteristics of diverse CarEs and assess the risk of insecticide resistance conferred by CarEs in the locust and other insect species.
Collapse
|
8
|
Lopes VF, Cabral H, Machado LPB, Mateus RP. Purification and characterization of a specific late-larval esterase from two species of the Drosophila repleta group: contributions to understand its evolution. Zool Stud 2014. [DOI: 10.1186/1810-522x-53-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Abstract
Background
After duplication, one copy of an original gene can become redundant and decay toward a pseudogene status or functionally diverge. Here, we performed the purification and biochemical characterization of EST-4 (a late larval β-esterase) from two Drosophila repleta group species, Drosophila mulleri and Drosophila arizonae, in order to establish comparative parameters between these enzymes in these species and to contribute to better understand their evolution.
Results
In D. mulleri, EST-4 had an optimal activity in temperatures ranging from 40° to 45°C and at pH 7.5, maintaining stability in alkaline pH (8.0 to 10.0). It was classified as serine esterase as its activity was inhibited by PMSF. No ion negatively modulated EST-4 activity, and iron had the most positive modulating effect. In D. arizonae, it showed similar optimum temperature (40°C), pH (8.0), and was also classified as a serine esterase, but the enzymatic stability was maintained in an acidic pH (5.5 to 6.5). Fe+2 had the opposite effect found in D. mulleri, that is, negative modulation. Al+3 almost totally inhibited the EST-4 activity, and Na+ and Cu+2 had a positive modulation effect. Kinetic studies, using ρ-nitrophenyl acetate as substrate, showed that EST-4 from D. mulleri had higher affinity, while in D. arizonae, it showed higher V
max and catalytic efficiency in optimal reaction conditions.
Conclusions
EST-4 from D. mulleri and D. arizonae are very closely related and still maintain several similar features; however, they show some degree of differentiation. Considering that EST-4 from D. mulleri has more conspicuous gel mobility difference among all EST-4 studied so far and a lower catalytic efficiency was observed here, we proposed that after duplication, this new copy of the original gene became redundant and started to decay toward a pseudogene status in this species, which probably is not occurring in D. arizonae.
Collapse
|
9
|
Balakirev ES, Chechetkin VR, Lobzin VV, Ayala FJ. Computational methods of identification of pseudogenes based on functionality: entropy and GC content. Methods Mol Biol 2014; 1167:41-62. [PMID: 24823770 DOI: 10.1007/978-1-4939-0835-6_4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Spectral entropy and GC content analyses reveal comprehensive structural features of DNA sequences. To illustrate the significance of these features, we analyze the β-esterase gene cluster, including the Est-6 gene and the ψEst-6 putative pseudogene, in seven species of the Drosophila melanogaster subgroup. The spectral entropies show distinctly lower structural ordering for ψEst-6 than for Est-6 in all species studied. However, entropy accumulation is not a completely random process for either gene and it shows to be nucleotide dependent. Furthermore, GC content in synonymous positions is uniformly higher in Est-6 than in ψEst-6, in agreement with the reduced GC content generally observed in pseudogenes and nonfunctional sequences. The observed differences in entropy and GC content reflect an evolutionary shift associated with the process of pseudogenization and subsequent functional divergence of ψEst-6 and Est-6 after the duplication event. The data obtained show the relevance and significance of entropy and GC content analyses for pseudogene identification and for the comparative study of gene-pseudogene evolution.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA,
| | | | | | | |
Collapse
|
10
|
Balakirev ES, Romanov NS, Mikheev PB, Ayala FJ. Mitochondrial DNA variation and introgression in Siberian taimen Hucho taimen. PLoS One 2013; 8:e71147. [PMID: 23951096 PMCID: PMC3741329 DOI: 10.1371/journal.pone.0071147] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 06/22/2013] [Indexed: 11/18/2022] Open
Abstract
Siberian taimen Hucho taimen is the largest representative of the family Salmonidae inhabiting rivers of northern Eurasia. The species is under intensive aquaculture activity. To monitor natural taimen populations we have sequenced a portion (8,141 bp) of the mitochondrial (mt) genome in 28 specimens of H. taimen from six localities in the Amur River basin. Nucleotide variability is low (π = 0.0010), but structured in two divergent haplotype groups. A comparison of the data with the GenBank H. taimen mt genome (HQ897271) reveals significant differences between them in spite of the fact that the fish specimens come from neighboring geographical areas. The distribution of divergence is non-uniform with two highly pronounced divergent regions centered on two genes, ND3 and ND6. To clarify the pattern of divergence we sequenced the corresponding portion of the mt genome of lenok Brachymystax tumensis and analyzed the GenBank complete mt genomes of related species. We have found that the first and second divergent regions are identical between the GenBank H. taimen and two lenok subspecies, B. lenok and B. lenok tsinlingensis, respectively. Consequently, both divergent regions represent introgressed mtDNA resulting from intergeneric hybridization between the two lenok subspecies and H. taimen. Introgression is, however, not detected in our specimens. This plus the precise identity of the introgressed fragments between the donor and the recipient GenBank sequence suggests that the introgression is local and very recent, probably due to artificial manipulations involving taimen – lenok intergeneric hybridization. Human-mediated hybridization may become a major threat to aquatic biodiversity. Consequently we suggest that due attention needs to be given to this threat by means of responsible breeding program management, so as to prevent a potential spread of hybrid fishes that could jeopardize the resilience of locally adapted gene pools of the native H. taimen populations.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, USA.
| | | | | | | |
Collapse
|
11
|
Kravatskaya GI, Chechetkin VR, Kravatsky YV, Tumanyan VG. Structural attributes of nucleotide sequences in promoter regions of supercoiling-sensitive genes: how to relate microarray expression data with genomic sequences. Genomics 2012; 101:1-11. [PMID: 23085385 DOI: 10.1016/j.ygeno.2012.10.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2012] [Revised: 09/10/2012] [Accepted: 10/11/2012] [Indexed: 11/18/2022]
Abstract
The level of supercoiling in the chromosome can affect gene expression. To clarify the basis of supercoiling sensitivity, we analyzed the structural features of nucleotide sequences in the vicinity of promoters for the genes with expression enhanced and decreased in response to loss of chromosomal supercoiling in Escherichia coli. Fourier analysis of promoter sequences for supercoiling-sensitive genes reveals the tendency in selection of sequences with helical periodicities close to 10nt for relaxation-induced genes and to 11nt for relaxation-repressed genes. The helical periodicities in the subsets of promoters recognized by RNA polymerase with different sigma factors were also studied. A special procedure was developed for the study of correlations between the intensities of periodicities in promoter sequences and the expression levels of corresponding genes. Significant correlations of expression with the AT content and with AT periodicities about 10, 11, and 50nt indicate their role in regulation of supercoiling-sensitive genes.
Collapse
Affiliation(s)
- Galina I Kravatskaya
- Engelhardt Institute of Molecular Biology of Russian Academy of Sciences, Russia.
| | | | | | | |
Collapse
|
12
|
Balakirev ES, Krupnova TN, Ayala FJ. DNA variation in the phenotypically-diverse brown alga Saccharina japonica. BMC PLANT BIOLOGY 2012; 12:108. [PMID: 22784095 PMCID: PMC3490969 DOI: 10.1186/1471-2229-12-108] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 06/21/2012] [Indexed: 05/17/2023]
Abstract
BACKGROUND Saccharina japonica (Areschoug) Lane, Mayes, Druehl et Saunders is an economically important and highly morphologically variable brown alga inhabiting the northwest Pacific marine waters. On the basis of nuclear (ITS), plastid (rbcLS) and mitochondrial (COI) DNA sequence data, we have analyzed the genetic composition of typical Saccharina japonica (TYP) and its two common morphological varieties, known as the "longipes" (LON) and "shallow-water" (SHA) forms seeking to clarify their taxonomical status and to evaluate the possibility of cryptic species within S. japonica. RESULTS The data show that the TYP and LON forms are very similar genetically in spite of drastic differences in morphology, life history traits, and ecological preferences. Both, however, are genetically quite different from the SHA form. The two Saccharina lineages are distinguished by 109 fixed single nucleotide differences as well as by seven fixed length polymorphisms (based on a 4,286 bp concatenated dataset that includes three gene regions). The GenBank database reveals a close affinity of the TYP and LON forms to S. japonica and the SHA form to S. cichorioides. The three gene markers used in the present work have different sensitivity for the algal species identification. COI gene was the most discriminant gene marker. However, we have detected instances of interspecific COI recombination reflecting putative historical hybridization events between distantly related algal lineages. The recombinant sequences show highly contrasted level of divergence in the 5'- and 3'- regions of the gene, leading to significantly different tree topologies depending on the gene segment (5'- or 3'-) used for tree reconstruction. Consequently, the 5'-COI "barcoding" region (~ 650 bp) can be misleading for identification purposes, at least in the case of algal species that might have experienced historical hybridization events. CONCLUSION Taking into account the potential roles of phenotypic plasticity in evolution, we conclude that the TYP and LON forms represent examples of algae phenotypic diversification that enables successful adaptation to contrasting shallow- and deep-water marine environments, while the SHA form is very similar to S. cichorioides and should be considered a different species. Practical applications for algal management and conservation are briefly considered.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA
- A. V. Zhirmunsky Institute of Marine Biology, Far Eastern Branch of the Russian Academy of Science, Vladivostok, 690059, Russia
| | - Tatiana N Krupnova
- Pacific Research Fisheries Centre (TINRO-Centre), Vladivostok, 690600, Russia
| | - Francisco J Ayala
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA, 92697-2525, USA
| |
Collapse
|
13
|
Balakirev ES, Krupnova TN, Ayala FJ. Symbiotic associations in the phenotypically-diverse brown alga Saccharina japonica. PLoS One 2012; 7:e39587. [PMID: 22745792 PMCID: PMC3379999 DOI: 10.1371/journal.pone.0039587] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2012] [Accepted: 05/23/2012] [Indexed: 02/08/2023] Open
Abstract
The brown alga Saccharina japonica (Areschoug) Lane, Mayes, Druehl et Saunders is a highly polymorphic representative of the family Laminariaceae, inhabiting the northwest Pacific region. We have obtained 16S rRNA sequence data in symbiont microorganisms of the typical form (TYP) of S. japonica and its common morphological varieties, known as "longipes" (LON) and "shallow-water" (SHA), which show contrasting bathymetric distribution and sharp morphological, life history traits, and ecological differences. Phylogenetic analysis of the 16S rRNA sequences shows that the microbial communities are significantly different in the three forms studied and consist of mosaic sets of common and form-specific bacterial lineages. The divergence in bacterial composition is substantial between the TYP and LON forms in spite of their high genetic similarity. The symbiont distribution in the S. japonica forms and in three other laminarialean species is not related to the depth or locality of the algae settlements. Combined with our previous results on symbiont associations in sea urchins and taking into account the highly specific character of bacteria-algae associations, we propose that the TYP and LON forms may represent incipient species passing through initial steps of reproductive isolation. We suggest that phenotype differences between genetically similar forms may be caused by host-symbiont interactions that may be a general feature of evolution in algae and other eukaryote organisms. Bacterial symbionts could serve as sensitive markers to distinguish genetically similar algae forms and also as possible growth-promoting inductors to increase algae productivity.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, United States of America.
| | | | | |
Collapse
|
14
|
Birner-Gruenberger R, Bickmeyer I, Lange J, Hehlert P, Hermetter A, Kollroser M, Rechberger GN, Kühnlein RP. Functional fat body proteomics and gene targeting reveal in vivo functions of Drosophila melanogaster α-Esterase-7. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2012; 42:220-229. [PMID: 22198472 DOI: 10.1016/j.ibmb.2011.12.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2011] [Revised: 12/07/2011] [Accepted: 12/08/2011] [Indexed: 05/31/2023]
Abstract
Carboxylesterases constitute a large enzyme family in insects, which is involved in diverse functions such as xenobiotic detoxification, lipid metabolism and reproduction. Phylogenetically, many insect carboxylesterases are represented by multienzyme clades, which are encoded by evolutionarily ancient gene clusters such as the α-Esterase cluster. Much in contrast to the vital importance attributed to carboxylesterases in general, the in vivo function of individual α-Esterase genes is largely unknown. This study employs a functional proteomics approach to identify esterolytic enzymes of the vinegar fly Drosophila melanogaster fat body. One of the fat body carboxylesterases, α-Esterase-7, was selected for mutational analysis by gene targeting to generate a deletion mutant fly. Phenotypic characterization of α-Esterase-7 null mutants and transgenic flies, which overexpress a chimeric α-Esterase-7:EGFP gene, reveals important functions of α-Esterase-7 in insecticide tolerance, lipid metabolism and lifespan control. The presented first deletion mutant of any α-Esterase in the model insect D. melanogaster generated by gene targeting not only provides experimental evidence for the endogenous functions of this gene family. It also offers an entry point for in vivo structure-function analyses of α-Esterase-7, which is of central importance for naturally occurring insecticide resistance in wild populations of various dipteran insect species.
Collapse
Affiliation(s)
- Ruth Birner-Gruenberger
- Research Group Functional Proteomics, Institute of Pathology, and Proteomics Core Facility, Center of Medical Research, Medical University of Graz, Stiftingtalstrasse 24, 8010 Graz, Austria
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Pink RC, Wicks K, Caley DP, Punch EK, Jacobs L, Carter DRF. Pseudogenes: pseudo-functional or key regulators in health and disease? RNA (NEW YORK, N.Y.) 2011; 17:792-8. [PMID: 21398401 PMCID: PMC3078729 DOI: 10.1261/rna.2658311] [Citation(s) in RCA: 293] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Pseudogenes have long been labeled as "junk" DNA, failed copies of genes that arise during the evolution of genomes. However, recent results are challenging this moniker; indeed, some pseudogenes appear to harbor the potential to regulate their protein-coding cousins. Far from being silent relics, many pseudogenes are transcribed into RNA, some exhibiting a tissue-specific pattern of activation. Pseudogene transcripts can be processed into short interfering RNAs that regulate coding genes through the RNAi pathway. In another remarkable discovery, it has been shown that pseudogenes are capable of regulating tumor suppressors and oncogenes by acting as microRNA decoys. The finding that pseudogenes are often deregulated during cancer progression warrants further investigation into the true extent of pseudogene function. In this review, we describe the ways in which pseudogenes exert their effect on coding genes and explore the role of pseudogenes in the increasingly complex web of noncoding RNA that contributes to normal cellular regulation.
Collapse
Affiliation(s)
- Ryan Charles Pink
- School of Life Sciences, Oxford Brookes University, Headington, Oxford, United Kingdom
| | | | | | | | | | | |
Collapse
|
16
|
Koziol U, Iriarte A, Castillo E, Soto J, Bello G, Cajarville A, Roche L, Marín M. Characterization of a putative hsp70 pseudogene transcribed in protoscoleces and adult worms of Echinococcus granulosus. Gene 2009; 443:1-11. [PMID: 19460422 DOI: 10.1016/j.gene.2009.05.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2009] [Revised: 05/06/2009] [Accepted: 05/06/2009] [Indexed: 11/25/2022]
Abstract
Searching for hsp70 genes in Echinococcus granulosus, a divergent cytoplasmic hsp70-like sequence (EgpsiHsp70) was isolated, possessing a small truncation in the region coding for the C-terminal glycine-rich linker and EEVD-Ct motif. Southern Blot analyses of E. granulosus, and in silico analyses of E. multilocularis indicate that this truncated sequence is repeated several times in both genomes, in some cases containing clear cut features of pseudogenization. Phylogenetic analyses and comparison of surrounding regions indicate that all these copies originated by successive genomic duplications of one originally truncated copy. These copies are diverging at an increased rate compared to functional cytoplasmic hsp70 genes, and ratios of non-synonymous over synonymous substitutions rates (dN/dS) point to a relaxation of sequence constraint, suggesting that these sequences are pseudogenes. Interestingly, RT-PCR demonstrates that EgpsiHsp70 is transcribed in protoscoleces and adult individuals of E. granulosus. We suggest that this sequence does not code for a functional polypeptide, although some features are unexpected for a sequence evolving under a strictly neutral mode. Transcription could either be vestigial or have a specific, non-coding function.
Collapse
Affiliation(s)
- Uriel Koziol
- Sección Bioquímica-Biología Molecular, Facultad de Ciencias, Iguá 4225, 11400, Montevideo, Uruguay.
| | | | | | | | | | | | | | | |
Collapse
|
17
|
DNA variation and symbiotic associations in phenotypically diverse sea urchin Strongylocentrotus intermedius. Proc Natl Acad Sci U S A 2008; 105:16218-23. [PMID: 18852450 DOI: 10.1073/pnas.0807860105] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Strongylocentrotus intermedius (A. Agassiz, 1863) is an economically important sea urchin inhabiting the northwest Pacific region of Asia. The northern Primorye (Sea of Japan) populations of S. intermedius consist of two sympatric morphological forms, "usual" (U) and "gray" (G). The two forms are significantly different in morphology and preferred bathymetric distribution, the G form prevailing in deeper-water settlements. We have analyzed the genetic composition of the S. intermedius forms using the nucleotide sequences of the mitochondrial gene encoding the cytochrome c oxidase subunit I and the nuclear gene encoding bindin to evaluate the possibility of cryptic species within S. intermedius. We have examined the presence of symbiont microorganisms by means of 16S rRNA sequences. The nucleotide sequence divergence between the morphological forms is low: 0.74% and 0.70% for cytochrome c oxidase subunit I and nuclear gene encoding bindin, respectively, which is significantly below average intrageneric sequence divergence among Strongylocentrotus species. We thus have found no genetic evidence of cryptic species within S. intermedius. Phylogenetic analysis shows that the bacteria symbionts of S. intermedius belong to the phylum Bacteroidetes, but the U and G forms predominantly harbor highly divergent bacterial lineages belonging to two different taxonomic classes, Flavobacteria and Sphingobacteria. We propose that the U and G forms of S. intermedius represent distinct ecomorphological adaptations to contrasting shallow- and deep-water marine environments and might be considered incipient species. We also propose that the symbiotic bacteria likely play an important role in the evolution of morphological divergence of S. intermedius.
Collapse
|
18
|
Bocock PN, Morse AM, Dervinis C, Davis JM. Evolution and diversity of invertase genes in Populus trichocarpa. PLANTA 2008; 227:565-76. [PMID: 17938954 DOI: 10.1007/s00425-007-0639-3] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2007] [Accepted: 09/27/2007] [Indexed: 05/21/2023]
Abstract
Invertase (EC 3.2.1.26) plays a key role in carbon utilization as it catalyzes the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families thought to have distinct evolutionary origins and can be distinguished by their pH optima for activity: acid invertases and neutral/alkaline invertases. The acid invertases apparently originated in eubacteria and are targeted to the cell wall and vacuole, while neutral/alkaline invertases apparently originated in cyanobacteria and function in the cytosol. The recently sequenced genome of Populus trichocharpa (Torr. and Gray) allowed us to identify the genes encoding invertase in this woody perennial. Here we describe the identification of eight acid invertase genes; three of which belong to the vacuolar targeted group (PtVIN1-3), and five of which belong to the cell wall targeted group (PtCIN1-5). Similarly, we report the identification of 16 neutral/alkaline invertase genes (PtNIN1-16). Expression analyses using whole genome microarrays and RT-PCR reveal evidence for expression of all invertase family members. An examination of the micro-syntenic regions surrounding the poplar invertase genes reveals extensive colinearity with Arabidopsis invertases. We also find evidence for expression of a novel intronless vacuolar invertase (PtVIN1), which apparently arose from a processed PtVIN2 transcript that re-inserted into the genome. To our knowledge, this is the first intronless invertase found in plants. This work increases the understanding of the role this family plays in carbon allocation and partitioning in forest trees as well as its evolutionary development.
Collapse
Affiliation(s)
- Philip N Bocock
- Plant Molecular and Cellular Biology Program, University of Florida, PO Box 110690, Gainesville, FL 32611, USA
| | | | | | | |
Collapse
|
19
|
Chechetkin VR, Lobzin VV. Anticodons, frameshifts, and hidden periodicities in tRNA sequences. J Biomol Struct Dyn 2006; 24:189-202. [PMID: 16928142 DOI: 10.1080/07391102.2006.10507112] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Fourier analysis of the short-range periodicities for the complete set of sequences coding for tRNA genes in genome of Bacillus subtilis proves that periodicities with periods p = 2, 3, 4, and 6 sites are the inherent properties of tRNAs. The related periodicities should be understood in a broad statistical sense and their identifying needs the elaborate statistical methods. To improve the statistics, the analysis of significant periodicities was performed for the binary R-Y, S-W, and K-M sequences. Generally, such short-range periodicities are produced via biased positioning of particular nucleotides rather than via the tandem multiplication and subsequent modifications of repeats, though the latter mechanism may also be realized. Quasi-coherently piercing long segments of tRNA, the short-range periodicities create the effective long-range structural coupling between the acceptor stem and the anticodon loop and may participate in the mechanisms of molecular recognition. The periodicities with p = 2 and 4 provide the natural ground for the translation with spontaneous or programmed frameshifting and are present in tRNAs decoding the most frameshift-prone codons. The observation of short-range periodicities suggests that the mechanisms of amino-acylation of tRNAs and codon-anticodon pairing are not independent. Their study may also provide the important information related to the origin and evolution of the genetic code.
Collapse
Affiliation(s)
- V R Chechetkin
- Troitsk Institute of Innovation and Thermonuclear Investigations (TRINITI), Theoretical Department of Division for Perspective Investigations, 142190 Troitsk, Moscow Region, Russia.
| | | |
Collapse
|
20
|
Coppin CW, Odgers WA, Oakeshott JG. Latitudinal clines for nucleotide polymorphisms in the Esterase 6 gene of Drosophila melanogaster. Genetica 2006; 129:259-71. [PMID: 16955332 DOI: 10.1007/s10709-006-0006-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2005] [Accepted: 03/31/2006] [Indexed: 11/27/2022]
Abstract
Previous studies have found non-neutral patterns of nucleotide polymorphism in the promoter and coding regions of Est6 in D. melanogaster. Coding region polymorphism peaks around two closely linked replacement differences associated with the EST6-F/EST6-S allozyme polymorphism. The promoter contains two common, highly diverged haplotype groups, P1 and P7, that differentially affect Est6 expression. Allozyme studies have also revealed latitudinal clines in EST6-F and EST6-S frequencies that recur across continents. Here we analyse nucleotide polymorphisms across the promoter and the region of peak coding sequence polymorphism in 10 Australian populations along a 25 degrees latitudinal gradient in order to examine the basis for the allozyme clines. As with the earlier studies, we find an excess of intermediate to high frequency variants in both the P1/P7 region and around the two EST6-F/EST6-S replacements in some populations. The two EST6-F/EST6-S replacement polymorphisms show latitudinal clines whereas the P1 and P7 groups of promoter haplotypes do not. However the strongest clines are for three co-segregating silent site polymorphisms in a 4 bp stretch at the 3' end of the sequenced region. Monte Carlo simulations show that the clines for those three sites can explain all others in the data but none of the others can explain those three. Thus the allozyme clines may not reflect selection on either the P1/P7 polymorphism or the two replacements previously associated with the EST6-F/EST-S difference.
Collapse
|
21
|
Morrell PL, Toleno DM, Lundy KE, Clegg MT. Estimating the contribution of mutation, recombination and gene conversion in the generation of haplotypic diversity. Genetics 2006; 173:1705-23. [PMID: 16624913 PMCID: PMC1526701 DOI: 10.1534/genetics.105.054502] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2005] [Accepted: 04/11/2006] [Indexed: 11/18/2022] Open
Abstract
Recombination occurs through both homologous crossing over and homologous gene conversion during meiosis. The contribution of recombination relative to mutation is expected to be dramatically reduced in inbreeding organisms. We report coalescent-based estimates of the recombination parameter (rho) relative to estimates of the mutation parameter (theta) for 18 genes from the highly self-fertilizing grass, wild barley, Hordeum vulgare ssp. spontaneum. Estimates of rho/theta are much greater than expected, with a mean rho/theta approximately 1.5, similar to estimates from outcrossing species. We also estimate rho with and without the contribution of gene conversion. Genotyping errors can mimic the effect of gene conversion, upwardly biasing estimates of the role of conversion. Thus we report a novel method for identifying genotyping errors in nucleotide sequence data sets. We show that there is evidence for gene conversion in many large nucleotide sequence data sets including our data that have been purged of all detectable sequencing errors and in data sets from Drosophila melanogaster, D. simulans, and Zea mays. In total, 13 of 27 loci show evidence of gene conversion. For these loci, gene conversion is estimated to contribute an average of twice as much as crossing over to total recombination.
Collapse
Affiliation(s)
- Peter L Morrell
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697, USA
| | | | | | | |
Collapse
|
22
|
Balakirev ES, Anisimova M, Ayala FJ. Positive and negative selection in the beta-esterase gene cluster of the Drosophila melanogaster subgroup. J Mol Evol 2006; 62:496-510. [PMID: 16547641 DOI: 10.1007/s00239-005-0140-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2005] [Accepted: 12/20/2005] [Indexed: 11/25/2022]
Abstract
We examine the pattern of molecular evolution of the beta-esterase gene cluster, including the Est-6 and psiEst-6 genes, in eight species of the Drosophila melanogaster subgroup. Using maximum likelihood estimates of nonsynonymous/synonymous rate ratios, we show that the majority of Est-6 sites evolves under strong (48% of sites) or moderate (50% of sites) negative selection and a minority of sites (1.5%) is under significant positive selection. Est-6 sites likely to be under positive selection are associated with increased intraspecific variability. One positively selected site is responsible for the EST-6 F/S allozyme polymorphism; the same site is responsible for the EST-6 functional divergence between species of the melanogaster subgroup. For psiEst-6 83.7% sites evolve under negative selection, 16% sites evolve neutrally, and 0.3% sites are under positive selection. The positively selected sites of psiEst-6 are located at the beginning and at the end of the gene, where there is reduced divergence between D. melanogaster and D. simulans; these regions of psiEst-6 could be involved in regulation or some other function. Branch-site-specific analysis shows that the evolution of the melanogaster subgroup underwent episodic positive selection. Collating the present data with previous results for the beta-esterase genes, we propose that positive and negative selection are involved in a complex relationship that may be typical of the divergence of duplicate genes as one or both duplicates evolve a new function.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA 92697-2525, USA.
| | | | | |
Collapse
|
23
|
Brosius J. Echoes from the past--are we still in an RNP world? Cytogenet Genome Res 2005; 110:8-24. [PMID: 16093654 DOI: 10.1159/000084934] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2004] [Accepted: 05/04/2004] [Indexed: 11/19/2022] Open
Abstract
Availability of the human genome sequence and those of other species is unmeasured in their value for a comprehensive understanding of the architecture, function and evolution of genomes and cells. Various mechanisms keep genomes in flux and generate intra- and interspecies variation. The conversion of RNA modules into DNA and their more or less random integration into chromosomes (retroposition) is in many lineages including our own the most pervasive and perhaps the most enigmatic. The proclivity of such events in extant multicellular eukaryotes, even in more recent evolutionary times, gives the impression that the transition period from the RNP (ribonucleoprotein) world to the emergence of modern cells, where DNA became the predominant carrier of genetic information, has lasted billions of years and is an endlessly drawn-out process rather than the punctuated event one might expect. Apart from the impact of such RNA-mediated processes as retroposition, the role of RNA in a wide variety of cellular functions has only recently become more widely appreciated.
Collapse
Affiliation(s)
- J Brosius
- Institute of Experimental Pathology, ZMBE, University of Munster, Munster, Germany.
| |
Collapse
|
24
|
Balakirev ES, Chechetkin VR, Lobzin VV, Ayala FJ. Entropy and GC Content in the beta-esterase gene cluster of the Drosophila melanogaster subgroup. Mol Biol Evol 2005; 22:2063-72. [PMID: 15972847 DOI: 10.1093/molbev/msi197] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We perform spectral entropy and GC content analyses in the beta-esterase gene cluster, including the Est-6 gene and the psiEst-6 putative pseudogene, in seven species of the Drosophila melanogaster species subgroup. psiEst-6 combines features of functional and nonfunctional genes. The spectral entropies show distinctly lower structural ordering for psiEst-6 than for Est-6 in all species studied. Our observations agree with previous results for D. melanogaster and provide additional support to our hypothesis that after the duplication event Est-6 retained the esterase-coding function and its role during copulation, while psiEst-6 lost that function but now operates in conjunction with Est-6 as an intergene. Entropy accumulation is not a completely random process for either gene. Structural entropy is nucleotide dependent. The relative normalized deviations for structural entropy are higher for G than for C nucleotides. The entropy values are similar for Est-6 and psiEst-6 in the case of A and T but are lower for Est-6 in the case of G and C. The GC content in synonymous positions is uniformly higher in Est-6 than in psiEst-6, which agrees with the reduced GC content generally observed in pseudogenes and nonfunctional sequences. The observed differences in entropy and GC content reflect an evolutionary shift associated with the process of pseudogenization and subsequent functional divergence of psiEst-6 and Est-6 after the duplication event.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| | | | | | | |
Collapse
|
25
|
Balakirev ES, Ayala FJ. Nucleotide variation in the tinman and bagpipe homeobox genes of Drosophila melanogaster. Genetics 2005; 166:1845-56. [PMID: 15126403 PMCID: PMC1470812 DOI: 10.1534/genetics.166.4.1845] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The tinman (tin) and bagpipe (bap) genes are members of the NK homeobox gene family of Drosophila, so that tin occupies a higher position than bap in the regulatory hierarchy. Little is known about the level and pattern of genetic polymorphism in homeobox genes. We have analyzed nucleotide polymorphism in 27 strains of Drosophila melanogaster and one each of D. simulans and D. sechellia, within two closely linked regions encompassing a partial sequence of tin and the complete sequence of bap. The two genes exhibit different levels and patterns of nucleotide diversity. Two sets of sharply divergent sequence types are detected for tin. The haplotype structure of bap is more complex: about half of the sequences are identical (or virtually so), while the rest are fairly heterogeneous. The level of silent nucleotide variability is 0.0063 for tin but significantly higher, 0.0141, for bap, a level of polymorphism comparable to the most polymorphic structural genes of D. melanogaster. Recombination rate and gene conversion are also higher for bap than for tin. There is strong linkage disequilibrium, with the highest values in the introns of both genes and exon II of bap. The patterns of polymorphism in tin and bap are not compatible with an equilibrium model of selective neutrality. We suggest that negative selection and demographic history are the major factors shaping the pattern of nucleotide polymorphism in the tin and bap genes; moreover, there are clear indications of positive selection in the bap gene.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA
| | | |
Collapse
|
26
|
Balakirev ES, Ayala FJ. Nucleotide Variation in the tinman and bagpipe Homeobox Genes of Drosophila melanogaster. Genetics 2004. [DOI: 10.1093/genetics/166.4.1845] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
The tinman (tin) and bagpipe (bap) genes are members of the NK homeobox gene family of Drosophila, so that tin occupies a higher position than bap in the regulatory hierarchy. Little is known about the level and pattern of genetic polymorphism in homeobox genes. We have analyzed nucleotide polymorphism in 27 strains of Drosophila melanogaster and one each of D. simulans and D. sechellia, within two closely linked regions encompassing a partial sequence of tin and the complete sequence of bap. The two genes exhibit different levels and patterns of nucleotide diversity. Two sets of sharply divergent sequence types are detected for tin. The haplotype structure of bap is more complex: about half of the sequences are identical (or virtually so), while the rest are fairly heterogeneous. The level of silent nucleotide variability is 0.0063 for tin but significantly higher, 0.0141, for bap, a level of polymorphism comparable to the most polymorphic structural genes of D. melanogaster. Recombination rate and gene conversion are also higher for bap than for tin. There is strong linkage disequilibrium, with the highest values in the introns of both genes and exon II of bap. The patterns of polymorphism in tin and bap are not compatible with an equilibrium model of selective neutrality. We suggest that negative selection and demographic history are the major factors shaping the pattern of nucleotide polymorphism in the tin and bap genes; moreover, there are clear indications of positive selection in the bap gene.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525
- Institute of Marine Biology, Vladivostok 690041, Russia
- Academy of Ecology, Marine Biology, and Biotechnology, Far Eastern State University, Vladivostok 690600, Russia
| | - Francisco J Ayala
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525
| |
Collapse
|
27
|
Baudry E, Viginier B, Veuille M. Non-African populations of Drosophila melanogaster have a unique origin. Mol Biol Evol 2004; 21:1482-91. [PMID: 15014160 DOI: 10.1093/molbev/msh089] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Drosophila melanogaster is widely used as a model in DNA variation studies. Patterns of polymorphism have, however, been affected by the history of this species, which is thought to have recently spread out of Africa to the rest of the world. We analyzed DNA sequence variation in 11 populations, including four continental African and seven non-African samples (including Madagascar), at four independent X-linked loci. Variation patterns at all four loci followed neutral expectations in all African populations, but departed from it in all non-African ones due to a marked haplotype dimorphism at three out of four loci. We also found that all non-African populations show the same major haplotypes, though in various frequencies. A parsimonious explanation for these observations is that all non-African populations are derived from a single ancestral population having undergone a substantial reduction of polymorphism, probably through a bottleneck. Less likely alternatives involve either selection at all four loci simultaneously (including balancing selection at three of them), or admixture between two divergent populations. Small but significant structure was observed among African populations, and there were indications of differentiation across Eurasia for non-African ones. Since population history may result in non-equilibrium variation patterns, our study confirms that the search for footprints of selection in the D. melanogaster genome must include a sufficient understanding of its history.
Collapse
Affiliation(s)
- Emmanuelle Baudry
- Ecole Pratique des Hautes Etudes, Laboratoire d'Ecologie, Université Pierre et Marie Curie, Paris, France.
| | | | | |
Collapse
|
28
|
Abstract
Pseudogenes have been defined as nonfunctional sequences of genomic DNA originally derived from functional genes. It is therefore assumed that all pseudogene mutations are selectively neutral and have equal probability to become fixed in the population. Rather, pseudogenes that have been suitably investigated often exhibit functional roles, such as gene expression, gene regulation, generation of genetic (antibody, antigenic, and other) diversity. Pseudogenes are involved in gene conversion or recombination with functional genes. Pseudogenes exhibit evolutionary conservation of gene sequence, reduced nucleotide variability, excess synonymous over nonsynonymous nucleotide polymorphism, and other features that are expected in genes or DNA sequences that have functional roles. We first review the Drosophila literature and then extend the discussion to the various functional features identified in the pseudogenes of other organisms. A pseudogene that has arisen by duplication or retroposition may, at first, not be subject to natural selection if the source gene remains functional. Mutant alleles that incorporate new functions may, nevertheless, be favored by natural selection and will have enhanced probability of becoming fixed in the population. We agree with the proposal that pseudogenes be considered as potogenes, i.e., DNA sequences with a potentiality for becoming new genes.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | |
Collapse
|
29
|
Balakirev ES, Ayala FJ. Molecular population genetics of theβ-esterase gene cluster ofDrosophila melanogaster. J Genet 2003; 82:115-31. [PMID: 15133190 DOI: 10.1007/bf02715813] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
We have investigated nucleotide polymorphism at the beta-esterase gene cluster including the Est-6 gene and psiEst-6 putative pseudogene in four samples of Drosophila melanogaster derived from natural populations of southern Africa (Zimbabwe), Europe (Spain), North America (USA: California), and South America (Venezuela). A complex haplotype structure is revealed in both Est-6 and psiEst-6. Total nucleotide diversity is twice in psiEst-6 as in Est-6; diversity is higher in the African sample than in the non-African ones. Strong linkage disequilibrium occurs within the beta-esterase gene cluster in non-African samples, but not in the African one. Intragenic gene conversion events are detected within Est-6 and, to a much greater extent, within psiEst-6; intergenic gene conversion events are rare. Tests of neutrality with recombination are significant for the beta-esterase gene cluster in the non-African samples but not significant in the African one. We suggest that the demographic history (bottleneck and admixture of genetically differentiated populations) is the major factor shaping the pattern of nucleotide polymorphism in the beta-esterase gene cluster. However there are some 'footprints' of directional and balancing selection shaping specific distribution of nucleotide polymorphism within the cluster. Intergenic epistatic selection between Est-6 and psiEst-6 may play an important role in the evolution of the beta-esterase gene cluster preserving the putative pseudogene from degenerative destruction and reflecting possible functional interaction between the functional gene and the putative pseudogene. Est-6 and psiEst-6 may represent an indivisible intergenic complex ('intergene') in which each single component (Est-6 or psiEst-6) cannot separately carry out the full functional role.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697-2525, USA
| | | |
Collapse
|
30
|
Balakirev ES, Ayala FJ. Nucleotide Variation of theEst-6Gene Region in Natural Populations ofDrosophila melanogaster. Genetics 2003; 165:1901-14. [PMID: 14704175 PMCID: PMC1462900 DOI: 10.1093/genetics/165.4.1901] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
AbstractWe have investigated nucleotide polymorphism in the Est-6 gene region in four samples of Drosophila melanogaster derived from natural populations of East Africa (Zimbabwe), Europe (Spain), North America (California), and South America (Venezuela). There are two divergent sequence types in the North and South American samples, which are not perfectly (North America) or not at all (South America) associated with the Est-6 allozyme variation. Less pronounced or no sequence dimorphism occurs in the European and African samples, respectively. The level of nucleotide diversity is highest in the African sample, lower (and similar to each other) in the samples from Europe and North America, and lowest in the sample from South America. The extent of linkage disequilibrium is low in Africa (1.23% significant associations), but much higher in non-African populations (22.59, 21.45, and 37.68% in Europe, North America, and South America, respectively). Tests of neutrality with recombination are significant in non-African samples but not significant in the African sample. We propose that demographic history (bottleneck and admixture of genetically different populations) is the major factor shaping the nucleotide patterns in the Est-6 gene region. However, positive selection modifies the pattern: balanced selection creates elevated levels of nucleotide variation around functionally important (target) polymorphic sites (RsaI–/RsaI+ in the promoter region and F/S in the coding region) in both African and non-African samples; and directional selection, acting during the geographic expansion phase of D. melanogaster, creates an excess of very similar sequences (RsaI– and S allelic lineages, in the promoter and coding regions, respectively) in the non-African samples.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA
| | | |
Collapse
|