1
|
Kılıç S, Sánchez-Osuna M, Collado-Padilla A, Barbé J, Erill I. Flexible comparative genomics of prokaryotic transcriptional regulatory networks. BMC Genomics 2020; 21:466. [PMID: 33327941 PMCID: PMC7739468 DOI: 10.1186/s12864-020-06838-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 06/16/2020] [Indexed: 11/25/2022] Open
Abstract
Background Comparative genomics methods enable the reconstruction of bacterial regulatory networks using available experimental data. In spite of their potential for accelerating research into the composition and evolution of bacterial regulons, few comparative genomics suites have been developed for the automated analysis of these regulatory systems. Available solutions typically rely on precomputed databases for operon and ortholog predictions, limiting the scope of analyses to processed complete genomes, and several key issues such as the transfer of experimental information or the integration of regulatory information in a probabilistic setting remain largely unaddressed. Results Here we introduce CGB, a flexible platform for comparative genomics of prokaryotic regulons. CGB has few external dependencies and enables fully customized analyses of newly available genome data. The platform automates the merging of experimental information and uses a gene-centered, Bayesian framework to generate and integrate easily interpretable results. We demonstrate its flexibility and power by analyzing the evolution of type III secretion system regulation in pathogenic Proteobacteria and by characterizing the SOS regulon of a new bacterial phylum, the Balneolaeota. Conclusions Our results demonstrate the applicability of the CGB pipeline in multiple settings. CGB’s ability to automatically integrate experimental information from multiple sources and use complete and draft genomic data, coupled with its non-reliance on precomputed databases and its easily interpretable display of gene-centered posterior probabilities of regulation provide users with an unprecedented level of flexibility in launching comparative genomics analyses of prokaryotic transcriptional regulatory networks. The analyses of type III secretion and SOS response regulatory networks illustrate instances of convergent and divergent evolution of these regulatory systems, showcasing the power of formal ancestral state reconstruction at inferring the evolutionary history of regulatory networks.
Collapse
Affiliation(s)
- Sefa Kılıç
- University of Maryland Baltimore County, Baltimore, MD, 21250, USA
| | | | | | - Jordi Barbé
- Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
| | - Ivan Erill
- University of Maryland Baltimore County, Baltimore, MD, 21250, USA.
| |
Collapse
|
2
|
Sánchez-Osuna M, Barbé J, Erill I. Comparative genomics of the DNA damage-inducible network in the Patescibacteria. Environ Microbiol 2017; 19:3465-3474. [DOI: 10.1111/1462-2920.13826] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 06/09/2017] [Indexed: 11/28/2022]
Affiliation(s)
- Miquel Sánchez-Osuna
- Departament de Genètica i de Microbiologia; Universitat Autònoma de Barcelona; Spain
| | - Jordi Barbé
- Departament de Genètica i de Microbiologia; Universitat Autònoma de Barcelona; Spain
| | - Ivan Erill
- Department of Biological Sciences; University of Maryland Baltimore County; Baltimore Maryland USA
| |
Collapse
|
3
|
Erill I, Campoy S, Kılıç S, Barbé J. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response. Front Mol Biosci 2016; 3:33. [PMID: 27489856 PMCID: PMC4951493 DOI: 10.3389/fmolb.2016.00033] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 07/04/2016] [Indexed: 12/20/2022] Open
Abstract
The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.
Collapse
Affiliation(s)
- Ivan Erill
- Erill Lab, Department of Biological Sciences, University of Maryland Baltimore County Baltimore, MD, USA
| | - Susana Campoy
- Unitat de Microbiologia, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona Barcelona, Spain
| | - Sefa Kılıç
- Erill Lab, Department of Biological Sciences, University of Maryland Baltimore County Baltimore, MD, USA
| | - Jordi Barbé
- Unitat de Microbiologia, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona Barcelona, Spain
| |
Collapse
|
4
|
An SOS Regulon under Control of a Noncanonical LexA-Binding Motif in the Betaproteobacteria. J Bacteriol 2015; 197:2622-30. [PMID: 25986903 DOI: 10.1128/jb.00035-15] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 05/09/2015] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED The SOS response is a transcriptional regulatory network governed by the LexA repressor that activates in response to DNA damage. In the Betaproteobacteria, LexA is known to target a palindromic sequence with the consensus sequence CTGT-N8-ACAG. We report the characterization of a LexA regulon in the iron-oxidizing betaproteobacterium Sideroxydans lithotrophicus. In silico and in vitro analyses show that LexA targets six genes by recognizing a binding motif with the consensus sequence GAACGaaCGTTC, which is strongly reminiscent of the Bacillus subtilis LexA-binding motif. We confirm that the closely related Gallionella capsiferriformans shares the same LexA-binding motif, and in silico analyses indicate that this motif is also conserved in the Nitrosomonadales and the Methylophilales. Phylogenetic analysis of LexA and the alpha subunit of DNA polymerase III (DnaE) reveal that the organisms harboring this noncanonical LexA form a compact taxonomic cluster within the Betaproteobacteria. However, their lexA gene is unrelated to the standard Betaproteobacteria lexA, and there is evidence of its spread through lateral gene transfer. In contrast to other reported cases of noncanonical LexA-binding motifs, the regulon of S. lithotrophicus is comparable in size and function to that of many other Betaproteobacteria, suggesting that a convergent SOS regulon has reevolved under the control of a new LexA protein. Analysis of the DNA-binding domain of S. lithotrophicus LexA reveals little sequence similarity with that of other LexA proteins targeting similar binding motifs, suggesting that network structure may limit site evolution or that structural constrains make the B. subtilis-type motif an optimal interface for multiple LexA sequences. IMPORTANCE Understanding the evolution of transcriptional systems enables us to address important questions in microbiology, such as the emergence and transfer potential of different regulatory systems to regulate virulence or mediate responses to stress. The results reported here constitute the first characterization of a noncanonical LexA protein regulating a standard SOS regulon. This is significant because it illustrates how a complex transcriptional program can be put under the control of a novel transcriptional regulator. Our results also reveal a substantial degree of plasticity in the LexA recognition domain, raising intriguing questions about the space of protein-DNA interfaces and the specific evolutionary constrains faced by these elements.
Collapse
|
5
|
Meng L, Alter T, Aho T, Huehn S. Gene expression profiles of Vibrio parahaemolyticus in viable but non-culturable state. FEMS Microbiol Ecol 2015; 91:fiv035. [DOI: 10.1093/femsec/fiv035] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/25/2015] [Indexed: 11/13/2022] Open
|
6
|
Baharoglu Z, Mazel D. SOS, the formidable strategy of bacteria against aggressions. FEMS Microbiol Rev 2014; 38:1126-45. [PMID: 24923554 DOI: 10.1111/1574-6976.12077] [Citation(s) in RCA: 272] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 05/01/2014] [Accepted: 05/07/2014] [Indexed: 12/22/2022] Open
Abstract
The presence of an abnormal amount of single-stranded DNA in the bacterial cell constitutes a genotoxic alarm signal that induces the SOS response, a broad regulatory network found in most bacterial species to address DNA damage. The aim of this review was to point out that beyond being a repair process, SOS induction leads to a very strong but transient response to genotoxic stress, during which bacteria can rearrange and mutate their genome, induce several phenotypic changes through differential regulation of genes, and sometimes acquire characteristics that potentiate bacterial survival and adaptation to changing environments. We review here the causes and consequences of SOS induction, but also how this response can be modulated under various circumstances and how it is connected to the network of other important stress responses. In the first section, we review articles describing the induction of the SOS response at the molecular level. The second section discusses consequences of this induction in terms of DNA repair, changes in the genome and gene expression, and sharing of genomic information, with their effects on the bacteria's life and evolution. The third section is about the fine tuning of this response to fit with the bacteria's 'needs'. Finally, we discuss recent findings linking the SOS response to other stress responses. Under these perspectives, SOS can be perceived as a powerful bacterial strategy against aggressions.
Collapse
Affiliation(s)
- Zeynep Baharoglu
- Institut Pasteur, Département Génomes et Génétique, Unité Plasticité du Génome Bactérien, Paris, France; CNRS, UMR3525, Paris, France
| | | |
Collapse
|
7
|
Fonseca LS, da Silva JB, Milanez JS, Monteiro-Vitorello CB, Momo L, de Morais ZM, Vasconcellos SA, Marques MV, Ho PL, da Costa RMA. Leptospira interrogans serovar copenhageni harbors two lexA genes involved in SOS response. PLoS One 2013; 8:e76419. [PMID: 24098496 PMCID: PMC3789691 DOI: 10.1371/journal.pone.0076419] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2012] [Accepted: 08/28/2013] [Indexed: 11/24/2022] Open
Abstract
Bacteria activate a regulatory network in response to the challenges imposed by DNA damage to genetic material, known as the SOS response. This system is regulated by the RecA recombinase and by the transcriptional repressor lexA. Leptospira interrogans is a pathogen capable of surviving in the environment for weeks, being exposed to a great variety of stress agents and yet retaining its ability to infect the host. This study aims to investigate the behavior of L. interrogans serovar Copenhageni after the stress induced by DNA damage. We show that L. interrogans serovar Copenhageni genome contains two genes encoding putative LexA proteins (lexA1 and lexA2) one of them being potentially acquired by lateral gene transfer. Both genes are induced after DNA damage, but the steady state levels of both LexA proteins drop, probably due to auto-proteolytic activity triggered in this condition. In addition, seven other genes were up-regulated following UV-C irradiation, recA, recN, dinP, and four genes encoding hypothetical proteins. This set of genes is potentially regulated by LexA1, as it showed binding to their promoter regions. All these regions contain degenerated sequences in relation to the previously described SOS box, TTTGN 5CAAA. On the other hand, LexA2 was able to bind to the palindrome TTGTAN10TACAA, found in its own promoter region, but not in the others. Therefore, the L. interrogans serovar Copenhageni SOS regulon may be even more complex, as a result of LexA1 and LexA2 binding to divergent motifs. New possibilities for DNA damage response in Leptospira are expected, with potential influence in other biological responses such as virulence.
Collapse
Affiliation(s)
- Luciane S Fonseca
- Centro de Biotecnologia, Instituto Butantan, São Paulo, Brazil ; Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Sanchez-Alberola N, Campoy S, Barbé J, Erill I. Analysis of the SOS response of Vibrio and other bacteria with multiple chromosomes. BMC Genomics 2012; 13:58. [PMID: 22305460 PMCID: PMC3323433 DOI: 10.1186/1471-2164-13-58] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Accepted: 02/03/2012] [Indexed: 12/18/2022] Open
Abstract
Background The SOS response is a well-known regulatory network present in most bacteria and aimed at addressing DNA damage. It has also been linked extensively to stress-induced mutagenesis, virulence and the emergence and dissemination of antibiotic resistance determinants. Recently, the SOS response has been shown to regulate the activity of integrases in the chromosomal superintegrons of the Vibrionaceae, which encompasses a wide range of pathogenic species harboring multiple chromosomes. Here we combine in silico and in vitro techniques to perform a comparative genomics analysis of the SOS regulon in the Vibrionaceae, and we extend the methodology to map this transcriptional network in other bacterial species harboring multiple chromosomes. Results Our analysis provides the first comprehensive description of the SOS response in a family (Vibrionaceae) that includes major human pathogens. It also identifies several previously unreported members of the SOS transcriptional network, including two proteins of unknown function. The analysis of the SOS response in other bacterial species with multiple chromosomes uncovers additional regulon members and reveals that there is a conserved core of SOS genes, and that specialized additions to this basic network take place in different phylogenetic groups. Our results also indicate that across all groups the main elements of the SOS response are always found in the large chromosome, whereas specialized additions are found in the smaller chromosomes and plasmids. Conclusions Our findings confirm that the SOS response of the Vibrionaceae is strongly linked with pathogenicity and dissemination of antibiotic resistance, and suggest that the characterization of the newly identified members of this regulon could provide key insights into the pathogenesis of Vibrio. The persistent location of key SOS genes in the large chromosome across several bacterial groups confirms that the SOS response plays an essential role in these organisms and sheds light into the mechanisms of evolution of global transcriptional networks involved in adaptability and rapid response to environmental changes, suggesting that small chromosomes may act as evolutionary test beds for the rewiring of transcriptional networks.
Collapse
Affiliation(s)
- Neus Sanchez-Alberola
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | | | | | | |
Collapse
|
9
|
RecA-independent DNA damage induction of Mycobacterium tuberculosis ruvC despite an appropriately located SOS box. J Bacteriol 2009; 192:599-603. [PMID: 19915023 DOI: 10.1128/jb.01066-09] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Mycobacterium tuberculosis ruvC was induced by DNA damage in a DeltarecA strain despite having an appropriately positioned SOS box to which LexA binds in vitro. An inducible transcript start mapped within the SOS box, and transcriptional fusions identified the promoter. Disruption of the SOS box did not prevent induction, indicating that an alternative mechanism plays a significant role in the control of ruvC expression.
Collapse
|
10
|
Three genomes from the phylum Acidobacteria provide insight into the lifestyles of these microorganisms in soils. Appl Environ Microbiol 2009; 75:2046-56. [PMID: 19201974 DOI: 10.1128/aem.02294-08] [Citation(s) in RCA: 481] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The complete genomes of three strains from the phylum Acidobacteria were compared. Phylogenetic analysis placed them as a unique phylum. They share genomic traits with members of the Proteobacteria, the Cyanobacteria, and the Fungi. The three strains appear to be versatile heterotrophs. Genomic and culture traits indicate the use of carbon sources that span simple sugars to more complex substrates such as hemicellulose, cellulose, and chitin. The genomes encode low-specificity major facilitator superfamily transporters and high-affinity ABC transporters for sugars, suggesting that they are best suited to low-nutrient conditions. They appear capable of nitrate and nitrite reduction but not N(2) fixation or denitrification. The genomes contained numerous genes that encode siderophore receptors, but no evidence of siderophore production was found, suggesting that they may obtain iron via interaction with other microorganisms. The presence of cellulose synthesis genes and a large class of novel high-molecular-weight excreted proteins suggests potential traits for desiccation resistance, biofilm formation, and/or contribution to soil structure. Polyketide synthase and macrolide glycosylation genes suggest the production of novel antimicrobial compounds. Genes that encode a variety of novel proteins were also identified. The abundance of acidobacteria in soils worldwide and the breadth of potential carbon use by the sequenced strains suggest significant and previously unrecognized contributions to the terrestrial carbon cycle. Combining our genomic evidence with available culture traits, we postulate that cells of these isolates are long-lived, divide slowly, exhibit slow metabolic rates under low-nutrient conditions, and are well equipped to tolerate fluctuations in soil hydration.
Collapse
|
11
|
Riaz K, Elmerich C, Moreira D, Raffoux A, Dessaux Y, Faure D. A metagenomic analysis of soil bacteria extends the diversity of quorum-quenching lactonases. Environ Microbiol 2008; 10:560-70. [PMID: 18201196 DOI: 10.1111/j.1462-2920.2007.01475.x] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A metagenomic library of 10,121 clones, generated from bacteria inhabiting a pasture soil from France, was screened for the presence of fosmids conferring either N-acylhomoserine lactone (NAHL) synthesis or NAHL degradation ability upon their Escherichia coli host. No clone producing NAHLs was identified whereas one, containing a 31 972 bp insert in fosmid p2H8, allowed NAHL degradation. This led to the cloning and identification of a gene, qlcA, encoding an NAHL-lactonase activity, as judged by lactone-ring closure and HPLC/MS analyses of NAHL degradation products. The qlcA gene efficiently quenched quorum-sensing regulated pathogenic functions when expressed in Pectobacterium carotovorum. The QlcA peptide belongs to the family of zinc-dependent metallohydrolases and appears to be distantly related to other NAHL-lactonases discovered in Agrobacterium, Bacillus, Photorhabdus and Rhizobium. In-silico analysis of the metagenomic insert revealed the occurrence of 20 orf, with a constant GC% and codon usage, suggesting a unique bacterial origin. Nine out of these 20 orf were homologous to genes encoding biosynthesis of arginine; they were clustered with an unusual succession argFJADBCRGH. The fosmid p2H8 is able to complement the argA, argB and argC mutants in E. coli. Phylogenetic analysis showed that 9 orf out of 20 were related to sequences from members of the Acidobacteria, supporting the hypothesis that the analysed insert might be originated from an organism related to this phylum.
Collapse
Affiliation(s)
- Kashif Riaz
- Institut des Sciences du Végétal, Centre National de la Recherche Scientifique, UPR2355, Gif-sur-Yvette, France
| | | | | | | | | | | |
Collapse
|
12
|
Abstract
The SOS regulon is a paradigm of bacterial responses to DNA damage. A wide variety of bacterial species possess homologs of lexA and recA, the central players in the regulation of the SOS circuit. Nevertheless, the genes actually regulated by the SOS have been determined only experimentally in a few bacterial species. In this work, we describe 37 genes regulated in a LexA-dependent manner in the alphaproteobacterium Caulobacter crescentus. In agreement with previous results, we have found that the direct repeat GTTCN7GTTC is the SOS operator of C. crescentus, which was confirmed by site-directed mutagenesis studies of the imuA promoter. Several potential promoter regions containing the SOS operator were identified in the genome, and the expression of the corresponding genes was analyzed for both the wild type and the lexA strain, demonstrating that the vast majority of these genes are indeed SOS regulated. Interestingly, many of these genes encode proteins with unknown functions, revealing the potential of this approach for the discovery of novel genes involved in cellular responses to DNA damage in prokaryotes, and illustrating the diversity of SOS-regulated genes among different bacterial species.
Collapse
|
13
|
Erill I, Campoy S, Barbé J. Aeons of distress: an evolutionary perspective on the bacterial SOS response. FEMS Microbiol Rev 2007; 31:637-56. [PMID: 17883408 DOI: 10.1111/j.1574-6976.2007.00082.x] [Citation(s) in RCA: 249] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The SOS response of bacteria is a global regulatory network targeted at addressing DNA damage. Governed by the products of the lexA and recA genes, it co-ordinates a comprehensive response against DNA lesions and its description in Escherichia coli has stood for years as a textbook paradigm of stress-response systems in bacteria. In this paper we review the current state of research on the SOS response outside E. coli. By retracing research on the identification of multiple diverging LexA-binding motifs across the Bacteria Domain, we show how this work has led to the description of a minimum regulon core, but also of a heterogeneous collection of SOS regulatory networks that challenges many tenets of the E. coli model. We also review recent attempts at reconstructing the evolutionary history of the SOS network that have cast new light on the SOS response. Exploiting the newly gained knowledge on LexA-binding motifs and the tight association of LexA with a recently described mutagenesis cassette, these works put forward likely evolutionary scenarios for the SOS response, and we discuss their relevance on the ultimate nature of this stress-response system and the evolutionary pressures driving its evolution.
Collapse
Affiliation(s)
- Ivan Erill
- Biomedical Applications Group, Centro Nacional de Microelectrónica, Barcelona, Spain
| | | | | |
Collapse
|
14
|
Enikeeva FN, Kotelnikova EA, Gelfand MS, Makeev VJ. A model of evolution with constant selective pressure for regulatory DNA sites. BMC Evol Biol 2007; 7:125. [PMID: 17662135 PMCID: PMC1978210 DOI: 10.1186/1471-2148-7-125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2007] [Accepted: 07/27/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Molecular evolution is usually described assuming a neutral or weakly non-neutral substitution model. Recently, new data have become available on evolution of sequence regions under a selective pressure, e.g. transcription factor binding sites. To reconstruct the evolutionary history of such sequences, one needs evolutionary models that take into account a substantial constant selective pressure. RESULTS We present a simple evolutionary model with a single preferred (consensus) nucleotide and the neutral substitution model adopted for all other nucleotides. This evolutionary model has a rate matrix in which all substitutions that do not involve the consensus nucleotide occur with the same rate. The model has two time scales for achieving a stationary distribution; in the general case only one of the two rate parameters can be evaluated from the stationary distribution. In the middle-time zone, a counterintuitive behavior was observed for some parameter values, with a probability of conservation for a non-consensus nucleotide greater than that for the consensus nucleotide. Such an effect can be observed only in the case of weak preference for the consensus nucleotide, when the probability to observe the consensus nucleotide in the stationary distribution is less than 1/2. If the substitution rate is represented as a product of mutation and fixation, only the fixation can be calculated from the stationary distribution. The exhibited conservation of non-consensus nucleotides does not take place if the elements of mutation matrix are identical, and can be related to the reduced mutation rate between the non-consensus nucleotides. This bias can have no effect on the stationary distribution of nucleotide frequencies calculated over the ensemble of multiple alignments, e.g. transcription factor binding sites upstream of different sets of co-regulated orthologous genes. CONCLUSION The derived model can be used as a null model when analyzing the evolution of orthologous transcription factor binding sites. In particular, our findings show that a nucleotide preferred at some position of a multiple alignment of binding sites for some transcription factor in the same genome is not necessarily the most conserved nucleotide in an alignment of orthologous sites from different species. However, this effect can take place only in the case of a mutation matrix whose elements are not identical.
Collapse
Affiliation(s)
- Farida N Enikeeva
- Institute for Information Transmission Problems (the Kharkevich Institute) of RAS, Bolshoi Karetny pereulok, 19, GSP-4, Moscow, 127994, Russia
| | - Ekaterina A Kotelnikova
- State Research Institute of Genetics and Selection of Industrial Microorganisms, 1st Dorozhnyj proezd, 1, Moscow, 113535, Russia
- Ariadne Genomics Inc. 9700 Great Seneca Highway, Suite 113, Rockville, MD 20850, USA
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (the Kharkevich Institute) of RAS, Bolshoi Karetny pereulok, 19, GSP-4, Moscow, 127994, Russia
- Faculty of Bioengineering and Bioinformatics, Moscow State University, Vorobyevy Gory 1-73, Moscow, 119992, Russia
| | - Vsevolod J Makeev
- State Research Institute of Genetics and Selection of Industrial Microorganisms, 1st Dorozhnyj proezd, 1, Moscow, 113535, Russia
- Engelgardt Institute of Molecular Biology of RAS, Vavilova 32, Moscow, 119991, Russia
| |
Collapse
|