1
|
Wassenaar TM, Harville T, Chastain J, Wanchai V, Ussery DW. DNA structural features and variability of complete MHC locus sequences. FRONTIERS IN BIOINFORMATICS 2024; 4:1392613. [PMID: 39022183 PMCID: PMC11251971 DOI: 10.3389/fbinf.2024.1392613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 06/07/2024] [Indexed: 07/20/2024] Open
Abstract
The major histocompatibility (MHC) locus, also known as the Human Leukocyte Antigen (HLA) genes, is located on the short arm of chromosome 6, and contains three regions (Class I, Class II and Class III). This 5 Mbp locus is one of the most variable regions of the human genome, yet it also encodes a set of highly conserved and important proteins related to immunological response. Genetic variations in this region are responsible for more diseases than in the entire rest of the human genome. However, information on local structural features of the DNA is largely ignored. With recent advances in long-read sequencing technology, it is now becoming possible to sequence the entire 5 Mbp MHC locus, producing complete diploid haplotypes of the whole region. Here, we describe structural maps based on the complete sequences from six different homozygous HLA cell lines. We find long-range structural variability in the different sequences for DNA stacking energy, position preference and curvature, variation in repeats, as well as more local changes in regions forming open chromatin structures, likely to influence gene expression levels. These structural maps can be useful in visualizing large scale structural variation across HLA types, in particular when this can be complemented with epigenetic signals.
Collapse
Affiliation(s)
| | - Terry Harville
- Department of Pathology and Laboratory Services, and Department of Internal Medicine, Division of Hematology, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Jonathan Chastain
- Department of Pediatrics, The University of Arkansas for Medical Sciences University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Visanu Wanchai
- Myeloma Center, Winthrop P. Rockefeller Institute, Department of Internal Medicine, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - David W. Ussery
- Department of BioMedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| |
Collapse
|
2
|
Zaritsky A, Rabinovitch A, Liu C, Woldringh CL. Does the eclipse limit bacterial nucleoid complexity and cell width? Synth Syst Biotechnol 2017; 2:267-275. [PMID: 29552651 PMCID: PMC5851910 DOI: 10.1016/j.synbio.2017.11.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 11/07/2017] [Accepted: 11/07/2017] [Indexed: 12/19/2022] Open
Abstract
Cell size of bacteria M is related to 3 temporal parameters: chromosome replication time C, period from replication-termination to subsequent division D, and doubling time τ. Steady-state, bacillary cells grow exponentially by extending length L only, but their constant width W is larger at shorter τ's or longer C's, in proportion to the number of chromosome replication positions n (= C/τ), at least in Escherichia coli and Salmonella typhimurium. Extending C by thymine limitation of fast-growing thyA mutants result in continuous increase of M, associated with rising W, up to a limit before branching. A set of such puzzling observations is qualitatively consistent with the view that the actual cell mass (or volume) at the time of replication-initiation Mi (or Vi), usually relatively constant in growth at varying τ's, rises with time under thymine limitation of fast-growing, thymine-requiring E. coli strains. The hypothesis will be tested that presumes existence of a minimal distance lmin between successive moving replisomes, translated into the time needed for a replisome to reach lmin before a new replication-initiation at oriC is allowed, termed Eclipse E. Preliminary analysis of currently available data is inconsistent with a constant E under all conditions, hence other explanations and ways to test them are proposed in an attempt to elucidate these and other results. The complex hypothesis takes into account much of what is currently known about Bacterial Physiology: the relationships between cell dimensions, growth and cycle parameters, particularly nucleoid structure, replication and position, and the mode of peptidoglycan biosynthesis. Further experiments are mentioned that are necessary to test the discussed ideas and hypotheses.
Collapse
Affiliation(s)
- Arieh Zaritsky
- Faculty of Natural Sciences, Ben-Gurion University of the Negev, POB 653, Be'er-Sheva, 84105, Israel
| | - Avinoam Rabinovitch
- Faculty of Natural Sciences, Ben-Gurion University of the Negev, POB 653, Be'er-Sheva, 84105, Israel
| | - Chenli Liu
- Center for Synthetic Biology Engineering Research, Shenzhen Institutes of Advanced Technology (SIAT), Chinese Academy of Sciences, Shenzhen, PR China
| | - Conrad L Woldringh
- Bacterial Cell Biology, SILS, Boelelaan 1108, Amsterdam, The Netherlands
| |
Collapse
|
3
|
A genome-wide identification of genes undergoing recombination and positive selection in Neisseria. BIOMED RESEARCH INTERNATIONAL 2014; 2014:815672. [PMID: 25180194 PMCID: PMC4142384 DOI: 10.1155/2014/815672] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Revised: 07/18/2014] [Accepted: 07/18/2014] [Indexed: 01/01/2023]
Abstract
Currently, there is particular interest in the molecular mechanisms of adaptive evolution in bacteria. Neisseria is a genus of gram negative bacteria, and there has recently been considerable focus on its two human pathogenic species N. meningitidis and N. gonorrhoeae. Until now, no genome-wide studies have attempted to scan for the genes related to adaptive evolution. For this reason, we selected 18 Neisseria genomes (14 N. meningitidis, 3 N. gonorrhoeae and 1 commensal N. lactamics) to conduct a comparative genome analysis to obtain a comprehensive understanding of the roles of natural selection and homologous recombination throughout the history of adaptive evolution. Among the 1012 core orthologous genes, we identified 635 genes with recombination signals and 10 genes that showed significant evidence of positive selection. Further functional analyses revealed that no functional bias was found in the recombined genes. Positively selected genes are prone to DNA processing and iron uptake, which are essential for the fundamental life cycle. Overall, the results indicate that both recombination and positive selection play crucial roles in the adaptive evolution of Neisseria genomes. The positively selected genes and the corresponding amino acid sites provide us with valuable targets for further research into the detailed mechanisms of adaptive evolution in Neisseria.
Collapse
|
4
|
Ying BW, Tsuru S, Seno S, Matsuda H, Yomo T. Gene expression scaled by distance to the genome replication site. MOLECULAR BIOSYSTEMS 2013; 10:375-9. [PMID: 24336896 DOI: 10.1039/c3mb70254e] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
A simple mode of gene expression scaled by the distance from the chromosomal location of the gene to the genome replication site oriC was determined. The common formula representing the effect of genomic position on expression capacity not only supports the multifork replication model but also provides a base correlation for theoretical simulation and synthetic constructs.
Collapse
Affiliation(s)
- Bei-Wen Ying
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8572, Japan
| | | | | | | | | |
Collapse
|
5
|
Mrázek J, Chaudhari T, Basu A. PerPlot & PerScan: tools for analysis of DNA curvature-related periodicity in genomic nucleotide sequences. MICROBIAL INFORMATICS AND EXPERIMENTATION 2011; 1:13. [PMID: 22587738 PMCID: PMC3372288 DOI: 10.1186/2042-5783-1-13] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2011] [Accepted: 11/28/2011] [Indexed: 04/12/2023]
Abstract
Background Periodic spacing of short adenine or thymine runs phased with DNA helical period of ~10.5 bp is associated with intrinsic DNA curvature and deformability, which play important roles in DNA-protein interactions and in the organization of chromosomes in both eukaryotes and prokaryotes. Local differences in DNA sequence periodicity have been linked to differences in gene expression in some organisms. Despite the significance of these periodic patterns, there are virtually no publicly accessible tools for their analysis. Results We present novel tools suitable for assessments of DNA curvature-related sequence periodicity in nucleotide sequences at the genome scale. Utility of the present software is demonstrated on a comparison of sequence periodicities in the genomes of Haemophilus influenzae, Methanocaldococcus jannaschii, Saccharomyces cerevisiae, and Arabidopsis thaliana. The software can be accessed through a web interface and the programs are also available for download. Conclusions The present software is suitable for comparing DNA curvature-related sequence periodicity among different genomes as well as for analysis of intrachromosomal heterogeneity of the sequence periodicity. It provides a quick and convenient way to detect anomalous regions of chromosomes that could have unusual structural and functional properties and/or distinct evolutionary history.
Collapse
Affiliation(s)
- Jan Mrázek
- Department of Microbiology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602-2605, USA.
| | | | | |
Collapse
|
6
|
Complete genome sequence of the marine fish pathogen Vibrio anguillarum harboring the pJM1 virulence plasmid and genomic comparison with other virulent strains of V. anguillarum and V. ordalii. Infect Immun 2011; 79:2889-900. [PMID: 21576332 DOI: 10.1128/iai.05138-11] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We dissected the complete genome sequence of the O1 serotype strain Vibrio anguillarum 775(pJM1) and determined the draft genomic sequences of plasmidless strains of serotype O1 (strain 96F) and O2β (strain RV22) and V. ordalii. All strains harbor two chromosomes, but 775 also harbors the virulence plasmid pJM1, which carries the anguibactin-producing and cognate transport genes, one of the main virulence factors of V. anguillarum. Genomic analysis identified eight genomic islands in chromosome 1 of V. anguillarum 775(pJM1) and two in chromosome 2. Some of them carried potential virulence genes for the biosynthesis of O antigens, hemolysins, and exonucleases as well as others for sugar transport and metabolism. The majority of genes for essential cell functions and pathogenicity are located on chromosome 1. In contrast, chromosome 2 contains a larger fraction (59%) of hypothetical genes than does chromosome 1 (42%). Chromosome 2 also harbors a superintegron, as well as host "addiction" genes that are typically found on plasmids. Unique distinctive properties include homologues of type III secretion system genes in 96F, homologues of V. cholerae zot and ace toxin genes in RV22, and the biofilm formation syp genes in V. ordalii. Mobile genetic elements, some of them possibly originated in the pJM1 plasmid, were very abundant in 775, resulting in the silencing of specific genes, with only few insertions in the 96F and RV22 chromosomes.
Collapse
|
7
|
Söllner J, Heinzel A, Summer G, Fechete R, Stipkovits L, Szathmary S, Mayer B. Concept and application of a computational vaccinology workflow. Immunome Res 2010; 6 Suppl 2:S7. [PMID: 21067549 PMCID: PMC2981879 DOI: 10.1186/1745-7580-6-s2-s7] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The last years have seen a renaissance of the vaccine area, driven by clinical needs in infectious diseases but also chronic diseases such as cancer and autoimmune disorders. Equally important are technological improvements involving nano-scale delivery platforms as well as third generation adjuvants. In parallel immunoinformatics routines have reached essential maturity for supporting central aspects in vaccinology going beyond prediction of antigenic determinants. On this basis computational vaccinology has emerged as a discipline aimed at ab-initio rational vaccine design.Here we present a computational workflow for implementing computational vaccinology covering aspects from vaccine target identification to functional characterization and epitope selection supported by a Systems Biology assessment of central aspects in host-pathogen interaction. We exemplify the procedures for Epstein Barr Virus (EBV), a clinically relevant pathogen causing chronic infection and suspected of triggering malignancies and autoimmune disorders. RESULTS We introduce pBone/pView as a computational workflow supporting design and execution of immunoinformatics workflow modules, additionally involving aspects of results visualization, knowledge sharing and re-use. Specific elements of the workflow involve identification of vaccine targets in the realm of a Systems Biology assessment of host-pathogen interaction for identifying functionally relevant targets, as well as various methodologies for delineating B- and T-cell epitopes with particular emphasis on broad coverage of viral isolates as well as MHC alleles.Applying the workflow on EBV specifically proposes sequences from the viral proteins LMP2, EBNA2 and BALF4 as vaccine targets holding specific B- and T-cell epitopes promising broad strain and allele coverage. CONCLUSION Based on advancements in the experimental assessment of genomes, transcriptomes and proteomes for both, pathogen and (human) host, the fundaments for rational design of vaccines have been laid out. In parallel, immunoinformatics modules have been designed and successfully applied for supporting specific aspects in vaccine design. Joining these advancements, further complemented by novel vaccine formulation and delivery aspects, have paved the way for implementing computational vaccinology for rational vaccine design tackling presently unmet vaccine challenges.
Collapse
Affiliation(s)
- Johannes Söllner
- emergentec biodevelopment GmbH, Rathausstrasse 5/3, 1010 Vienna, Austria
| | - Andreas Heinzel
- emergentec biodevelopment GmbH, Rathausstrasse 5/3, 1010 Vienna, Austria
- University of Applied Sciences, Softwarepark 11, 4232 Hagenberg, Austria
| | - Georg Summer
- University of Applied Sciences, Softwarepark 11, 4232 Hagenberg, Austria
| | - Raul Fechete
- emergentec biodevelopment GmbH, Rathausstrasse 5/3, 1010 Vienna, Austria
| | | | - Susan Szathmary
- Galenbio Kft., Erdőszél köz 21, 1037 Budapest, Hungary and GalenBio, Inc., 5922 Farnsworth Ct, Carlsbad, CA 92008, USA
| | - Bernd Mayer
- emergentec biodevelopment GmbH, Rathausstrasse 5/3, 1010 Vienna, Austria
- Institute for Theoretical Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| |
Collapse
|
8
|
Comparative analysis of sequence periodicity among prokaryotic genomes points to differences in nucleoid structure and a relationship to gene expression. J Bacteriol 2010; 192:3763-72. [PMID: 20494989 DOI: 10.1128/jb.00149-10] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Regular spacing of short runs of A or T nucleotides in DNA sequences with a period close to the helical period of the DNA double helix has been associated with intrinsic DNA bending and nucleosome positioning in eukaryotes. Analogous periodic signals were also observed in prokaryotic genomes. While the exact role of this periodicity in prokaryotes is not known, it has been proposed to facilitate the DNA packaging in the prokaryotic nucleoid and/or to promote negative or positive supercoiling. We developed a methodology for assessments of intragenomic heterogeneity of these periodic patterns and applied it in analysis of 1,025 prokaryotic chromosomes. This technique allows more detailed analysis of sequence periodicity than previous methods where sequence periodicity was assessed in an integral form across the whole chromosome. We found that most genomes have the periodic signal confined to several chromosomal segments while most of the chromosome lacks a strong sequence periodicity. Moreover, there are significant differences among different prokaryotes in both the intensity and persistency of sequence periodicity related to DNA curvature. We proffer that the prokaryotic nucleoid consists of relatively rigid sections stabilized by short intrinsically bent DNA segments and characterized by locally strong periodic patterns alternating with regions featuring a weak periodic signal, which presumably permits higher structural flexibility. This model applies to most bacteria and archaea. In genomes with an exceptionally persistent periodic signal, highly expressed genes tend to concentrate in aperiodic sections, suggesting that structural heterogeneity of the nucleoid is related to local differences in transcriptional activity.
Collapse
|
9
|
Gao J, Chen LL. Theoretical methods for identifying important functional genes in bacterial genomes. Res Microbiol 2009; 161:1-8. [PMID: 19900539 DOI: 10.1016/j.resmic.2009.10.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2009] [Revised: 10/05/2009] [Accepted: 10/21/2009] [Indexed: 12/30/2022]
Abstract
Some functional genes, such as essential genes, highly expressed genes and horizontally transferred genes, play important roles in the survival and pathogenicity of bacteria. This review attempts to summarize current computational methods in identifying the above functional genes from bacterial genomes, which is of significant importance in exploring the bacterial genomes.
Collapse
Affiliation(s)
- Junxiang Gao
- School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, PR China
| | | |
Collapse
|
10
|
Orsi RH, Sun Q, Wiedmann M. Genome-wide analyses reveal lineage specific contributions of positive selection and recombination to the evolution of Listeria monocytogenes. BMC Evol Biol 2008; 8:233. [PMID: 18700032 PMCID: PMC2532693 DOI: 10.1186/1471-2148-8-233] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2007] [Accepted: 08/12/2008] [Indexed: 12/30/2022] Open
Abstract
Background The genus Listeria includes two closely related pathogenic and non-pathogenic species, L. monocytogenes and L. innocua. L. monocytogenes is an opportunistic human foodborne and animal pathogen that includes two common lineages. While lineage I is more commonly found among human listeriosis cases, lineage II appears to be overrepresented among isolates from foods and environmental sources. This study used the genome sequences for one L. innocua strain and four L. monocytogenes strains representing lineages I and II, to characterize the contributions of positive selection and recombination to the evolution of the L. innocua/L. monocytogenes core genome. Results Among the 2267 genes in the L. monocytogenes/L. innocua core genome, 1097 genes showed evidence for recombination and 36 genes showed evidence for positive selection. Positive selection was strongly associated with recombination. Specifically, 29 of the 36 genes under positive selection also showed evidence for recombination. Recombination was more common among isolates in lineage II than lineage I; this trend was confirmed by sequencing five genes in a larger isolate set. Positive selection was more abundant in the ancestral branch of lineage II (20 genes) as compared to the ancestral branch of lineage I (9 genes). Additional genes under positive selection were identified in the branch separating the two species; for this branch, genes in the role category "Cell wall and membrane biogenesis" were significantly more likely to have evidence for positive selection. Positive selection of three genes was confirmed in a larger isolate set, which also revealed occurrence of multiple premature stop codons in one positively selected gene involved in flagellar motility (flaR). Conclusion While recombination and positive selection both contribute to evolution of L. monocytogenes, the relative contributions of these evolutionary forces seem to differ by L. monocytogenes lineages and appear to be more important in the evolution of lineage II, which seems to be found in a broader range of environments, as compared to the apparently more host adapted lineage I. Diversification of cell wall and membrane biogenesis and motility-related genes may play a particularly important role in the evolution of L. monocytogenes.
Collapse
Affiliation(s)
- Renato H Orsi
- Department of Food Science, Cornell University, Ithaca, NY, USA.
| | | | | |
Collapse
|
11
|
Reva ON, Hallin PF, Willenbrock H, Sicheritz-Ponten T, Tümmler B, Ussery DW. Global features of the Alcanivorax borkumensis SK2 genome. Environ Microbiol 2007; 10:614-25. [PMID: 18081853 DOI: 10.1111/j.1462-2920.2007.01483.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The global feature of the completely sequenced Alcanivorax borkumensis SK2 type strain chromosome is its symmetry and homogeneity. The origin and terminus of replication are located opposite to each other in the chromosome and are discerned with high signal to noise ratios by maximal oligonucleotide usage biases on the leading and lagging strand. Genomic DNA structure is rather uniform throughout the chromosome with respect to intrinsic curvature, position preference or base stacking energy. The orthologs and paralogs of A. borkumensis genes with the highest sequence homology were found in most cases among gamma-Proteobacteria, with Acinetobacter and P. aeruginosa as closest relatives. A. borkumensis shares a similar oligonucleotide usage and promoter structure with the Pseudomonadales. A comparatively low number of only 18 genome islands with atypical oligonucleotide usage was detected in the A. borkumensis chromosome. The gene clusters that confer the assimilation of aliphatic hydrocarbons, are localized in two genome islands which were probably acquired from an ancestor of the Yersinia lineage, whereas the alk genes of Pseudomonas putida still exhibit the typical Alcanivorax oligonucleotide signature indicating a complex evolution of this major hydrocarbonoclastic trait.
Collapse
Affiliation(s)
- Oleg N Reva
- Klinische Forschergruppe, OE6711, Medizinische Hochschule Hannover, Carl-Neuberg-Strasse 1, D-30625 Hannover, Germany
| | | | | | | | | | | |
Collapse
|
12
|
Prediction of highly expressed genes in microbes based on chromatin accessibility. BMC Mol Biol 2007; 8:11. [PMID: 17295928 PMCID: PMC1805505 DOI: 10.1186/1471-2199-8-11] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2006] [Accepted: 02/13/2007] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed genes in microbial genomes. We compare these predictions with those based on codon adaptation index (CAI) values, and also with experimental data for 6 different microbial genomes, with a particular interest in experimental data from Escherichia coli. Moreover, position preference is examined further in 328 sequenced microbial genomes. RESULTS We find that absolute gene expression levels are correlated with the position preference in many microbial genomes. It is postulated that in these regions, the DNA may be more accessible to the transcriptional machinery. Moreover, ribosomal proteins and ribosomal RNA are encoded by DNA having significantly lower position preference values than other genes in fast-replicating microbes. CONCLUSION This insight into DNA structure-dependent gene expression in microbes may be exploited for predicting the expression of non-translated genes such as non-coding RNAs that may not be predicted by any of the conventional codon usage bias approaches.
Collapse
|