1
|
Sánchez IE, Galpern EA, Ferreiro DU. Solvent constraints for biopolymer folding and evolution in extraterrestrial environments. Proc Natl Acad Sci U S A 2024; 121:e2318905121. [PMID: 38739787 PMCID: PMC11127021 DOI: 10.1073/pnas.2318905121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 04/16/2024] [Indexed: 05/16/2024] Open
Abstract
We propose that spontaneous folding and molecular evolution of biopolymers are two universal aspects that must concur for life to happen. These aspects are fundamentally related to the chemical composition of biopolymers and crucially depend on the solvent in which they are embedded. We show that molecular information theory and energy landscape theory allow us to explore the limits that solvents impose on biopolymer existence. We consider 54 solvents, including water, alcohols, hydrocarbons, halogenated solvents, aromatic solvents, and low molecular weight substances made up of elements abundant in the universe, which may potentially take part in alternative biochemistries. We find that along with water, there are many solvents for which the liquid regime is compatible with biopolymer folding and evolution. We present a ranking of the solvents in terms of biopolymer compatibility. Many of these solvents have been found in molecular clouds or may be expected to occur in extrasolar planets.
Collapse
Affiliation(s)
- Ignacio E. Sánchez
- Laboratorio de Fisiología de Proteínas, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales, Buenos AiresCP1428, Argentina
| | - Ezequiel A. Galpern
- Laboratorio de Fisiología de Proteínas, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales, Buenos AiresCP1428, Argentina
| | - Diego U. Ferreiro
- Laboratorio de Fisiología de Proteínas, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales, Buenos AiresCP1428, Argentina
| |
Collapse
|
2
|
Schneider TD. Generalizing the isothermal efficiency by using Gaussian distributions. PLoS One 2023; 18:e0279758. [PMID: 36626367 PMCID: PMC9831307 DOI: 10.1371/journal.pone.0279758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 11/28/2022] [Indexed: 01/11/2023] Open
Abstract
Unlike the Carnot heat engine efficiency published in 1824, an isothermal efficiency derived from thermodynamics and information theory can be applied to biological systems. The original approach by Pierce and Cutler in 1959 to derive the isothermal efficiency equation came from Shannon's channel capacity of 1949 and from Felker's 1952 determination of the minimum energy dissipation needed to gain a bit. In 1991 and 2010 Schneider showed how the isothermal efficiency equation can be applied to molecular machines and that this can be used to explain why several molecular machines are 70% efficient. Surprisingly, some macroscopic biological systems, such as whole ecosystems, are also 70% efficient but it is hard to see how this could be explained by a thermodynamic and molecular theory. The thesis of this paper is that the isothermal efficiency can be derived without using thermodynamics by starting from a set of independent Gaussian distributions. This novel derivation generalizes the isothermal efficiency equation for use at all levels of biology, from molecules to ecosystems.
Collapse
Affiliation(s)
- Thomas D. Schneider
- National Institutes of Health, National Cancer Institute, Center for Cancer Research, RNA Biology Laboratory, Frederick, MD, United States of America
- * E-mail:
| |
Collapse
|
3
|
Sánchez IE, Galpern EA, Garibaldi MM, Ferreiro DU. Molecular Information Theory Meets Protein Folding. J Phys Chem B 2022; 126:8655-8668. [PMID: 36282961 DOI: 10.1021/acs.jpcb.2c04532] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
We propose an application of molecular information theory to analyze the folding of single domain proteins. We analyze results from various areas of protein science, such as sequence-based potentials, reduced amino acid alphabets, backbone configurational entropy, secondary structure content, residue burial layers, and mutational studies of protein stability changes. We found that the average information contained in the sequences of evolved proteins is very close to the average information needed to specify a fold ∼2.2 ± 0.3 bits/(site·operation). The effective alphabet size in evolved proteins equals the effective number of conformations of a residue in the compact unfolded state at around 5. We calculated an energy-to-information conversion efficiency upon folding of around 50%, lower than the theoretical limit of 70%, but much higher than human-built macroscopic machines. We propose a simple mapping between molecular information theory and energy landscape theory and explore the connections between sequence evolution, configurational entropy, and the energetics of protein folding.
Collapse
Affiliation(s)
- Ignacio E Sánchez
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Ezequiel A Galpern
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Martín M Garibaldi
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Diego U Ferreiro
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| |
Collapse
|
4
|
Borsley S, Leigh DA, Roberts BMW. A Doubly Kinetically-Gated Information Ratchet Autonomously Driven by Carbodiimide Hydration. J Am Chem Soc 2021; 143:4414-4420. [DOI: 10.1021/jacs.1c01172] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Stefan Borsley
- Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| | - David A. Leigh
- Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Benjamin M. W. Roberts
- Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| |
Collapse
|
5
|
Bin Y, Zhang W, Tang W, Dai R, Li M, Zhu Q, Xia J. Prediction of Neuropeptides from Sequence Information Using Ensemble Classifier and Hybrid Features. J Proteome Res 2020; 19:3732-3740. [DOI: 10.1021/acs.jproteome.0c00276] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Yannan Bin
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
- School of Computer Science and Technology, Anhui University, Hefei, Anhui 230601, China
| | - Wei Zhang
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
| | - Wending Tang
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
| | - Ruyu Dai
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
| | - Menglu Li
- School of Computer Science and Technology, Anhui University, Hefei, Anhui 230601, China
| | - Qizhi Zhu
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
| | - Junfeng Xia
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China
- School of Computer Science and Technology, Anhui University, Hefei, Anhui 230601, China
| |
Collapse
|
6
|
Brash DE. Rethinking Causation for Data-intensive Biology: Constraints, Cancellations, and Quantized Organisms: Causality in complex organisms is sculpted by constraints rather than instigators, with outcomes perhaps better described by quantized patterns than rectilinear pathways. Bioessays 2020; 42:e1900135. [PMID: 32484248 PMCID: PMC7518294 DOI: 10.1002/bies.201900135] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 03/19/2020] [Indexed: 12/22/2022]
Abstract
Complex organisms thwart the simple rectilinear causality paradigm of "necessary and sufficient," with its experimental strategy of "knock down and overexpress." This Essay organizes the eccentricities of biology into four categories that call for new mathematical approaches; recaps for the biologist the philosopher's recent refinements to the causation concept and the mathematician's computational tools that handle some but not all of the biological eccentricities; and describes overlooked insights that make causal properties of physical hierarchies such as emergence and downward causation straightforward. Reviewing and extrapolating from similar situations in physics, it is suggested that new mathematical tools for causation analysis incorporating feedback, signal cancellation, nonlinear dependencies, physical hierarchies, and fixed constraints rather than instigative changes will reveal unconventional biological behaviors. These include "eigenisms," organisms that are limited to quantized states; trajectories that steer a system such as an evolving species toward optimal states; and medical control via distributed "sheets" rather than single control points.
Collapse
Affiliation(s)
- Douglas E Brash
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT, 06520-8040, USA
| |
Collapse
|
7
|
Fontana J, Dong C, Kiattisewee C, Chavali VP, Tickman BI, Carothers JM, Zalatan JG. Effective CRISPRa-mediated control of gene expression in bacteria must overcome strict target site requirements. Nat Commun 2020; 11:1618. [PMID: 32238808 PMCID: PMC7113249 DOI: 10.1038/s41467-020-15454-y] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 03/04/2020] [Indexed: 11/09/2022] Open
Abstract
In bacterial systems, CRISPR-Cas transcriptional activation (CRISPRa) has the potential to dramatically expand our ability to regulate gene expression, but we lack predictive rules for designing effective gRNA target sites. Here, we identify multiple features of bacterial promoters that impose stringent requirements on CRISPRa target sites. Notably, we observe narrow, 2-4 base windows of effective sites with a periodicity corresponding to one helical turn of DNA, spanning ~40 bases and centered ~80 bases upstream of the TSS. However, we also identify two features suggesting the potential for broad scope: CRISPRa is effective at a broad range of σ70-family promoters, and an expanded PAM dCas9 allows the activation of promoters that cannot be activated by S. pyogenes dCas9. These results provide a roadmap for future engineering efforts to further expand and generalize the scope of bacterial CRISPRa.
Collapse
Affiliation(s)
- Jason Fontana
- Molecular Engineering & Sciences Institute, University of Washington, Seattle, 98195, WA, USA
| | - Chen Dong
- Department of Chemistry, University of Washington, Seattle, 98195, WA, USA
| | - Cholpisit Kiattisewee
- Molecular Engineering & Sciences Institute, University of Washington, Seattle, 98195, WA, USA
| | - Venkata P Chavali
- Department of Chemical Engineering, University of Washington, Seattle, 98195, WA, USA
| | - Benjamin I Tickman
- Molecular Engineering & Sciences Institute, University of Washington, Seattle, 98195, WA, USA
| | - James M Carothers
- Molecular Engineering & Sciences Institute, University of Washington, Seattle, 98195, WA, USA.
- Department of Chemical Engineering, University of Washington, Seattle, 98195, WA, USA.
- Center for Synthetic Biology, University of Washington, Seattle, 98195, WA, USA.
| | - Jesse G Zalatan
- Department of Chemistry, University of Washington, Seattle, 98195, WA, USA.
- Department of Chemical Engineering, University of Washington, Seattle, 98195, WA, USA.
- Center for Synthetic Biology, University of Washington, Seattle, 98195, WA, USA.
| |
Collapse
|
8
|
Schneider TD, Jejjala V. Restriction enzymes use a 24 dimensional coding space to recognize 6 base long DNA sequences. PLoS One 2019; 14:e0222419. [PMID: 31671158 PMCID: PMC6822723 DOI: 10.1371/journal.pone.0222419] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 08/29/2019] [Indexed: 11/19/2022] Open
Abstract
Restriction enzymes recognize and bind to specific sequences on invading bacteriophage DNA. Like a key in a lock, these proteins require many contacts to specify the correct DNA sequence. Using information theory we develop an equation that defines the number of independent contacts, which is the dimensionality of the binding. We show that EcoRI, which binds to the sequence GAATTC, functions in 24 dimensions. Information theory represents messages as spheres in high dimensional spaces. Better sphere packing leads to better communications systems. The densest known packing of hyperspheres occurs on the Leech lattice in 24 dimensions. We suggest that the single protein EcoRI molecule employs a Leech lattice in its operation. Optimizing density of sphere packing explains why 6 base restriction enzymes are so common.
Collapse
Affiliation(s)
- Thomas D. Schneider
- National Institutes of Health, National Cancer Institute, Center for Cancer Research, RNA Biology Laboratory, Frederick, Maryland, United States of America
- * E-mail:
| | - Vishnu Jejjala
- Mandelstam Institute for Theoretical Physics, School of Physics, NITheP, and CoE-MaSS, University of the Witwatersrand, Johannesburg, South Africa
- David Rittenhouse Laboratory, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
9
|
Zavyalova E, Kopylov A. Energy Transfer as A Driving Force in Nucleic Acid⁻Protein Interactions. Molecules 2019; 24:molecules24071443. [PMID: 30979095 PMCID: PMC6480146 DOI: 10.3390/molecules24071443] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 04/10/2019] [Accepted: 04/11/2019] [Indexed: 12/19/2022] Open
Abstract
Many nucleic acid–protein structures have been resolved, though quantitative structure-activity relationship remains unclear in many cases. Thrombin complexes with G-quadruplex aptamers are striking examples of a lack of any correlation between affinity, interface organization, and other common parameters. Here, we tested the hypothesis that affinity of the aptamer–protein complex is determined with the capacity of the interface to dissipate energy of binding. Description and detailed analysis of 63 nucleic acid–protein structures discriminated peculiarities of high-affinity nucleic acid–protein complexes. The size of the amino acid sidechain in the interface was demonstrated to be the most significant parameter that correlates with affinity of aptamers. This observation could be explained in terms of need of efficient energy transfer from interacting residues. Application of energy dissipation theory provided an illustrative tool for estimation of efficiency of aptamer–protein complexes. These results are of great importance for a design of efficient aptamers.
Collapse
Affiliation(s)
| | - Alexey Kopylov
- Chemistry Department, Lomonosov Moscow State University, 119991 Moscow, Russia.
| |
Collapse
|
10
|
Aptekmann AA, Nadra AD. Core promoter information content correlates with optimal growth temperature. Sci Rep 2018; 8:1313. [PMID: 29358630 PMCID: PMC5777992 DOI: 10.1038/s41598-018-19495-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2017] [Accepted: 01/03/2018] [Indexed: 01/27/2023] Open
Abstract
The subtle mechanisms by which protein-DNA interactions remain functional across a wide range of temperatures are largely unknown. In this work, we manually curated available information relating fully sequenced archaeal genomes with organism growth temperatures. We built a motif that represents the core promoter of each species and calculated its information content. We then studied the relation between optimal growth temperature (OGT) and information content (IC) in the promoter region.We found a positive correlation between G + C content and OGT in tRNA regions and not in overall genome. Furthermore, we found that there is a positive correlation between information content and optimal growth temperatures in Archaea. This can't be explained by an increased C+G composition nor by other obvious mechanisms. These findings suggest that increased information content could produce a positive fitness in organisms living at high temperatures. We suggest that molecular information theory may need to be adapted for hyperthermophiles.
Collapse
Affiliation(s)
- Ariel A Aptekmann
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Laboratorio de Bioquímica Estructural, Buenos Aires, Argentina
- CONICET- Universidad de Buenos Aires, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN). Intendente Güiraldes 2160, C1428EGA, Buenos Aires, Argentina
| | - Alejandro D Nadra
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Laboratorio de Bioquímica Estructural, Buenos Aires, Argentina.
- CONICET- Universidad de Buenos Aires, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN). Intendente Güiraldes 2160, C1428EGA, Buenos Aires, Argentina.
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Fisiología, Biología Molecular y Celular, Laboratorio de Bioquímica Estructural, Buenos Aires, Argentina.
| |
Collapse
|
11
|
Abstract
Sorting of single-wall carbon nanotubes by their electronic and atomic structures in liquid phases is reviewed in this chapter. We first introduce the sorting problem, and then provide an overview of several sorting methodologies, following roughly the chronological order of their development over the past 15 years or so. Major methods discussed include ion-exchange chromatography, density-gradient ultracentrifugation, selective extraction in organic solvents, gel chromatography, and aqueous two-phase extraction. A main focus of the review is on the common mechanisms underlining all sorting processes. We propose that differences in solvation among different nanotube species are the ultimate driving force of sorting, and we corroborate this proposal by presenting analysis on how the differences are realized in electronic-structure-based sorting and atomic-structure-based sorting. In the end, we offer some suggestions on future directions that may grow out of carbon nanotube sorting. In particular, the prospect of expanding the function of DNA/carbon nanotube hybrid to control inter-particle interactions both inside and outside the nanotube is discussed.
Collapse
Affiliation(s)
- Ming Zheng
- Materials Science and Engineering Division, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD, 20899, USA.
| |
Collapse
|
12
|
|
13
|
He G, Tolic A, Bashkin JK, Poon GMK. Heterogeneous dynamics in DNA site discrimination by the structurally homologous DNA-binding domains of ETS-family transcription factors. Nucleic Acids Res 2015; 43:4322-31. [PMID: 25824951 PMCID: PMC4417174 DOI: 10.1093/nar/gkv267] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 03/05/2015] [Indexed: 01/09/2023] Open
Abstract
The ETS family of transcription factors exemplifies current uncertainty in how eukaryotic genetic regulators with overlapping DNA sequence preferences achieve target site specificity. PU.1 and Ets-1 represent archetypes for studying site discrimination by ETS proteins because their DNA-binding domains are the most divergent in sequence, yet they share remarkably superimposable DNA-bound structures. To gain insight into the contrasting thermodynamics and kinetics of DNA recognition by these two proteins, we investigated the structure and dynamics of site discrimination by their DNA-binding domains. Electrophoretic mobilities of complexes formed by the two homologs with circularly permuted binding sites showed significant dynamic differences only for DNA complexes of PU.1. Free solution measurements by dynamic light scattering showed PU.1 to be more dynamic than Ets-1; moreover, dynamic changes are strongly coupled to site discrimination by PU.1, but not Ets-1. Interrogation of the protein/DNA interface by DNA footprinting showed similar accessibility to dimethyl sulfate for PU.1/DNA and Ets-1/DNA complexes, indicating that the dynamics of PU.1/DNA complexes reside primarily outside that interface. An information-based analysis of the two homologs’ binding motifs suggests a role for dynamic coupling in PU.1's ability to enforce a more stringent sequence preference than Ets-1 and its proximal sequence homologs.
Collapse
Affiliation(s)
- Gaofei He
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri - St. Louis, St. Louis, MO 63121, USA
| | - Ana Tolic
- College of Pharmacy, Washington State University, Spokane, WA 99210, USA
| | - James K Bashkin
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri - St. Louis, St. Louis, MO 63121, USA
| | - Gregory M K Poon
- College of Pharmacy, Washington State University, Spokane, WA 99210, USA
| |
Collapse
|
14
|
Carrasco-Ruiz A, Tiburcio J. Electrostatic Kinetic Barriers in the Threading/Dethreading Motion of a Rotaxane-like Complex. Org Lett 2015; 17:1858-61. [PMID: 25822091 DOI: 10.1021/acs.orglett.5b00530] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Anayeli Carrasco-Ruiz
- Departamento
de Química, Centro de Investigación y de Estudios Avanzados (Cinvestav), México D. F., México
| | - Jorge Tiburcio
- Departamento
de Química, Centro de Investigación y de Estudios Avanzados (Cinvestav), México D. F., México
| |
Collapse
|
15
|
Bandyopadhyay A, Biswas S, Maity AK, Banik SK. Analysis of DevR regulated genes in Mycobacterium tuberculosis. SYSTEMS AND SYNTHETIC BIOLOGY 2014; 8:3-20. [PMID: 24592287 DOI: 10.1007/s11693-014-9133-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2013] [Revised: 01/29/2014] [Accepted: 01/30/2014] [Indexed: 11/25/2022]
Abstract
The DevRS two component system of Mycobacterium tuberculosis is responsible for its dormancy in host and becomes operative under hypoxic condition. It is experimentally known that phosphorylated DevR controls the expression of several downstream genes in a complex manner. In the present work we propose a theoretical model to show role of binding sites in DevR mediated gene expression. Individual and collective role of binding sites in regulating DevR mediated gene expression has been shown via modeling. Objective of the present work is twofold. First, to describe qualitatively the temporal dynamics of wild type genes and their known mutants. Based on these results we propose that DevR controlled gene expression follows a specific pattern which is efficient in describing other DevR mediated gene expression. Second, to analyze behavior of the system from information theoretical point of view. Using the tools of information theory we have calculated molecular efficiency of the system and have shown that it is close to the maximum limit of isothermal efficiency.
Collapse
Affiliation(s)
- Arnab Bandyopadhyay
- Department of Chemistry, Bose Institute, 93/1 A P C Road, Kolkata, 700009 India
| | - Soumi Biswas
- Department of Chemistry, Bose Institute, 93/1 A P C Road, Kolkata, 700009 India
| | - Alok Kumar Maity
- Department of Chemistry, University of Calcutta, 92 A P C Road, Kolkata, 700009 India
| | - Suman K Banik
- Department of Chemistry, Bose Institute, 93/1 A P C Road, Kolkata, 700009 India
| |
Collapse
|
16
|
Ofer D, Linial M. NeuroPID: a predictor for identifying neuropeptide precursors from metazoan proteomes. ACTA ACUST UNITED AC 2013; 30:931-40. [PMID: 24336809 DOI: 10.1093/bioinformatics/btt725] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
MOTIVATION The evolution of multicellular organisms is associated with increasing variability of molecules governing behavioral and physiological states. This is often achieved by neuropeptides (NPs) that are produced in neurons from a longer protein, named neuropeptide precursor (NPP). The maturation of NPs occurs through a sequence of proteolytic cleavages. The difficulty in identifying NPPs is a consequence of their diversity and the lack of applicable sequence similarity among the short functionally related NPs. RESULTS Herein, we describe Neuropeptide Precursor Identifier (NeuroPID), a machine learning scheme that predicts metazoan NPPs. NeuroPID was trained on hundreds of identified NPPs from the UniProtKB database. Some 600 features were extracted from the primary sequences and processed using support vector machines (SVM) and ensemble decision tree classifiers. These features combined biophysical, chemical and informational-statistical properties of NPs and NPPs. Other features were guided by the defining characteristics of the dibasic cleavage sites motif. NeuroPID reached 89-94% accuracy and 90-93% precision in cross-validation blind tests against known NPPs (with an emphasis on Chordata and Arthropoda). NeuroPID also identified NPP-like proteins from extensively studied model organisms as well as from poorly annotated proteomes. We then focused on the most significant sets of features that contribute to the success of the classifiers. We propose that NPPs are attractive targets for investigating and modulating behavior, metabolism and homeostasis and that a rich repertoire of NPs remains to be identified. AVAILABILITY NeuroPID source code is freely available at http://www.protonet.cs.huji.ac.il/neuropid
Collapse
Affiliation(s)
- Dan Ofer
- Department of Biological Chemistry, Institute of Life Sciences, The Edmond J. Safra Campus, The Hebrew University of Jerusalem, Givat Ram 91904, Israel
| | | |
Collapse
|
17
|
Agayan RR, Tucker R, Nitta T, Ruhnow F, Walter WJ, Diez S, Hess H. Optimization of isopolar microtubule arrays. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2013; 29:2265-2272. [PMID: 23330965 DOI: 10.1021/la303792v] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Isopolar arrays of aligned cytoskeletal filaments are components in a number of designs of hybrid nanodevices incorporating biomolecular motors. For example, a combination of filament arrays and motor arrays can form an actuator or a molecular engine resembling an artificial muscle. Here, isopolar arrays of microtubules are fabricated by flow alignment, and their quality is characterized by their degree of alignment. We find, in agreement with our analytical models, that the degree of alignment is ultimately limited by thermal forces, while the kinetics of the alignment process are influenced by the flow strength, the microtubule stiffness, the gliding velocity, and the tip length. Strong flows remove microtubules from the surface and reduce the filament density, suggesting that there is an optimal flow strength for the fabrication of ordered arrays.
Collapse
Affiliation(s)
- Rodney R Agayan
- Department of Biomedical Engineering, Columbia University, New York, New York 10027, United States
| | | | | | | | | | | | | |
Collapse
|
18
|
Agarwala EK, Chiel HJ, Thomas PJ. Pursuit of food versus pursuit of information in a Markovian perception-action loop model of foraging. J Theor Biol 2012; 304:235-72. [PMID: 22381540 DOI: 10.1016/j.jtbi.2012.02.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2010] [Revised: 12/21/2011] [Accepted: 02/13/2012] [Indexed: 12/30/2022]
Abstract
Efficient coding, redundancy reduction, and other information theoretic optimization principles have successfully explained the organization of many biological phenomena, from the physiology of sensory receptive fields to the variability of certain DNA sequence ensembles. Here we examine the hypothesis that behavioral strategies that are optimal for survival must necessarily involve efficient information processing, and ask whether there can be circumstances in which deliberately sacrificing some information can lead to higher utility? To this end, we present an analytically tractable model for a particular instance of a perception-action loop: a creature searching for a randomly moving food source confined to a 1D ring world. The model incorporates the statistical structure of the creature's world, the effects of the creature's actions on that structure, and the creature's strategic decision process. The underlying model takes the form of a Markov process on an infinite dimensional state space. To analyze it we construct an exact coarse graining that reduces the model to a Markov process on a finite number of "information states". This mathematical technique allows us to make quantitative comparisons between the performance of an information-theoretically optimal strategy with other candidate search strategies on a food gathering task. We find that 1. Information optimal search does not necessarily optimize utility (expected food gain). 2. The rank ordering of search strategies by information performance does not predict their ordering by expected food obtained. 3. The relative advantage of different strategies depends on the statistical structure of the environment, in particular the variability of motion of the source. We conclude that there is no simple relationship between information and utility. Even in the absence of information processing costs or bandwidth constraints, behavioral optimality does not imply information efficiency, nor is there a simple tradeoff between the two objectives of gaining information about a food source versus obtaining the food itself. For a wide range of values of the food source's movement parameter, the strategy of collecting the most information possible about the unknown source location carries an ineliminable structural cost, leading to a situation in which a foraging creature could actually choose to be less well-informed while simultaneously being, on average, better fed.
Collapse
Affiliation(s)
- Edward K Agarwala
- Department of Mathematics, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | | | | |
Collapse
|
19
|
Mayorga LS, López MJ, Becker WM. Molecular thermodynamics for cell biology as taught with boxes. CBE LIFE SCIENCES EDUCATION 2012; 11:31-38. [PMID: 22383615 PMCID: PMC3337714 DOI: 10.1187/cbe.11-07-0053] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Revised: 11/30/2011] [Accepted: 12/13/2011] [Indexed: 05/31/2023]
Abstract
Thermodynamic principles are basic to an understanding of the complex fluxes of energy and information required to keep cells alive. These microscopic machines are nonequilibrium systems at the micron scale that are maintained in pseudo-steady-state conditions by very sophisticated processes. Therefore, several nonstandard concepts need to be taught to rationalize why these very ordered systems proliferate actively all over our planet in seeming contradiction to the second law of thermodynamics. We propose a model consisting of boxes with different shapes that contain small balls that are in constant motion due to a stream of air blowing from below. This is a simple macroscopic system that can be easily visualized by students and that can be understood as mimicking the behavior of a set of molecules exchanging energy. With such boxes, the basic concepts of entropy, enthalpy, and free energy can be taught while reinforcing a molecular understanding of the concepts and stressing the stochastic nature of the thermodynamic laws. In addition, time-related concepts, such as reaction rates and activation energy, can be readily visualized. Moreover, the boxes provide an intuitive way to introduce the role in cellular organization of "information" and Maxwell's demons operating under nonequilibrium conditions.
Collapse
Affiliation(s)
- Luis S Mayorga
- IHEM (Consejo Nacional de Investigaciones Científicas y Técnicas/Universidad Nacional de Cuyo), Area of Biological Chemistry, Mendoza, Argentina.
| | | | | |
Collapse
|
20
|
Affiliation(s)
- Henry Hess
- Department of Biomedical Engineering, Columbia University, New York, NY 10027;
| |
Collapse
|
21
|
Abstract
In this Perspective, we propose that communication theory--a field of mathematics concerned with the problems of signal transmission, reception and processing--provides a new quantitative lens for investigating multicellular biology, ancient and modern. What underpins the cohesive organisation and collective behaviour of multicellular ecosystems such as microbial colonies and communities (microbiomes) and multicellular organisms such as plants and animals, whether built of simple tissue layers (sponges) or of complex differentiated cells arranged in tissues and organs (members of the 35 or so phyla of the subkingdom Metazoa)? How do mammalian tissues and organs develop, maintain their architecture, become subverted in disease, and decline with age? How did single-celled organisms coalesce to produce many-celled forms that evolved and diversified into the varied multicellular organisms in existence today? Some answers can be found in the blueprints or recipes encoded in (epi)genomes, yet others lie in the generic physical properties of biological matter such as the ability of cell aggregates to attain a certain complexity in size, shape, and pattern. We suggest that Lasswell's maxim "Who says what to whom in what channel with what effect" provides a foundation for understanding not only the emergence and evolution of multicellularity, but also the assembly and sculpting of multicellular ecosystems and many-celled structures, whether of natural or human-engineered origin. We explore how the abstraction of communication theory as an organising principle for multicellular biology could be realised. We highlight the inherent ability of communication theory to be blind to molecular and/or genetic mechanisms. We describe selected applications that analyse the physics of communication and use energy efficiency as a central tenet. Whilst communication theory has and could contribute to understanding a myriad of problems in biology, investigations of multicellular biology could, in turn, lead to advances in communication theory, especially in the still immature field of network information theory.
Collapse
Affiliation(s)
- I S Mian
- Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | | |
Collapse
|
22
|
Abstract
The idea that we could build molecular communications systems can be advanced by investigating how actual molecules from living organisms function. Information theory provides tools for such an investigation. This review describes how we can compute the average information in the DNA binding sites of any genetic control protein and how this can be extended to analyze its individual sites. A formula equivalent to Claude Shannon's channel capacity can be applied to molecular systems and used to compute the efficiency of protein binding. This efficiency is often 70% and a brief explanation for that is given. The results imply that biological systems have evolved to function at channel capacity, which means that we should be able to build molecular communications that are just as robust as our macroscopic ones.
Collapse
Affiliation(s)
- Thomas D. Schneider
- National Institutes of Health, National Cancer Institute at Frederick, P.O. Box B, Frederick, MD 21702-1201, United States
| |
Collapse
|