1
|
Poyatos JF. Design principles of multi-map variation in biological systems. Phys Biol 2024; 21:043001. [PMID: 38949447 DOI: 10.1088/1478-3975/ad5d6c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 07/01/2024] [Indexed: 07/02/2024]
Abstract
Complexity in biology is often described using a multi-map hierarchical architecture, where the genotype, representing the encoded information, is mapped to the functional level, known as the phenotype, which is then connected to a latent phenotype we refer to as fitness. This underlying architecture governs the processes driving evolution. Furthermore, natural selection, along with other neutral forces, can, in turn, modify these maps. At each level, variation is observed. Here, I propose the need to establish principles that can aid in understanding the transformation of variation within this multi-map architecture. Specifically, I will introduce three, related to the presence of modulators, constraints, and the modular channeling of variation. By comprehending these design principles in various biological systems, we can gain better insights into the mechanisms underlying these maps and how they ultimately contribute to evolutionary dynamics.
Collapse
Affiliation(s)
- Juan F Poyatos
- Logic of Genomic Systems Lab (CNB-CSIC), Madrid 28049, Spain
| |
Collapse
|
2
|
Rafelski SM, Theriot JA. Establishing a conceptual framework for holistic cell states and state transitions. Cell 2024; 187:2633-2651. [PMID: 38788687 DOI: 10.1016/j.cell.2024.04.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 04/10/2024] [Accepted: 04/24/2024] [Indexed: 05/26/2024]
Abstract
Cell states were traditionally defined by how they looked, where they were located, and what functions they performed. In this post-genomic era, the field is largely focused on a molecular view of cell state. Moving forward, we anticipate that the observables used to define cell states will evolve again as single-cell imaging and analytics are advancing at a breakneck pace via the collection of large-scale, systematic cell image datasets and the application of quantitative image-based data science methods. This is, therefore, a key moment in the arc of cell biological research to develop approaches that integrate the spatiotemporal observables of the physical structure and organization of the cell with molecular observables toward the concept of a holistic cell state. In this perspective, we propose a conceptual framework for holistic cell states and state transitions that is data-driven, practical, and useful to enable integrative analyses and modeling across many data types.
Collapse
Affiliation(s)
- Susanne M Rafelski
- Allen Institute for Cell Science, 615 Westlake Avenue N, Seattle, WA 98125, USA.
| | - Julie A Theriot
- Department of Biology and Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
3
|
McBride JM, Polev K, Abdirasulov A, Reinharz V, Grzybowski BA, Tlusty T. AlphaFold2 Can Predict Single-Mutation Effects. PHYSICAL REVIEW LETTERS 2023; 131:218401. [PMID: 38072605 DOI: 10.1103/physrevlett.131.218401] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 09/26/2023] [Indexed: 12/18/2023]
Abstract
AlphaFold2 (AF) is a promising tool, but is it accurate enough to predict single mutation effects? Here, we report that the localized structural deformation between protein pairs differing by only 1-3 mutations-as measured by the effective strain-is correlated across 3901 experimental and AF-predicted structures. Furthermore, analysis of ∼11 000 proteins shows that the local structural change correlates with various phenotypic changes. These findings suggest that AF can predict the range and magnitude of single-mutation effects on average, and we propose a method to improve precision of AF predictions and to indicate when predictions are unreliable.
Collapse
Affiliation(s)
- John M McBride
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, South Korea
| | - Konstantin Polev
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, South Korea
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
| | - Amirbek Abdirasulov
- Department of Computer Science and Engineering, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
| | | | - Bartosz A Grzybowski
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, South Korea
- Departments of Physics and Chemistry, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
| | - Tsvi Tlusty
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, South Korea
- Departments of Physics and Chemistry, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
| |
Collapse
|
4
|
Mani S, Tlusty T. Gene birth in a model of non-genic adaptation. BMC Biol 2023; 21:257. [PMID: 37957718 PMCID: PMC10644530 DOI: 10.1186/s12915-023-01745-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 10/24/2023] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND Over evolutionary timescales, genomic loci can switch between functional and non-functional states through processes such as pseudogenization and de novo gene birth. Particularly, de novo gene birth is a widespread process, and many examples continue to be discovered across diverse evolutionary lineages. However, the general mechanisms that lead to functionalization are poorly understood, and estimated rates of de novo gene birth remain contentious. Here, we address this problem within a model that takes into account mutations and structural variation, allowing us to estimate the likelihood of emergence of new functions at non-functional loci. RESULTS Assuming biologically reasonable mutation rates and mutational effects, we find that functionalization of non-genic loci requires the realization of strict conditions. This is in line with the observation that most de novo genes are localized to the vicinity of established genes. Our model also provides an explanation for the empirical observation that emerging proto-genes are often lost despite showing signs of adaptation. CONCLUSIONS Our work elucidates the properties of non-genic loci that make them fertile for adaptation, and our results offer mechanistic insights into the process of de novo gene birth.
Collapse
Affiliation(s)
- Somya Mani
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, Republic of Korea.
| | - Tsvi Tlusty
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 44919, Republic of Korea
- Departments of Physics and Chemistry, Ulsan National Institute of Science and Technology (UNIST), Ulsan 44919, Republic of Korea
| |
Collapse
|
5
|
Jordan DJ, Miska EA. Canalisation and plasticity on the developmental manifold of Caenorhabditis elegans. Mol Syst Biol 2023; 19:e11835. [PMID: 37850520 PMCID: PMC10632735 DOI: 10.15252/msb.202311835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 09/26/2023] [Accepted: 10/05/2023] [Indexed: 10/19/2023] Open
Abstract
How do the same mechanisms that faithfully regenerate complex developmental programmes in spite of environmental and genetic perturbations also allow responsiveness to environmental signals, adaptation and genetic evolution? Using the nematode Caenorhabditis elegans as a model, we explore the phenotypic space of growth and development in various genetic and environmental contexts. Our data are growth curves and developmental parameters obtained by automated microscopy. Using these, we show that among the traits that make up the developmental space, correlations within a particular context are predictive of correlations among different contexts. Furthermore, we find that the developmental variability of this animal can be captured on a relatively low-dimensional phenotypic manifold and that on this manifold, genetic and environmental contributions to plasticity can be deconvolved independently. Our perspective offers a new way of understanding the relationship between robustness and flexibility in complex systems, suggesting that projection and concentration of dimension can naturally align these forces as complementary rather than competing.
Collapse
Affiliation(s)
- David J Jordan
- Department of BiochemistryUniversity of CambridgeCambridgeUK
| | - Eric A Miska
- Department of BiochemistryUniversity of CambridgeCambridgeUK
| |
Collapse
|
6
|
Tang QY, Ren W, Wang J, Kaneko K. The Statistical Trends of Protein Evolution: A Lesson from AlphaFold Database. Mol Biol Evol 2022; 39:6701686. [PMID: 36108094 PMCID: PMC9550990 DOI: 10.1093/molbev/msac197] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The recent development of artificial intelligence provides us with new and powerful tools for studying the mysterious relationship between organism evolution and protein evolution. In this work, based on the AlphaFold Protein Structure Database (AlphaFold DB), we perform comparative analyses of the proteins of different organisms. The statistics of AlphaFold-predicted structures show that, for organisms with higher complexity, their constituent proteins will have larger radii of gyration, higher coil fractions, and slower vibrations, statistically. By conducting normal mode analysis and scaling analyses, we demonstrate that higher organismal complexity correlates with lower fractal dimensions in both the structure and dynamics of the constituent proteins, suggesting that higher functional specialization is associated with higher organismal complexity. We also uncover the topology and sequence bases of these correlations. As the organismal complexity increases, the residue contact networks of the constituent proteins will be more assortative, and these proteins will have a higher degree of hydrophilic-hydrophobic segregation in the sequences. Furthermore, by comparing the statistical structural proximity across the proteomes with the phylogenetic tree of homologous proteins, we show that, statistical structural proximity across the proteomes may indirectly reflect the phylogenetic proximity, indicating a statistical trend of protein evolution in parallel with organism evolution. This study provides new insights into how the diversity in the functionality of proteins increases and how the dimensionality of the manifold of protein dynamics reduces during evolution, contributing to the understanding of the origin and evolution of lives.
Collapse
Affiliation(s)
| | - Weitong Ren
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Jun Wang
- School of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, People’s Republic of China
| | | |
Collapse
|
7
|
Gopalakrishnappa C, Gowda K, Prabhakara KH, Kuehn S. An ensemble approach to the structure-function problem in microbial communities. iScience 2022; 25:103761. [PMID: 35141504 PMCID: PMC8810406 DOI: 10.1016/j.isci.2022.103761] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The metabolic activity of microbial communities plays a primary role in the flow of essential nutrients throughout the biosphere. Molecular genetics has revealed the metabolic pathways that model organisms utilize to generate energy and biomass, but we understand little about how the metabolism of diverse, natural communities emerges from the collective action of its constituents. We propose that quantifying and mapping metabolic fluxes to sequencing measurements of genomic, taxonomic, or transcriptional variation across an ensemble of diverse communities, either in the laboratory or in the wild, can reveal low-dimensional descriptions of community structure that can explain or predict their emergent metabolic activity. We survey the types of communities for which this approach might be best suited, review the analytical techniques available for quantifying metabolite fluxes in communities, and discuss what types of data analysis approaches might be lucrative for learning the structure-function mapping in communities from these data.
Collapse
Affiliation(s)
| | - Karna Gowda
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
- Center for the Physics of Evolving Systems, University of Chicago, Chicago, IL 60637, USA
| | - Kaumudi H. Prabhakara
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
- Center for the Physics of Evolving Systems, University of Chicago, Chicago, IL 60637, USA
| | - Seppe Kuehn
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
- Center for the Physics of Evolving Systems, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
8
|
A topological look into the evolution of developmental programs. Biophys J 2021; 120:4193-4201. [PMID: 34480926 PMCID: PMC8516677 DOI: 10.1016/j.bpj.2021.08.044] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 07/13/2021] [Accepted: 08/30/2021] [Indexed: 01/06/2023] Open
Abstract
Rapid advance of experimental techniques provides an unprecedented in-depth view into complex developmental processes. Still, little is known on how the complexity of multicellular organisms evolved by elaborating developmental programs and inventing new cell types. A hurdle to understanding developmental evolution is the difficulty of even describing the intertwined network of spatiotemporal processes underlying the development of complex multicellular organisms. Nonetheless, an overview of developmental trajectories can be obtained from cell type lineage maps. Here, we propose that these lineage maps can also reveal how developmental programs evolve: the modes of evolving new cell types in an organism should be visible in its developmental trajectories and therefore in the geometry of its cell type lineage map. This idea is demonstrated using a parsimonious generative model of developmental programs, which allows us to reliably survey the universe of all possible programs and examine their topological features. We find that, contrary to belief, tree-like lineage maps are rare, and lineage maps of complex multicellular organisms are likely to be directed acyclic graphs in which multiple developmental routes can converge on the same cell type. Although cell type evolution prescribes what developmental programs come into existence, natural selection prunes those programs that produce low-functioning organisms. Our model indicates that additionally, lineage map topologies are correlated with such a functional property: the ability of organisms to regenerate.
Collapse
|
9
|
Leahy BD, Racowsky C, Needleman D. Inferring simple but precise quantitative models of human oocyte and early embryo development. J R Soc Interface 2021; 18:20210475. [PMID: 34493094 PMCID: PMC8424348 DOI: 10.1098/rsif.2021.0475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 08/16/2021] [Indexed: 11/12/2022] Open
Abstract
Macroscopic, phenomenological models are useful as concise framings of our understandings in fields from statistical physics to finance to biology. Constructing a phenomenological model for development would provide a framework for understanding the complicated, regulatory nature of oogenesis and embryogenesis. Here, we use a data-driven approach to infer quantitative, precise models of human oocyte maturation and pre-implantation embryo development, by analysing clinical in-vitro fertilization (IVF) data on 7399 IVF cycles resulting in 57 827 embryos. Surprisingly, we find that both oocyte maturation and early embryo development are quantitatively described by simple models with minimal interactions. This simplicity suggests that oogenesis and embryogenesis are composed of modular processes that are relatively siloed from one another. In particular, our analysis provides strong evidence that (i) pre-antral follicles produce anti-Müllerian hormone independently of effects from other follicles, (ii) oocytes mature to metaphase-II independently of the woman's age, her BMI and other factors, (iii) early embryo development is memoryless for the variables assessed here, in that the probability of an embryo transitioning from its current developmental stage to the next is independent of its previous stage. Our results both provide insight into the fundamentals of oogenesis and embryogenesis and have implications for the clinical IVF.
Collapse
Affiliation(s)
- Brian D. Leahy
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- SEAS, Harvard University, Cambridge, MA, USA
| | - Catherine Racowsky
- Brigham Women’s Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Daniel Needleman
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- SEAS, Harvard University, Cambridge, MA, USA
- Center for Computational Biology, Flatiron Institute, New York, NY, USA
| |
Collapse
|