1
|
RNA World Modeling: A Comparison of Two Complementary Approaches. ENTROPY 2022; 24:e24040536. [PMID: 35455198 PMCID: PMC9027272 DOI: 10.3390/e24040536] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/08/2022] [Accepted: 04/09/2022] [Indexed: 12/02/2022]
Abstract
Simple Summary Despite years of dedicated research, scientists are still not sure what the first ”living” cell would have looked like. One of the most well-known hypotheses is the RNA world hypothesis, which assumes that, in the beginning, life relied on RNA molecules instead of DNA as information carriers and primitive enzymes. The population of such RNAs is made up of self-replicating molecules (replicases) that could make copies of themselves and parasite molecules that could only be copied by replicases. In this study, we further investigated the interplay between these hypothetical prebiotic RNA species, since it plays a crucial role in generating diversity and complexity in prebiotic molecular evolution. We compared two approaches that are commonly used to investigate such simple prebiotic systems, representing different modeling and observation scales—namely, microscopic and macroscopic. In both cases, we were able to obtain consistent results. Abstract The origin of life remains one of the major scientific questions in modern biology. Among many hypotheses aiming to explain how life on Earth started, RNA world is probably the most extensively studied. It assumes that, in the very beginning, RNA molecules served as both enzymes and as genetic information carriers. However, even if this is true, there are many questions that still need to be answered—for example, whether the population of such molecules could achieve stability and retain genetic information for many generations, which is necessary in order for evolution to start. In this paper, we try to answer this question based on the parasite–replicase model (RP model), which divides RNA molecules into enzymes (RNA replicases) capable of catalyzing replication and parasites that do not possess replicase activity but can be replicated by RNA replicases. We describe the aforementioned system using partial differential equations and, based on the analysis of the simulation, surmise general rules governing its evolution. We also compare this approach with one where the RP system is modeled and implemented using a multi-agent modeling technique. We show that approaching the description and analysis of the RP system from different perspectives (microscopic represented by MAS and macroscopic depicted by PDE) provides consistent results. Therefore, applying MAS does not lead to erroneous results and allows us to study more complex situations where many cases are concerned, which would not be possible through the PDE model.
Collapse
|
2
|
Kaden M, Bohnsack KS, Weber M, Kudła M, Gutowska K, Blazewicz J, Villmann T. Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences. Neural Comput Appl 2021; 34:67-78. [PMID: 33935376 PMCID: PMC8076884 DOI: 10.1007/s00521-021-06018-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 04/07/2021] [Indexed: 02/06/2023]
Abstract
We present an approach to discriminate SARS-CoV-2 virus types based on their RNA sequence descriptions avoiding a sequence alignment. For that purpose, sequences are preprocessed by feature extraction and the resulting feature vectors are analyzed by prototype-based classification to remain interpretable. In particular, we propose to use variants of learning vector quantization (LVQ) based on dissimilarity measures for RNA sequence data. The respective matrix LVQ provides additional knowledge about the classification decisions like discriminant feature correlations and, additionally, can be equipped with easy to realize reject options for uncertain data. Those options provide self-controlled evidence, i.e., the model refuses to make a classification decision if the model evidence for the presented data is not sufficient. This model is first trained using a GISAID dataset with given virus types detected according to the molecular differences in coronavirus populations by phylogenetic tree clustering. In a second step, we apply the trained model to another but unlabeled SARS-CoV-2 virus dataset. For these data, we can either assign a virus type to the sequences or reject atypical samples. Those rejected sequences allow to speculate about new virus types with respect to nucleotide base mutations in the viral sequences. Moreover, this rejection analysis improves model robustness. Last but not least, the presented approach has lower computational complexity compared to methods based on (multiple) sequence alignment. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s00521-021-06018-2.
Collapse
Affiliation(s)
- Marika Kaden
- University of Applied Sciences Mittweida, Technikumplatz 17, 09648 Mittweida, Germany
- Saxon Institute for Computational Intelligence and Machine Learning, Technikumplatz 17, 09648 Mittweida, Germany
| | - Katrin Sophie Bohnsack
- University of Applied Sciences Mittweida, Technikumplatz 17, 09648 Mittweida, Germany
- Saxon Institute for Computational Intelligence and Machine Learning, Technikumplatz 17, 09648 Mittweida, Germany
| | - Mirko Weber
- University of Applied Sciences Mittweida, Technikumplatz 17, 09648 Mittweida, Germany
- Saxon Institute for Computational Intelligence and Machine Learning, Technikumplatz 17, 09648 Mittweida, Germany
| | - Mateusz Kudła
- University of Applied Sciences Mittweida, Technikumplatz 17, 09648 Mittweida, Germany
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Kaja Gutowska
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
- European Centre for Bioinformatics and Genomics, Piotrowo 2, 60-965 Poznan, Poland
| | - Jacek Blazewicz
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
- European Centre for Bioinformatics and Genomics, Piotrowo 2, 60-965 Poznan, Poland
| | - Thomas Villmann
- University of Applied Sciences Mittweida, Technikumplatz 17, 09648 Mittweida, Germany
- Saxon Institute for Computational Intelligence and Machine Learning, Technikumplatz 17, 09648 Mittweida, Germany
| |
Collapse
|
3
|
Synak J, Rybarczyk A, Blazewicz J. Multi-agent approach to sequence structure simulation in the RNA World hypothesis. PLoS One 2020; 15:e0238253. [PMID: 32857812 PMCID: PMC7455006 DOI: 10.1371/journal.pone.0238253] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 08/12/2020] [Indexed: 12/03/2022] Open
Abstract
The origins of life on Earth have been the subject of inquiry since the early days of philosophical thought and are still intensively investigated by the researchers around the world. One of the theories explaining the life emergence, that gained the most attention recently is the RNA World hypothesis, which assumes that life on Earth was sparked by replicating RNA chains. Since wet lab analysis is time-consuming, many mathematical and computational approaches have been proposed that try to explain the origins of life. Recently proposed one, based on the work by Takeuchi and Hogeweg, addresses the problem of interplay between RNA replicases and RNA parasitic species, which is crucial for understanding the first steps of prebiotic evolution. In this paper, the aforementioned model has been extended and modified by introducing RNA sequence (structure) information and mutation rate close to real one. It allowed to observe the simple evolution mechanisms, which could have led to the more complicated systems and eventually, to the formation of the first cells. The main goal of this study was to determine the conditions that allowed the spontaneous emergence and evolution of the prebiotic replicases equipped with simple functional domains within a large population. Here we show that polymerase ribozymes could have appeared randomly and then quickly started to copy themselves in order for the system to reach equilibrium. It has been shown that evolutionary selection works even in the simplest systems.
Collapse
Affiliation(s)
- Jaroslaw Synak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- European Center for Bioinformatics and Genomics, Poznan, Poland
| | - Agnieszka Rybarczyk
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- European Center for Bioinformatics and Genomics, Poznan, Poland
- * E-mail: (JB); (AR)
| | - Jacek Blazewicz
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- European Center for Bioinformatics and Genomics, Poznan, Poland
- * E-mail: (JB); (AR)
| |
Collapse
|
4
|
Prebiotic Soup Components Trapped in Montmorillonite Nanoclay Form New Molecules: Car-Parrinello Ab Initio Simulations. Life (Basel) 2019; 9:life9020046. [PMID: 31167366 PMCID: PMC6617125 DOI: 10.3390/life9020046] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 05/30/2019] [Accepted: 05/30/2019] [Indexed: 01/08/2023] Open
Abstract
The catalytic effects of complex minerals or meteorites are often mentioned as important factors for the origins of life. To assess the possible role of nanoconfinement within a catalyst consisting of montmorillonite (MMT) and the impact of local electric field on the formation efficiency of the simple hypothetical precursors of nucleic acid bases or amino acids, we performed ab initio Car–Parrinello molecular dynamics simulations. We prepared four condensed-phase systems corresponding to previously suggested prototypes of a primordial soup. We monitored possible chemical reactions occurring within gas-like bulk and MMT-confined four simulation boxes on a 20-ps time scale at 1 atm and 300 K, 400 K, and 600 K. Elevated temperatures did not affect the reactivity of the elementary components of the gas-like boxes considerably; however, the presence of the MMT nanoclay substantially increased the formation probability of new molecules. Approximately 20 different new compounds were found in boxes containing carbon monoxide or formaldehyde molecules. This observation and an analysis of the atom–atom radial distribution functions indicated that the presence of Ca2+ ions at the surface of the internal MMT cavities may be an important factor in the initial steps of the formation of complex molecules at the early stages of the Earth’s history.
Collapse
|
5
|
Wasik S, Szostak N, Kudla M, Wachowiak M, Krawiec K, Blazewicz J. Detecting life signatures with RNA sequence similarity measures. J Theor Biol 2018; 463:110-120. [PMID: 30562502 DOI: 10.1016/j.jtbi.2018.12.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 10/25/2018] [Accepted: 12/14/2018] [Indexed: 12/20/2022]
Abstract
The RNA World is currently the most plausible hypothesis for explaining the origins of life on Earth. The supporting body of evidence is growing and it comes from multiple areas, including astrobiology, chemistry, biology, mathematics, and, in particular, from computer simulations. Such methods frequently assume the existence of a hypothetical species on Earth, around three billion years ago, with a base sequence probably dissimilar from any in known genomes. However, it is often hard to verify whether or not a hypothetical sequence has the characteristics of biological sequences, and is thus likely to be functional. The primary objective of the presented research was to verify the possibility of building a computational 'life probe' for determining whether a given genetic sequence is biological, and assessing the sensitivity of such probes to the signatures of life present in known biological sequences. We have proposed decision algorithms based on the normalized compression distance (NCD) and Levenshtein distance (LD). We have validated the proposed method in the context of the RNA World hypothesis using short genetic sequences shorter than the error threshold value (i.e., 100 nucleotides). We have demonstrated that both measures can be successfully used to construct life probes that are significantly better than a random decision procedure, while varying from each other when it comes to detailed characteristics. We also observed that fragments of sequences related to replication have better discriminatory power than sequences having other molecular functions. In a broader context, this shows that the signatures of life in short RNA samples can be effectively detected using relatively simple means.
Collapse
Affiliation(s)
- Szymon Wasik
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland; Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland; European Centre for Bioinformatics and Genomics, Poznan, Poland.
| | - Natalia Szostak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland; Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland; European Centre for Bioinformatics and Genomics, Poznan, Poland
| | - Mateusz Kudla
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Michal Wachowiak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Krzysztof Krawiec
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Jacek Blazewicz
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland; Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland; European Centre for Bioinformatics and Genomics, Poznan, Poland
| |
Collapse
|
6
|
Abstract
Abstract
Crowdsourcing is a very effective technique for outsourcing work to a vast network usually comprising anonymous people. In this study, we review the application of crowdsourcing to modeling systems originating from systems biology. We consider a variety of verified approaches, including well-known projects such as EyeWire, FoldIt, and DREAM Challenges, as well as novel projects conducted at the European Center for Bioinformatics and Genomics. The latter projects utilized crowdsourced serious games to design models of dynamic biological systems, and it was demonstrated that these models could be used successfully to involve players without domain knowledge. We conclude the review of these systems by providing 10 guidelines to facilitate the efficient use of crowdsourcing.
Collapse
|
7
|
Mietchen D, Wodak S, Wasik S, Szostak N, Dessimoz C. Submit a Topic Page to PLOS Computational Biology and Wikipedia. PLoS Comput Biol 2018; 14:e1006137. [PMID: 29851950 PMCID: PMC5978877 DOI: 10.1371/journal.pcbi.1006137] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Daniel Mietchen
- Data Science Institute, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| | - Shoshana Wodak
- Vlaams Instituut voor Biotechnologie-Vrije Universiteit Brussel Centre for Structural Biology, Brussels, Belgium
| | - Szymon Wasik
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- European Centre for Bioinformatics and Genomics, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Natalia Szostak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- European Centre for Bioinformatics and Genomics, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Christophe Dessimoz
- University College London, London, United Kingdom
- University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
8
|
Szilágyi A, Zachar I, Scheuring I, Kun Á, Könnyű B, Czárán T. Ecology and Evolution in the RNA World Dynamics and Stability of Prebiotic Replicator Systems. Life (Basel) 2017; 7:E48. [PMID: 29186916 PMCID: PMC5745561 DOI: 10.3390/life7040048] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Revised: 11/09/2017] [Accepted: 11/13/2017] [Indexed: 11/16/2022] Open
Abstract
As of today, the most credible scientific paradigm pertaining to the origin of life on Earth is undoubtedly the RNA World scenario. It is built on the assumption that catalytically active replicators (most probably RNA-like macromolecules) may have been responsible for booting up life almost four billion years ago. The many different incarnations of nucleotide sequence (string) replicator models proposed recently are all attempts to explain on this basis how the genetic information transfer and the functional diversity of prebiotic replicator systems may have emerged, persisted and evolved into the first living cell. We have postulated three necessary conditions for an RNA World model system to be a dynamically feasible representation of prebiotic chemical evolution: (1) it must maintain and transfer a sufficient diversity of information reliably and indefinitely, (2) it must be ecologically stable and (3) it must be evolutionarily stable. In this review, we discuss the best-known prebiotic scenarios and the corresponding models of string-replicator dynamics and assess them against these criteria. We suggest that the most popular of prebiotic replicator systems, the hypercycle, is probably the worst performer in almost all of these respects, whereas a few other model concepts (parabolic replicator, open chaotic flows, stochastic corrector, metabolically coupled replicator system) are promising candidates for development into coherent models that may become experimentally accessible in the future.
Collapse
Affiliation(s)
- András Szilágyi
- Evolutionary Systems Research Group, MTA, Centre for Ecological Research, Hungarian Academy of Sciences, Klebelsberg Kuno u. 3, 8237 Tihany, Hungary.
- Center for the Conceptual Foundations of Science, Parmenides Foundation, Kirchplatz 1, 82049 Pullach/Munich, Germany.
- MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
| | - István Zachar
- Evolutionary Systems Research Group, MTA, Centre for Ecological Research, Hungarian Academy of Sciences, Klebelsberg Kuno u. 3, 8237 Tihany, Hungary.
- Center for the Conceptual Foundations of Science, Parmenides Foundation, Kirchplatz 1, 82049 Pullach/Munich, Germany.
- MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
| | - István Scheuring
- Evolutionary Systems Research Group, MTA, Centre for Ecological Research, Hungarian Academy of Sciences, Klebelsberg Kuno u. 3, 8237 Tihany, Hungary.
- MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
| | - Ádám Kun
- Evolutionary Systems Research Group, MTA, Centre for Ecological Research, Hungarian Academy of Sciences, Klebelsberg Kuno u. 3, 8237 Tihany, Hungary.
- Center for the Conceptual Foundations of Science, Parmenides Foundation, Kirchplatz 1, 82049 Pullach/Munich, Germany.
- MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
| | - Balázs Könnyű
- Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
| | - Tamás Czárán
- Evolutionary Systems Research Group, MTA, Centre for Ecological Research, Hungarian Academy of Sciences, Klebelsberg Kuno u. 3, 8237 Tihany, Hungary.
- MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány. 1/c, 1117 Budapest, Hungary.
- Biocomplexity Group, Niels Bohr Institute, Copenhagen University, Blegdamsvej 17, 2100 Copenhagen, Denmark.
| |
Collapse
|