1
|
Banerjee K, Das B. Elucidating the link between binding statistics and Shannon information in biological networks. J Chem Phys 2024; 161:125102. [PMID: 39319659 DOI: 10.1063/5.0226904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 09/10/2024] [Indexed: 09/26/2024] Open
Abstract
The response of a biological network to ligand binding is of crucial importance for regulatory control in various cellular biophysical processes that is achieved with information transmission through the different ligand-bound states of such networks. In this work, we address a vital issue regarding the link between the information content of such network states and the experimentally measurable binding statistics. Several fundamental networks of cooperative ligand binding, with the bound states being adjacent in time only and in both space and time, are considered for this purpose using the chemical master equation approach. To express the binding characteristics in the language of information, a quantity denoted as differential information index is employed based on the Shannon information. The index, determined for the whole network, follows a linear relationship with (logarithmic) ligand concentration with a slope equal to the size of the system. On the other hand, the variation of Shannon information associated with the individual network states and the logarithmic sensitivity of its slope are shown to have generic forms related to the average binding number and variance, respectively, the latter yielding the Hill slope, the phenomenological measure of cooperativity. Furthermore, the variation of Shannon information entropy, the average of Shannon information, is also shown to be related to the average binding.
Collapse
Affiliation(s)
- Kinshuk Banerjee
- Department of Chemistry, Acharya Jagadish Chandra Bose College, 1/1B A. J. C. Bose Road, Kolkata 700 020, India
| | - Biswajit Das
- School of Artificial Intelligence (AI), Amrita Vishwa Vidyapeetham (Amrita University), Amritanagar, Ettimadai, Coimbatore, Tamil Nadu 641112, India
| |
Collapse
|
2
|
Sato T. Application of a novel numerical simulation to biochemical reaction systems. Front Cell Dev Biol 2024; 12:1351974. [PMID: 39310225 PMCID: PMC11412882 DOI: 10.3389/fcell.2024.1351974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 08/09/2024] [Indexed: 09/25/2024] Open
Abstract
Recent advancements in omics and single-cell analysis highlight the necessity of numerical methods for managing the complexity of biological data. This paper introduces a simulation program for biochemical reaction systems based on the natural number simulation (NNS) method. This novel approach ensures the equitable treatment of all molecular entities, such as DNA, proteins, H2O, and hydrogen ions (H+), in biological systems. Central to NNS is its use of stoichiometric formulas, simplifying the modeling process and facilitating efficient and accurate simulations of diverse biochemical reactions. The advantage of this method is its ability to manage all molecules uniformly, ensuring a balanced representation in simulations. Detailed in Python, NNS is adept at simulating various reactions, ranging from water ionization to Michaelis-Menten kinetics and complex gene-based systems, making it an effective tool for scientific and engineering research.
Collapse
Affiliation(s)
- Takashi Sato
- Digital Engineering Team, Production Tech. Lab, Research and Development Center, Zeon Corporation, Tokyo, Kanagawa, Japan
| |
Collapse
|
3
|
Santoni D. An entropy-based study on the mutational landscape of SARS-CoV-2 in USA: Comparing different variants and revealing co-mutational behavior of proteins. Gene 2024; 922:148556. [PMID: 38754568 DOI: 10.1016/j.gene.2024.148556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 05/08/2024] [Accepted: 05/09/2024] [Indexed: 05/18/2024]
Abstract
COVID-19 emergency has pushed the international scientific community to use every resource to combat the spread of the virus, to understand its biology and predict its possible evolution in terms of new variants. Since the first SARS-CoV-2 virus nucleotide and amino acid sequences were made available, information theory was used to study how viral information content was changing over time and then trace the evolution of its mutational landscape. In this work we analyzed SARS-CoV-2 sequences collected mainly in the USA in a period from March 2020 until December 2022 and computed mutation profiles of viral proteins over time through an entropy-based approach using Shannon Entropy and Hellinger distance. This representation allows an at-a-glance view of the mutational landscape of viral proteins over time and can provide new insights on the evolution of the virus from different points of view. Non-structural proteins typically showed flat mutation profiles, characterized by a very low Average mutation Entropy, while accessory and structural proteins showed mostly non uniform and high mutation profiles, often coupled with the predominance of variants. Interestingly NSP2 protein, whose function is currently still debated, falls in the same branch of NSP14 and NSP10 in the phylogenetic tree of mutations constructed through correlations of mutation profiles, suggesting a co-evolution of those proteins and a possible functional link with each other. To the best of our knowledge this is the first study based on a massive amount of data (n = 107,939,973) that analyzes from an entropy point of view the mutational landscape of SARS-CoV-2 over time and depicts a mutational temporal profile of each protein of the virus.
Collapse
Affiliation(s)
- Daniele Santoni
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council of Italy, Via dei Taurini 19, Rome 00185, Italy.
| |
Collapse
|
4
|
Bajić D. Information Theory, Living Systems, and Communication Engineering. ENTROPY (BASEL, SWITZERLAND) 2024; 26:430. [PMID: 38785679 PMCID: PMC11120474 DOI: 10.3390/e26050430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 05/08/2024] [Accepted: 05/17/2024] [Indexed: 05/25/2024]
Abstract
Mainstream research on information theory within the field of living systems involves the application of analytical tools to understand a broad range of life processes. This paper is dedicated to an opposite problem: it explores the information theory and communication engineering methods that have counterparts in the data transmission process by way of DNA structures and neural fibers. Considering the requirements of modern multimedia, transmission methods chosen by nature may be different, suboptimal, or even far from optimal. However, nature is known for rational resource usage, so its methods have a significant advantage: they are proven to be sustainable. Perhaps understanding the engineering aspects of methods of nature can inspire a design of alternative green, stable, and low-cost transmission.
Collapse
Affiliation(s)
- Dragana Bajić
- Department of Communications and Signal Processing, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
| |
Collapse
|
5
|
Lam WYS, Kwong E, Chan HWT, Zheng YP. Using Sequence Analyses to Quantitatively Measure Oropharyngeal Swallowing Temporality in Point-of-Care Ultrasound Examinations: A Pilot Study. J Clin Med 2024; 13:2288. [PMID: 38673561 PMCID: PMC11051012 DOI: 10.3390/jcm13082288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/02/2024] [Accepted: 04/12/2024] [Indexed: 04/28/2024] Open
Abstract
(1) Background: Swallowing is a complex process that comprises well-timed control of oropharyngeal and laryngeal structures to achieve airway protection and swallowing efficiency. To understand its temporality, previous research adopted adherence measures and revealed obligatory pairs in healthy swallows and the effect of aging and bolus type on the variability of event timing and order. This study aimed to (i) propose a systemic conceptualization of swallowing physiology, (ii) apply sequence analyses, a set of information-theoretic and bioinformatic methods, to quantify and characterize swallowing temporality, and (iii) investigate the effect of aging and dysphagia on the quantified variables using sequence analyses measures. (2) Method: Forty-three participants (17 young adults, 15 older adults, and 11 dysphagic adults) underwent B-mode ultrasound swallowing examinations at the mid-sagittal plane of the submental region. The onset, maximum, and offset states of hyoid bone displacement, geniohyoid muscle contraction, and tongue base retraction were identified and sorted to form sequences which were analyzed using an inventory of sequence analytic techniques; namely, overlap coefficients, Shannon entropy, and longest common subsequence algorithms. (3) Results: The concurrency of movement sequence was found to be significantly impacted by aging and dysphagia. Swallowing sequence variability was also found to be reduced with age and the presence of dysphagia (H(2) = 52.253, p < 0.001, η2 = 0.260). Four obligatory sequences were identified, and high adherence was also indicated in two previously reported pairs. These results provided preliminary support for the validity of sequence analyses for quantifying swallowing sequence temporality. (4) Conclusions: A systemic conceptualization of human deglutition permits a multi-level quantitative analysis of swallowing physiology. Sequence analyses are a set of promising quantitative measurement techniques for point-of-care ultrasound (POCUS) swallowing examinations and outcome measures for swallowing rehabilitation and evaluation of associated physiological conditions, such as sarcopenia. Findings in the current study revealed physiological differences among healthy young, healthy older, and dysphagic adults. They also helped lay the groundwork for future AI-assisted dysphagia assessment and outcome measures using POCUSs. Arguably, the proposed conceptualization and analyses are also modality-independent measures that can potentially be generalized for other instrumental swallowing assessment modalities.
Collapse
Affiliation(s)
- Wilson Yiu Shun Lam
- Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong SAR, China (H.W.T.C.)
- Research Institute for Smart Ageing, The Hong Kong Polytechnic University, Hong Kong SAR, China;
| | - Elaine Kwong
- Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong SAR, China (H.W.T.C.)
- Research Institute for Smart Ageing, The Hong Kong Polytechnic University, Hong Kong SAR, China;
| | - Huberta Wai Tung Chan
- Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong SAR, China (H.W.T.C.)
| | - Yong-Ping Zheng
- Research Institute for Smart Ageing, The Hong Kong Polytechnic University, Hong Kong SAR, China;
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong SAR, China
| |
Collapse
|
6
|
Jeuken GS, Käll L. Pathway analysis through mutual information. Bioinformatics 2024; 40:btad776. [PMID: 38195928 PMCID: PMC10783954 DOI: 10.1093/bioinformatics/btad776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 12/09/2023] [Accepted: 01/08/2024] [Indexed: 01/11/2024] Open
Abstract
MOTIVATION In pathway analysis, we aim to establish a connection between the activity of a particular biological pathway and a difference in phenotype. There are many available methods to perform pathway analysis, many of them rely on an upstream differential expression analysis, and many model the relations between the abundances of the analytes in a pathway as linear relationships. RESULTS Here, we propose a new method for pathway analysis, MIPath, that relies on information theoretical principles and, therefore, does not model the association between pathway activity and phenotype, resulting in relatively few assumptions. For this, we construct a graph of the data points for each pathway using a nearest-neighbor approach and score the association between the structure of this graph and the phenotype of these same samples using Mutual Information while adjusting for the effects of random chance in each score. The initial nearest neighbor approach evades individual gene-level comparisons, hence making the method scalable and less vulnerable to missing values. These properties make our method particularly useful for single-cell data. We benchmarked our method on several single-cell datasets, comparing it to established and new methods, and found that it produces robust, reproducible, and meaningful scores. AVAILABILITY AND IMPLEMENTATION Source code is available at https://github.com/statisticalbiotechnology/mipath, or through Python Package Index as "mipathway."
Collapse
Affiliation(s)
- Gustavo S Jeuken
- Science for Life Laboratory, KTH – Royal Institute of Technology, Stockholm 171 65, Sweden
- Computer Science Department, Vrije Universiteit Amsterdam, Amsterdam 1081 HV, The Netherlands
| | - Lukas Käll
- Science for Life Laboratory, KTH – Royal Institute of Technology, Stockholm 171 65, Sweden
| |
Collapse
|
7
|
Formentin M, Chignola R, Favretti M. Optimal entropic properties of SARS-CoV-2 RNA sequences. ROYAL SOCIETY OPEN SCIENCE 2024; 11:231369. [PMID: 38298394 PMCID: PMC10827432 DOI: 10.1098/rsos.231369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/02/2024] [Indexed: 02/02/2024]
Abstract
The reaction of the scientific community against the COVID-19 pandemic has generated a huge (approx. 106 entries) dataset of genome sequences collected worldwide and spanning a relatively short time window. These unprecedented conditions together with the certain identification of the reference viral genome sequence allow for an original statistical study of mutations in the virus genome. In this paper, we compute the Shannon entropy of every sequence in the dataset as well as the relative entropy and the mutual information between the reference sequence and the mutated ones. These functions, originally developed in information theory, measure the information content of a sequence and allows us to study the random character of mutation mechanism in terms of its entropy and information gain or loss. We show that this approach allows us to set in new format known features of the SARS-CoV-2 mutation mechanism like the CT bias, but also to discover new optimal entropic properties of the mutation process in the sense that the virus mutation mechanism track closely theoretically computable lower bounds for the entropy decrease and the information transfer.
Collapse
Affiliation(s)
- Marco Formentin
- Department of Mathematics Tullio Levi-Civita, University of Padova, via Trieste 63 35131 Padova, Italy
| | - Roberto Chignola
- Department of Biotechnology, University of Verona, Strada le Grazie 15-CV1, 37134 Verona, Italy
| | - Marco Favretti
- Department of Mathematics Tullio Levi-Civita, University of Padova, via Trieste 63 35131 Padova, Italy
| |
Collapse
|
8
|
Sarkar BK, Bhattacharya M, Agoramoorthy G, Dhama K, Chakraborty C. Entropy-Driven, Integrative Bioinformatics Approaches Reveal the Recent Transmission of the Monkeypox Virus from Nigeria to Multiple Non-African Countries. Mol Biotechnol 2023:10.1007/s12033-023-00889-7. [PMID: 37798393 DOI: 10.1007/s12033-023-00889-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 09/06/2023] [Indexed: 10/07/2023]
Abstract
Monkeypox virus (mpox) has currently affected multiple countries around the globe. This study aims to analyze how the virus spread globally. The study uses entropy-driven bioinformatics in five directions to analyze the 60 full-length complete genomes of mpox. We analyzed the topological entropy distribution of the genomes, principal component analysis (PCA), the dissimilarity matrix, entropy-driven phylogenetics, and genome clustering. The topological entropy distribution showed genome positional entropy. We found five clusters of the mpox genomes through the two PCA, while the three PCA elucidated the clustering events in 3D space. The clustering of genomes was further confirmed through the dissimilarity matrix and phylogenetic analysis which showed the bigger size of Cluster 1 and size similarity between Clusters 2 and 4 as well as Clusters 3 and 5. It corroborated with the phylogenetics of the genomes, where Cluster 1 showed clear segregation from the other four clusters. Finally, the study concluded that the spreading of the mpox is likely to have originated from African countries to the rest of the non-African countries. Overall, the spreading and distribution of the mpox will shed light on its evolution and pathogenicity of the mpox and help to adopt preventive measures to stop the spreading of the virus.
Collapse
Affiliation(s)
- Bimal Kumar Sarkar
- Department of Physics, Adamas University, Kolkata, West Bengal, 700126, India
| | - Manojit Bhattacharya
- Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore, 756020, Odisha, India
| | | | - Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India.
| | - Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal, 700126, India.
| |
Collapse
|
9
|
Orlov YL, Orlova NG. Bioinformatics tools for the sequence complexity estimates. Biophys Rev 2023; 15:1367-1378. [PMID: 37974990 PMCID: PMC10643780 DOI: 10.1007/s12551-023-01140-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 09/01/2023] [Indexed: 11/19/2023] Open
Abstract
We review current methods and bioinformatics tools for the text complexity estimates (information and entropy measures). The search DNA regions with extreme statistical characteristics such as low complexity regions are important for biophysical models of chromosome function and gene transcription regulation in genome scale. We discuss the complexity profiling for segmentation and delineation of genome sequences, search for genome repeats and transposable elements, and applications to next-generation sequencing reads. We review the complexity methods and new applications fields: analysis of mutation hotspots loci, analysis of short sequencing reads with quality control, and alignment-free genome comparisons. The algorithms implementing various numerical measures of text complexity estimates including combinatorial and linguistic measures have been developed before genome sequencing era. The series of tools to estimate sequence complexity use compression approaches, mainly by modification of Lempel-Ziv compression. Most of the tools are available online providing large-scale service for whole genome analysis. Novel machine learning applications for classification of complete genome sequences also include sequence compression and complexity algorithms. We present comparison of the complexity methods on the different sequence sets, the applications for gene transcription regulatory regions analysis. Furthermore, we discuss approaches and application of sequence complexity for proteins. The complexity measures for amino acid sequences could be calculated by the same entropy and compression-based algorithms. But the functional and evolutionary roles of low complexity regions in protein have specific features differing from DNA. The tools for protein sequence complexity aimed for protein structural constraints. It was shown that low complexity regions in protein sequences are conservative in evolution and have important biological and structural functions. Finally, we summarize recent findings in large scale genome complexity comparison and applications for coronavirus genome analysis.
Collapse
Affiliation(s)
- Yuriy L. Orlov
- The Digital Health Institute, I.M. Sechenov First Moscow State Medical University of the Russian Ministry of Health (Sechenov University), Moscow, 119991 Russia
- Institute of Cytology and Genetics SB RAS, 630090 Novosibirsk, Russia
- Agrarian and Technological Institute, Peoples’ Friendship University of Russia, 117198 Moscow, Russia
| | - Nina G. Orlova
- Department of Mathematics, Financial University under the Government of the Russian Federation, Moscow, 125167 Russia
| |
Collapse
|
10
|
Morales-Rubio R, Bernal-Ramírez J, Rubio-Infante N, Luévano-Martínez LA, Ríos A, Escalante BA, García-Rivas G, Rodríguez González J. Cellular shortening and calcium dynamics are improved by noisy stimulus in a model of cardiomyopathy. Sci Rep 2023; 13:14898. [PMID: 37689752 PMCID: PMC10492796 DOI: 10.1038/s41598-023-41611-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 08/29/2023] [Indexed: 09/11/2023] Open
Abstract
Noise is present in cell biology. The capability of cells to respond to noisy environment have become essential. This study aimed to investigate whether noise can enhance the contractile response and Ca2+ handling in cardiomyocytes from a cardiomyopathy model. Experiments were conducted in an experimental setup with Gaussian white noise, frequency, and amplitude control to stimulate myocytes. Cell shortening, maximal shortening velocity, time to peak shortening, and time to half relaxation variables were recorded to cell shortening. Ca2+ transient amplitude and raise rate variables were registered to measure Ca2+ transients. Our results for cell shortening, Ca2+ transient amplitude, and raise rate suggest that cell response improve when myocytes are noise stimulated. Also, cell shortening, maximal shortening velocity, Ca2+ transient amplitude, and raise improves in control cells. Altogether, these findings suggest novel characteristics in how cells improve their response in a noisy environment.
Collapse
Affiliation(s)
- Russell Morales-Rubio
- Centro de Investigación y de Estudios Avanzados del I.P.N-Unidad Monterrey, Vía del Conocimiento 201, Parque de Investigación e Innovación Tecnológica, 66600, Apodaca, NL, México
| | - Judith Bernal-Ramírez
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Cátedra de Cardiología, Hospital Zambrano Hellion, TecSalud, San Pedro Garza García, México
- The Institute for Obesity Research, Tecnologico de Monterrey, Monterrey, Mexico
| | - Nestor Rubio-Infante
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Cátedra de Cardiología, Hospital Zambrano Hellion, TecSalud, San Pedro Garza García, México
- The Institute for Obesity Research, Tecnologico de Monterrey, Monterrey, Mexico
| | - Luis A Luévano-Martínez
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Cátedra de Cardiología, Hospital Zambrano Hellion, TecSalud, San Pedro Garza García, México
- The Institute for Obesity Research, Tecnologico de Monterrey, Monterrey, Mexico
| | - Amelia Ríos
- Centro de Investigación y de Estudios Avanzados del I.P.N-Unidad Monterrey, Vía del Conocimiento 201, Parque de Investigación e Innovación Tecnológica, 66600, Apodaca, NL, México
| | - Bruno A Escalante
- Centro de Investigación y de Estudios Avanzados del I.P.N-Unidad Monterrey, Vía del Conocimiento 201, Parque de Investigación e Innovación Tecnológica, 66600, Apodaca, NL, México
| | - Gerardo García-Rivas
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Cátedra de Cardiología, Hospital Zambrano Hellion, TecSalud, San Pedro Garza García, México
- The Institute for Obesity Research, Tecnologico de Monterrey, Monterrey, Mexico
| | - Jesús Rodríguez González
- Centro de Investigación y de Estudios Avanzados del I.P.N-Unidad Monterrey, Vía del Conocimiento 201, Parque de Investigación e Innovación Tecnológica, 66600, Apodaca, NL, México.
| |
Collapse
|
11
|
Lainscsek X, Taher L. Predicting chromosomal compartments directly from the nucleotide sequence with DNA-DDA. Brief Bioinform 2023; 24:bbad198. [PMID: 37264486 PMCID: PMC10359093 DOI: 10.1093/bib/bbad198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 04/18/2023] [Accepted: 05/08/2023] [Indexed: 06/03/2023] Open
Abstract
Three-dimensional (3D) genome architecture is characterized by multi-scale patterns and plays an essential role in gene regulation. Chromatin conformation capturing experiments have revealed many properties underlying 3D genome architecture, such as the compartmentalization of chromatin based on transcriptional states. However, they are complex, costly and time consuming, and therefore only a limited number of cell types have been examined using these techniques. Increasing effort is being directed towards deriving computational methods that can predict chromatin conformation and associated structures. Here we present DNA-delay differential analysis (DDA), a purely sequence-based method based on chaos theory to predict genome-wide A and B compartments. We show that DNA-DDA models derived from a 20 Mb sequence are sufficient to predict genome wide compartmentalization at the scale of 100 kb in four different cell types. Although this is a proof-of-concept study, our method shows promise in elucidating the mechanisms responsible for genome folding as well as modeling the impact of genetic variation on 3D genome architecture and the processes regulated thereby.
Collapse
Affiliation(s)
- Xenia Lainscsek
- Institute of Biomedical Informatics, Graz University of Technology, Austria
| | - Leila Taher
- Institute of Biomedical Informatics, Graz University of Technology, Austria
| |
Collapse
|
12
|
Zhang W, Xiang X, Zhao B, Huang J, Yang L, Zeng Y. Identifying Cancer Driver Pathways Based on the Mouth Brooding Fish Algorithm. ENTROPY (BASEL, SWITZERLAND) 2023; 25:841. [PMID: 37372185 DOI: 10.3390/e25060841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 05/05/2023] [Accepted: 05/23/2023] [Indexed: 06/29/2023]
Abstract
Identifying the driver genes of cancer progression is of great significance in improving our understanding of the causes of cancer and promoting the development of personalized treatment. In this paper, we identify the driver genes at the pathway level via an existing intelligent optimization algorithm, named the Mouth Brooding Fish (MBF) algorithm. Many methods based on the maximum weight submatrix model to identify driver pathways attach equal importance to coverage and exclusivity and assign them equal weight, but those methods ignore the impact of mutational heterogeneity. Here, we use principal component analysis (PCA) to incorporate covariate data to reduce the complexity of the algorithm and construct a maximum weight submatrix model considering different weights of coverage and exclusivity. Using this strategy, the unfavorable effect of mutational heterogeneity is overcome to some extent. Data involving lung adenocarcinoma and glioblastoma multiforme were tested with this method and the results compared with the MDPFinder, Dendrix, and Mutex methods. When the driver pathway size was 10, the recognition accuracy of the MBF method reached 80% in both datasets, and the weight values of the submatrix were 1.7 and 1.89, respectively, which are better than those of the compared methods. At the same time, in the signal pathway enrichment analysis, the important role of the driver genes identified by our MBF method in the cancer signaling pathway is revealed, and the validity of these driver genes is demonstrated from the perspective of their biological effects.
Collapse
Affiliation(s)
- Wei Zhang
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
- Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha 410022, China
| | - Xiaowen Xiang
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
| | - Bihai Zhao
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
- Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha 410022, China
| | - Jianlin Huang
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
| | - Lan Yang
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
| | - Yifu Zeng
- College of Computer Science and Engineering, Changsha University, Changsha 410022, China
- Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha 410022, China
| |
Collapse
|
13
|
Yin F, Zhang H, Qi A, Zhu Z, Yang L, Wen G, Xie W. An exploratory study of CT radiomics using differential network feature selection for WHO/ISUP grading and progression-free survival prediction of clear cell renal cell carcinoma. Front Oncol 2022; 12:979613. [DOI: 10.3389/fonc.2022.979613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 10/11/2022] [Indexed: 11/13/2022] Open
Abstract
ObjectivesTo explore the feasibility of predicting the World Health Organization/International Society of Urological Pathology (WHO/ISUP) grade and progression-free survival (PFS) of clear cell renal cell cancer (ccRCC) using the radiomics features (RFs) based on the differential network feature selection (FS) method using the maximum-entropy probability model (MEPM).Methods175 ccRCC patients were divided into a training set (125) and a test set (50). The non-contrast phase (NCP), cortico-medullary phase, nephrographic phase, excretory phase phases, and all-phase WHO/ISUP grade prediction models were constructed based on a new differential network FS method using the MEPM. The diagnostic performance of the best phase model was compared with the other state-of-the-art machine learning models and the clinical models. The RFs of the best phase model were used for survival analysis and visualized using risk scores and nomograms. The performance of the above models was tested in both cross-validated and independent validation and checked by the Hosmer-Lemeshow test.ResultsThe NCP RFs model was the best phase model, with an AUC of 0.89 in the test set, and performed superior to other machine learning models and the clinical models (all p <0.05). Kaplan-Meier survival analysis, univariate and multivariate cox regression results, and risk score analyses showed the NCP RFs could predict PFS well (almost all p < 0.05). The nomogram model incorporated the best two RFs and showed good discrimination, a C-index of 0.71 and 0.69 in the training and test set, and good calibration.ConclusionThe NCP CT-based RFs selected by differential network FS could predict the WHO/ISUP grade and PFS of RCC.
Collapse
|
14
|
Das P, Shen T, McCord RP. Characterizing the variation in chromosome structure ensembles in the context of the nuclear microenvironment. PLoS Comput Biol 2022; 18:e1010392. [PMID: 35969616 PMCID: PMC9410561 DOI: 10.1371/journal.pcbi.1010392] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 08/25/2022] [Accepted: 07/15/2022] [Indexed: 11/23/2022] Open
Abstract
Inside the nucleus, chromosomes are subjected to direct physical interaction between different components, active forces, and thermal noise, leading to the formation of an ensemble of three-dimensional structures. However, it is still not well understood to what extent and how the structural ensemble varies from one chromosome region or cell-type to another. We designed a statistical analysis technique and applied it to single-cell chromosome imaging data to reveal the heterogeneity of individual chromosome structures. By analyzing the resulting structural landscape, we find that the largest dynamic variation is the overall radius of gyration of the chromatin region, followed by domain reorganization within the region. By comparing different human cell-lines and experimental perturbation data using this statistical analysis technique and a network-based similarity quantification approach, we identify both cell-type and condition-specific features of the structural landscapes. We identify a relationship between epigenetic state and the properties of chromosome structure fluctuation and validate this relationship through polymer simulations. Overall, our study suggests that the types of variation in a chromosome structure ensemble are cell-type as well as region-specific and can be attributed to constraints placed on the structure by factors such as variation in epigenetic state. Recent work has revealed principles of how chromosomes are folded and structured inside the human nucleus. It is now even possible to microscopically trace the path of chromosomes in 3D in individual cells. With this data, we can start to examine how much variation exists in chromosome structure and what biological factors may restrict or enhance this variation. Are chromosomes stuck in just a few possible positions or do they move around more freely, sampling many configurations? Here, we use a mathematical approach to compare chromosome structure variation in different cell types, at different locations along the genome, and when key structural proteins are removed. Through these comparisons and dynamic simulations of chromosome behavior, we identify factors that may constrain or promote variation in chromosome structure.
Collapse
Affiliation(s)
- Priyojit Das
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Tongye Shen
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, Tennessee, United States of America
- Biochemistry & Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Rachel Patton McCord
- UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, Tennessee, United States of America
- Biochemistry & Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
15
|
Progress in and Opportunities for Applying Information Theory to Computational Biology and Bioinformatics. ENTROPY 2022; 24:e24070925. [PMID: 35885148 PMCID: PMC9323281 DOI: 10.3390/e24070925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/27/2022] [Accepted: 06/30/2022] [Indexed: 11/25/2022]
|
16
|
Zandavi SM, Koch FC, Vijayan A, Zanini F, Mora F, Ortega D, Vafaee F. Disentangling single-cell omics representation with a power spectral density-based feature extraction. Nucleic Acids Res 2022; 50:5482-5492. [PMID: 35639509 PMCID: PMC9178020 DOI: 10.1093/nar/gkac436] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 04/26/2022] [Accepted: 05/10/2022] [Indexed: 12/13/2022] Open
Abstract
Emerging single-cell technologies provide high-resolution measurements of distinct cellular modalities opening new avenues for generating detailed cellular atlases of many and diverse tissues. The high dimensionality, sparsity, and inaccuracy of single cell sequencing measurements, however, can obscure discriminatory information, mask cellular subtype variations and complicate downstream analyses which can limit our understanding of cell function and tissue heterogeneity. Here, we present a novel pre-processing method (scPSD) inspired by power spectral density analysis that enhances the accuracy for cell subtype separation from large-scale single-cell omics data. We comprehensively benchmarked our method on a wide range of single-cell RNA-sequencing datasets and showed that scPSD pre-processing, while being fast and scalable, significantly reduces data complexity, enhances cell-type separation, and enables rare cell identification. Additionally, we applied scPSD to transcriptomics and chromatin accessibility cell atlases and demonstrated its capacity to discriminate over 100 cell types across the whole organism and across different modalities of single-cell omics data.
Collapse
Affiliation(s)
- Seid Miad Zandavi
- School of Biotechnology and Biomolecular Sciences, University of New South Wales (UNSW Sydney), Australia
- Programs in Metabolism and Medical & Population Genetics, Broad Institute, Cambridge, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Forrest C Koch
- School of Biotechnology and Biomolecular Sciences, University of New South Wales (UNSW Sydney), Australia
| | - Abhishek Vijayan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales (UNSW Sydney), Australia
| | - Fabio Zanini
- Prince of Wales Clinical School, UNSW Sydney, Australia
- Cellular Genomics Future Institute, UNSW Sydney, Australia
| | - Fatima Valdes Mora
- Children's Cancer Institute, Lowy Cancer Research Centre, UNSW Sydney, Australia
- School of Women's and Children's Health, Faculty of Medicine, UNSW, Sydney, Australia
| | - David Gallego Ortega
- School of Biomedical Engineering, University of Technology Sydney (UTS), Australia
| | - Fatemeh Vafaee
- School of Biotechnology and Biomolecular Sciences, University of New South Wales (UNSW Sydney), Australia
- Cellular Genomics Future Institute, UNSW Sydney, Australia
- UNSW Data Science Hub (uDASH), UNSW Sydney, Australia
| |
Collapse
|
17
|
Yee J, Park T, Park M. Identification of the associations between genes and quantitative traits using entropy-based kernel density estimation. Genomics Inform 2022; 20:e17. [PMID: 35794697 PMCID: PMC9299569 DOI: 10.5808/gi.22033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 06/15/2022] [Indexed: 11/20/2022] Open
Abstract
Genetic associations have been quantified using a number of statistical measures. Entropy-based mutual information may be one of the more direct ways of estimating the association, in the sense that it does not depend on the parametrization. For this purpose, both the entropy and conditional entropy of the phenotype distribution should be obtained. Quantitative traits, however, do not usually allow an exact evaluation of entropy. The estimation of entropy needs a probability density function, which can be approximated by kernel density estimation. We have investigated the proper sequence of procedures for combining the kernel density estimation and entropy estimation with a probability density function in order to calculate mutual information. Genotypes and their interactions were constructed to set the conditions for conditional entropy. Extensive simulation data created using three types of generating functions were analyzed using two different kernels as well as two types of multifactor dimensionality reduction and another probability density approximation method called m-spacing. The statistical power in terms of correct detection rates was compared. Using kernels was found to be most useful when the trait distributions were more complex than simple normal or gamma distributions. A full-scale genomic dataset was explored to identify associations using the 2-h oral glucose tolerance test results and γ-glutamyl transpeptidase levels as phenotypes. Clearly distinguishable single-nucleotide polymorphisms (SNPs) and interacting SNP pairs associated with these phenotypes were found and listed with empirical p-values.
Collapse
Affiliation(s)
- Jaeyong Yee
- Department of Physiology and Biophysics, Eulji University, Daejeon 34824, Korea
| | - Taesung Park
- Department of Statistics, Seoul National University, Seoul 08826, Korea
| | - Mira Park
- Department of Preventive Medicine, Eulji University, Daejeon 34824, Korea
| |
Collapse
|
18
|
Posada-Reyes AB, Balderas-Martínez YI, Ávila-Ríos S, Vinuesa P, Fonseca-Coronado S. An Epistatic Network Describes oppA and glgB as Relevant Genes for Mycobacterium tuberculosis. Front Mol Biosci 2022; 9:856212. [PMID: 35712352 PMCID: PMC9194097 DOI: 10.3389/fmolb.2022.856212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Accepted: 03/11/2022] [Indexed: 11/18/2022] Open
Abstract
Mycobacterium tuberculosis is an acid-fast bacterium that causes tuberculosis worldwide. The role of epistatic interactions among different loci of the M. tuberculosis genome under selective pressure may be crucial for understanding the disease and the molecular basis of antibiotic resistance acquisition. Here, we analyzed polymorphic loci interactions by applying a model-free method for epistasis detection, SpydrPick, on a pan–genome-wide alignment created from a set of 254 complete reference genomes. By means of the analysis of an epistatic network created with the detected epistatic interactions, we found that glgB (α-1,4-glucan branching enzyme) and oppA (oligopeptide-binding protein) are putative targets of co-selection in M. tuberculosis as they were associated in the network with M. tuberculosis genes related to virulence, pathogenesis, transport system modulators of the immune response, and antibiotic resistance. In addition, our work unveiled potential pharmacological applications for genotypic antibiotic resistance inherent to the mutations of glgB and oppA as they epistatically interact with fprA and embC, two genes recently included as antibiotic-resistant genes in the catalog of the World Health Organization. Our findings showed that this approach allows the identification of relevant epistatic interactions that may lead to a better understanding of M. tuberculosis by deciphering the complex interactions of molecules involved in its metabolism, virulence, and pathogenesis and that may be applied to different bacterial populations.
Collapse
Affiliation(s)
- Ali-Berenice Posada-Reyes
- Posgrado en Ciencias Biológicas, UNAM, Mexico, Mexico
- Facultad de Estudios Superiores Cuautitlán, UNAM, Estado de Mexico, Mexico
- *Correspondence: Ali-Berenice Posada-Reyes, ; Salvador Fonseca-Coronado,
| | | | - Santiago Ávila-Ríos
- Instituto Nacional de Enfermedades Respiratorias “Ismael Cosio Villegas”, Ciudad de Mexico, Mexico
| | - Pablo Vinuesa
- Centro de Ciencias Genómicas, UNAM, Cuernavaca, Mexico
| | - Salvador Fonseca-Coronado
- Facultad de Estudios Superiores Cuautitlán, UNAM, Estado de Mexico, Mexico
- *Correspondence: Ali-Berenice Posada-Reyes, ; Salvador Fonseca-Coronado,
| |
Collapse
|
19
|
Cerulo L, Pagnotta SM. massiveGST: A Mann-Whitney-Wilcoxon Gene-Set Test Tool That Gives Meaning to Gene-Set Enrichment Analysis. ENTROPY 2022; 24:e24050739. [PMID: 35626622 PMCID: PMC9140214 DOI: 10.3390/e24050739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/16/2022] [Accepted: 05/19/2022] [Indexed: 01/27/2023]
Abstract
Gene-set enrichment analysis is the key methodology for obtaining biological information from transcriptomic space’s statistical result. Since its introduction, Gene-set Enrichment analysis methods have obtained more reliable results and a wider range of application. Great attention has been devoted to global tests, in contrast to competitive methods that have been largely ignored, although they appear more flexible because they are independent from the source of gene-profiles. We analyzed the properties of the Mann–Whitney–Wilcoxon test, a competitive method, and adapted its interpretation in the context of enrichment analysis by introducing a Normalized Enrichment Score that summarize two interpretations: a probability estimate and a location index. Two implementations are presented and compared with relevant literature methods: an R package and an online web tool. Both allow for obtaining tabular and graphical results with attention to reproducible research.
Collapse
Affiliation(s)
- Luigi Cerulo
- Department of Science and Technology, Università degli Studi del Sannio, 82100 Benevento, Italy;
- Bioinformatics Lab, Biogem, Molecular Biology and Genetics Research Institute, 83031 Ariano Irpino, Italy
| | - Stefano Maria Pagnotta
- Department of Science and Technology, Università degli Studi del Sannio, 82100 Benevento, Italy;
- Correspondence:
| |
Collapse
|
20
|
Casagrande A, Fabris F, Girometti R. Fifty years of Shannon information theory in assessing the accuracy and agreement of diagnostic tests. Med Biol Eng Comput 2022; 60:941-955. [PMID: 35195818 PMCID: PMC8863911 DOI: 10.1007/s11517-021-02494-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 12/17/2021] [Indexed: 11/28/2022]
Abstract
Since 1948, Shannon theoretic methods for modeling information have found a wide range of applications in several areas where information plays a key role, which goes well beyond the original scopes for which they have been conceived, namely data compression and error correction over a noisy channel. Among other uses, these methods have been applied in the broad field of medical diagnostics since the 1970s, to quantify diagnostic information, to evaluate diagnostic test performance, but also to be used as technical tools in image processing and registration. This review illustrates the main contributions in assessing the accuracy of diagnostic tests and the agreement between raters, focusing on diagnostic test performance measurements and paired agreement evaluation. This work also presents a recent unified, coherent, and hopefully, final information-theoretical approach to deal with the flows of information involved among the patient, the diagnostic test performed to appraise the state of disease, and the raters who are checking the test results. The approach is assessed by considering two case studies: the first one is related to evaluating extra-prostatic cancers; the second concerns the quality of rapid tests for COVID-19 detection.
Collapse
Affiliation(s)
- Alberto Casagrande
- Dipartimento di Matematica e Geoscienze, Università degli Studi di Trieste, Trieste, Italy
| | - Francesco Fabris
- Dipartimento di Matematica e Geoscienze, Università degli Studi di Trieste, Trieste, Italy
| | - Rossano Girometti
- Istituto di Radiologia, Dipartimento di Area Medica, Università degli Studi di Udine, Ospedale S. Maria della Misericordia, Udine, Italy
| |
Collapse
|
21
|
Bohnsack KS, Kaden M, Abel J, Saralajew S, Villmann T. The Resolved Mutual Information Function as a Structural Fingerprint of Biomolecular Sequences for Interpretable Machine Learning Classifiers. ENTROPY (BASEL, SWITZERLAND) 2021; 23:1357. [PMID: 34682081 PMCID: PMC8534762 DOI: 10.3390/e23101357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Revised: 10/11/2021] [Accepted: 10/14/2021] [Indexed: 11/16/2022]
Abstract
In the present article we propose the application of variants of the mutual information function as characteristic fingerprints of biomolecular sequences for classification analysis. In particular, we consider the resolved mutual information functions based on Shannon-, Rényi-, and Tsallis-entropy. In combination with interpretable machine learning classifier models based on generalized learning vector quantization, a powerful methodology for sequence classification is achieved which allows substantial knowledge extraction in addition to the high classification ability due to the model-inherent robustness. Any potential (slightly) inferior performance of the used classifier is compensated by the additional knowledge provided by interpretable models. This knowledge may assist the user in the analysis and understanding of the used data and considered task. After theoretical justification of the concepts, we demonstrate the approach for various example data sets covering different areas in biomolecular sequence analysis.
Collapse
Affiliation(s)
- Katrin Sophie Bohnsack
- Saxon Institute for Computational Intelligence and Machine Learning, University of Applied Sciences Mittweida, 09648 Mittweida, Germany; (M.K.); (J.A.)
| | - Marika Kaden
- Saxon Institute for Computational Intelligence and Machine Learning, University of Applied Sciences Mittweida, 09648 Mittweida, Germany; (M.K.); (J.A.)
| | - Julia Abel
- Saxon Institute for Computational Intelligence and Machine Learning, University of Applied Sciences Mittweida, 09648 Mittweida, Germany; (M.K.); (J.A.)
| | - Sascha Saralajew
- Bosch Center for Artificial Intelligence, 71272 Renningen, Germany;
| | - Thomas Villmann
- Saxon Institute for Computational Intelligence and Machine Learning, University of Applied Sciences Mittweida, 09648 Mittweida, Germany; (M.K.); (J.A.)
| |
Collapse
|
22
|
Ricci L, Perinelli A, Castelluzzo M. Estimating the variance of Shannon entropy. Phys Rev E 2021; 104:024220. [PMID: 34525589 DOI: 10.1103/physreve.104.024220] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Accepted: 08/10/2021] [Indexed: 11/07/2022]
Abstract
The statistical analysis of data stemming from dynamical systems, including, but not limited to, time series, routinely relies on the estimation of information theoretical quantities, most notably Shannon entropy. To this purpose, possibly the most widespread tool is provided by the so-called plug-in estimator, whose statistical properties in terms of bias and variance were investigated since the first decade after the publication of Shannon's seminal works. In the case of an underlying multinomial distribution, while the bias can be evaluated by knowing support and data set size, variance is far more elusive. The aim of the present work is to investigate, in the multinomial case, the statistical properties of an estimator of a parameter that describes the variance of the plug-in estimator of Shannon entropy. We then exactly determine the probability distributions that maximize that parameter. The results presented here allow one to set upper limits to the uncertainty of entropy assessments under the hypothesis of memoryless underlying stochastic processes.
Collapse
Affiliation(s)
- Leonardo Ricci
- Department of Physics, University of Trento, 38123 Trento, Italy.,CIMeC, Center for Mind/Brain Sciences, University of Trento, 38068 Rovereto, Italy
| | - Alessio Perinelli
- CIMeC, Center for Mind/Brain Sciences, University of Trento, 38068 Rovereto, Italy
| | | |
Collapse
|
23
|
An Information-theoretic approach to dimensionality reduction in data science. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2021. [DOI: 10.1007/s41060-021-00272-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
24
|
Makarieva AM, Nefiodov AV, Li BL. Life's Energy and Information: Contrasting Evolution of Volume- versus Surface-Specific Rates of Energy Consumption. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1025. [PMID: 33286794 PMCID: PMC7597118 DOI: 10.3390/e22091025] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 08/12/2020] [Accepted: 08/12/2020] [Indexed: 12/02/2022]
Abstract
As humanity struggles to find a path to resilience amidst global change vagaries, understanding organizing principles of living systems as the pillar for human existence is rapidly growing in importance. However, finding quantitative definitions for order, complexity, information and functionality of living systems remains a challenge. Here, we review and develop insights into this problem from the concept of the biotic regulation of the environment developed by Victor Gorshkov (1935-2019). Life's extraordinary persistence-despite being a strongly non-equilibrium process-requires a quantum-classical duality: the program of life is written in molecules and thus can be copied without information loss, while life's interaction with its non-equilibrium environment is performed by macroscopic classical objects (living individuals) that age. Life's key energetic parameter, the volume-specific rate of energy consumption, is maintained within universal limits by most life forms. Contrary to previous suggestions, it cannot serve as a proxy for "evolutionary progress". In contrast, ecosystem-level surface-specific energy consumption declines with growing animal body size in stable ecosystems. High consumption by big animals is associated with instability. We suggest that the evolutionary increase in body size may represent a spontaneous loss of information about environmental regulation, a manifestation of life's algorithm ageing as a whole.
Collapse
Affiliation(s)
- Anastassia M. Makarieva
- Theoretical Physics Division, Petersburg Nuclear Physics Institute, Gatchina 188300, Russia
- USDA-China MOST Joint Research Center for AgroEcology and Sustainability, University of California, Riverside, CA 92521-0124, USA
| | - Andrei V. Nefiodov
- Theoretical Physics Division, Petersburg Nuclear Physics Institute, Gatchina 188300, Russia
| | - Bai-Lian Li
- USDA-China MOST Joint Research Center for AgroEcology and Sustainability, University of California, Riverside, CA 92521-0124, USA
| |
Collapse
|