26
|
Chen RM. Whether County Lockdown Could Deter the Contagion of COVID-19 in the USA. Risk Manag Healthc Policy 2021; 14:2665-2673. [PMID: 34194248 PMCID: PMC8236672 DOI: 10.2147/rmhp.s314750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 05/23/2021] [Indexed: 11/26/2022] Open
Abstract
Aim Whether to lock down a country or not during COVID-19 pandemic becomes a vital issue, since it affects people’s daily life. The objective of this research is to design a measurement that could be utilised to predict the efficacy of a lockdown decision. Methods One would expect that the effectiveness of lockdown lies in the assumption that the virus spreads from one area to another area in a rippling way. If the virus spreads in a radiating way, then lockdown should be an effective countermeasure to contain the pandemic. On the other hand, if it spreads indiscernibly or randomly, then a lockdown decision would have lesser or little effect on the containment. We mainly combine graphs and metric to compute correlation matrices, which would measure whether the virus spreads in a rippling way. The metric used is to measure the boundary (or county) distances between counties. We take 3073 counties and equivalents in the USA and explore the property of contagion with respect to distance. The distance between any two counties is measured by the number of neighbours (or counties) between them. Then, we study the relation between contagion and distances. The relation between distance (complexity of neighbouring) and confirmed cases (contagion) is further explored. Results Then, we study the relation between contagion and distances. The relation between distance (complexity of neighbouring) and confirmed cases (contagion) could be explored. Our research shows county lockdown in the USA plays no important role in containing the spread of coronavirus for the time being. Conclusion Rippling effect in the USA regarding COVID-19 is not significant. This indicates other robust approaches or policies should be taken into consideration, rather than a simple lockdown policy.
Collapse
|
27
|
Dabrowski-Tumanski P, Rubach P, Niemyska W, Gren BA, Sulkowska JI. Topoly: Python package to analyze topology of polymers. Brief Bioinform 2021; 22:bbaa196. [PMID: 32935829 PMCID: PMC8138882 DOI: 10.1093/bib/bbaa196] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 07/15/2020] [Accepted: 07/29/2020] [Indexed: 12/27/2022] Open
Abstract
The increasing role of topology in (bio)physical properties of matter creates a need for an efficient method of detecting the topology of a (bio)polymer. However, the existing tools allow one to classify only the simplest knots and cannot be used in automated sample analysis. To answer this need, we created the Topoly Python package. This package enables the distinguishing of knots, slipknots, links and spatial graphs through the calculation of different topological polynomial invariants. It also enables one to create the minimal spanning surface on a given loop, e.g. to detect a lasso motif or to generate random closed polymers. It is capable of reading various file formats, including PDB. The extensive documentation along with test cases and the simplicity of the Python programming language make it a very simple to use yet powerful tool, suitable even for inexperienced users. Topoly can be obtained from https://topoly.cent.uw.edu.pl.
Collapse
|
28
|
Son H, Pham VT, Jang Y, Kim SE. Damage Localization and Severity Assessment of a Cable-Stayed Bridge Using a Message Passing Neural Network. SENSORS 2021; 21:s21093118. [PMID: 33946232 PMCID: PMC8125630 DOI: 10.3390/s21093118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Revised: 04/18/2021] [Accepted: 04/26/2021] [Indexed: 11/27/2022]
Abstract
Cable-stayed bridges are damaged by multiple factors such as natural disasters, weather, and vehicle load. In particular, if the stayed cable, which is an essential and weak component of the cable-stayed bridge, is damaged, it may adversely affect the adjacent cables and worsen the bridge structure condition. Therefore, we must accurately determine the condition of the cable with a technology-based evaluation strategy. In this paper, we propose a deep learning model that allows us to locate the damaged cable and estimate its cross-sectional area. To obtain the data required for the deep learning training, we use the tension data of the reduced area cable, which are simulated in the Practical Advanced Analysis Program (PAAP), a robust structural analysis program. We represent the sensor data of the damaged cable-stayed bridge as a graph composed of vertices and edges using tension and spatial information of the sensors. We apply the sensor geometry by mapping the tension data to the graph vertices and the connection relationship between sensors to the graph edges. We employ a Graph Neural Network (GNN) to use the graph representation of the sensor data directly. GNN, which has been actively studied recently, can treat graph-structured data with the most advanced performance. We train the GNN framework, the Message Passing Neural Network (MPNN), to perform two tasks to identify damaged cables and estimate the cable areas. We adopt a multi-task learning method for more efficient optimization. We show that the proposed technique achieves high performance with the cable-stayed bridge data generated from PAAP.
Collapse
|
29
|
Ji X, Ferreira T, Friedman B, Liu R, Liechty H, Bas E, Chandrashekar J, Kleinfeld D. Brain microvasculature has a common topology with local differences in geometry that match metabolic load. Neuron 2021; 109:1168-1187.e13. [PMID: 33657412 PMCID: PMC8525211 DOI: 10.1016/j.neuron.2021.02.006] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 12/09/2020] [Accepted: 02/03/2021] [Indexed: 01/03/2023]
Abstract
The microvasculature underlies the supply networks that support neuronal activity within heterogeneous brain regions. What are common versus heterogeneous aspects of the connectivity, density, and orientation of capillary networks? To address this, we imaged, reconstructed, and analyzed the microvasculature connectome in whole adult mice brains with sub-micrometer resolution. Graph analysis revealed common network topology across the brain that leads to a shared structural robustness against the rarefaction of vessels. Geometrical analysis, based on anatomically accurate reconstructions, uncovered a scaling law that links length density, i.e., the length of vessel per volume, with tissue-to-vessel distances. We then derive a formula that connects regional differences in metabolism to differences in length density and, further, predicts a common value of maximum tissue oxygen tension across the brain. Last, the orientation of capillaries is weakly anisotropic with the exception of a few strongly anisotropic regions; this variation can impact the interpretation of fMRI data.
Collapse
|
30
|
Ma Y, Li Q, Hu N, Li L. SeBio Graph: Semi-supervised Deep Learning for the Graph via Sustainable Knowledge Transfer. Front Neurorobot 2021; 15:665055. [PMID: 33867966 PMCID: PMC8047129 DOI: 10.3389/fnbot.2021.665055] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 03/09/2021] [Indexed: 11/17/2022] Open
Abstract
Semi-supervised deep learning for the biomedical graph and advanced manufacturing graph is rapidly becoming an important topic in both academia and industry. Many existing types of research focus on semi-supervised link prediction and node classification, as well as the application of these methods in sustainable development and advanced manufacturing. To date, most manufacturing graph neural networks are mainly evaluated on social and information networks, which improve the quality of network representation y integrating neighbor node descriptions. However, previous methods have not yet been comprehensively studied on biomedical networks. Traditional techniques fail to achieve satisfying results, especially when labeled nodes are deficient in number. In this paper, a new semi-supervised deep learning method for the biomedical graph via sustainable knowledge transfer called SeBioGraph is proposed. In SeBioGraph, both node embedding and graph-specific prototype embedding are utilized as transferable metric space characterized. By incorporating prior knowledge learned from auxiliary graphs, SeBioGraph further promotes the performance of the target graph. Experimental results on the two-class node classification tasks and three-class link prediction tasks demonstrate that the SeBioGraph realizes state-of-the-art results. Finally, the method is thoroughly evaluated.
Collapse
|
31
|
Shin DY, Hussain S, Afzal F, Park C, Afzal D, Farahani MR. Closed Formulas for Some New Degree Based Topological Descriptors Using M-polynomial and Boron Triangular Nanotube. Front Chem 2021; 8:613873. [PMID: 33614594 PMCID: PMC7886695 DOI: 10.3389/fchem.2020.613873] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 12/09/2020] [Indexed: 11/18/2022] Open
Abstract
In this article, we provide new formulas to compute the reduced reciprocal randić index, Arithmetic geometric1 index, SK index, SK 1 index, SK 2 index, edge version of the first zagreb index, sum connectivity index, general sum connectivity index, and the forgotten index using the M-polynomial and finding these topological indices for a boron triangular nanotube. We also elaborate the results with graphical representations.
Collapse
|
32
|
Mahmoud H, Chulahwat A. Assessing wildland-urban interface fire risk. ROYAL SOCIETY OPEN SCIENCE 2020; 7:201183. [PMID: 32968539 PMCID: PMC7481699 DOI: 10.1098/rsos.201183] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 07/27/2020] [Indexed: 06/11/2023]
Abstract
Recent wildfire events, in the United States (USA) and around the world, have resulted in thousands of homes destroyed and many lives lost, leaving communities and policy makers, once again, with the question as to how to manage wildfire risk. This is particularly important given the prevalent trend of increased fire frequency and intensity. Current approaches to managing wildfires focus on fire suppression and managing fuel build-up in wildlands. However, reliance on these strategies alone has clearly proven inadequate. As such, focus should be shifted towards minimizing potential losses to communities. Achieving this goal, however, requires detailed understanding of the factors that contribute to community vulnerability and the interplay between probability of ignition, vulnerability and calculated risk. In this study, we evaluate wildfire risk for four different communities across the USA for the duration of May to September to communicate a different perspective of risk assessment. We show, for the first time, that community risk is closely related to wind speed and direction, pattern of surrounding wildland vegetation, and buildings layout. The importance of the findings lies in the need for exploring unique viable solutions to reduce risk for every community independently as opposed to embracing a generalized approach as is currently the case.
Collapse
|
33
|
Turner SL, Karahalios A, Forbes AB, Taljaard M, Grimshaw JM, Korevaar E, Cheng AC, Bero L, McKenzie JE. Creating effective interrupted time series graphs: Review and recommendations. Res Synth Methods 2020; 12:106-117. [PMID: 32657532 PMCID: PMC7818488 DOI: 10.1002/jrsm.1435] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 05/18/2020] [Accepted: 07/09/2020] [Indexed: 11/30/2022]
Abstract
Introduction Interrupted Time Series (ITS) studies may be used to assess the impact of an interruption, such as an intervention or exposure. The data from such studies are particularly amenable to visual display and, when clearly depicted, can readily show the short‐ and long‐term impact of an interruption. Further, well‐constructed graphs allow data to be extracted using digitizing software, which can facilitate their inclusion in systematic reviews and meta‐analyses. Aim We provide recommendations for graphing ITS data, examine the properties of plots presented in ITS studies, and provide examples employing our recommendations. Methods and results Graphing recommendations from seminal data visualization resources were adapted for use with ITS studies. The adapted recommendations cover plotting of data points, trend lines, interruptions, additional lines and general graph components. We assessed whether 217 graphs from recently published (2013‐2017) ITS studies met our recommendations and found that 130 graphs (60%) had clearly distinct data points, 100 (46%) had trend lines, and 161 (74%) had a clearly defined interruption. Accurate data extraction (requiring distinct points that align with axis tick marks and labels that allow the points to be interpreted) was possible in only 72 (33%) graphs. Conclusion We found that many ITS graphs did not meet our recommendations and could be improved with simple changes. Our proposed recommendations aim to achieve greater standardization and improvement in the display of ITS data, and facilitate re‐use of the data in systematic reviews and meta‐analyses. Application of data visualization recommendations can improve quality of interrupted time series graphs. Well‐designed graphs accurately depict time series data, any impact of the interruption, and the results of the analysis. Well‐designed graphs facilitate data extraction for use in systematic reviews and reproducibility. An assessment of graphs included in interrupted time series studies (published between 2013 and 2017) found that graphs often do not meet core graphing recommendations.
Collapse
|
34
|
Zhang W, Wang Y. DEEP MULTIMODAL BRAIN NETWORK LEARNING FOR JOINT ANALYSIS OF STRUCTURAL MORPHOMETRY AND FUNCTIONAL CONNECTIVITY. PROCEEDINGS. IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING 2020; 2020. [PMID: 34012504 DOI: 10.1109/isbi45749.2020.9098624] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Learning from the multimodal brain imaging data attracts a large amount of attention in medical image analysis due to the proliferation of multimodal data collection. It is widely accepted that multimodal data can provide complementary information than mining from a single modality. However, unifying the image-based knowledge from the multimodal data is very challenging due to different image signals, resolution, data structure, etc.. In this study, we design a supervised deep model to jointly analyze brain morphometry and functional connectivity on the cortical surface and we name it deep multimodal brain network learning (DMBNL). Two graph-based kernels, i.e., geometry-aware surface kernel (GSK) and topology-aware network kernel (TNK), are proposed for processing the cortical surface morphometry and brain functional network. The vertex features on the cortical surface from GSK is pooled and feed into TNK as its initial regional features. In the end, the graph-level feature is computed for each individual and thus can be applied for classification tasks. We test our model on a large autism imaging dataset. The experimental results prove the effectiveness of our model.
Collapse
|
35
|
Sverchkov Y, Ho YH, Gasch A, Craven M. Context-Specific Nested Effects Models. J Comput Biol 2020; 27:403-417. [PMID: 32053004 DOI: 10.1089/cmb.2019.0459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Advances in systems biology have made clear the importance of network models for capturing knowledge about complex relationships in gene regulation, metabolism, and cellular signaling. A common approach to uncovering biological networks involves performing perturbations on elements of the network, such as gene knockdown experiments, and measuring how the perturbation affects some reporter of the process under study. In this article, we develop context-specific nested effects models (CSNEMs), an approach to inferring such networks that generalizes nested effects models (NEMs). The main contribution of this work is that CSNEMs explicitly model the participation of a gene in multiple contexts, meaning that a gene can appear in multiple places in the network. Biologically, the representation of regulators in multiple contexts may indicate that these regulators have distinct roles in different cellular compartments or cell cycle phases. We present an evaluation of the method on simulated data as well as on data from a study of the sodium chloride stress response in Saccharomyces cerevisiae.
Collapse
|
36
|
Information Management in Healthcare and Environment: Towards an Automatic System for Fake News Detection. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17031066. [PMID: 32046238 PMCID: PMC7037767 DOI: 10.3390/ijerph17031066] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 01/31/2020] [Accepted: 02/04/2020] [Indexed: 12/04/2022]
Abstract
Comments and information appearing on the internet and on different social media sway opinion concerning potential remedies for diagnosing and curing diseases. In many cases, this has an impact on citizens’ health and affects medical professionals, who find themselves having to defend their diagnoses as well as the treatments they propose against ill-informed patients. The propagation of these opinions follows the same pattern as the dissemination of fake news about other important topics, such as the environment, via social media networks, which we use as a testing ground for checking our procedure. In this article, we present an algorithm to analyse the behaviour of users of Twitter, the most important social network with respect to this issue, as well as a dynamic knowledge graph construction method based on information gathered from Twitter and other open data sources such as web pages. To show our methodology, we present a concrete example of how the associated graph structure of the tweets related to World Environment Day 2019 is used to develop a heuristic analysis of the validity of the information. The proposed analytical scheme is based on the interaction between the computer tool—a database implemented with Neo4j—and the analyst, who must ask the right questions to the tool, allowing to follow the line of any doubtful data. We also show how this method can be used. We also present some methodological guidelines on how our system could allow, in the future, an automation of the procedures for the construction of an autonomous algorithm for the detection of false news on the internet related to health.
Collapse
|
37
|
Ambriola Oku AY, Zimeo Morais GA, Arantes Bueno AP, Fujita A, Sato JR. Potential Confounders in the Analysis of Brazilian Adolescent's Health: A Combination of Machine Learning and Graph Theory. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2019; 17:ijerph17010090. [PMID: 31877700 PMCID: PMC6981403 DOI: 10.3390/ijerph17010090] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 12/09/2019] [Accepted: 12/16/2019] [Indexed: 12/20/2022]
Abstract
The prevalence of health problems during childhood and adolescence is high in developing countries such as Brazil. Social inequality, violence, and malnutrition have strong impact on youth health. To better understand these issues we propose to combine machine-learning methods and graph analysis to build predictive networks applied to the Brazilian National Student Health Survey (PenSE 2015) data, a large dataset that consists of questionnaires filled by the students. By using a combination of gradient boosting machines and centrality hub metric, it was possible to identify potential confounders to be considered when conducting association analyses among variables. The variables were ranked according to their hub centrality to predict the other variables from a directed weighted-graph perspective. The top five ranked confounder variables were “gender”, “oral health care”, “intended education level”, and two variables associated with nutrition habits—“eat while watching TV” and “never eat fast-food”. In conclusion, although causal effects cannot be inferred from the data, we believe that the proposed approach might be a useful tool to obtain novel insights on the association between variables and to identify general factors related to health conditions.
Collapse
|
38
|
Ramos PIP, Arge LWP, Lima NCB, Fukutani KF, de Queiroz ATL. Leveraging User-Friendly Network Approaches to Extract Knowledge From High-Throughput Omics Datasets. Front Genet 2019; 10:1120. [PMID: 31798629 PMCID: PMC6863976 DOI: 10.3389/fgene.2019.01120] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 10/16/2019] [Indexed: 11/13/2022] Open
Abstract
Recent technological advances for the acquisition of multi-omics data have allowed an unprecedented understanding of the complex intricacies of biological systems. In parallel, a myriad of computational analysis techniques and bioinformatics tools have been developed, with many efforts directed towards the creation and interpretation of networks from this data. In this review, we begin by examining key network concepts and terminology. Then, computational tools that allow for their construction and analysis from high-throughput omics datasets are presented. We focus on the study of functional relationships such as co-expression, protein-protein interactions, and regulatory interactions that are particularly amenable to modeling using the framework of networks. We envisage that many potential users of these analytical strategies may not be completely literate in programming languages and code adaptation, and for this reason, emphasis is given to tools' user-friendliness, including plugins for the widely adopted Cytoscape software, an open-source, cross-platform tool for network analysis, visualization, and data integration.
Collapse
|
39
|
Damseh R, Pouliot P, Gagnon L, Sakadzic S, Boas D, Cheriet F, Lesage F. Automatic Graph-Based Modeling of Brain Microvessels Captured With Two-Photon Microscopy. IEEE J Biomed Health Inform 2019; 23:2551-2562. [PMID: 30507542 PMCID: PMC6546554 DOI: 10.1109/jbhi.2018.2884678] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Graph models of cerebral vasculature derived from two-photon microscopy have shown to be relevant to study brain microphysiology. Automatic graphing of these microvessels remain problematic due to the vascular network complexity and two-photon sensitivity limitations with depth. In this paper, we propose a fully automatic processing pipeline to address this issue. The modeling scheme consists of a fully-convolution neural network to segment microvessels, a three-dimensional surface model generator, and a geometry contraction algorithm to produce graphical models with a single connected component. Based on a quantitative assessment using NetMets metrics, at a tolerance of 60 μm, false negative and false positive geometric error 19 rates are 3.8% and 4.2%, respectively, whereas false nega- 20 tive and false positive topological error rates are 6.1% and 4.5%, respectively. Our qualitative evaluation confirms the efficiency of our scheme in generating useful and accurate graphical models.
Collapse
|
40
|
Li C, Liu H, Hu Q, Que J, Yao J. A Novel Computational Model for Predicting microRNA-Disease Associations Based on Heterogeneous Graph Convolutional Networks. Cells 2019; 8:cells8090977. [PMID: 31455028 PMCID: PMC6769654 DOI: 10.3390/cells8090977] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 08/22/2019] [Accepted: 08/23/2019] [Indexed: 01/13/2023] Open
Abstract
Identifying the interactions between disease and microRNA (miRNA) can accelerate drugs development, individualized diagnosis, and treatment for various human diseases. However, experimental methods are time-consuming and costly. So computational approaches to predict latent miRNA-disease interactions are eliciting increased attention. But most previous studies have mainly focused on designing complicated similarity-based methods to predict latent interactions between miRNAs and diseases. In this study, we propose a novel computational model, termed heterogeneous graph convolutional network for miRNA-disease associations (HGCNMDA), which is based on known human protein-protein interaction (PPI) and integrates four biological networks: miRNA-disease, miRNA-gene, disease-gene, and PPI network. HGCNMDA achieved reliable performance using leave-one-out cross-validation (LOOCV). HGCNMDA is then compared to three state-of-the-art algorithms based on five-fold cross-validation. HGCNMDA achieves an AUC of 0.9626 and an average precision of 0.9660, respectively, which is ahead of other competitive algorithms. We further analyze the top-10 unknown interactions between miRNA and disease. In summary, HGCNMDA is a useful computational model for predicting miRNA-disease interactions.
Collapse
|
41
|
Foffano D, Rossi L, Torsello A. You Can't See Me: Anonymizing Graphs Using the Szemerédi Regularity Lemma. Front Big Data 2019; 2:7. [PMID: 33693330 PMCID: PMC7931930 DOI: 10.3389/fdata.2019.00007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Accepted: 05/13/2019] [Indexed: 11/23/2022] Open
Abstract
Complex networks gathered from our online interactions provide a rich source of information that can be used to try to model and predict our behavior. While this has very tangible benefits that we have all grown accustomed to, there is a concrete privacy risk in sharing potentially sensitive data about ourselves and the people we interact with, especially when this data is publicly available online and unprotected from malicious attacks. k-anonymity is a technique aimed at reducing this risk by obfuscating the topological information of a graph that can be used to infer the nodes' identity. In this paper we propose a novel algorithm to enforce k-anonymity based on a well-known result in extremal graph theory, the Szemerédi regularity lemma. Given a graph, we start by computing a regular partition of its nodes. The Szemerédi regularity lemma ensures that such a partition exists and that the edges between the sets of nodes behave almost randomly. With this partition, we anonymize the graph by randomizing the edges within each set, obtaining a graph that is structurally similar to the original one yet the nodes within each set are structurally indistinguishable. We test the proposed approach on real-world networks extracted from Facebook. Our experimental results show that the proposed approach is able to anonymize a graph while retaining most of its structural information.
Collapse
|
42
|
Li C, Zhao J, Wang C, Yao Y. Protein Sequence Comparison and DNA-binding Protein Identification with Generalized PseAAC and Graphical Representation. Comb Chem High Throughput Screen 2019; 21:100-110. [PMID: 29380690 PMCID: PMC5930480 DOI: 10.2174/1386207321666180130100838] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 01/24/2018] [Accepted: 01/26/2018] [Indexed: 11/22/2022]
Abstract
AIM AND OBJECTIVE The rapid increase in the amount of protein sequence data available leads to an urgent need for novel computational algorithms to analyze and compare these sequences. This study is undertaken to develop an efficient computational approach for timely encoding protein sequences and extracting the hidden information. METHODS Based on two physicochemical properties of amino acids, a protein primary sequence was converted into a three-letter sequence, and then a graph without loops and multiple edges and its geometric line adjacency matrix were obtained. A generalized PseAAC (pseudo amino acid composition) model was thus constructed to characterize a protein sequence numerically. RESULTS By using the proposed mathematical descriptor of a protein sequence, similarity comparisons among β-globin proteins of 17 species and 72 spike proteins of coronaviruses were made, respectively. The resulting clusters agreed well with the established taxonomic groups. In addition, a generalized PseAAC based SVM (support vector machine) model was developed to identify DNA-binding proteins. Experiment results showed that our method performed better than DNAbinder, DNA-Prot, iDNA-Prot and enDNA-Prot by 3.29-10.44% in terms of ACC, 0.056-0.206 in terms of MCC, and 1.45-15.76% in terms of F1M. When the benchmark dataset was expanded with negative samples, the presented approach outperformed the four previous methods with improvement in the range of 2.49-19.12% in terms of ACC, 0.05-0.32 in terms of MCC, and 3.82- 33.85% in terms of F1M. CONCLUSION These results suggested that the generalized PseAAC model was very efficient for comparison and analysis of protein sequences, and very competitive in identifying DNA-binding proteins.
Collapse
|
43
|
Ponisio LC, de Valpine P, M'Gonigle LK, Kremen C. Proximity of restored hedgerows interacts with local floral diversity and species' traits to shape long-term pollinator metacommunity dynamics. Ecol Lett 2019; 22:1048-1060. [PMID: 30938483 DOI: 10.1111/ele.13257] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Revised: 09/28/2018] [Accepted: 02/22/2019] [Indexed: 01/09/2023]
Abstract
Disconnected habitat fragments are poor at supporting population and community persistence; restoration ecologists, therefore, advocate for the establishment of habitat networks across landscapes. Few empirical studies, however, have considered how networks of restored habitat patches affect metacommunity dynamics. Here, using a 10-year study on restored hedgerows and unrestored field margins within an intensive agricultural landscape, we integrate occupancy modelling with network theory to examine the interaction between local and landscape characteristics, habitat selection and dispersal in shaping pollinator metacommunity dynamics. We show that surrounding hedgerows and remnant habitat patches interact with the local floral diversity, bee diet breadth and bee body size to influence site occupancy, via colonisation and persistence dynamics. Florally diverse sites and generalist, small-bodied species are most important for maintaining metacommunity connectivity. By providing the first in-depth assessment of how a network of restored habitat influences long-term population dynamics, we confirm the conservation benefit of hedgerows for pollinator populations and demonstrate the importance of restoring and maintaining habitat networks within an inhospitable matrix.
Collapse
|
44
|
Abstract
Spectral graph clustering—clustering the vertices of a graph based on their spectral embedding—is of significant current interest, finding applications throughout the sciences. But as with clustering in general, what a particular methodology identifies as “clusters” is defined (explicitly, or, more often, implicitly) by the clustering algorithm itself. We provide a clear and concise demonstration of a “two-truths” phenomenon for spectral graph clustering in which the first step—spectral embedding—is either Laplacian spectral embedding, wherein one decomposes the normalized Laplacian of the adjacency matrix, or adjacency spectral embedding given by a decomposition of the adjacency matrix itself. The two resulting clustering methods identify fundamentally different (true and meaningful) structure. Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering—clustering the vertices of a graph based on their spectral embedding—is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian spectral embedding (LSE) or adjacency spectral embedding (ASE). Recent theoretical results provide deeper understanding of the problem and solutions and lead us to a “two-truths” LSE vs. ASE spectral graph clustering phenomenon convincingly illustrated here via a diffusion MRI connectome dataset: The different embedding methods yield different clustering results, with LSE capturing left hemisphere/right hemisphere affinity structure and ASE capturing gray matter/white matter core–periphery structure.
Collapse
|
45
|
Solo V, Poline JB, Lindquist MA, Simpson SL, Bowman FD, Chung MK, Cassidy B. Connectivity in fMRI: Blind Spots and Breakthroughs. IEEE TRANSACTIONS ON MEDICAL IMAGING 2018; 37:1537-1550. [PMID: 29969406 PMCID: PMC6291757 DOI: 10.1109/tmi.2018.2831261] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In recent years, driven by scientific and clinical concerns, there has been an increased interest in the analysis of functional brain networks. The goal of these analyses is to better understand how brain regions interact, how this depends upon experimental conditions and behavioral measures and how anomalies (disease) can be recognized. In this paper, we provide, first, a brief review of some of the main existing methods of functional brain network analysis. But rather than compare them, as a traditional review would do, instead, we draw attention to their significant limitations and blind spots. Then, second, relevant experts, sketch a number of emerging methods, which can break through these limitations. In particular we discuss five such methods. The first two, stochastic block models and exponential random graph models, provide an inferential basis for network analysis lacking in the exploratory graph analysis methods. The other three addresses: network comparison via persistent homology, time-varying connectivity that distinguishes sample fluctuations from neural fluctuations, and network system identification that draws inferential strength from temporal autocorrelation.
Collapse
|
46
|
Lachmann M, Libby E. Epigenetic inheritance systems contribute to the evolution of a germline. Philos Trans R Soc Lond B Biol Sci 2017; 371:rstb.2015.0445. [PMID: 27431523 PMCID: PMC4958939 DOI: 10.1098/rstb.2015.0445] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/29/2016] [Indexed: 12/26/2022] Open
Abstract
Differentiation within multicellular organisms is controlled by epigenetic markers transmitted across cell division. The process of differentiation will modify these epigenetic markers so that information that one cell type possesses can be lost in the transition to another. Many of the systems that encode these markers also exist in unicellular organisms but do not control differentiation. Thus, during the evolution of multicellularity, epigenetic inheritance systems were probably exapted for their current use in differentiation. We show that the simultaneous use of an information carrier for differentiation and transmission across generations can lead to the evolution of cell types that do not directly contribute to the progeny of the organism and ergo a germ-soma distinction. This shows that an intrinsic instability during a transition from unicellularity to multicellularity may contribute to widespread evolution of a germline and its maintenance, a phenomenon also relevant to the evolution of eusociality. The difference in epigenetic information contents between different cell lines in a multicellular organism is also relevant for the full-success cloning of higher animals, as well as for the maintenance of single germlines over evolutionary timescales.This article is part of the themed issue 'The major synthetic evolutionary transitions'.
Collapse
|
47
|
He J, Liu YM, Tian JK, Liu XH. Some new sharp bounds for the spectral radius of a nonnegative matrix and its application. JOURNAL OF INEQUALITIES AND APPLICATIONS 2017; 2017:260. [PMID: 29104398 PMCID: PMC5648768 DOI: 10.1186/s13660-017-1536-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Accepted: 10/10/2017] [Indexed: 06/07/2023]
Abstract
In this paper, we give some new sharp upper and lower bounds for the spectral radius of a nonnegative irreducible matrix. Using these bounds, we obtain some new and improved bounds for the signless Laplacian spectral radius of a graph or a digraph.
Collapse
|
48
|
Abstract
Univariate analysis has the purpose to describe a single variable distribution in one sample. It is the first important step of every clinical trial. In this short review, we focus on this analysis, the methods that authors should use to report this type of data, information that they should not miss and mistakes that they must avoid.
Collapse
|
49
|
Geng S, Liu X, Biswal BB, Niu H. Effect of Resting-State fNIRS Scanning Duration on Functional Brain Connectivity and Graph Theory Metrics of Brain Network. Front Neurosci 2017; 11:392. [PMID: 28775676 PMCID: PMC5517460 DOI: 10.3389/fnins.2017.00392] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2016] [Accepted: 06/22/2017] [Indexed: 12/18/2022] Open
Abstract
As an emerging brain imaging technique, functional near infrared spectroscopy (fNIRS) has attracted widespread attention for advancing resting-state functional connectivity (FC) and graph theoretical analyses of brain networks. However, it remains largely unknown how the duration of the fNIRS signal scanning is related to stable and reproducible functional brain network features. To answer this question, we collected resting-state fNIRS signals (10-min duration, two runs) from 18 participants and then truncated the hemodynamic time series into 30-s time bins that ranged from 1 to 10 min. Measures of nodal efficiency, nodal betweenness, network local efficiency, global efficiency, and clustering coefficient were computed for each subject at each fNIRS signal acquisition duration. Analyses of the stability and between-run reproducibility were performed to identify optimal time length for each measure. We found that the FC, nodal efficiency and nodal betweenness stabilized and were reproducible after 1 min of fNIRS signal acquisition, whereas network clustering coefficient, local and global efficiencies stabilized after 1 min and were reproducible after 5 min of fNIRS signal acquisition for only local and global efficiencies. These quantitative results provide direct evidence regarding the choice of the resting-state fNIRS scanning duration for functional brain connectivity and topological metric stability of brain network connectivity.
Collapse
|
50
|
Biswas A, Ranjan D, Zubair M, Zeil S, Nasr KA, He J. An Effective Computational Method Incorporating Multiple Secondary Structure Predictions in Topology Determination for Cryo-EM Images. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:578-586. [PMID: 27008671 PMCID: PMC5071113 DOI: 10.1109/tcbb.2016.2543721] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from α-proteins and two Cryo-EM images of α-β proteins. We observed that the rank of the true topologies is consistently improved by using multiple secondary structure predictions instead of a single prediction. The results show that the algorithm is robust and works well even when errors/misses in the predicted secondary structures are present in the image or the sequence. The results also show that the algorithm is efficient and is able to handle proteins with as many as 33 helices.
Collapse
|