1
|
Barry T, Roeder K, Katsevich E. Exponential family measurement error models for single-cell CRISPR screens. Biostatistics 2024:kxae010. [PMID: 38649751 DOI: 10.1093/biostatistics/kxae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 01/10/2024] [Accepted: 03/11/2024] [Indexed: 04/25/2024] Open
Abstract
CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens-"thresholded regression"-exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV ("GLM-based errors-in-variables"), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across hundreds of processors on clouds (e.g. Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, yielding several new insights.
Collapse
Affiliation(s)
- Timothy Barry
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Building 2 435, 655 Huntington Ave, Boston, MA 02115, United States
| | - Kathryn Roeder
- Department of Statistics and Data Science, Carnegie Mellon University, Baker Hall 228B, 4909 Frew St, Pittsburgh, PA 15213, United States
| | - Eugene Katsevich
- Department of Statistics and Data Science, University of Pennsylvania, Academic Research Building 311, 265 South 37th Street Philadelphia, PA 19104, United States
| |
Collapse
|
2
|
Yu I, Mori T, Matsuoka D, Surblys D, Sugita Y. SPANA: Spatial decomposition analysis for cellular-scale molecular dynamics simulations. J Comput Chem 2024; 45:498-505. [PMID: 37966727 DOI: 10.1002/jcc.27260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/31/2023] [Accepted: 11/02/2023] [Indexed: 11/16/2023]
Abstract
The rapid increase in computational power with the latest supercomputers has enabled atomistic molecular dynamics (MDs) simulations of biomolecules in biological membrane, cytoplasm, and other cellular environments. These environments often contain a million or more atoms to be simulated simultaneously. Therefore, their trajectory analyses involve heavy computations that can become a bottleneck in the computational studies. Spatial decomposition analysis (SPANA) is a set of analysis tools in the Generalized-Ensemble Simulation System (GENESIS) software package that can carry out MD trajectory analyses of large-scale biological simulations using multiple CPU cores in parallel. SPANA applies the spatial decomposition of a large biological system to distribute structural and dynamical analyses into individual CPU cores, which reduces the computational time and the memory size, significantly. SPANA opens new possibilities for detailed atomistic analyses of biomacromolecules as well as solvent water molecules, ions, and metabolites in MD simulation trajectories of very large biological systems containing more than millions of atoms in cellular environments.
Collapse
Affiliation(s)
- Isseki Yu
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, Japan
- Department of Bioinformatics, Maebashi Institute of Technology, Maebashi, Gunma, Japan
| | - Takaharu Mori
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, Japan
| | - Daisuke Matsuoka
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, Japan
| | - Donatas Surblys
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, Japan
| | - Yuji Sugita
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, Japan
- Laboratory for Biomolecular Function Simulation, RIKEN Center for Biosystems Dynamics Research, Kobe, Hyogo, Japan
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Hyogo, Japan
| |
Collapse
|
3
|
Olbrich M, Bartels L, Wohlers I. Sequencing technologies and hardware-accelerated parallel computing transform computational genomics research. Front Bioinform 2024; 4:1384497. [PMID: 38567256 PMCID: PMC10985184 DOI: 10.3389/fbinf.2024.1384497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 03/07/2024] [Indexed: 04/04/2024] Open
Affiliation(s)
- Michael Olbrich
- Center for Biotechnology, Khalifa University for Science and Technology, Abu Dhabi, United Arab Emirates
| | - Lennart Bartels
- Biomolecular Data Science in Pneumology, Research Center Borstel, Borstel, Germany
| | - Inken Wohlers
- Biomolecular Data Science in Pneumology, Research Center Borstel, Borstel, Germany
- University of Lübeck, Lübeck, Germany
| |
Collapse
|
4
|
López-Ales E, Menchón-Lara RM, Simmross-Wattenberg F, Rodríguez-Cayetano M, Martín-Fernández M, Alberola-López C. Multi-Device Parallel MRI Reconstruction: Efficient Partitioning for Undersampled 5D Cardiac CINE. Sensors (Basel) 2024; 24:1313. [PMID: 38400470 PMCID: PMC10891760 DOI: 10.3390/s24041313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 02/04/2024] [Accepted: 02/16/2024] [Indexed: 02/25/2024]
Abstract
Cardiac CINE, a form of dynamic cardiac MRI, is indispensable in the diagnosis and treatment of heart conditions, offering detailed visualization essential for the early detection of cardiac diseases. As the demand for higher-resolution images increases, so does the volume of data requiring processing, presenting significant computational challenges that can impede the efficiency of diagnostic imaging. Our research presents an approach that takes advantage of the computational power of multiple Graphics Processing Units (GPUs) to address these challenges. GPUs are devices capable of performing large volumes of computations in a short period, and have significantly improved the cardiac MRI reconstruction process, allowing images to be produced faster. The innovation of our work resides in utilizing a multi-device system capable of processing the substantial data volumes demanded by high-resolution, five-dimensional cardiac MRI. This system surpasses the memory capacity limitations of single GPUs by partitioning large datasets into smaller, manageable segments for parallel processing, thereby preserving image integrity and accelerating reconstruction times. Utilizing OpenCL technology, our system offers adaptability and cross-platform functionality, ensuring wider applicability. The proposed multi-device approach offers an advancement in medical imaging, accelerating the reconstruction process and facilitating faster and more effective cardiac health assessment.
Collapse
Affiliation(s)
- Emilio López-Ales
- Laboratorio de Procesado de Imagen, Universidad de Valladolid, Campus Miguel Delibes sn., 47011 Valladolid, Spain; (R.-M.M.-L.); (F.S.-W.); (M.R.-C.); (M.M.-F.)
| | | | | | | | | | - Carlos Alberola-López
- Laboratorio de Procesado de Imagen, Universidad de Valladolid, Campus Miguel Delibes sn., 47011 Valladolid, Spain; (R.-M.M.-L.); (F.S.-W.); (M.R.-C.); (M.M.-F.)
| |
Collapse
|
5
|
Kopal I, Labaj I, Vršková J, Harničárová M, Valíček J, Tozan H. Intelligent Modelling of the Real Dynamic Viscosity of Rubber Blends Using Parallel Computing. Polymers (Basel) 2023; 15:3636. [PMID: 37688262 PMCID: PMC10490080 DOI: 10.3390/polym15173636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 08/27/2023] [Accepted: 08/30/2023] [Indexed: 09/10/2023] Open
Abstract
Modelling the flow properties of rubber blends makes it possible to predict their rheological behaviour during the processing and production of rubber-based products. As the nonlinear nature of such complex processes complicates the creation of exact analytical models, it is appropriate to use artificial intelligence tools in this modelling. The present study was implemented to develop a highly efficient artificial neural network model, optimised using a novel training algorithm with fast parallel computing to predict the results of rheological tests of rubber blends performed under different conditions. A series of 120 real dynamic viscosity-time curves, acquired by a rubber process analyser for styrene-butadiene rubber blends with varying carbon black contents vulcanised at different temperatures, were analysed using a Generalised Regression Neural Network. The model was optimised by limiting the fitting error of the training dataset to a pre-specified value of less than 1%. All repeated calculations were made via parallel computing with multiple computer cores, which significantly reduces the total computation time. An excellent agreement between the predicted and measured generalisation data was found, with an error of less than 4.7%, confirming the high generalisation performance of the newly developed model.
Collapse
Affiliation(s)
- Ivan Kopal
- Department of Numerical Methods and Computational Modeling, Faculty of Industrial Technologies in Púchov, Alexander Dubček University of Trenčín, Ivana Krasku 491/30, 020 01 Púchov, Slovakia; (I.K.); (I.L.); (J.V.)
| | - Ivan Labaj
- Department of Numerical Methods and Computational Modeling, Faculty of Industrial Technologies in Púchov, Alexander Dubček University of Trenčín, Ivana Krasku 491/30, 020 01 Púchov, Slovakia; (I.K.); (I.L.); (J.V.)
| | - Juliána Vršková
- Department of Numerical Methods and Computational Modeling, Faculty of Industrial Technologies in Púchov, Alexander Dubček University of Trenčín, Ivana Krasku 491/30, 020 01 Púchov, Slovakia; (I.K.); (I.L.); (J.V.)
| | - Marta Harničárová
- Department of Electrical Engineering, Automation and Informatics, Faculty of Engineering, Slovak University of Agriculture in Nitra, Tr. A. Hlinku 2, 949 76 Nitra, Slovakia;
- Department of Mechanical Engineering, Faculty of Technology, Institute of Technology and Business in České Budějovice, Okružní 10, 370 01 České Budějovice, Czech Republic
| | - Jan Valíček
- Department of Electrical Engineering, Automation and Informatics, Faculty of Engineering, Slovak University of Agriculture in Nitra, Tr. A. Hlinku 2, 949 76 Nitra, Slovakia;
- Department of Mechanical Engineering, Faculty of Technology, Institute of Technology and Business in České Budějovice, Okružní 10, 370 01 České Budějovice, Czech Republic
| | - Hakan Tozan
- College of Engineering and Technology, American University of the Middle East, Egaila 54200, Kuwait;
| |
Collapse
|
6
|
Jung J, Kobayashi C, Sugita Y. Acceleration of generalized replica exchange with solute tempering simulations of large biological systems on massively parallel supercomputer. J Comput Chem 2023. [PMID: 37141320 DOI: 10.1002/jcc.27124] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 04/10/2023] [Accepted: 04/14/2023] [Indexed: 05/06/2023]
Abstract
Generalized replica exchange with solute tempering (gREST) is one of the enhanced sampling algorithms for proteins or other systems with rugged energy landscapes. Unlike the replica-exchange molecular dynamics (REMD) method, solvent temperatures are the same in all replicas, while solute temperatures are different and are exchanged frequently between replicas for exploring various solute structures. Here, we apply the gREST scheme to large biological systems containing over one million atoms using a large number of processors in a supercomputer. First, communication time on a multi-dimensional torus network is reduced by matching each replica to MPI processors optimally. This is applicable not only to gREST but also to other multi-copy algorithms. Second, energy evaluations, which are necessary for the multistate bennet acceptance ratio (MBAR) method for free energy estimations, are performed on-the-fly during the gREST simulations. Using these two advanced schemes, we observed 57.72 ns/day performance in 128-replica gREST calculations with 1.5 million atoms system using 16,384 nodes in Fugaku. These schemes implemented in the latest version of GENESIS software could open new possibilities to answer unresolved questions on large biomolecular complex systems with slow conformational dynamics.
Collapse
Affiliation(s)
- Jaewoon Jung
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Saitama, Japan
| | - Chigusa Kobayashi
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan
| | - Yuji Sugita
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Saitama, Japan
- Laboratory for Biomolecular Function Simulation, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
7
|
Zhang Y, Sun H, Lian X, Tang J, Zhu F. ANPELA: Significantly Enhanced Quantification Tool for Cytometry-Based Single-Cell Proteomics. Adv Sci (Weinh) 2023; 10:e2207061. [PMID: 36950745 DOI: 10.1002/advs.202207061] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 02/13/2023] [Indexed: 05/27/2023]
Abstract
ANPELA is widely used for quantifying traditional bulk proteomic data. Recently, there is a clear shift from bulk proteomics to the single-cell ones (SCP), for which powerful cytometry techniques demonstrate the fantastic capacity of capturing cellular heterogeneity that is completely overlooked by traditional bulk profiling. However, the in-depth and high-quality quantification of SCP data is still challenging and severely affected by the large numbers of quantification workflows and extreme performance dependence on the studied datasets. In other words, the proper selection of well-performing workflow(s) for any studied dataset is elusory, and it is urgently needed to have a significantly enhanced and accelerated tool to address this issue. However, no such tool is developed yet. Herein, ANPELA is therefore updated to its 2.0 version (https://idrblab.org/anpela/), which is unique in providing the most comprehensive set of quantification alternatives (>1000 workflows) among all existing tools, enabling systematic performance evaluation from multiple perspectives based on machine learning, and identifying the optimal workflow(s) using overall performance ranking together with the parallel computation. Extensive validation on different benchmark datasets and representative application scenarios suggest the great application potential of ANPELA in current SCP research for gaining more accurate and reliable biological insights.
Collapse
Affiliation(s)
- Ying Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Xichen Lian
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Jing Tang
- Department of Bioinformatics, Chongqing Medical University, Chongqing, 400016, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| |
Collapse
|
8
|
AL-Jumaili AHA, Muniyandi RC, Hasan MK, Paw JKS, Singh MJ. Big Data Analytics Using Cloud Computing Based Frameworks for Power Management Systems: Status, Constraints, and Future Recommendations. Sensors (Basel) 2023; 23:2952. [PMID: 36991663 PMCID: PMC10051254 DOI: 10.3390/s23062952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 02/03/2023] [Accepted: 02/10/2023] [Indexed: 06/19/2023]
Abstract
Traditional parallel computing for power management systems has prime challenges such as execution time, computational complexity, and efficiency like process time and delays in power system condition monitoring, particularly consumer power consumption, weather data, and power generation for detecting and predicting data mining in the centralized parallel processing and diagnosis. Due to these constraints, data management has become a critical research consideration and bottleneck. To cope with these constraints, cloud computing-based methodologies have been introduced for managing data efficiently in power management systems. This paper reviews the concept of cloud computing architecture that can meet the multi-level real-time requirements to improve monitoring and performance which is designed for different application scenarios for power system monitoring. Then, cloud computing solutions are discussed under the background of big data, and emerging parallel programming models such as Hadoop, Spark, and Storm are briefly described to analyze the advancement, constraints, and innovations. The key performance metrics of cloud computing applications such as core data sampling, modeling, and analyzing the competitiveness of big data was modeled by applying related hypotheses. Finally, it introduces a new design concept with cloud computing and eventually some recommendations focusing on cloud computing infrastructure, and methods for managing real-time big data in the power management system that solve the data mining challenges.
Collapse
Affiliation(s)
- Ahmed Hadi Ali AL-Jumaili
- Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
- Computer Centre Department, University of Fallujah, Anbar 00964, Iraq
| | - Ravie Chandren Muniyandi
- Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Mohammad Kamrul Hasan
- Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| | - Johnny Koh Siaw Paw
- Department of Electronic & Communication Engineering, Universiti Tenaga Nasional, Km 7, Jalan Ikram-Uniten, Kajang 43009, Selangor, Malaysia
| | - Mandeep Jit Singh
- Department of Electrical, Electronic and System Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| |
Collapse
|
9
|
Wang Y, Fu T, Wu C, Fan J, Song H, Xiao D, Lin Y, Liu F, Yang J. Adaptive tetrahedral interpolation for reconstruction of uneven freehand 3D ultrasound. Phys Med Biol 2023; 68. [PMID: 36731138 DOI: 10.1088/1361-6560/acb88c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 02/02/2023] [Indexed: 02/04/2023]
Abstract
Objective.Freehand 3D ultrasound volume reconstruction has received considerable attention in medical research because it can freely perform spatial imaging at a low cost. However, the uneven distribution of the original ultrasound images in space reduces the reconstruction effect of the traditional method.Approach.An adaptive tetrahedral interpolation algorithm is proposed to reconstruct 3D ultrasound volume data. The algorithm adaptively divides the unevenly distributed images into numerous tetrahedrons and interpolates the voxel value in each tetrahedron to reconstruct 3D ultrasound volume data.Main results.Extensive experiments on simulated and clinical data confirm that the proposed method can achieve more accurate reconstruction than six benchmark methods. Specifically, the averaged interpolation error at the gray level can be reduced by 0.22-0.82, and the peak signal-to-noise ratio and the mean structure similarity can be improved by 0.32-1.83 dB and 0.01-0.05, respectively.Significance.With the parallel implementation of the algorithm, one 3D ultrasound volume data with size 279 × 279 × 276 can be reconstructed from 100 slices 2D ultrasound images with size 200 × 200 at 1.04 s. Such a quick and accurate approach has practical value in medical research.
Collapse
Affiliation(s)
- Yifan Wang
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Tianyu Fu
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Chan Wu
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Jingfan Fan
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Hong Song
- School of Software, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Deqiang Xiao
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Yucong Lin
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Fangyi Liu
- Department of Interventional Ultrasound, Chinese PLA General Hospital, Beijing 100853, People's Republic of China
| | - Jian Yang
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| |
Collapse
|
10
|
Sebastian S, Roy S, Kalita J. A generic parallel framework for inferring large-scale gene regulatory networks from expression profiles: application to Alzheimer's disease network. Brief Bioinform 2023; 24:6868522. [PMID: 36534961 DOI: 10.1093/bib/bbac482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 09/14/2022] [Accepted: 10/11/2022] [Indexed: 12/23/2022] Open
Abstract
The inference of large-scale gene regulatory networks is essential for understanding comprehensive interactions among genes. Most existing methods are limited to reconstructing networks with a few hundred nodes. Therefore, parallel computing paradigms must be leveraged to construct large networks. We propose a generic parallel framework that enables any existing method, without re-engineering, to infer large networks in parallel, guaranteeing quality output. The framework is tested on 15 inference methods (not limited to) employing in silico benchmarks and real-world large expression matrices, followed by qualitative and speedup assessment. The framework does not compromise the quality of the base serial inference method. We rank the candidate methods and use the top-performing method to infer an Alzheimer's Disease (AD) affected network from large expression profiles of a triple transgenic mouse model consisting of 45,101 genes. The resultant network is further explored to obtain hub genes that emerge functionally related to the disease. We partition the network into 41 modules and conduct pathway enrichment analysis, revealing that a good number of participating genes are collectively responsible for several brain disorders, including AD. Finally, we extract the interactions of a few known AD genes and observe that they are periphery genes connected to the network's hub genes. Availability: The R implementation of the framework is downloadable from https://github.com/Netralab/GenericParallelFramework.
Collapse
Affiliation(s)
- Softya Sebastian
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, 6th Mile, Gangtok, 737102, Sikkim, India
| | - Swarup Roy
- Network Reconstruction and Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, 6th Mile, Gangtok, 737102, Sikkim, India
| | - Jugal Kalita
- Department of Computer Science, University of Colorado at Colorado Springs, CO, 80918 USA
| |
Collapse
|
11
|
Kauth K, Stadtmann T, Sobhani V, Gemmeke T. neuroAIx-Framework: design of future neuroscience simulation systems exhibiting execution of the cortical microcircuit model 20× faster than biological real-time. Front Comput Neurosci 2023; 17:1144143. [PMID: 37152299 PMCID: PMC10156974 DOI: 10.3389/fncom.2023.1144143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 03/30/2023] [Indexed: 05/09/2023] Open
Abstract
Introduction Research in the field of computational neuroscience relies on highly capable simulation platforms. With real-time capabilities surpassed for established models like the cortical microcircuit, it is time to conceive next-generation systems: neuroscience simulators providing significant acceleration, even for larger networks with natural density, biologically plausible multi-compartment models and the modeling of long-term and structural plasticity. Methods Stressing the need for agility to adapt to new concepts or findings in the domain of neuroscience, we have developed the neuroAIx-Framework consisting of an empirical modeling tool, a virtual prototype, and a cluster of FPGA boards. This framework is designed to support and accelerate the continuous development of such platforms driven by new insights in neuroscience. Results Based on design space explorations using this framework, we devised and realized an FPGA cluster consisting of 35 NetFPGA SUME boards. Discussion This system functions as an evaluation platform for our framework. At the same time, it resulted in a fully deterministic neuroscience simulation system surpassing the state of the art in both performance and energy efficiency. It is capable of simulating the microcircuit with 20× acceleration compared to biological real-time and achieves an energy efficiency of 48nJ per synaptic event.
Collapse
|
12
|
Tran NY, Hieu HT, Bao PT. A proposed scenario to improve the Ncut algorithm in segmentation. Front Big Data 2023; 6:1134946. [PMID: 36936997 PMCID: PMC10020342 DOI: 10.3389/fdata.2023.1134946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 02/13/2023] [Indexed: 03/06/2023] Open
Abstract
In image segmentation, there are many methods to accomplish the result of segmenting an image into k clusters. However, the number of clusters k is always defined before running the process. It is defined by some observation or knowledge based on the application. In this paper, we propose a new scenario in order to define the value k clusters automatically using histogram information. This scenario is applied to Ncut algorithm and speeds up the running time by using CUDA language to parallel computing in GPU. The Ncut is improved in four steps: determination of number of clusters in segmentation, computing the similarity matrix W, computing the similarity matrix's eigenvalues, and grouping on the Fuzzy C-Means (FCM) clustering algorithm. Some experimental results are shown to prove that our scenario is 20 times faster than the Ncut algorithm while keeping the same accuracy.
Collapse
Affiliation(s)
- Nhu Y. Tran
- Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Vietnam
- Information Technology Faculty, Ho Chi Minh City University of Food Industry, Ho Chi Minh City, Vietnam
| | - Huynh Trung Hieu
- Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Pham The Bao
- Information Science Faculty, Sai Gon University, Ho Chi Minh City, Vietnam
- *Correspondence: Pham The Bao
| |
Collapse
|
13
|
Abdusalomov AB, Safarov F, Rakhimov M, Turaev B, Whangbo TK. Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm. Sensors (Basel) 2022; 22:8122. [PMID: 36365819 PMCID: PMC9654697 DOI: 10.3390/s22218122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/14/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]
Abstract
Speech recognition refers to the capability of software or hardware to receive a speech signal, identify the speaker's features in the speech signal, and recognize the speaker thereafter. In general, the speech recognition process involves three main steps: acoustic processing, feature extraction, and classification/recognition. The purpose of feature extraction is to illustrate a speech signal using a predetermined number of signal components. This is because all information in the acoustic signal is excessively cumbersome to handle, and some information is irrelevant in the identification task. This study proposes a machine learning-based approach that performs feature parameter extraction from speech signals to improve the performance of speech recognition applications in real-time smart city environments. Moreover, the principle of mapping a block of main memory to the cache is used efficiently to reduce computing time. The block size of cache memory is a parameter that strongly affects the cache performance. In particular, the implementation of such processes in real-time systems requires a high computation speed. Processing speed plays an important role in speech recognition in real-time systems. It requires the use of modern technologies and fast algorithms that increase the acceleration in extracting the feature parameters from speech signals. Problems with overclocking during the digital processing of speech signals have yet to be completely resolved. The experimental results demonstrate that the proposed method successfully extracts the signal features and achieves seamless classification performance compared to other conventional speech recognition algorithms.
Collapse
Affiliation(s)
| | - Furkat Safarov
- Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Korea
| | - Mekhriddin Rakhimov
- Department of Artificial Intelligence, Tashkent University of Information Technologies Named after Muhammad Al-Khwarizmi, Tashkent 100200, Uzbekistan
| | - Boburkhon Turaev
- Department of Artificial Intelligence, Tashkent University of Information Technologies Named after Muhammad Al-Khwarizmi, Tashkent 100200, Uzbekistan
| | - Taeg Keun Whangbo
- Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Korea
| |
Collapse
|
14
|
Cheng C, Feng X, Li X, Wu M. Robust analysis of cancer heterogeneity for high-dimensional data. Stat Med 2022; 41:5448-5462. [PMID: 36117143 DOI: 10.1002/sim.9578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 06/04/2022] [Accepted: 09/05/2022] [Indexed: 11/06/2022]
Abstract
Cancer heterogeneity plays an important role in the understanding of tumor etiology, progression, and response to treatment. To accommodate heterogeneity, cancer subgroup analysis has been extensively conducted. However, most of the existing studies share the limitation that they cannot accommodate heavy-tailed or contaminated outcomes and also high dimensional covariates, both of which are not uncommon in biomedical research. In this study, we propose a robust subgroup identification approach based on M-estimators together with concave and pairwise fusion penalties, which advances from existing studies by effectively accommodating high-dimensional data containing some outliers. The penalties are applied on both latent heterogeneity factors and covariates, where the estimation is expected to achieve subgroup identification and variable selection simultaneously, with the number of subgroups being apriori unknown. We innovatively develop an algorithm based on parallel computing strategy, with a significant advantage of capable of processing large-scale data. The convergence property of the proposed algorithm, oracle property of the penalized M-estimators, and selection consistency of the proposed BIC criterion are carefully established. Simulation and analysis of TCGA breast cancer data demonstrate that the proposed approach is promising to efficiently identify underlying subgroups in high-dimensional data.
Collapse
Affiliation(s)
- Chao Cheng
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Xingdong Feng
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Xiaoguang Li
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Mengyun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| |
Collapse
|
15
|
Makarov VL, Bakhtizin AR, Sushko ED, Sushko GB. Creation of a Supercomputer Simulation of a Society with Different Types of Active Agents and Its Approbation. Her Russ Acad Sci 2022; 92:268-275. [PMID: 36035028 PMCID: PMC9395919 DOI: 10.1134/s1019331622030182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 06/30/2021] [Accepted: 09/15/2021] [Indexed: 06/15/2023]
Abstract
This article continues a series of works devoted to the creation of large agent-based models, built as an artificial society, and the development of software for their implementation-the MÖBIUS design system for scalable agent-based models. The basic core of the system is a demographic model that simulates the natural movement of the population. A new stage in the development of the work discussed in this article was the creation on the basis of this core of an agent-based model of Russia, which includes families as agents of a new type, hierarchically connected with human agents. In addition, objects of a new type were introduced into the model-projects that provide for the creation in an artificial environment of analogues of complex control actions aimed at stimulating fertility. Developed on the basis of simulating the reaction of individual families to the introduced regional support measures, the model makes it possible to track their impact on key demographic indicators. The agent-based model of Russia was tested on data for a long retrospective period using the example of the launch of maternal capital programs and showed good agreement with official statistics.
Collapse
Affiliation(s)
- V. L. Makarov
- Central Economics and Mathematics Institute, Russian Academy of Sciences, Moscow, Russia
| | - A. R. Bakhtizin
- Central Economics and Mathematics Institute, Russian Academy of Sciences, Moscow, Russia
| | - E. D. Sushko
- Central Economics and Mathematics Institute, Russian Academy of Sciences, Moscow, Russia
| | - G. B. Sushko
- Central Economics and Mathematics Institute, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
16
|
Trensch G, Morrison A. A System-on-Chip Based Hybrid Neuromorphic Compute Node Architecture for Reproducible Hyper-Real-Time Simulations of Spiking Neural Networks. Front Neuroinform 2022; 16:884033. [PMID: 35846779 PMCID: PMC9277345 DOI: 10.3389/fninf.2022.884033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 05/23/2022] [Indexed: 11/23/2022] Open
Abstract
Despite the great strides neuroscience has made in recent decades, the underlying principles of brain function remain largely unknown. Advancing the field strongly depends on the ability to study large-scale neural networks and perform complex simulations. In this context, simulations in hyper-real-time are of high interest, as they would enable both comprehensive parameter scans and the study of slow processes, such as learning and long-term memory. Not even the fastest supercomputer available today is able to meet the challenge of accurate and reproducible simulation with hyper-real acceleration. The development of novel neuromorphic computer architectures holds out promise, but the high costs and long development cycles for application-specific hardware solutions makes it difficult to keep pace with the rapid developments in neuroscience. However, advances in System-on-Chip (SoC) device technology and tools are now providing interesting new design possibilities for application-specific implementations. Here, we present a novel hybrid software-hardware architecture approach for a neuromorphic compute node intended to work in a multi-node cluster configuration. The node design builds on the Xilinx Zynq-7000 SoC device architecture that combines a powerful programmable logic gate array (FPGA) and a dual-core ARM Cortex-A9 processor extension on a single chip. Our proposed architecture makes use of both and takes advantage of their tight coupling. We show that available SoC device technology can be used to build smaller neuromorphic computing clusters that enable hyper-real-time simulation of networks consisting of tens of thousands of neurons, and are thus capable of meeting the high demands for modeling and simulation in neuroscience.
Collapse
Affiliation(s)
- Guido Trensch
- Simulation and Data Laboratory Neuroscience, Jülich Supercomputing Centre, Institute for Advanced Simulation, Jülich Research Centre, Jülich, Germany.,Department of Computer Science 3-Software Engineering, RWTH Aachen University, Aachen, Germany
| | - Abigail Morrison
- Simulation and Data Laboratory Neuroscience, Jülich Supercomputing Centre, Institute for Advanced Simulation, Jülich Research Centre, Jülich, Germany.,Department of Computer Science 3-Software Engineering, RWTH Aachen University, Aachen, Germany.,Institute of Neuroscience and Medicine (INM-6), Institute for Advanced Simulation (IAS-6), JARA-Institute Brain Structure-Function Relationship (JBI-1/INM-10), Research Centre Jülich, Jülich, Germany
| |
Collapse
|
17
|
Ahmad K, Rizzi A, Capelli R, Mandelli D, Lyu W, Carloni P. Enhanced-Sampling Simulations for the Estimation of Ligand Binding Kinetics: Current Status and Perspective. Front Mol Biosci 2022; 9:899805. [PMID: 35755817 PMCID: PMC9216551 DOI: 10.3389/fmolb.2022.899805] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 05/09/2022] [Indexed: 12/12/2022] Open
Abstract
The dissociation rate (k off) associated with ligand unbinding events from proteins is a parameter of fundamental importance in drug design. Here we review recent major advancements in molecular simulation methodologies for the prediction of k off. Next, we discuss the impact of the potential energy function models on the accuracy of calculated k off values. Finally, we provide a perspective from high-performance computing and machine learning which might help improve such predictions.
Collapse
Affiliation(s)
- Katya Ahmad
- Computational Biomedicine (IAS-5/INM-9), Forschungszentrum Jülich, Jülich, Germany
| | - Andrea Rizzi
- Computational Biomedicine (IAS-5/INM-9), Forschungszentrum Jülich, Jülich, Germany
- Atomistic Simulations, Istituto Italiano di Tecnologia, Genova, Italy
| | - Riccardo Capelli
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Torino, Italy
| | - Davide Mandelli
- Computational Biomedicine (IAS-5/INM-9), Forschungszentrum Jülich, Jülich, Germany
| | - Wenping Lyu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, China
- School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, China
| | - Paolo Carloni
- Computational Biomedicine (IAS-5/INM-9), Forschungszentrum Jülich, Jülich, Germany
- Molecular Neuroscience and Neuroimaging (INM-11), Forschungszentrum Jülich, Jülich, Germany
| |
Collapse
|
18
|
Pham M, Li H, Yuan Y, Mou C, Ramachandran K, Xu Z, Tu Y. Dynamic Memory Management in Massively Parallel Systems: A Case on GPUs. ICS 2022; 2022:24. [PMID: 35943281 PMCID: PMC9357265 DOI: 10.1145/3524059.3532387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Due to the high level of parallelism, there are unique challenges in developing system software on massively parallel hardware such as GPUs. One such challenge is designing a dynamic memory allocator whose task is to allocate memory chunks to requesting threads at runtime. State-of-the-art GPU memory allocators maintain a global data structure holding metadata to facilitate allocation/deallocation. However, the centralized data structure can easily become a bottleneck in a massively parallel system. In this paper, we present a novel approach for designing dynamic memory allocation without a centralized data structure. The core idea is to let threads follow a random search procedure to locate free pages. Then we further extend to more advanced designs and algorithms that can achieve an order of magnitude improvement over the basic idea. We present mathematical proofs to demonstrate that (1) the basic random search design achieves asymptotically lower latency than the traditional queue-based design and (2) the advanced designs achieve significant improvement over the basic idea. Extensive experiments show consistency to our mathematical models and demonstrate that our solutions can achieve up to two orders of magnitude improvement in latency over the best-known existing solutions.
Collapse
Affiliation(s)
- Minh Pham
- University of South Florida, Tampa, FL, USA
| | - Hao Li
- University of South Florida, Tampa, FL, USA
| | - Yongke Yuan
- Beijing University of Technology, Beijing, China
| | | | | | - Zichen Xu
- Jiaxing Neofelis Scientific, Inc., Jiaxing, Zhejiang, China
| | - Yicheng Tu
- University of South Florida, Tampa, FL, USA
| |
Collapse
|
19
|
Liu Z, Lin Y, Hoover J, Beene D, Charley PH, Singer N. Individual level spatial-temporal modelling of exposure potential of livestock in the Cove Wash watershed, Arizona. Ann GIS 2022; 29:87-107. [PMID: 37090684 PMCID: PMC10117392 DOI: 10.1080/19475683.2022.2075935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 05/02/2022] [Indexed: 05/03/2023]
Abstract
Personal exposure studies suffer from uncertainty issues, largely stemming from individual behavior uncertainties. Built on spatial-temporal exposure analysis and methods, this study proposed a novel approach to spatial-temporal modeling that incorporated behavior classifications taking into account uncertainties, to estimate individual livestock exposure potential. The new approach was applied in a community-based research project with a Tribal community in the southwest United States. The community project examined the geospatial and temporal grazing patterns of domesticated livestock in a watershed containing 52 abandoned uranium mines (AUMs). Thus, the study aimed to 1) classify Global Positioning System (GPS) data from livestock into three behavior subgroups - grazing, traveling or resting; 2) calculate the daily cumulative exposure potential for livestock; 3) assess the performance of the computational method with and without behavior classifications. Using Lotek Litetrack GPS collars, we collected data at a 20-minute-interval for 2 flocks of sheep and goats during the spring and summer of 2019. Analysis and modeling of GPS data demonstrated no significant difference in individual cumulative exposure potential within each flock when animal behaviors with probability/uncertainties were considered. However, when daily cumulative exposure potential was calculated without consideration of animal behavior or probability/uncertainties, significant differences among animals within a herd were observed, which does not match animal grazing behaviors reported by livestock owners. These results suggest that the proposed method of including behavior subgroups with probability/uncertainties more closely resembled the observed grazing behaviors reported by livestock owners. Results from the research may be used for future intervention and policy-making on remediation efforts in communities where grazing livestock may encounter environmental contaminants. This research also demonstrates a novel robust geographic information system (GIS)-based framework to estimate cumulative exposure potential to environmental contaminants and provides critical information to address community questions on livestock exposure to AUMs.
Collapse
Affiliation(s)
- Zhuoming Liu
- Department of Geography and Environmental Studies, University of New Mexico, Albuquerque, NM, USA
| | - Yan Lin
- Department of Geography and Environmental Studies, University of New Mexico, Albuquerque, NM, USA
| | - Joseph Hoover
- Department of Social Sciences and Cultural Studies, Montana State University Billings, Bozeman, MT, USA
| | - Daniel Beene
- Department of Geography and Environmental Studies, University of New Mexico, Albuquerque, NM, USA
- Community Environmental Health Program, College of Pharmacy, University of New Mexico, Albuquerque, NM, USA
| | - Perry H. Charley
- Dine Environmental Consultant, Beclabito Chapter, Navajo Nation, NM, USA
| | | |
Collapse
|
20
|
Lim HGM, Hsiao SH, Fann YC, Lee YCG. Robust Mutation Profiling of SARS-CoV-2 Variants from Multiple Raw Illumina Sequencing Data with Cloud Workflow. Genes (Basel) 2022. [PMID: 35456492 DOI: 10.3390/genes1304068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2023] Open
Abstract
Several variants of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are emerging all over the world. Variant surveillance from genome sequencing has become crucial to determine if mutations in these variants are rendering the virus more infectious, potent, or resistant to existing vaccines and therapeutics. Meanwhile, analyzing many raw sequencing data repeatedly with currently available code-based bioinformatics tools is tremendously challenging to be implemented in this unprecedented pandemic time due to the fact of limited experts and computational resources. Therefore, in order to hasten variant surveillance efforts, we developed an installation-free cloud workflow for robust mutation profiling of SARS-CoV-2 variants from multiple Illumina sequencing data. Herein, 55 raw sequencing data representing four early SARS-CoV-2 variants of concern (Alpha, Beta, Gamma, and Delta) from an open-access database were used to test our workflow performance. As a result, our workflow could automatically identify mutated sites of the variants along with reliable annotation of the protein-coding genes at cost-effective and timely manner for all by harnessing parallel cloud computing in one execution under resource-limitation settings. In addition, our workflow can also generate a consensus genome sequence which can be shared with others in public data repositories to support global variant surveillance efforts.
Collapse
Affiliation(s)
- Hendrick Gao-Min Lim
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei 11031, Taiwan
| | - Shih-Hsin Hsiao
- Division of Pulmonary Medicine, Department of Internal Medicine, School of Medicine, College of Medicine, Taipei Medical University, Taipei 11031, Taiwan
- Division of Pulmonary Medicine, Department of Internal Medicine, Taipei Medical University Hospital, Taipei 11031, Taiwan
| | - Yang C Fann
- IT and Bioinformatics Program, Division of Intramural, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Yuan-Chii Gladys Lee
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei 11031, Taiwan
| |
Collapse
|
21
|
Wu W, Yang Y, Kang J, He K. Improving large-scale estimation and inference for profiling health care providers. Stat Med 2022; 41:2840-2853. [PMID: 35318706 PMCID: PMC9314652 DOI: 10.1002/sim.9387] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 02/04/2022] [Accepted: 02/21/2022] [Indexed: 01/25/2023]
Abstract
Provider profiling has been recognized as a useful tool in monitoring health care quality, facilitating inter-provider care coordination, and improving medical cost-effectiveness. Existing methods often use generalized linear models with fixed provider effects, especially when profiling dialysis facilities. As the number of providers under evaluation escalates, the computational burden becomes formidable even for specially designed workstations. To address this challenge, we introduce a serial blockwise inversion Newton algorithm exploiting the block structure of the information matrix. A shared-memory divide-and-conquer algorithm is proposed to further boost computational efficiency. In addition to the computational challenge, the current literature lacks an appropriate inferential approach to detecting providers with outlying performance especially when small providers with extreme outcomes are present. In this context, traditional score and Wald tests relying on large-sample distributions of the test statistics lead to inaccurate approximations of the small-sample properties. In light of the inferential issue, we develop an exact test of provider effects using exact finite-sample distributions, with the Poisson-binomial distribution as a special case when the outcome is binary. Simulation analyses demonstrate improved estimation and inference over existing methods. The proposed methods are applied to profiling dialysis facilities based on emergency department encounters using a dialysis patient database from the Centers for Medicare & Medicaid Services.
Collapse
Affiliation(s)
- Wenbo Wu
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan.,Kidney Epidemiology and Cost Center, University of Michigan, Ann Arbor, Michigan
| | - Yuan Yang
- Parexel International, Newton, Massachusetts
| | - Jian Kang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan.,Kidney Epidemiology and Cost Center, University of Michigan, Ann Arbor, Michigan
| | - Kevin He
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan.,Kidney Epidemiology and Cost Center, University of Michigan, Ann Arbor, Michigan
| |
Collapse
|
22
|
Paik H, Cho Y, Cho SB, Kwon OK. MPI-GWAS: a supercomputing-aided permutation approach for genomewide association studies. Genomics Inform 2022; 20:e14. [PMID: 35399013 PMCID: PMC9001997 DOI: 10.5808/gi.22001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 02/10/2022] [Indexed: 11/21/2022] Open
Abstract
Permutation testing is a robust and popular approach for significance testing in genomic research that has the advantage of reducing inflated type 1 error rates; however, its computational cost is notorious in genome-wide association studies (GWAS). Here, we developed a supercomputing-aided approach to accelerate the permutation testing for GWAS, based on the message-passing interface (MPI) on parallel computing architecture. Our application, called MPI-GWAS, conducts MPI-based permutation testing using a parallel computing approach with our supercomputing system, Nurion (8,305 compute nodes, and 563,740 central processing units [CPUs]). For 107 permutations of one locus in MPI-GWAS, it was calculated in 600 s using 2,720 CPU cores. For 107 permutations of ~30,000–50,000 loci in over 7,000 subjects, the total elapsed time was ~4 days in the Nurion supercomputer. Thus, MPI-GWAS enables us to feasibly compute the permutation-based GWAS within a reason-able time by harnessing the power of parallel computing resources.
Collapse
Affiliation(s)
- Hyojung Paik
- Division of Supercomputing, Center for supercomputing application and research, Korea Institute of Science and Technology Information (KISTI), Daejeon 34141, Korea.,Department of Data and HPC Science, University of Science and Technology (UST), Daejeon, 34141, Korea
| | - Yongseong Cho
- Division of Supercomputing, Center for supercomputing application and research, Korea Institute of Science and Technology Information (KISTI), Daejeon 34141, Korea
| | - Seong Beom Cho
- Department of Bio-Medical Informatics, Gachon University College of Medicine, Incheon 21565, Korea
| | - Oh-Kyoung Kwon
- Division of Supercomputing, Center for supercomputing application and research, Korea Institute of Science and Technology Information (KISTI), Daejeon 34141, Korea
| |
Collapse
|
23
|
Pronold J, Jordan J, Wylie BJN, Kitayama I, Diesmann M, Kunkel S. Routing Brain Traffic Through the Von Neumann Bottleneck: Parallel Sorting and Refactoring. Front Neuroinform 2022; 15:785068. [PMID: 35300490 PMCID: PMC8921864 DOI: 10.3389/fninf.2021.785068] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 12/24/2021] [Indexed: 11/26/2022] Open
Abstract
Generic simulation code for spiking neuronal networks spends the major part of the time in the phase where spikes have arrived at a compute node and need to be delivered to their target neurons. These spikes were emitted over the last interval between communication steps by source neurons distributed across many compute nodes and are inherently irregular and unsorted with respect to their targets. For finding those targets, the spikes need to be dispatched to a three-dimensional data structure with decisions on target thread and synapse type to be made on the way. With growing network size, a compute node receives spikes from an increasing number of different source neurons until in the limit each synapse on the compute node has a unique source. Here, we show analytically how this sparsity emerges over the practically relevant range of network sizes from a hundred thousand to a billion neurons. By profiling a production code we investigate opportunities for algorithmic changes to avoid indirections and branching. Every thread hosts an equal share of the neurons on a compute node. In the original algorithm, all threads search through all spikes to pick out the relevant ones. With increasing network size, the fraction of hits remains invariant but the absolute number of rejections grows. Our new alternative algorithm equally divides the spikes among the threads and immediately sorts them in parallel according to target thread and synapse type. After this, every thread completes delivery solely of the section of spikes for its own neurons. Independent of the number of threads, all spikes are looked at only two times. The new algorithm halves the number of instructions in spike delivery which leads to a reduction of simulation time of up to 40 %. Thus, spike delivery is a fully parallelizable process with a single synchronization point and thereby well suited for many-core systems. Our analysis indicates that further progress requires a reduction of the latency that the instructions experience in accessing memory. The study provides the foundation for the exploration of methods of latency hiding like software pipelining and software-induced prefetching.
Collapse
Affiliation(s)
- Jari Pronold
- Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany
- RWTH Aachen University, Aachen, Germany
| | - Jakob Jordan
- Department of Physiology, University of Bern, Bern, Switzerland
| | - Brian J. N. Wylie
- Jülich Supercomputing Centre, Jülich Research Centre, Jülich, Germany
| | | | - Markus Diesmann
- Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany
- Department of Physics, Faculty 1, RWTH Aachen University, Aachen, Germany
- Department of Psychiatry, Psychotherapy and Psychosomatics, Medical Faculty, RWTH Aachen University, Aachen, Germany
| | - Susanne Kunkel
- Faculty of Science and Technology, Norwegian University of Life Sciences, Ås, Norway
| |
Collapse
|
24
|
Oh S, Lee JH, Seo S, Choo H, Lee D, Cho JI, Park JH. Electrolyte-Gated Vertical Synapse Array based on Van Der Waals Heterostructure for Parallel Computing. Adv Sci (Weinh) 2022; 9:e2103808. [PMID: 34957687 PMCID: PMC8867203 DOI: 10.1002/advs.202103808] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/11/2021] [Indexed: 06/01/2023]
Abstract
Recently, three-terminal synaptic devices, which separate read and write terminals, have attracted significant attention because they enable nondestructive read-out and parallel-access for updating synaptic weights. However, owing to their structural features, it is difficult to address the relatively high device density compared with two-terminal synaptic devices. In this study, a vertical synaptic device featuring remotely controllable weight updates via e-field-dependent movement of mobile ions in the ion-gel layer is developed. This synaptic device successfully demonstrates all essential synaptic characteristics, such as excitatory/inhibitory postsynaptic current (E/IPSC), paired-pulse facilitation (PPF), and long-term potentiation/depression (LTP/D) by electrical measurements, and exhibits competitive LTP/D characteristics with a dynamic range (Gmax /Gmin ) of 31.3, and asymmetry (AS) of 8.56. The stability of the LTP/D characteristics is also verified through repeated measurements over 50 cycles; the relative standard deviations (RSDs) of Gmax /Gmin and AS are calculated as 1.65% and 0.25%, respectively. These excellent synaptic properties enable a recognition rate of ≈99% in the training and inference tasks for acoustic and emotional information patterns. This study is expected to be an important foundation for the realization of future parallel computing networks for energy-efficient and high-speed data processing.
Collapse
Affiliation(s)
- Seyong Oh
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Ju-Hee Lee
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Seunghwan Seo
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Hyongsuk Choo
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Dongyoung Lee
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Jeong-Ick Cho
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
| | - Jin-Hong Park
- Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
- Sungkyunkwan Advanced Institute of Nanotechnology (SAINT), Sungkyunkwan University, Suwon, 16417, Korea
| |
Collapse
|
25
|
Lai X, Taskén HA, Mo T, Funke SW, Frigessi A, Rognes ME, Köhn-Luque A. A scalable solver for a stochastic, hybrid cellular automaton model of personalized breast cancer therapy. Int J Numer Method Biomed Eng 2022; 38:e3542. [PMID: 34716985 DOI: 10.1002/cnm.3542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 10/24/2021] [Indexed: 06/13/2023]
Abstract
Mathematical modeling and simulation is a promising approach to personalized cancer medicine. Yet, the complexity, heterogeneity and multi-scale nature of cancer pose significant computational challenges. Coupling discrete cell-based models with continuous models using hybrid cellular automata (CA) is a powerful approach for mimicking biological complexity and describing the dynamical exchange of information across different scales. However, when clinically relevant cancer portions are taken into account, such models become computationally very expensive. While efficient parallelization techniques for continuous models exist, their coupling with discrete models, particularly CA, necessitates more elaborate solutions. Building upon FEniCS, a popular and powerful scientific computing platform for solving partial differential equations, we developed parallel algorithms to link stochastic CA with differential equations (https://bitbucket.org/HTasken/cansim). The algorithms minimize the communication between processes that share CA neighborhood values while also allowing for reproducibility during stochastic updates. We demonstrated the potential of our solution on a complex hybrid cellular automaton model of breast cancer treated with combination chemotherapy. On a single-core processor, we obtained nearly linear scaling with an increasing problem size, whereas weak parallel scaling showed moderate growth in solving time relative to increase in problem size. Finally, we applied the algorithm to a problem that is 500 times larger than previous work, allowing us to run personalized therapy simulations based on heterogeneous cell density and tumor perfusion conditions estimated from magnetic resonance imaging data on an unprecedented scale.
Collapse
Affiliation(s)
- Xiaoran Lai
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Håkon A Taskén
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Torgeir Mo
- Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | | | - Arnoldo Frigessi
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, Oslo, Norway
- Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Oslo, Norway
| | | | - Alvaro Köhn-Luque
- Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, Oslo, Norway
| |
Collapse
|
26
|
Yan W, Ansari S, Lamson A, Glaser MA, Blackwell R, Betterton MD, Shelley M. Toward the cellular-scale simulation of motor-driven cytoskeletal assemblies. eLife 2022; 11:74160. [PMID: 35617115 PMCID: PMC9135453 DOI: 10.7554/elife.74160] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 04/24/2022] [Indexed: 11/17/2022] Open
Abstract
The cytoskeleton - a collection of polymeric filaments, molecular motors, and crosslinkers - is a foundational example of active matter, and in the cell assembles into organelles that guide basic biological functions. Simulation of cytoskeletal assemblies is an important tool for modeling cellular processes and understanding their surprising material properties. Here, we present aLENS (a Living Ensemble Simulator), a novel computational framework designed to surmount the limits of conventional simulation methods. We model molecular motors with crosslinking kinetics that adhere to a thermodynamic energy landscape, and integrate the system dynamics while efficiently and stably enforcing hard-body repulsion between filaments. Molecular potentials are entirely avoided in imposing steric constraints. Utilizing parallel computing, we simulate tens to hundreds of thousands of cytoskeletal filaments and crosslinking motors, recapitulating emergent phenomena such as bundle formation and buckling. This simulation framework can help elucidate how motor type, thermal fluctuations, internal stresses, and confinement determine the evolution of cytoskeletal active matter.
Collapse
Affiliation(s)
- Wen Yan
- Center for Computational Biology, Flatiron InstituteNew YorkUnited States
| | - Saad Ansari
- Department of Physics, University of Colorado BoulderBoulderUnited States
| | - Adam Lamson
- Center for Computational Biology, Flatiron InstituteNew YorkUnited States,Department of Physics, University of Colorado BoulderBoulderUnited States
| | - Matthew A Glaser
- Department of Physics, University of Colorado BoulderBoulderUnited States
| | - Robert Blackwell
- Center for Computational Biology, Flatiron InstituteNew YorkUnited States
| | - Meredith D Betterton
- Center for Computational Biology, Flatiron InstituteNew YorkUnited States,Department of Physics, University of Colorado BoulderBoulderUnited States,Department of Molecular, Cellular, and Developmental Biology, University of Colorado BoulderBoulderUnited States
| | - Michael Shelley
- Center for Computational Biology, Flatiron InstituteNew YorkUnited States,Courant Institute, New York UniversityNew YorkUnited States
| |
Collapse
|
27
|
Zhang S, Qiang Y. Fast parallel implementation for total variation constrained algebraic reconstruction technique. J Xray Sci Technol 2022; 30:737-750. [PMID: 35527622 DOI: 10.3233/xst-221163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In computed tomography (CT), the total variation (TV) constrained algebraic reconstruction technique (ART) can obtain better reconstruction quality when the projection data are sparse and noisy. However, the ART-TV algorithm remains time-consuming since it requires large numbers of iterations, especially for the reconstruction of high-resolution images. In this work, we propose a fast algorithm to calculate the system matrix for line intersection model and apply this algorithm to perform the forward-projection and back-projection operations of the ART. Then, we utilize the parallel computing techniques of multithreading and graphics processing units (GPU) to accelerate the ART iteration and the TV minimization, respectively. Numerical experiments show that our proposed parallel implementation approach is very efficient and accurate. For the reconstruction of a 2048 × 2048 image from 180 projection views of 2048 detector bins, it takes about 2.2 seconds to perform one iteration of the ART-TV algorithm using our proposed approach on a ten-core platform. Experimental results demonstrate that our new approach achieves a speedup of 23 times over the conventional single-threaded CPU implementation that using the Siddon algorithm.
Collapse
Affiliation(s)
- Shunli Zhang
- School of Information Science and Technology, Northwest University, Xi'an, China
| | - Yu Qiang
- School of Information Science and Technology, Northwest University, Xi'an, China
| |
Collapse
|
28
|
Guerrero-Araya E, Muñoz M, Rodríguez C, Paredes-Sabja D. FastMLST: A Multi-core Tool for Multilocus Sequence Typing of Draft Genome Assemblies. Bioinform Biol Insights 2021; 15:11779322211059238. [PMID: 34866905 PMCID: PMC8637782 DOI: 10.1177/11779322211059238] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 10/19/2021] [Indexed: 11/21/2022] Open
Abstract
Multilocus Sequence Typing (MLST) is a precise microbial typing approach at the
intra-species level for epidemiologic and evolutionary purposes. It operates by
assigning a sequence type (ST) identifier to each specimen, based on a
combination of alleles of multiple housekeeping genes included in a defined
scheme. The use of MLST has multiplied due to the availability of large numbers
of genomic sequences and epidemiologic data in public repositories. However,
data processing speed has become problematic due to the massive size of modern
datasets. Here, we present FastMLST, a tool that is designed to perform PubMLST
searches using BLASTn and a divide-and-conquer approach that processes each
genome assembly in parallel. The output offered by FastMLST includes a table
with the ST, allelic profile, and clonal complex or clade (when available),
detected for a query, as well as a multi-FASTA file or a series of FASTA files
with the concatenated or single allele sequences detected, respectively.
FastMLST was validated with 91 different species, with a wide range of
guanine-cytosine content (%GC), genome sizes, and fragmentation levels, and a
speed test was performed on 3 datasets with varying genome sizes. Compared with
other tools such as mlst, CGE/MLST, MLSTar, and PubMLST, FastMLST takes
advantage of multiple processors to simultaneously type up to 28 000 genomes in
less than 10 minutes, reducing processing times by at least 3-fold with 100%
concordance to PubMLST, if contaminated genomes are excluded from the analysis.
The source code, installation instructions, and documentation of FastMLST are
available at https://github.com/EnzoAndree/FastMLST
Collapse
Affiliation(s)
- Enzo Guerrero-Araya
- Microbiota-Host Interactions and Clostridia Research Group, Facultad de Ciencias de la Vida, Universidad Andrés Bello, Santiago, Chile.,ANID, Millennium Science Initiative Program, Millennium Nucleus in the Biology of the Intestinal Microbiota, Santiago, Chile
| | - Marina Muñoz
- ANID, Millennium Science Initiative Program, Millennium Nucleus in the Biology of the Intestinal Microbiota, Santiago, Chile.,Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - César Rodríguez
- Facultad de Microbiología and Centro de Investigación en Enfermedades Tropicales (CIET), Universidad de Costa Rica, San José, Costa Rica
| | - Daniel Paredes-Sabja
- ANID, Millennium Science Initiative Program, Millennium Nucleus in the Biology of the Intestinal Microbiota, Santiago, Chile.,Department of Biology, Texas A&M University, College Station, TX, USA
| |
Collapse
|
29
|
Iserte S, Carratalà P, Arnau R, Martínez-Cuenca R, Barreda P, Basiero L, Climent J, Chiva S. Modeling of wastewater treatment processes with hydrosludge. Water Environ Res 2021; 93:3049-3063. [PMID: 34755418 DOI: 10.1002/wer.1656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 10/18/2021] [Accepted: 10/22/2021] [Indexed: 06/13/2023]
Abstract
The pressure for Water Resource Recovery Facilities (WRRF) operators to efficiently treat wastewater is greater than ever because of the water crisis, produced by the climate change effects and more restrictive regulations. Technicians and researchers need to evaluate WRRF performance to ensure maximum efficiency. For this purpose, numerical techniques, such as CFD, have been widely applied to the wastewater sector to model biological reactors and secondary settling tanks with high spatial and temporal accuracy. However, limitations such as complexity and learning curve prevent extending CFD usage among wastewater modeling experts. This paper presents HydroSludge, a framework that provides a series of tools that simplify the implementation of the processes and workflows in a WRRF. This work leverages HydroSludge to preprocess existing data, aid the meshing process, and perform CFD simulations. Its intuitive interface proves itself as an effective tool to increase the efficiency of wastewater treatment. PRACTITIONER POINTS: This paper introduces a software platform specifically oriented to WRRF, named HydroSludge, which provides easy access to the most widespread and leading CFD simulation software, OpenFOAM. Hydrosludge is intended to be used by WRRF operators, bringing a more wizard-like, automatic, and intuitive usage. Meshing assistance, submersible mixers, biological models, and distributed parallel computing are the most remarkable features included in HydroSludge. With the provided study cases, HydroSludge has proven to be a crucial tool for operators, managers, and researchers in WRRF.
Collapse
Affiliation(s)
- Sergio Iserte
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| | - Pablo Carratalà
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| | - Rosario Arnau
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| | - Raúl Martínez-Cuenca
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| | - Paloma Barreda
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| | - Luís Basiero
- Sociedad Fomento Agrícola Castellonense (FACSA), Castelló, Spain
| | - Javier Climent
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
- Sociedad Fomento Agrícola Castellonense (FACSA), Castelló, Spain
| | - Sergio Chiva
- Department of Mechanical and Engineering Construction, Universitat Jaume I (UJI), Castelló, Spain
| |
Collapse
|
30
|
Cury LFM, Maso Talou GD, Younes-Ibrahim M, Blanco PJ. Parallel generation of extensive vascular networks with application to an archetypal human kidney model. R Soc Open Sci 2021; 8:210973. [PMID: 34966553 PMCID: PMC8633801 DOI: 10.1098/rsos.210973] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 10/28/2021] [Indexed: 05/25/2023]
Abstract
Given the relevance of the inextricable coupling between microcirculation and physiology, and the relation to organ function and disease progression, the construction of synthetic vascular networks for mathematical modelling and computer simulation is becoming an increasingly broad field of research. Building vascular networks that mimic in vivo morphometry is feasible through algorithms such as constrained constructive optimization (CCO) and variations. Nevertheless, these methods are limited by the maximum number of vessels to be generated due to the whole network update required at each vessel addition. In this work, we propose a CCO-based approach endowed with a domain decomposition strategy to concurrently create vascular networks. The performance of this approach is evaluated by analysing the agreement with the sequentially generated networks and studying the scalability when building vascular networks up to 200 000 vascular segments. Finally, we apply our method to vascularize a highly complex geometry corresponding to the cortex of a prototypical human kidney. The technique presented in this work enables the automatic generation of extensive vascular networks, removing the limitation from previous works. Thus, we can extend vascular networks (e.g. obtained from medical images) to pre-arteriolar level, yielding patient-specific whole-organ vascular models with an unprecedented level of detail.
Collapse
Affiliation(s)
- L. F. M. Cury
- National Laboratory for Scientific Computing, LNCC/MCTI, Petrópolis, Brazil
- National Institute of Science and Technology in Medicine Assisted by Scientific Computing, INCT-MACC, Petrópolis, Brazil
| | - G. D. Maso Talou
- Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand
| | - M. Younes-Ibrahim
- Faculty of Medical Sciences, Rio de Janeiro State University, UERJ, Rio de Janeiro, Brazil
- National Institute of Science and Technology in Medicine Assisted by Scientific Computing, INCT-MACC, Petrópolis, Brazil
| | - P. J. Blanco
- National Laboratory for Scientific Computing, LNCC/MCTI, Petrópolis, Brazil
- National Institute of Science and Technology in Medicine Assisted by Scientific Computing, INCT-MACC, Petrópolis, Brazil
| |
Collapse
|
31
|
Yang X, Wang W, Ma JL, Qiu YL, Lu K, Cao DS, Wu CK. BioNet: a large-scale and heterogeneous biological network model for interaction prediction with graph convolution. Brief Bioinform 2021; 23:6440126. [PMID: 34849567 PMCID: PMC8690188 DOI: 10.1093/bib/bbab491] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 01/09/2023] Open
Abstract
Motivation Understanding chemical–gene interactions (CGIs) is crucial for screening drugs. Wet experiments are usually costly and laborious, which limits relevant studies to a small scale. On the contrary, computational studies enable efficient in-silico exploration. For the CGI prediction problem, a common method is to perform systematic analyses on a heterogeneous network involving various biomedical entities. Recently, graph neural networks become popular in the field of relation prediction. However, the inherent heterogeneous complexity of biological interaction networks and the massive amount of data pose enormous challenges. This paper aims to develop a data-driven model that is capable of learning latent information from the interaction network and making correct predictions. Results We developed BioNet, a deep biological networkmodel with a graph encoder–decoder architecture. The graph encoder utilizes graph convolution to learn latent information embedded in complex interactions among chemicals, genes, diseases and biological pathways. The learning process is featured by two consecutive steps. Then, embedded information learnt by the encoder is then employed to make multi-type interaction predictions between chemicals and genes with a tensor decomposition decoder based on the RESCAL algorithm. BioNet includes 79 325 entities as nodes, and 34 005 501 relations as edges. To train such a massive deep graph model, BioNet introduces a parallel training algorithm utilizing multiple Graphics Processing Unit (GPUs). The evaluation experiments indicated that BioNet exhibits outstanding prediction performance with a best area under Receiver Operating Characteristic (ROC) curve of 0.952, which significantly surpasses state-of-theart methods. For further validation, top predicted CGIs of cancer and COVID-19 by BioNet were verified by external curated data and published literature.
Collapse
Affiliation(s)
- Xi Yang
- College of Computer, National University of Defense Technology, China
| | - Wei Wang
- National Supercomputer Center in Tianjin, China
| | - Jing-Lun Ma
- College of Computer, National University of Defense Technology, China
| | - Yan-Long Qiu
- College of Computer, National University of Defense Technology, China
| | - Kai Lu
- College of Computer, National University of Defense Technology, China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, China
| | - Cheng-Kun Wu
- Institute for Quantum Information & State Key Laboratory of High Performance Computing, College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
| |
Collapse
|
32
|
Shuai M, He D, Chen X. Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix. Stat Appl Genet Mol Biol 2021; 20:145-153. [PMID: 34757703 DOI: 10.1515/sagmb-2021-0025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 10/08/2021] [Indexed: 02/06/2023]
Abstract
Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.
Collapse
Affiliation(s)
- Min Shuai
- State Key Laboratory of Characteristic Chinese Medicine Resources in Southwest China, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, People's Republic of China.,Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China
| | - Dongmei He
- State Key Laboratory of Characteristic Chinese Medicine Resources in Southwest China, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, People's Republic of China.,Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China.,Center for Post-doctoral research, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, State Key Laboratory Breeding Base of Dao-di Herbs, Beijing 100700, China
| | - Xin Chen
- State Key Laboratory of Characteristic Chinese Medicine Resources in Southwest China, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, People's Republic of China.,Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China
| |
Collapse
|
33
|
Ko S, Li GX, Choi H, Won JH. Computationally scalable regression modeling for ultrahigh-dimensional omics data with ParProx. Brief Bioinform 2021; 22:bbab256. [PMID: 34254998 PMCID: PMC8575036 DOI: 10.1093/bib/bbab256] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 06/15/2021] [Accepted: 06/17/2021] [Indexed: 12/20/2022] Open
Abstract
Statistical analysis of ultrahigh-dimensional omics scale data has long depended on univariate hypothesis testing. With growing data features and samples, the obvious next step is to establish multivariable association analysis as a routine method to describe genotype-phenotype association. Here we present ParProx, a state-of-the-art implementation to optimize overlapping and non-overlapping group lasso regression models for time-to-event and classification analysis, with selection of variables grouped by biological priors. ParProx enables multivariable model fitting for ultrahigh-dimensional data within an architecture for parallel or distributed computing via latent variable group representation. It thereby aims to produce interpretable regression models consistent with known biological relationships among independent variables, a property often explored post hoc, not during model estimation. Simulation studies clearly demonstrate the scalability of ParProx with graphics processing units in comparison to existing implementations. We illustrate the tool using three different omics data sets featuring moderate to large numbers of variables, where we use genomic regions and biological pathways as variable groups, rendering the selected independent variables directly interpretable with respect to those groups. ParProx is applicable to a wide range of studies using ultrahigh-dimensional omics data, from genome-wide association analysis to multi-omics studies where model estimation is computationally intractable with existing implementation.
Collapse
Affiliation(s)
- Seyoon Ko
- Department of Statistics, Seoul National University, Republic of Korea
| | - Ginny X Li
- Department of Medicine, National University of Singapore, Singapore
| | - Hyungwon Choi
- Department of Medicine, National University of Singapore, Singapore
| | - Joong-Ho Won
- Department of Statistics, Seoul National University, Republic of Korea
| |
Collapse
|
34
|
Lu H, Wei Z, Wang C, Guo J, Zhou Y, Wang Z, Liu H. Redesigning Vina@QNLM for Ultra-Large-Scale Molecular Docking and Screening on a Sunway Supercomputer. Front Chem 2021; 9:750325. [PMID: 34778205 PMCID: PMC8581564 DOI: 10.3389/fchem.2021.750325] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 09/14/2021] [Indexed: 11/28/2022] Open
Abstract
Ultra-large-scale molecular docking can improve the accuracy of lead compounds in drug discovery. In this study, we developed a molecular docking piece of software, Vina@QNLM, which can use more than 4,80,000 parallel processes to search for potential lead compounds from hundreds of millions of compounds. We proposed a task scheduling mechanism for large-scale parallelism based on Vinardo and Sunway supercomputer architecture. Then, we readopted the core docking algorithm to incorporate the full advantage of the heterogeneous multicore processor architecture in intensive computing. We successfully expanded it to 10, 465, 065 cores (1,61,001 management process elements and 0, 465, 065 computing process elements), with a strong scalability of 55.92%. To the best of our knowledge, this is the first time that 10 million cores are used for molecular docking on Sunway. The introduction of the heterogeneous multicore processor architecture achieved the best speedup, which is 11x more than that of the management process element of Sunway. The performance of Vina@QNLM was comprehensively evaluated using the CASF-2013 and CASF-2016 protein-ligand benchmarks, and the screening power was the highest out of the 27 pieces of software tested in the CASF-2013 benchmark. In some existing applications, we used Vina@QNLM to dock more than 10 million molecules to nine rigid proteins related to SARS-CoV-2 within 8.5 h on 10 million cores. We also developed a platform for the general public to use the software.
Collapse
Affiliation(s)
- Hao Lu
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
- Pilot National Laboratory for Marine Science and Technology, Qingdao, China
| | - Zhiqiang Wei
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
- Pilot National Laboratory for Marine Science and Technology, Qingdao, China
| | - Cunji Wang
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
| | - Jingjing Guo
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
| | - Yuandong Zhou
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
| | - Zhuoya Wang
- Pilot National Laboratory for Marine Science and Technology, Qingdao, China
| | - Hao Liu
- College of Computer Science and Technology, Ocean University of China, Qingdao, China
- Pilot National Laboratory for Marine Science and Technology, Qingdao, China
| |
Collapse
|
35
|
Thomine O, Alizon S, Boennec C, Barthelemy M, Sofonea M. Emerging dynamics from high-resolution spatial numerical epidemics. eLife 2021; 10:71417. [PMID: 34652271 PMCID: PMC8568339 DOI: 10.7554/elife.71417] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 10/07/2021] [Indexed: 11/23/2022] Open
Abstract
Simulating nationwide realistic individual movements with a detailed geographical structure can help optimise public health policies. However, existing tools have limited resolution or can only account for a limited number of agents. We introduce Epidemap, a new framework that can capture the daily movement of more than 60 million people in a country at a building-level resolution in a realistic and computationally efficient way. By applying it to the case of an infectious disease spreading in France, we uncover hitherto neglected effects, such as the emergence of two distinct peaks in the daily number of cases or the importance of local density in the timing of arrival of the epidemic. Finally, we show that the importance of super-spreading events strongly varies over time.
Collapse
Affiliation(s)
- Olivier Thomine
- LIS UMR 7020 CNRS, Aix Marseille University, Marseille, France
| | - Samuel Alizon
- MIVEGEC, Université de Montpellier, CNRS, IRD, Montpellier, France
| | - Corentin Boennec
- MIVEGEC, Université de Montpellier, CNRS, IRD, Montpellier, France
| | | | - Mircea Sofonea
- MIVEGEC, Université de Montpellier, CNRS, IRD, Montpellier, France
| |
Collapse
|
36
|
Roberge V, Tarbouchi M. Parallel Algorithm on GPU for Wireless Sensor Data Acquisition Using a Team of Unmanned Aerial Vehicles. Sensors (Basel) 2021; 21:6851. [PMID: 34696064 DOI: 10.3390/s21206851] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 09/30/2021] [Accepted: 10/12/2021] [Indexed: 11/16/2022]
Abstract
This paper proposes a framework for the wireless sensor data acquisition using a team of Unmanned Aerial Vehicles (UAVs). Scattered over a terrain, the sensors detect information about their surroundings and can transmit this information wirelessly over a short range. With no access to a terrestrial or satellite communication network to relay the information to, UAVs are used to visit the sensors and collect the data. The proposed framework uses an iterative k-means algorithm to group the sensors into clusters and to identify Download Points (DPs) where the UAVs hover to download the data. A Single-Source–Shortest-Path algorithm (SSSP) is used to compute optimal paths between every pair of DPs with a constraint to reduce the number of turns. A genetic algorithm supplemented with a 2-opt local search heuristic is used to solve the multi-travelling salesperson problem and to find optimized tours for each UAVs. Finally, a collision avoidance strategy is implemented to guarantee collision-free trajectories. Concerned with the overall runtime of the framework, the SSSP algorithm is implemented in parallel on a graphics processing unit. The proposed framework is tested in simulation using three UAVs and realistic 3D maps with up to 100 sensors and runs in just 20.7 s, a 33.3× speed-up compared to a sequential execution on CPU. The results show that the proposed method is efficient at calculating optimized trajectories for the UAVs for data acquisition from wireless sensors. The results also show the significant advantage of the parallel implementation on GPU.
Collapse
|
37
|
Goicovich I, Olivares P, Román C, Vázquez A, Poupon C, Mangin JF, Guevara P, Hernández C. Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism. Front Neuroinform 2021; 15:727859. [PMID: 34539370 PMCID: PMC8445177 DOI: 10.3389/fninf.2021.727859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 08/10/2021] [Indexed: 11/13/2022] Open
Abstract
Fiber clustering methods are typically used in brain research to study the organization of white matter bundles from large diffusion MRI tractography datasets. These methods enable exploratory bundle inspection using visualization and other methods that require identifying brain white matter structures in individuals or a population. Some applications, such as real-time visualization and inter-subject clustering, need fast and high-quality intra-subject clustering algorithms. This work proposes a parallel algorithm using a General Purpose Graphics Processing Unit (GPGPU) for fiber clustering based on the FFClust algorithm. The proposed GPGPU implementation exploits data parallelism using both multicore and GPU fine-grained parallelism present in commodity architectures, including current laptops and desktop computers. Our approach implements all FFClust steps in parallel, improving execution times in all of them. In addition, our parallel approach includes a parallel Kmeans++ algorithm implementation and defines a new variant of Kmeans++ to reduce the impact of choosing outliers as initial centroids. The results show that our approach provides clustering quality results very similar to FFClust, and it requires an execution time of 3.5 s for processing about a million fibers, achieving a speedup of 11.5 times compared to FFClust.
Collapse
Affiliation(s)
- Isaac Goicovich
- Department of Electrical Engineering, Universidad de Concepción, Concepción, Chile
| | - Paulo Olivares
- Department of Computer Science, Universidad de Concepción, Concepción, Chile
| | - Claudio Román
- Department of Electrical Engineering, Universidad de Concepción, Concepción, Chile
| | - Andrea Vázquez
- Department of Computer Science, Universidad de Concepción, Concepción, Chile
| | - Cyril Poupon
- Université Paris-Saclay, CEA, CNRS, Neurospin, Baobab, Gif-sur-Yvette, France
| | | | - Pamela Guevara
- Department of Electrical Engineering, Universidad de Concepción, Concepción, Chile
| | - Cecilia Hernández
- Department of Computer Science, Universidad de Concepción, Concepción, Chile.,Center for Biotechnology and Bioengineering, Santiago, Chile
| |
Collapse
|
38
|
Lebedev I, Lovskaya D, Mochalova M, Mitrofanov I, Menshutina N. Cellular Automata Modeling of Three-Dimensional Chitosan-Based Aerogels Fiberous Structures with Bezier Curves. Polymers (Basel) 2021; 13:polym13152511. [PMID: 34372113 PMCID: PMC8348900 DOI: 10.3390/polym13152511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 07/27/2021] [Accepted: 07/28/2021] [Indexed: 11/16/2022] Open
Abstract
In this work, a cellular automata approach was investigated for modeling three-dimensional fibrous nanoporous aerogel structures. A model for the generation of fibrous structures using the Bezier curves is proposed. Experimental chitosan-based aerogel particles were obtained for which analytical studies of the structural characteristics were carried out. The data obtained were used to generate digital copies of chitosan-based aerogel structures and to assess the accuracy of the developed model. The obtained digital copies of chitosan-based aerogel structures will be used to create digital copies of aerogel structures with embedded active pharmaceutical ingredients (APIs) and further predict the release of APIs from these structures.
Collapse
|
39
|
Abstract
Pedigree relationships between every pair of individuals forms the elements of the additive genetic relationship matrix (A). Calculation of A−1 does not require forming and inverting A, and it is faster and easier than the calculation of A. Although A−1 is used in best linear unbiased prediction of genetic merit, A is used in population studies and post-evaluation procedures, such as breeding programs and controlling the rate of inbreeding. Three pedigrees with 20,000 animals (20K) and different (1, 2, 4) litter sizes, and a pedigree with 180,000 animals (180K) and litter size 2 were simulated. Aiming to reduce the computation time for calculating A, new methods [Array-Tabular method, (T−1)−1 instead of T in Thompson's method, iterative updating of D in Thompson's method, and iteration by generation] were developed and compared with some existing methods. The methods were coded in the R programming language to demonstrate the algorithms, aiming for minimizing the computational time. Among 20K, computational time decreased with increasing litter size for most of the methods. Methods deriving A from A−1 were relatively slow. The other methods were either using only pedigree information or both the pedigree and inbreeding coefficients. Calculating inbreeding coefficients was extremely fast (<0.2 s for 180K). Parallel computing (15 cores) was adopted for methods that were based on solving A−1 for columns of A, as those methods allowed implicit parallelism. Optimizing the code for one of the earliest methods enabled A to be built in 13 s (faster than the 31 s for calculating A−1) for 20K and 17 min 3 s for 180K. Memory is a bottleneck for large pedigrees but attempts to reduce the memory usage increased the computational time. To reduce disk space usage, memory usage, and computational time, relationship coefficients of old animals in the pedigree can be archived and relationship coefficients for parents of the next generation can be saved in an external file for successive updates to the pedigree and the A matrix.
Collapse
Affiliation(s)
| | - Dorian Garrick
- AL Rae Centre for Genetics and Breeding, School of Agriculture, Massey University, Palmerston North, New Zealand
| | - Bevin Harris
- Livestock Improvement Corporation, Hamilton, New Zealand
| |
Collapse
|
40
|
Dumont AP, Fang Q, Patil CA. A computationally efficient Monte-Carlo model for biomedical Raman spectroscopy. J Biophotonics 2021; 14:e202000377. [PMID: 33733621 PMCID: PMC10069992 DOI: 10.1002/jbio.202000377] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 02/22/2021] [Accepted: 02/23/2021] [Indexed: 05/29/2023]
Abstract
Monte Carlo (MC) modeling is a valuable tool to gain fundamental understanding of light-tissue interactions, provide guidance and assessment to optical instrument designs, and help analyze experimental data. It has been a major challenge to efficiently extend MC towards modeling of bulk-tissue Raman spectroscopy (RS) due to the wide spectral range, relatively sharp spectral features, and presence of background autofluorescence. Here, we report a computationally efficient MC approach for RS by adapting the massively-parallel Monte Carlo eXtreme (MCX) simulator. Simulation efficiency is achieved through "isoweight," a novel approach that combines the statistical generation of Raman scattered and Fluorescence emission with a lookup-table-based technique well-suited for parallelization. The MC model uses a graphics processor to produce dense Raman and fluorescence spectra over a range of 800 - 2000 cm-1 with an approximately 100× increase in speed over prior RS Monte Carlo methods. The simulated RS signals are compared against experimentally collected spectra from gelatin phantoms, showing a strong correlation.
Collapse
Affiliation(s)
- Alexander P. Dumont
- Department of Bioengineering, Temple University, Philadelphia, Pennsylvania, USA
| | - Qianqian Fang
- Department of Bioengineering, Northeastern University, Boston, Massachusetts, USA
| | - Chetan A. Patil
- Department of Bioengineering, Temple University, Philadelphia, Pennsylvania, USA
| |
Collapse
|
41
|
Lin Z, Chen R, Gao B, Qin S, Wu B, Liu J, Cai XC. A highly parallel simulation of patient-specific hepatic flows. Int J Numer Method Biomed Eng 2021; 37:e3451. [PMID: 33609008 DOI: 10.1002/cnm.3451] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 01/29/2021] [Accepted: 02/13/2021] [Indexed: 06/12/2023]
Abstract
Computational hemodynamics is being developed as an alternative approach for assisting clinical diagnosis and treatment planning for liver diseases. The technology is non-invasive, but the computational time could be high when the full geometry of the blood vessels is taken into account. Existing approaches use either one-dimensional model of the artery or simplified three-dimensional tubular geometry in order to reduce the computational time, but the accuracy is sometime compromised, for example, when simulating blood flows in arteries with plaque. In this work, we study a highly parallel method for the transient incompressible Navier-Stokes equations for the simulation of the blood flows in the full three-dimensional patient-specific hepatic artery, portal vein and hepatic vein. As applications, we also simulate the flow in a patient with hepatectomy and calculate the S (PPG). One of the advantages of simulating blood flows in all hepatic vessels is that it provides a direct estimate of the PPG, which is a gold standard value to assess the portal hypertension. Moreover, the robustness and scalability of the algorithm are also investigated. A 83% parallel efficiency is achieved for solving a problem with 7 million elements on a supercomputer with more than 1000 processor cores.
Collapse
Affiliation(s)
- Zeng Lin
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Shenzhen Key Laboratory for Exascale Engineering and Scientific Computing, Shenzhen, China
| | - Rongliang Chen
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Shenzhen Key Laboratory for Exascale Engineering and Scientific Computing, Shenzhen, China
| | - Beibei Gao
- Department of Cardiology, Affiliated Hangzhou First People's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Shanlin Qin
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Bokai Wu
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Jia Liu
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Shenzhen Key Laboratory for Exascale Engineering and Scientific Computing, Shenzhen, China
| | - Xiao-Chuan Cai
- Department of Mathematics, University of Macau, Macau, China
| |
Collapse
|
42
|
He W, Yang D, Peng H, Liang S, Lin Y. An Efficient Ensemble Binarized Deep Neural Network on Chip with Perception-Control Integrated. Sensors (Basel) 2021; 21:s21103407. [PMID: 34068351 PMCID: PMC8153352 DOI: 10.3390/s21103407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 05/03/2021] [Accepted: 05/10/2021] [Indexed: 11/18/2022]
Abstract
Lightweight UAVs equipped with deep learning models have become a trend, which can be deployed for automatic navigation in a wide range of civilian and military missions. However, real-time applications usually need to process a large amount of image data, which leads to a very large computational complexity and storage consumption, and restricts its deployment on resource-constrained embedded edge devices. To reduce the computing requirements and storage occupancy of the neural network model, we proposed the ensemble binarized DroNet (EBDN) model, which implemented the reconstructed DroNet with the binarized and ensemble learning method, so that the model size of DroNet was effectively compressed, and ensemble learning method was used to overcome the defect of the poor performance of the low-precision network. Compared to the original DroNet, EBDN saves more than 7 times of memory footprint with similar model accuracy. Meanwhile, we also proposed a novel and high-efficiency hardware architecture to realize the EBDN on the chip (EBDNoC) system, which perfectly realizes the mapping of an algorithm model to hardware architecture. Compared to other solutions, the proposed architecture achieves about 10.21 GOP/s/kLUTs resource efficiency and 208.1 GOP/s/W energy efficiency, while also providing a good trade-off between model performance and resource utilization.
Collapse
|
43
|
Huang W, Zhou J, Zhang D. On-the-Fly Fusion of Remotely-Sensed Big Data Using an Elastic Computing Paradigm with a Containerized Spark Engine on Kubernetes. Sensors (Basel) 2021; 21:s21092971. [PMID: 33922709 PMCID: PMC8122984 DOI: 10.3390/s21092971] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 04/16/2021] [Accepted: 04/21/2021] [Indexed: 11/16/2022]
Abstract
Remotely-sensed satellite image fusion is indispensable for the generation of long-term gap-free Earth observation data. While cloud computing (CC) provides the big picture for RS big data (RSBD), the fundamental question of the efficient fusion of RSBD on CC platforms has not yet been settled. To this end, we propose a lightweight cloud-native framework for the elastic processing of RSBD in this study. With the scaling mechanisms provided by both the Infrastructure as a Service (IaaS) and Platform as a Services (PaaS) of CC, the Spark-on-Kubernetes operator model running in the framework can enhance the efficiency of Spark-based algorithms without considering bottlenecks such as task latency caused by an unbalanced workload, and can ease the burden to tune the performance parameters for their parallel algorithms. Internally, we propose a task scheduling mechanism (TSM) to dynamically change the Spark executor pods' affinities to the computing hosts. The TSM learns the workload of a computing host. Learning from the ratio between the number of completed and failed tasks on a computing host, the TSM dispatches Spark executor pods to newer and less-overwhelmed computing hosts. In order to illustrate the advantage, we implement a parallel enhanced spatial and temporal adaptive reflectance fusion model (PESTARFM) to enable the efficient fusion of big RS images with a Spark aggregation function. We construct an OpenStack cloud computing environment to test the usability of the framework. According to the experiments, TSM can improve the performance of the PESTARFM using only PaaS scaling to about 11.7%. When using both the IaaS and PaaS scaling, the maximum performance gain with the TSM can be even greater than 13.6%. The fusion of such big Sentinel and PlanetScope images requires less than 4 min in the experimental environment.
Collapse
|
44
|
Abstract
More than half of the Top 10 supercomputing sites worldwide use GPU accelerators and they are becoming ubiquitous in workstations and edge computing devices. GeNN is a C++ library for generating efficient spiking neural network simulation code for GPUs. However, until now, the full flexibility of GeNN could only be harnessed by writing model descriptions and simulation code in C++. Here we present PyGeNN, a Python package which exposes all of GeNN's functionality to Python with minimal overhead. This provides an alternative, arguably more user-friendly, way of using GeNN and allows modelers to use GeNN within the growing Python-based machine learning and computational neuroscience ecosystems. In addition, we demonstrate that, in both Python and C++ GeNN simulations, the overheads of recording spiking data can strongly affect runtimes and show how a new spike recording system can reduce these overheads by up to 10×. Using the new recording system, we demonstrate that by using PyGeNN on a modern GPU, we can simulate a full-scale model of a cortical column faster even than real-time neuromorphic systems. Finally, we show that long simulations of a smaller model with complex stimuli and a custom three-factor learning rule defined in PyGeNN can be simulated almost two orders of magnitude faster than real-time.
Collapse
Affiliation(s)
- James C. Knight
- Centre for Computational Neuroscience and Robotics, School of Engineering and Informatics, University of Sussex, Brighton, United Kingdom
| | - Anton Komissarov
- Bernstein Center for Computational Neuroscience Berlin, Berlin, Germany
- Department of Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany
| | - Thomas Nowotny
- Centre for Computational Neuroscience and Robotics, School of Engineering and Informatics, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
45
|
Gangopadhyay A, Winberg S, Naidoo KJ. Anisotropic numerical potentials for coarse-grained modeling from high-speed multidimensional lookup table and interpolation algorithms. J Comput Chem 2021; 42:666-675. [PMID: 33547644 DOI: 10.1002/jcc.26487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 01/15/2021] [Accepted: 01/18/2021] [Indexed: 11/12/2022]
Abstract
A high-speed numerical potential delivering computational performance comparable with complex coarse-grained analytic potentials makes available models that have greater degrees of physical and chemical accuracy. This opens the possibility of increased accuracy in classical molecular dynamics simulations of anisotropic systems. In this work, we report the development of a high-speed lookup table (LUT) of four-dimensional gridded data, that uses cubic B-spline interpolations to derive off grid values and their associated partial derivatives that are located between the known grid data points. The accuracy of the coarse-grained numerical potential using a LUT from uniaxial Gay-Berne (GB) potential produced array of values is within a 3% and a 5% margin of error respectively for the interpolation of the uniaxial GB potential and its partial derivatives. The numerical potential model and partial derivatives speedup is made competitive with the analytical potential by exploiting graphics processing units on board functionality. The capability of the numerical potential is demonstrated by comparing minimizations of a box of 500 naphthalene molecules. The minimizations using a full atomistic (NAMD/CHARMM force field), a biaxial GB and a numerical potential from a LUT using data from the CHARMM pair potential was done. The numerical potential model is significantly more accurate in its approximation of the atomistic local minimum configuration than is the biaxial GB analytical potential function. This demonstrates that using a numerical potential founded on a direct lookup of the atomistic potential landscape significantly improves coarse grain (CG) modeling of complex molecules, possibly paving the way for accurate anisotropic system CG modeling.
Collapse
Affiliation(s)
- Ananya Gangopadhyay
- Scientific Computing Research Unit and Department of Chemistry, University of Cape Town, Cape Town, South Africa
| | - Simon Winberg
- Scientific Computing Research Unit and Department of Electrical Engineering, University of Cape Town, Cape Town, South Africa
| | - Kevin J Naidoo
- Scientific Computing Research Unit and Department of Chemistry, University of Cape Town, Cape Town, South Africa
| |
Collapse
|
46
|
Zou Y, Zhu Y, Li Y, Wu FX, Wang J. Parallel computing for genome sequence processing. Brief Bioinform 2021; 22:6210355. [PMID: 33822883 DOI: 10.1093/bib/bbab070] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 01/26/2021] [Accepted: 02/10/2021] [Indexed: 01/08/2023] Open
Abstract
The rapid increase of genome data brought by gene sequencing technologies poses a massive challenge to data processing. To solve the problems caused by enormous data and complex computing requirements, researchers have proposed many methods and tools which can be divided into three types: big data storage, efficient algorithm design and parallel computing. The purpose of this review is to investigate popular parallel programming technologies for genome sequence processing. Three common parallel computing models are introduced according to their hardware architectures, and each of which is classified into two or three types and is further analyzed with their features. Then, the parallel computing for genome sequence processing is discussed with four common applications: genome sequence alignment, single nucleotide polymorphism calling, genome sequence preprocessing, and pattern detection and searching. For each kind of application, its background is firstly introduced, and then a list of tools or algorithms are summarized in the aspects of principle, hardware platform and computing efficiency. The programming model of each hardware and application provides a reference for researchers to choose high-performance computing tools. Finally, we discuss the limitations and future trends of parallel computing technologies.
Collapse
Affiliation(s)
- You Zou
- Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Changsha, China
| | - Yuejie Zhu
- Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Changsha, China
| | - Yaohang Li
- computer science at Old Dominion University, USA
| | - Fang-Xiang Wu
- College of Engineering and the Department of Computer Science at the University of Saskatchewan, Saskatoon, Canada
| | - Jianxin Wang
- School of Computer Science and Engineering at Central South University, Changsha, Hunan, China
| |
Collapse
|
47
|
Abstract
Due to the inefficiency of multiple binary images encryption, a parallel binary image encryption framework based on the typical variants of spiking neural networks, spiking neural P (SNP) systems is proposed in this paper. More specifically, the two basic units in the proposed image cryptosystem, the permutation unit and the diffusion unit, are designed through SNP systems with multiple channels and polarizations (SNP-MCP systems), and SNP systems with astrocyte-like control (SNP-ALC systems), respectively. Different from the serial computing of the traditional image permutation/diffusion unit, SNP-MCP-based permutation/SNP-ALC-based diffusion unit can realize parallel computing through the parallel use of rules inside the neurons. Theoretical analysis results confirm the high efficiency of the binary image proposed cryptosystem. Security analysis experiments demonstrate the security of the proposed cryptosystem.
Collapse
Affiliation(s)
- Mingzhe Liu
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610051, P. R. China
| | - Feixiang Zhao
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610051, P. R. China
| | - Xin Jiang
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610051, P. R. China
| | - Hong Zhang
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610051, P. R. China
| | - Helen Zhou
- School of Engineering, Manukau Institute of Technology, Auckland 1150, New Zealand
| |
Collapse
|
48
|
Xi NM, Li JJ. Benchmarking Computational Doublet-Detection Methods for Single-Cell RNA Sequencing Data. Cell Syst 2021; 12:176-194.e6. [PMID: 33338399 PMCID: PMC7897250 DOI: 10.1016/j.cels.2020.11.008] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Revised: 10/06/2020] [Accepted: 11/19/2020] [Indexed: 12/29/2022]
Abstract
In single-cell RNA sequencing (scRNA-seq), doublets form when two cells are encapsulated into one reaction volume. The existence of doublets, which appear to be-but are not-real cells, is a key confounder in scRNA-seq data analysis. Computational methods have been developed to detect doublets in scRNA-seq data; however, the scRNA-seq field lacks a comprehensive benchmarking of these methods, making it difficult for researchers to choose an appropriate method for specific analyses. We conducted a systematic benchmark study of nine cutting-edge computational doublet-detection methods. Our study included 16 real datasets, which contained experimentally annotated doublets, and 112 realistic synthetic datasets. We compared doublet-detection methods regarding detection accuracy under various experimental settings, impacts on downstream analyses, and computational efficiencies. Our results show that existing methods exhibited diverse performance and distinct advantages in different aspects. Overall, the DoubletFinder method has the best detection accuracy, and the cxds method has the highest computational efficiency. A record of this paper's transparent peer review process is included in the Supplemental Information.
Collapse
Affiliation(s)
- Nan Miles Xi
- Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA; Department of Human Genetics, University of California, Los Angeles, CA 90095-7088, USA; Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766, USA.
| |
Collapse
|
49
|
Zhang J, Ye Z, Zheng K. A Parallel Computing Approach to Spatial Neighboring Analysis of Large Amounts of Terrain Data Using Spark. Sensors (Basel) 2021; 21:E365. [PMID: 33430375 DOI: 10.3390/s21020365] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 01/04/2021] [Accepted: 01/05/2021] [Indexed: 12/29/2022]
Abstract
Spatial neighboring analysis is an indispensable part of geo-raster spatial analysis. In the big data era, high-resolution raster data offer us abundant and valuable information, and also bring enormous computational challenges to the existing focal statistics algorithms. Simply employing the in-memory computing framework Spark to serve such applications might incur performance issues due to its lack of native support for spatial data. In this article, we present a Spark-based parallel computing approach for the focal algorithms of neighboring analysis. This approach implements efficient manipulation of large amounts of terrain data through three steps: (1) partitioning a raster digital elevation model (DEM) file into multiple square tile files by adopting a tile-based multifile storing strategy suitable for the Hadoop Distributed File System (HDFS), (2) performing the quintessential slope algorithm on these tile files using a dynamic calculation window (DCW) computing strategy, and (3) writing back and merging the calculation results into a whole raster file. Experiments with the digital elevation data of Australia show that the proposed computing approach can effectively improve the parallel performance of focal statistics algorithms. The results also show that the approach has almost the same calculation accuracy as that of ArcGIS. The proposed approach also exhibits good scalability when the number of Spark executors in clusters is increased.
Collapse
|
50
|
Williams-Young DB, de Jong WA, van Dam HJJ, Yang C. On the Efficient Evaluation of the Exchange Correlation Potential on Graphics Processing Unit Clusters. Front Chem 2020; 8:581058. [PMID: 33363105 PMCID: PMC7758429 DOI: 10.3389/fchem.2020.581058] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 09/14/2020] [Indexed: 11/20/2022] Open
Abstract
The predominance of Kohn–Sham density functional theory (KS-DFT) for the theoretical treatment of large experimentally relevant systems in molecular chemistry and materials science relies primarily on the existence of efficient software implementations which are capable of leveraging the latest advances in modern high-performance computing (HPC). With recent trends in HPC leading toward increasing reliance on heterogeneous accelerator-based architectures such as graphics processing units (GPU), existing code bases must embrace these architectural advances to maintain the high levels of performance that have come to be expected for these methods. In this work, we purpose a three-level parallelism scheme for the distributed numerical integration of the exchange-correlation (XC) potential in the Gaussian basis set discretization of the Kohn–Sham equations on large computing clusters consisting of multiple GPUs per compute node. In addition, we purpose and demonstrate the efficacy of the use of batched kernels, including batched level-3 BLAS operations, in achieving high levels of performance on the GPU. We demonstrate the performance and scalability of the implementation of the purposed method in the NWChemEx software package by comparing to the existing scalable CPU XC integration in NWChem.
Collapse
Affiliation(s)
- David B Williams-Young
- Lawrence Berkeley National Laboratory, Computational Research Division, Berkeley, CA, United States
| | - Wibe A de Jong
- Lawrence Berkeley National Laboratory, Computational Research Division, Berkeley, CA, United States
| | - Hubertus J J van Dam
- Brookhaven National Laboratory, Computational Science Initiative, Upton, NY, United States
| | - Chao Yang
- Lawrence Berkeley National Laboratory, Computational Research Division, Berkeley, CA, United States
| |
Collapse
|