1
|
Shi M, Myronakis M, Jacobson M, Ferguson D, Williams C, Lehmann M, Baturin P, Huber P, Fueglistaller R, Lozano IV, Harris T, Morf D, Berbeco RI. GPU-accelerated Monte Carlo simulation of MV-CBCT. Phys Med Biol 2020; 65:235042. [PMID: 33263311 DOI: 10.1088/1361-6560/abaeba] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Monte Carlo simulation (MCS) is one of the most accurate computation methods for dose calculation and image formation in radiation therapy. However, the high computational complexity and long execution time of MCS limits its broad use. In this paper, we present a novel strategy to accelerate MCS using a graphic processing unit (GPU), and we demonstrate the application in mega-voltage (MV) cone-beam computed tomography (CBCT) simulation. A new framework that generates a series of MV projections from a single simulation run is designed specifically for MV-CBCT acquisition. A Geant4-based GPU code for photon simulation is incorporated into the framework for the simulation of photon transport through a phantom volume. The FastEPID method, which accelerates the simulation of MV images, is modified and integrated into the framework. The proposed GPU-based simulation strategy was tested for its accuracy and efficiency in a Catphan 604 phantom and an anthropomorphic pelvis phantom with beam energies at 2.5 MV, 6 MV, and 6 MV FFF. In all cases, the proposed GPU-based simulation demonstrated great simulation accuracy and excellent agreement with measurement and CPU-based simulation in terms of reconstructed image qualities. The MV-CBCT simulation was accelerated by factors of roughly 900-2300 using an NVIDIA Tesla V100 GPU card against a 2.5 GHz AMD Opteron™ Processor 6380.
Collapse
Affiliation(s)
- Mengying Shi
- Medical Physics Program, Department of Physics and Applied Physics, University of Massachusetts Lowell, Lowell, MA, United States of America. Brigham and Women's Hospital, Dana Farber Cancer Institute and Harvard Medical School, Boston, MA, United States of America
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Macdonald CM, Arridge S, Powell S. Efficient inversion strategies for estimating optical properties with Monte Carlo radiative transport models. JOURNAL OF BIOMEDICAL OPTICS 2020; 25:JBO-200101R. [PMID: 32798354 PMCID: PMC7426481 DOI: 10.1117/1.jbo.25.8.085002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 07/23/2020] [Indexed: 06/11/2023]
Abstract
SIGNIFICANCE Indirect imaging problems in biomedical optics generally require repeated evaluation of forward models of radiative transport, for which Monte Carlo is accurate yet computationally costly. We develop an approach to reduce this bottleneck, which has significant implications for quantitative tomographic imaging in a variety of medical and industrial applications. AIM Our aim is to enable computationally efficient image reconstruction in (hybrid) diffuse optical modalities using stochastic forward models. APPROACH Using Monte Carlo, we compute a fully stochastic gradient of an objective function for a given imaging problem. Leveraging techniques from the machine learning community, we then adaptively control the accuracy of this gradient throughout the iterative inversion scheme to substantially reduce computational resources at each step. RESULTS For example problems of quantitative photoacoustic tomography and ultrasound-modulated optical tomography, we demonstrate that solutions are attainable using a total computational expense that is comparable to (or less than) that which is required for a single high-accuracy forward run of the same Monte Carlo model. CONCLUSIONS This approach demonstrates significant computational savings when approaching the full nonlinear inverse problem of optical property estimation using stochastic methods.
Collapse
Affiliation(s)
- Callum M. Macdonald
- University College London, Department of Medical Physics and Biomedical Engineering, London, United Kingdom
| | - Simon Arridge
- University College London, Department of Computer Science, London, United Kingdom
| | - Samuel Powell
- University of Nottingham, Faculty of Engineering, Nottingham, United Kingdom
| |
Collapse
|
3
|
Young-Schultz T, Brown S, Lilge L, Betz V. FullMonteCUDA: a fast, flexible, and accurate GPU-accelerated Monte Carlo simulator for light propagation in turbid media. BIOMEDICAL OPTICS EXPRESS 2019; 10:4711-4726. [PMID: 31565520 PMCID: PMC6757465 DOI: 10.1364/boe.10.004711] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 08/12/2019] [Accepted: 08/14/2019] [Indexed: 05/07/2023]
Abstract
Optimizing light delivery for photodynamic therapy, quantifying tissue optical properties or reconstructing 3D distributions of sources in bioluminescence imaging and absorbers in diffuse optical imaging all involve solving an inverse problem. This can require thousands of forward light propagation simulations to determine the parameters to optimize treatment, image tissue or quantify tissue optical properties, which is time-consuming and computationally expensive. Addressing this problem requires a light propagation simulator that produces results quickly given modelling parameters. In previous work, we developed FullMonteSW: currently the fastest, tetrahedral-mesh, Monte Carlo light propagation simulator written in software. Additional software optimizations showed diminishing performance improvements, so we investigated hardware acceleration methods. This work focuses on FullMonteCUDA: a GPU-accelerated version of FullMonteSW which targets NVIDIA GPUs. FullMonteCUDA has been validated across several benchmark models and, through various GPU-specific optimizations, achieves a 288-936x speedup over the single-threaded, non-vectorized version of FullMonteSW and a 4-13x speedup over the highly optimized, hand-vectorized and multi-threaded version. The increase in performance allows inverse problems to be solved more efficiently and effectively.
Collapse
Affiliation(s)
- Tanner Young-Schultz
- University of Toronto, Department of Electrical & Computer Engineering, Toronto, ON, Canada
| | - Stephen Brown
- University of Toronto, Department of Electrical & Computer Engineering, Toronto, ON, Canada
| | - Lothar Lilge
- Princess Margaret Cancer Centre, Toronto, ON, Canada
- University of Toronto, Department of Medical Biophysics, Toronto, ON, Canada
| | - Vaughn Betz
- University of Toronto, Department of Electrical & Computer Engineering, Toronto, ON, Canada
| |
Collapse
|
4
|
Photodynamic therapy of deep tissue abscess cavities: Retrospective image‐based feasibility study using Monte Carlo simulation. Med Phys 2019; 46:3259-3267. [DOI: 10.1002/mp.13557] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 04/11/2019] [Accepted: 04/17/2019] [Indexed: 01/11/2023] Open
|
5
|
Pratx G, Xing L. Monte Carlo simulation of photon migration in a cloud computing environment with MapReduce. JOURNAL OF BIOMEDICAL OPTICS 2011; 16:125003. [PMID: 22191916 PMCID: PMC3273307 DOI: 10.1117/1.3656964] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Revised: 09/29/2011] [Accepted: 10/10/2011] [Indexed: 05/31/2023]
Abstract
Monte Carlo simulation is considered the most reliable method for modeling photon migration in heterogeneous media. However, its widespread use is hindered by the high computational cost. The purpose of this work is to report on our implementation of a simple MapReduce method for performing fault-tolerant Monte Carlo computations in a massively-parallel cloud computing environment. We ported the MC321 Monte Carlo package to Hadoop, an open-source MapReduce framework. In this implementation, Map tasks compute photon histories in parallel while a Reduce task scores photon absorption. The distributed implementation was evaluated on a commercial compute cloud. The simulation time was found to be linearly dependent on the number of photons and inversely proportional to the number of nodes. For a cluster size of 240 nodes, the simulation of 100 billion photon histories took 22 min, a 1258 × speed-up compared to the single-threaded Monte Carlo program. The overall computational throughput was 85,178 photon histories per node per second, with a latency of 100 s. The distributed simulation produced the same output as the original implementation and was resilient to hardware failure: the correctness of the simulation was unaffected by the shutdown of 50% of the nodes.
Collapse
Affiliation(s)
- Guillem Pratx
- Stanford University School of Medicine, Department of Radiation Oncology, 875 Blake Wilbur Drive, Stanford, California 94305, USA.
| | | |
Collapse
|
6
|
Jiang C, Zhang H, Wang J, Wang Y, He H, Liu R, Zhou F, Deng J, Li P, Luo Q. Dedicated hardware processor and corresponding system-on-chip design for real-time laser speckle imaging. JOURNAL OF BIOMEDICAL OPTICS 2011; 16:116008. [PMID: 22112113 DOI: 10.1117/1.3651772] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Laser speckle imaging (LSI) is a noninvasive and full-field optical imaging technique which produces two-dimensional blood flow maps of tissues from the raw laser speckle images captured by a CCD camera without scanning. We present a hardware-friendly algorithm for the real-time processing of laser speckle imaging. The algorithm is developed and optimized specifically for LSI processing in the field programmable gate array (FPGA). Based on this algorithm, we designed a dedicated hardware processor for real-time LSI in FPGA. The pipeline processing scheme and parallel computing architecture are introduced into the design of this LSI hardware processor. When the LSI hardware processor is implemented in the FPGA running at the maximum frequency of 130 MHz, up to 85 raw images with the resolution of 640×480 pixels can be processed per second. Meanwhile, we also present a system on chip (SOC) solution for LSI processing by integrating the CCD controller, memory controller, LSI hardware processor, and LCD display controller into a single FPGA chip. This SOC solution also can be used to produce an application specific integrated circuit for LSI processing.
Collapse
Affiliation(s)
- Chao Jiang
- Huazhong University of Science and Technology, Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, Wuhan, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Alerstam E, Lo WCY, Han TD, Rose J, Andersson-Engels S, Lilge L. Next-generation acceleration and code optimization for light transport in turbid media using GPUs. BIOMEDICAL OPTICS EXPRESS 2010; 1:658-75. [PMID: 21258498 PMCID: PMC3018007 DOI: 10.1364/boe.1.000658] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Revised: 08/10/2010] [Accepted: 08/11/2010] [Indexed: 05/19/2023]
Abstract
A highly optimized Monte Carlo (MC) code package for simulating light transport is developed on the latest graphics processing unit (GPU) built for general-purpose computing from NVIDIA - the Fermi GPU. In biomedical optics, the MC method is the gold standard approach for simulating light transport in biological tissue, both due to its accuracy and its flexibility in modelling realistic, heterogeneous tissue geometry in 3-D. However, the widespread use of MC simulations in inverse problems, such as treatment planning for PDT, is limited by their long computation time. Despite its parallel nature, optimizing MC code on the GPU has been shown to be a challenge, particularly when the sharing of simulation result matrices among many parallel threads demands the frequent use of atomic instructions to access the slow GPU global memory. This paper proposes an optimization scheme that utilizes the fast shared memory to resolve the performance bottleneck caused by atomic access, and discusses numerous other optimization techniques needed to harness the full potential of the GPU. Using these techniques, a widely accepted MC code package in biophotonics, called MCML, was successfully accelerated on a Fermi GPU by approximately 600x compared to a state-of-the-art Intel Core i7 CPU. A skin model consisting of 7 layers was used as the standard simulation geometry. To demonstrate the possibility of GPU cluster computing, the same GPU code was executed on four GPUs, showing a linear improvement in performance with an increasing number of GPUs. The GPU-based MCML code package, named GPU-MCML, is compatible with a wide range of graphics cards and is released as an open-source software in two versions: an optimized version tuned for high performance and a simplified version for beginners (http://code.google.com/p/gpumcml).
Collapse
Affiliation(s)
- Erik Alerstam
- Department of Physics, Lund University, Sweden
- These authors contributed equally to this work and
should be considered co-first authors
- http://code.google.com/p/gpumcml
| | - William Chun Yip Lo
- Department of Medical Biophysics, University of
Toronto, Canada
- These authors contributed equally to this work and
should be considered co-first authors
| | - Tianyi David Han
- The Edward S. Rogers Sr. Department of Electrical
and Computer Engineering, University of Toronto, Canada
| | - Jonathan Rose
- The Edward S. Rogers Sr. Department of Electrical
and Computer Engineering, University of Toronto, Canada
| | | | - Lothar Lilge
- Department of Medical Biophysics, University of
Toronto, Canada
- Ontario Cancer Institute, University Health Network,
Canada
| |
Collapse
|
8
|
Svanberg K, Bendsoe N, Axelsson J, Andersson-Engels S, Svanberg S. Photodynamic therapy: superficial and interstitial illumination. JOURNAL OF BIOMEDICAL OPTICS 2010; 15:041502. [PMID: 20799780 DOI: 10.1117/1.3466579] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Photodynamic therapy (PDT) is reviewed using the treatment of skin tumors as an example of superficial lesions and prostate cancer as an example of deep-lying lesions requiring interstitial intervention. These two applications are among the most commonly studied in oncological PDT, and illustrate well the different challenges facing the two modalities of PDT-superficial and interstitial. They thus serve as good examples to illustrate the entire field of PDT in oncology. PDT is discussed based on the Lund University group's over 20 yr of experience in the field. In particular, the interplay between optical diagnostics and dosimetry and the delivery of the therapeutic light dose are highlighted. An interactive multiple-fiber interstitial procedure to deliver the required therapeutic dose based on the assessment of light fluence rate and sensitizer concentration and oxygen level throughout the tumor is presented.
Collapse
|
9
|
Carbone N, Di Rocco H, Iriarte DI, Pomarico JA. Solution of the direct problem in turbid media with inclusions using Monte Carlo simulations implemented in graphics processing units: new criterion for processing transmittance data. JOURNAL OF BIOMEDICAL OPTICS 2010; 15:035002. [PMID: 20615002 DOI: 10.1117/1.3442750] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
The study of light propagation in diffusive media requires solving the radiative transfer equation, or eventually, the diffusion approximation. Except for some cases involving simple geometries, the problem with immersed inclusions has not been solved. Also, Monte Carlo (MC) calculations have become a gold standard for simulating photon migration in turbid media, although they have the drawback large processing times. The purpose of this work is two-fold: first, we introduce a new processing criterion to retrieve information about the location and shape of absorbing inclusions based on normalization to the background intensity, when no inhomogeneities are present. Second, we demonstrate the feasibility of including inhomogeneities in MC simulations implemented in graphics processing units, achieving large acceleration factors ( approximately 10(3)), thus providing an important tool for iteratively solving the forward problem to retrieve the optical properties of the inclusion. Results using a cw source are compared with MC outcomes showing very good agreement.
Collapse
Affiliation(s)
- Nicolas Carbone
- Universidad Nacional del Centro, Department of Physics, Pinto 399, Tandil, Buenos Aires 7000 Argentina
| | | | | | | |
Collapse
|
10
|
Abstract
Optical imaging has been widely applied in preclinical and clinical applications. Fifteen years ago, an efficient Monte Carlo program 'MCML' was developed for use with multi-layered turbid media and has gained popularity in the field of biophotonics. Currently, there is an increasingly pressing need for simulating tools more powerful than MCML in order to study light propagation phenomena in complex inhomogeneous objects, such as the mouse. Here we report a tetrahedron-based inhomogeneous Monte Carlo optical simulator (TIM-OS) to address this issue. By modeling an object as a tetrahedron-based inhomogeneous finite-element mesh, TIM-OS can determine the photon-triangle interaction recursively and rapidly. In numerical simulation, we have demonstrated the correctness and efficiency of TIM-OS.
Collapse
Affiliation(s)
- H Shen
- School of Biomedical Engineering and Sciences, Virginia Tech, Blacksburg, VA, USA.
| | | |
Collapse
|
11
|
Fang Q, Boas DA. Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units. OPTICS EXPRESS 2009; 17:20178-90. [PMID: 19997242 PMCID: PMC2863034 DOI: 10.1364/oe.17.020178] [Citation(s) in RCA: 431] [Impact Index Per Article: 28.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
We report a parallel Monte Carlo algorithm accelerated by graphics processing units (GPU) for modeling time-resolved photon migration in arbitrary 3D turbid media. By taking advantage of the massively parallel threads and low-memory latency, this algorithm allows many photons to be simulated simultaneously in a GPU. To further improve the computational efficiency, we explored two parallel random number generators (RNG), including a floating-point-only RNG based on a chaotic lattice. An efficient scheme for boundary reflection was implemented, along with the functions for time-resolved imaging. For a homogeneous semi-infinite medium, good agreement was observed between the simulation output and the analytical solution from the diffusion theory. The code was implemented with CUDA programming language, and benchmarked under various parameters, such as thread number, selection of RNG and memory access pattern. With a low-cost graphics card, this algorithm has demonstrated an acceleration ratio above 300 when using 1792 parallel threads over conventional CPU computation. The acceleration ratio drops to 75 when using atomic operations. These results render the GPU-based Monte Carlo simulation a practical solution for data analysis in a wide range of diffuse optical imaging applications, such as human brain or small-animal imaging.
Collapse
Affiliation(s)
- Qianqian Fang
- Martinos Center for Biomedical Imaging, Massachusetts General Hospital, 149 13th St, Charlestown, Massachusetts, 02129, USA.
| | | |
Collapse
|