Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Teodoro G, Pan T, Kurc T, Kong J, Cooper L, Saltz J. Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines. Parallel Comput 2013;39:189-211. [PMID: 23908562 PMCID: PMC3727669 DOI: 10.1016/j.parco.2013.03.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

For:	Teodoro G, Pan T, Kurc T, Kong J, Cooper L, Saltz J. Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines. Parallel Comput 2013;39:189-211. [PMID: 23908562 PMCID: PMC3727669 DOI: 10.1016/j.parco.2013.03.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Number

Cited by Other Article(s)

Jabłoński B, Puig Sitjes A, Makowski D, Jakubowski M, Gao Y, Fischer S, Winter A. Implementation and performance evaluation of the real-time algorithms for Wendelstein 7-X divertor protection system for OP2.1. FUSION ENGINEERING AND DESIGN 2023. [DOI: 10.1016/j.fusengdes.2023.113524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]

Barreiros W, Moreira J, Kurc T, Kong J, Melo AC, Saltz JH, Teodoro G. Optimizing parameter sensitivity analysis of large-scale microscopy image analysis workflows with multilevel computation reuse. CONCURRENCY AND COMPUTATION : PRACTICE & EXPERIENCE 2020;32:e5403. [PMID: 32669980 PMCID: PMC7363336 DOI: 10.1002/cpe.5403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 05/18/2019] [Indexed: 06/11/2023]

Gomes J, de Melo ACMA, Kong J, Kurc T, Saltz JH, Teodoro G. Cooperative and out-of-core execution of the irregular wavefront propagation pattern on hybrid machines with Intel^Ⓡ Xeon Phi™. CONCURRENCY AND COMPUTATION : PRACTICE & EXPERIENCE 2018;30:e4425. [PMID: 30344454 PMCID: PMC6195363 DOI: 10.1002/cpe.4425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Teodoro G, Kurc T, Andrade G, Kong J, Ferreira R, Saltz J. Application Performance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs: A Case Study with Microscopy Image Analysis. THE INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2017;31:32-51. [PMID: 28239253 PMCID: PMC5319667 DOI: 10.1177/1094342015594519] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Kurc T, Qi X, Wang D, Wang F, Teodoro G, Cooper L, Nalisnik M, Yang L, Saltz J, Foran DJ. Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies. BMC Bioinformatics 2015;16:399. [PMID: 26627175 PMCID: PMC4667532 DOI: 10.1186/s12859-015-0831-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 11/16/2015] [Indexed: 11/16/2022] Open

Gomes JM, Teodoro G, de Melo A, Kong J, Kurc T, Saltz JH. Efficient irregular wavefront propagation algorithms on Intel^® Xeon Phi^™. PROCEEDINGS. SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING 2015;2015:25-32. [PMID: 27298591 DOI: 10.1109/sbac-pad.2015.13] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Teodoro G, Pan T, Kurc T, Kong J, Cooper L, Klasky S, Saltz J. Region Templates: Data Representation and Management for High-Throughput Image Analysis. PARALLEL COMPUTING 2014;40:589-610. [PMID: 26139953 PMCID: PMC4484879 DOI: 10.1016/j.parco.2014.09.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Andrade G, Ferreira R, Teodoro G, Rocha L, Saltz JH, Kurc T. Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems. PROCEEDINGS. SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING 2014;2014:89-96. [PMID: 26640423 PMCID: PMC4670037 DOI: 10.1109/sbac-pad.2014.15] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Teodoro G, Kurc T, Kong J, Cooper L, Saltz J. Comparative Performance Analysis of Intel Xeon Phi, GPU, and CPU: A Case Study from Microscopy Image Analysis. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS : A PUBLICATION OF THE IEEE COMPUTER SOCIETY 2014;2014:1063-1072. [PMID: 25419088 PMCID: PMC4240026 DOI: 10.1109/ipdps.2014.111] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Kong J, Wang F, Teodoro G, Cooper L, Moreno CS, Kurc T, Pan T, Saltz J, Brat D. High-Performance Computational Analysis of Glioblastoma Pathology Images with Database Support Identifies Molecular and Survival Correlates. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2013:229-236. [PMID: 25098236 DOI: 10.1109/bibm.2013.6732495] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Teodoro G, Pan T, Kurc TM, Kong J, Cooper LAD, Podhorszki N, Klasky S, Saltz JH. High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms. PROCEEDINGS. IPDPS (CONFERENCE) 2013;2013:103-114. [PMID: 25419546 PMCID: PMC4240318 DOI: 10.1109/ipdps.2013.11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Abstract

Analysis of large pathology image datasets offers significant opportunities for the investigation of disease morphology, but the resource requirements of analysis pipelines limit the scale of such studies. Motivated by a brain cancer study, we propose and evaluate a parallel image analysis application pipeline for high throughput computation of large datasets of high resolution pathology tissue images on distributed CPU-GPU platforms. To achieve efficient execution on these hybrid systems, we have built runtime support that allows us to express the cancer image analysis application as a hierarchical data processing pipeline. The application is implemented as a coarse-grain pipeline of stages, where each stage may be further partitioned into another pipeline of fine-grain operations. The fine-grain operations are efficiently managed and scheduled for computation on CPUs and GPUs using performance aware scheduling techniques along with several optimizations, including architecture aware process placement, data locality conscious task assignment, data prefetching, and asynchronous data copy. These optimizations are employed to maximize the utilization of the aggregate computing power of CPUs and GPUs and minimize data copy overheads. Our experimental evaluation shows that the cooperative use of CPUs and GPUs achieves significant improvements on top of GPU-only versions (up to 1.6×) and that the execution of the application as a set of fine-grain operations provides more opportunities for runtime optimizations and attains better performance than coarser-grain, monolithic implementations used in other works. An implementation of the cancer image analysis pipeline using the runtime support was able to process an image dataset consisting of 36,848 4Kx4K-pixel image tiles (about 1.8TB uncompressed) in less than 4 minutes (150 tiles/second) on 100 nodes of a state-of-the-art hybrid cluster system.

Collapse