Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Isard M, Budiu M, Yu Y, Birrell A, Fetterly D. Dryad. ACTA ACUST UNITED AC 2007. [DOI: 10.1145/1272998.1273005] [Citation(s) in RCA: 213] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Number

Cited by Other Article(s)

Czumaj A, Davies-Peck P, Parter M. Component stability in low-space massively parallel computation. DISTRIBUTED COMPUTING 2024;37:35-64. [PMID: 38370529 PMCID: PMC10873458 DOI: 10.1007/s00446-024-00461-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 01/11/2024] [Indexed: 02/20/2024]

Abstract

In this paper, we study the power and limitations of component-stable algorithms in the low-space model of massively parallel computation (MPC). Recently Ghaffari, Kuhn and Uitto (FOCS 2019) introduced the class of component-stable low-space MPC algorithms, which are, informally, those algorithms for which the outputs reported by the nodes in different connected components are required to be independent. This very natural notion was introduced to capture most (if not all) of the known efficient MPC algorithms to date, and it was the first general class of MPC algorithms for which one can show non-trivial conditional lower bounds. In this paper we enhance the framework of component-stable algorithms and investigate its effect on the complexity of randomized and deterministic low-space MPC. Our key contributions include: 1. We revise and formalize the lifting approach of Ghaffari, Kuhn and Uitto. This requires a very delicate amendment of the notion of component stability, which allows us to fill in gaps in the earlier arguments. 2. We also extend the framework to obtain conditional lower bounds for deterministic algorithms and fine-grained lower bounds that depend on the maximum degree Δ . 3. We demonstrate a collection of natural graph problems for which deterministic component-unstable algorithms break the conditional lower bound obtained for component-stable algorithms. This implies that, in the context of deterministic algorithms, component-stable algorithms are conditionally weaker than the component-unstable ones. 4. We also show that the restriction to component-stable algorithms has an impact in the randomized setting. We present a natural problem which can be solved in O(1) rounds by a component-unstable MPC algorithm, but requires Ω ( log log ∗ n ) rounds for any component-stable algorithm, conditioned on the connectivity conjecture. Altogether our results imply that component-stability might limit the computational power of the low-space MPC model, at least in certain contexts, paving the way for improved upper bounds that escape the conditional lower bound setting of Ghaffari, Kuhn, and Uitto.

Collapse

A repository for the publication and sharing of heterogeneous materials data. Sci Data 2022;9:787. [PMID: 36575234 PMCID: PMC9794830 DOI: 10.1038/s41597-022-01897-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 12/14/2022] [Indexed: 12/28/2022] Open

Dall'Alba G, Casa PL, Abreu FPD, Notari DL, de Avila E Silva S. A Survey of Biological Data in a Big Data Perspective. BIG DATA 2022;10:279-297. [PMID: 35394342 DOI: 10.1089/big.2020.0383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Áika: A Distributed Edge System for AI Inference. BIG DATA AND COGNITIVE COMPUTING 2022. [DOI: 10.3390/bdcc6020068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Nanongkai D, Scquizzato M. Equivalence classes and conditional hardness in massively parallel computations. DISTRIBUTED COMPUTING 2022;35:165-183. [PMID: 35300185 PMCID: PMC8907129 DOI: 10.1007/s00446-021-00418-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Accepted: 12/17/2021] [Indexed: 06/14/2023]

Abstract

The Massively Parallel Computation (MPC) model serves as a common abstraction of many modern large-scale data processing frameworks, and has been receiving increasingly more attention over the past few years, especially in the context of classical graph problems. So far, the only way to argue lower bounds for this model is to condition on conjectures about the hardness of some specific problems, such as graph connectivity on promise graphs that are either one cycle or two cycles, usually called the one cycle versus two cycles problem. This is unlike the traditional arguments based on conjectures about complexity classes (e.g., P ≠ NP ), which are often more robust in the sense that refuting them would lead to groundbreaking algorithms for a whole bunch of problems. In this paper we present connections between problems and classes of problems that allow the latter type of arguments. These connections concern the class of problems solvable in a sublogarithmic amount of rounds in the MPC model, denoted by MPC ( o ( log N ) ) , and the standard space complexity classes L and NL , and suggest conjectures that are robust in the sense that refuting them would lead to many surprisingly fast new algorithms in the MPC model. We also obtain new conditional lower bounds, and prove new reductions and equivalences between problems in the MPC model. Specifically, our main results are as follows.Lower bounds conditioned on the one cycle versus two cycles conjecture can be instead argued under the L ⊈ MPC ( o ( log N ) ) conjecture: these two assumptions are equivalent, and refuting either of them would lead to o ( log N ) -round MPC algorithms for a large number of challenging problems, including list ranking, minimum cut, and planarity testing. In fact, we show that these problems and many others require asymptotically the same number of rounds as the seemingly much easier problem of distinguishing between a graph being one cycle or two cycles.Many lower bounds previously argued under the one cycle versus two cycles conjecture can be argued under an even more robust (thus harder to refute) conjecture, namely NL ⊈ MPC ( o ( log N ) ) . Refuting this conjecture would lead to o ( log N ) -round MPC algorithms for an even larger set of problems, including all-pairs shortest paths, betweenness centrality, and all aforementioned ones. Lower bounds under this conjecture hold for problems such as perfect matching and network flow.

Collapse

Nagy RC, Balch JK, Bissell EK, Cattau ME, Glenn NF, Halpern BS, Ilangakoon N, Johnson B, Joseph MB, Marconi S, O’Riordan C, Sanovia J, Swetnam TL, Travis WR, Wasser LA, Woolner E, Zarnetske P, Abdulrahim M, Adler J, Barnes G, Bartowitz KJ, Blake RE, Bombaci SP, Brun J, Buchanan JD, Chadwick KD, Chapman MS, Chong SS, Chung YA, Corman JR, Couret J, Crispo E, Doak TG, Donnelly A, Duffy KA, Dunning KH, Duran SM, Edmonds JW, Fairbanks DE, Felton AJ, Florian CR, Gann D, Gebhardt M, Gill NS, Gram WK, Guo JS, Harvey BJ, Hayes KR, Helmus MR, Hensley RT, Hondula KL, Huang T, Hundertmark WJ, Iglesias V, Jacinthe P, Jansen LS, Jarzyna MA, Johnson TM, Jones KD, Jones MA, Just MG, Kaddoura YO, Kagawa‐Vivani AK, Kaushik A, Keller AB, King KBS, Kitzes J, Koontz MJ, Kouba PV, Kwan W, LaMontagne JM, LaRue EA, Li D, Li B, Lin Y, Liptzin D, Long WA, Mahood AL, Malloy SS, Malone SL, McGlinchy JM, Meier CL, Melbourne BA, Mietkiewicz N, Morisette JT, Moustapha M, Muscarella C, Musinsky J, Muthukrishnan R, Naithani K, Neely M, Norman K, Parker SM, Perez Rocha M, Petri L, Ramey CA, Record S, Rossi MW, SanClements M, Scholl VM, Schweiger AK, Seyednasrollah B, Sihi D, Smith KR, Sokol ER, Spaulding SA, Spiers AI, St. Denis LA, Staccone AP, Stack Whitney K, Stanitski DM, Stricker E, Surasinghe TD, Thomsen SK, Vasek PM, Xiaolu L, Yang D, Yu R, Yule KM, Zhu K. Harnessing the NEON data revolution to advance open environmental science with a diverse and data‐capable community. Ecosphere 2021. [DOI: 10.1002/ecs2.3833] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Affiliation(s)

R. Chelsea Nagy Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Jennifer K. Balch Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA Department of Geography University of Colorado Boulder Boulder Colorado USA
Erin K. Bissell Biology Department Metropolitan State University of Denver Denver Colorado USA
Megan E. Cattau Human‐Environment Systems Boise State University Boise Idaho USA
Nancy F. Glenn Human‐Environment Systems Boise State University Boise Idaho USA University of New South Wales Sydney Sydney New South Wales Australia
Benjamin S. Halpern National Center for Ecological Analysis and Synthesis (NCEAS) Santa Barbara California USA University of California Santa Barbara Santa Barbara California USA
Nayani Ilangakoon Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Brian Johnson Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Maxwell B. Joseph Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Sergio Marconi School of Natural Resources & Environment University of Florida Gainesville Florida USA
Catherine O’Riordan Ecological Society of America Washington D.C. USA
James Sanovia Department of Math, Science, and Technology Oglala Lakota College Kyle South Dakota USA
Tyson L. Swetnam BIO5 Institute University of Arizona Tucson Arizona USA
William R. Travis Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA Department of Geography University of Colorado Boulder Boulder Colorado USA
Leah A. Wasser Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA Department of Geography University of Colorado Boulder Boulder Colorado USA
Elizabeth Woolner Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Phoebe Zarnetske Department of Integrative Biology Michigan State University East Lansing Michigan USA
Mujahid Abdulrahim Department of Civil and Mechanical Engineering University of Missouri Kansas City Kansas City Missouri USA
John Adler Department of Geography University of Colorado Boulder Boulder Colorado USA CIRES University of Colorado Boulder Boulder Colorado USA
Grenville Barnes Department of Forest, Fisheries and Geomatics Sciences University of Florida Gainesville Florida USA
Kristina J. Bartowitz Department of Forest, Rangeland, and Fire Sciences University of Idaho Moscow Idaho USA
Rachael E. Blake National Socio‐Environmental Synthesis Center University of Maryland Annapolis Maryland USA
Sara P. Bombaci Department of Fish, Wildlife, and Conservation Biology Colorado State University Fort Collins Colorado USA
Julien Brun National Center for Ecological Analysis and Synthesis (NCEAS) Santa Barbara California USA University of California Santa Barbara Santa Barbara California USA
Jacob D. Buchanan Department of Biological Sciences Bowling Green State University Bowling Green Ohio USA
K. Dana Chadwick Department of Geological Sciences University of Texas Austin Austin Texas USA Department of Integrative Biology University of Texas Austin Austin Texas USA
Melissa S. Chapman Department of Environmental Science, Policy, and Management University of California Berkeley Berkeley California USA
Steven S. Chong National Center for Ecological Analysis and Synthesis (NCEAS) Santa Barbara California USA University of California Santa Barbara Santa Barbara California USA University of California Berkeley Library University of California Berkeley Berkeley California USA
Y. Anny Chung Departments of Plant Biology and Plant Pathology University of Georgia Athens Georgia USA
Jessica R. Corman School of Natural Resources University of Nebraska Lincoln Lincoln Nebraska USA
Jannelle Couret Department of Biological Sciences University of Rhode Island Kingston Rhode Island USA
Erika Crispo Department of Biology Pace University New York City New York USA
Thomas G. Doak Department of Biology Indiana University Bloomington Indiana USA
Alison Donnelly Department of Geography University of Wisconsin‐Milwaukee Milwaukee Wisconsin USA
Katharyn A. Duffy School of Informatics, Computing & Cyber Systems Northern Arizona University Flagstaff Arizona USA
Kelly H. Dunning School of Forestry and Wildlife Auburn University Auburn Alabama USA
Sandra M. Duran Department of Ecology and Evolutionary Biology University of Arizona Tucson Arizona USA
Jennifer W. Edmonds Department of Physical and Life Sciences Nevada State College Henderson Nevada USA
Dawson E. Fairbanks Department of Environmental Science University of Arizona Tucson Arizona USA
Andrew J. Felton Department of Wildland Resources Utah State University Logan Utah USA
Christopher R. Florian Battelle National Ecological Observatory Network Boulder Colorado USA
Daniel Gann Department of Biological Sciences Florida International University Miami Florida USA
Martha Gebhardt School of Natural Resources and the Environment University of Arizona Tucson Arizona USA
Nathan S. Gill Department of Natural Resources Management Texas Tech University Lubbock Texas USA
Wendy K. Gram University Corporation for Atmospheric Research Boulder Colorado USA
Jessica S. Guo College of Agriculture and Life Sciences University of Arizona Tucson Arizona USA
Brian J. Harvey School of Environmental and Forest Sciences University of Washington Seattle Washington USA
Katherine R. Hayes Department of Integrative and Systems Biology University of Colorado Denver Denver Colorado USA
Matthew R. Helmus Department of Biology Temple University Philadelphia Pennsylvania USA
Robert T. Hensley Battelle National Ecological Observatory Network Boulder Colorado USA
Kelly L. Hondula National Socio‐Environmental Synthesis Center University of Maryland Annapolis Maryland USA
Tao Huang Human‐Environment Systems Boise State University Boise Idaho USA Cary Institute of Ecosystem Services Millbrook New York USA
Wiley J. Hundertmark Department of Earth and Environment Boston University Boston Massachusetts USA
Virginia Iglesias Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Pierre‐Andre Jacinthe Department of Earth Sciences Indiana University Purdue University Indianapolis Indiana USA
Lara S. Jansen Department of Environmental Science & Management Portland State University Portland Oregon USA
Marta A. Jarzyna Department of Evolution, Ecology, and Organismal Biology The Ohio State University Columbus Ohio USA Translational Data Analytics Institute The Ohio State University Columbus Ohio USA
Tiona M. Johnson Atlanta Georgia USA
Katherine D. Jones Battelle National Ecological Observatory Network Boulder Colorado USA
Megan A. Jones Boulder Colorado USA
Michael G. Just US Army ERDC CERL Champaign Illinois USA
Youssef O. Kaddoura Department of Forest, Fisheries and Geomatics Sciences University of Florida Gainesville Florida USA
Aurora K. Kagawa‐Vivani Department of Geography and Environment University of Hawaiʻi at Mānoa Honolulu Hawaii USA
Aleya Kaushik National Oceanic and Atmospheric Administration Boulder Colorado USA
Adrienne B. Keller Department of Ecology, Evolution, and Behavior University of Minnesota Twin Cities St. Paul Minnesota USA
Katelyn B. S. King Department of Fisheries and Wildlife Michigan State University East Lansing Michigan USA
Justin Kitzes Department of Biological Sciences University of Pittsburgh Pittsburgh Pennsylvania USA
Michael J. Koontz Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Paige V. Kouba Department of Plant Sciences University of California Davis Davis California USA
Wai‐Yin Kwan CALeDNA University of California Los Angeles Los Angeles California USA
Jalene M. LaMontagne Department of Biological Sciences DePaul University Chicago Illinois USA
Elizabeth A. LaRue Department of Forestry and Natural Resources Purdue University West Lafayette Indiana USA
Daijiang Li Department of Biological Sciences Louisiana State University Baton Rouge Louisiana USA Center for Computation & Technology Louisiana State University Baton Rouge Louisiana USA
Bonan Li Department of Biological & Ecological Engineering Oregon State University Corvallis Oregon USA
Yang Lin Soil and Water Sciences Department University of Florida Gainesville Florida USA
Daniel Liptzin Soil Health Institute Morrisville North Carolina USA
William Alex Long Science and Technology Innovation Program Woodrow Wilson International Center for Scholars Washington D.C. USA
Adam L. Mahood Department of Geography University of Colorado Boulder Boulder Colorado USA
Samuel S. Malloy Battelle Center for Science, Engineering and Public Policy in the John Glenn College of Public Affairs Ohio State University Columbus Ohio USA
Sparkle L. Malone Department of Biological Sciences Florida International University Miami Florida USA
Joseph M. McGlinchy Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Courtney L. Meier Battelle National Ecological Observatory Network Boulder Colorado USA
Brett A. Melbourne Department of Ecology and Evolutionary Biology University of Colorado Boulder Boulder Colorado USA
Nathan Mietkiewicz CoreLogic Irvine CA USA
Jeffery T. Morisette U.S. Department of Agriculture Forest Service Rocky Mountain Research Station Fort Collins Colorado USA
Moussa Moustapha Department of Biological Science University of Ngaoundere Ngaoundere Adamawa Cameroon
Chance Muscarella Department of Environmental Science University of Arizona Tucson Arizona USA
John Musinsky Battelle National Ecological Observatory Network Boulder Colorado USA
Ranjan Muthukrishnan Environmental Resilience Institute Indiana University Bloomington Illinois USA
Kusum Naithani Department of Biological Sciences University of Arkansas‐Fayetteville Fayetteville Arkansas USA
Merrie Neely GEO AquaWatch Clearwater Florida USA Global Science and Technology, Inc Greenbelt Maryland USA
Kari Norman Department of Environmental Science, Policy, and Management University of California Berkeley Berkeley California USA
Stephanie M. Parker Battelle National Ecological Observatory Network Boulder Colorado USA
Mariana Perez Rocha Department of Biology University of Oklahoma Norman Oklahoma USA
Laís Petri School for Environment and Sustainability University of Michigan East Lansing Michigan USA
Colette A. Ramey Biology Department Metropolitan State University of Denver Denver Colorado USA
Sydne Record Department of Biology Bryn Mawr College Bryn Mawr Pennsylvania USA
Matthew W. Rossi Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Michael SanClements Battelle National Ecological Observatory Network Boulder Colorado USA
Victoria M. Scholl Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA Department of Geography University of Colorado Boulder Boulder Colorado USA
Anna K. Schweiger Remote Sensing Laboratories Department of Geography University of Zurich Zurich Switzerland
Bijan Seyednasrollah School of Informatics, Computing & Cyber Systems Northern Arizona University Flagstaff Arizona USA
Debjani Sihi Department of Environmental Sciences Emory University Atlanta Georgia USA
Kathleen R. Smith Biology Department Metropolitan State University of Denver Denver Colorado USA
Eric R. Sokol Battelle National Ecological Observatory Network Boulder Colorado USA INSTAAR University of Colorado Boulder Boulder Colorado USA
Sarah A. Spaulding INSTAAR University of Colorado Boulder Boulder Colorado USA
Anna I. Spiers Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA Department of Ecology and Evolutionary Biology University of Colorado Boulder Boulder Colorado USA
Lise A. St. Denis Earth Lab, CIRES University of Colorado Boulder Boulder Colorado USA
Anika P. Staccone Department of Ecology, Evolution, & Environmental Biology Columbia University New York New York USA
Kaitlin Stack Whitney Department of Science, Technology, and Society Rochester Institute of Technology Henrietta New York USA
Diane M. Stanitski National Oceanic and Atmospheric Administration Boulder Colorado USA
Eva Stricker Department of Biology University of New Mexico Albuquerque New Mexico USA
Thilina D. Surasinghe Department of Biological Sciences Bridgewater State University Bridgewater Massachusetts USA
Sarah K. Thomsen Department of Integrative Biology Oregon State University Corvallis Oregon USA
Patrisse M. Vasek Department of Math, Science, and Technology Oglala Lakota College Kyle South Dakota USA
Li Xiaolu Department of Earth and Atmospheric Sciences Cornell University Ithaca New York USA
Di Yang Wyoming GIS Center University of Wyoming Laramie Wyoming USA
Rong Yu Department of Geography University of Wisconsin‐Milwaukee Milwaukee Wisconsin USA
Kelsey M. Yule Biodiversity Knowledge Integration Center Arizona State University Tempe Arizona USA
Kai Zhu Department of Environmental Studies University of California, Santa Cruz Santa Cruz California USA

Collapse

Xu R, Li W, Li K, Zhou X, Qi H. Scheduling Mix-Coflows in Datacenter Networks. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021. [DOI: 10.1109/tnsm.2020.3027498] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Feng H, Deng Y, Qin X, Min G. Criso: An Incremental Scalable and Cost-Effective Network Architecture for Data Centers. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021. [DOI: 10.1109/tnsm.2020.3036875] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Development of an Innovative ICT Infrastructure for an Eco-Cost System with Life Cycle Assessment. SUSTAINABILITY 2021. [DOI: 10.3390/su13063118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Zhang J, Jiang Y, Liu Y. Variable Expanding Structure for Data Center Interconnection Networks. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2021. [DOI: 10.20965/jaciii.2021.p0013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Advances in MapReduce Big Data Processing: Platform, Tools, and Algorithms. STUDIES IN BIG DATA 2021. [DOI: 10.1007/978-981-33-6400-4_6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

AlJame M, Ahmad I. DNA short read alignment on apache spark. APPLIED COMPUTING AND INFORMATICS 2020. [DOI: 10.1016/j.aci.2019.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Zhang Z, Deng Y, Min G, Xie J, Yang LT, Zhou Y. HSDC: A Highly Scalable Data Center Network Architecture for Greater Incremental Scalability. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2019;30:1105-1119. [DOI: 10.1109/tpds.2018.2874659] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2023]

An efficient cost-based algorithm for scheduling workflow tasks in cloud computing systems. Neural Comput Appl 2018. [DOI: 10.1007/s00521-018-3610-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Sebei H, Hadj Taieb MA, Ben Aouicha M. Review of social media analytics process and Big Data pipeline. SOCIAL NETWORK ANALYSIS AND MINING 2018. [DOI: 10.1007/s13278-018-0507-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Zhang Z, Deng Y, Min G, Xie J, Huang S. ExCCC-DCN: A Highly Scalable, Cost-Effective and Energy-Efficient Data Center Structure. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2017;28:1046-1060. [DOI: 10.1109/tpds.2016.2609428] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2023]

Distributed and scalable sequential pattern mining through stream processing. Knowl Inf Syst 2017. [DOI: 10.1007/s10115-017-1037-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Ghaleb AM, Khalifa T, Ayoubi S, Shaban KB, Assi C. Surviving Multiple Failures in Multicast Virtual Networks With Virtual Machines Migration. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2016. [DOI: 10.1109/tnsm.2016.2616283] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Huang M, Wu D, Yu CH, Fang Z, Interlandi M, Condie T, Cong J. Programming and Runtime Support to Blaze FPGA Accelerator Deployment at Datacenter Scale. PROCEEDINGS OF THE ... ACM SYMPOSIUM ON CLOUD COMPUTING [ELECTRONIC RESOURCE] : SOCC ... ... SOCC (CONFERENCE) 2016;2016:456-469. [PMID: 28317049 DOI: 10.1145/2987550.2987569] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

CAT: A Cost-Aware Translator for SQL-query workflow to MapReduce jobflow. DATA KNOWL ENG 2016. [DOI: 10.1016/j.datak.2015.12.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Lee A, Whiteley N. Forest resampling for distributed sequential Monte Carlo. Stat Anal Data Min 2015. [DOI: 10.1002/sam.11280] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Mohammed EA, Far BH, Naugler C. Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends. BioData Min 2014;7:22. [PMID: 25383096 PMCID: PMC4224309 DOI: 10.1186/1756-0381-7-22] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 10/18/2014] [Indexed: 12/23/2022] Open

Abstract

The emergence of massive datasets in a clinical setting presents both challenges and opportunities in data storage and analysis. This so called "big data" challenges traditional analytic tools and will increasingly require novel solutions adapted from other fields. Advances in information and communication technology present the most viable solutions to big data analysis in terms of efficiency and scalability. It is vital those big data solutions are multithreaded and that data access approaches be precisely tailored to large volumes of semi-structured/unstructured data. THE MAPREDUCE PROGRAMMING FRAMEWORK USES TWO TASKS COMMON IN FUNCTIONAL PROGRAMMING: Map and Reduce. MapReduce is a new parallel processing framework and Hadoop is its open-source implementation on a single computing node or on clusters. Compared with existing parallel processing paradigms (e.g. grid computing and graphical processing unit (GPU)), MapReduce and Hadoop have two advantages: 1) fault-tolerant storage resulting in reliable data processing by replicating the computing tasks, and cloning the data chunks on different computing nodes across the computing cluster; 2) high-throughput data processing via a batch processing framework and the Hadoop distributed file system (HDFS). Data are stored in the HDFS and made available to the slave nodes for computation. In this paper, we review the existing applications of the MapReduce programming framework and its implementation platform Hadoop in clinical big data and related medical health informatics fields. The usage of MapReduce and Hadoop on a distributed system represents a significant advance in clinical big data processing and utilization, and opens up new opportunities in the emerging era of big data analytics. The objective of this paper is to summarize the state-of-the-art efforts in clinical big data analytics and highlight what might be needed to enhance the outcomes of clinical big data analytics tools. This paper is concluded by summarizing the potential usage of the MapReduce programming framework and Hadoop platform to process huge volumes of clinical data in medical health informatics related fields.

Collapse

Beyond Batch Processing: Towards Real-Time and Streaming Big Data. COMPUTERS 2014. [DOI: 10.3390/computers3040117] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Zhang G, Li C, Zhang Y, Xing C. A Semantic++ MapReduce Parallel Programming Model. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING 2014. [DOI: 10.1142/s1793351x14400091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Philip Chen C, Zhang CY. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.01.015] [Citation(s) in RCA: 1722] [Impact Index Per Article: 172.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Freeman J, Vladimirov N, Kawashima T, Mu Y, Sofroniew NJ, Bennett DV, Rosen J, Yang CT, Looger LL, Ahrens MB. Mapping brain activity at scale with cluster computing. Nat Methods 2014;11:941-50. [DOI: 10.1038/nmeth.3041] [Citation(s) in RCA: 205] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 06/23/2014] [Indexed: 12/18/2022]

Risk intelligence: making profit from uncertainty in data processing system. ScientificWorldJournal 2014;2014:398235. [PMID: 24883392 PMCID: PMC4030500 DOI: 10.1155/2014/398235] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Accepted: 03/19/2014] [Indexed: 11/17/2022] Open

Liang F, Feng C, Lu X, Xu Z. Performance Benefits of DataMPI: A Case Study with BigDataBench. BIG DATA BENCHMARKS, PERFORMANCE OPTIMIZATION, AND EMERGING HARDWARE 2014. [DOI: 10.1007/978-3-319-13021-7_9] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Ravindra P, Anyanwu K. Nesting Strategies for Enabling Nimble MapReduce Dataflows for Large RDF Data. INT J SEMANT WEB INF 2014. [DOI: 10.4018/ijswis.2014010101] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Ding L, Wang G, Xin J, Wang X, Huang S, Zhang R. ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms. DATA KNOWL ENG 2013. [DOI: 10.1016/j.datak.2013.04.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Miklošík A, Hvizdová E. Knowledge base cloud - a new approach to knowledge management systems architecture. ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS 2013. [DOI: 10.11118/actaun201260040267] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Qiu J, Ekanayake J, Gunarathne T, Choi JY, Bae SH, Ruan Y, Ekanayake S, Wu S, Beason S, Fox G, Rho M, Tang H. Data Intensive Computing for Bioinformatics. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Xie J, Tian Y, Yin S, Zhang J, Ruan X, Qin X. Adaptive Preshuffling in Hadoop Clusters. ACTA ACUST UNITED AC 2013. [DOI: 10.1016/j.procs.2013.05.422] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

ETLMR: A Highly Scalable Dimensional ETL Framework Based on MapReduce. LECTURE NOTES IN COMPUTER SCIENCE 2013. [DOI: 10.1007/978-3-642-37574-3_1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Epstein J, Black AP, Peyton-Jones S. Towards Haskell in the cloud. ACTA ACUST UNITED AC 2012. [DOI: 10.1145/2096148.2034690] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Aksanli B, Venkatesh J, Zhang L, Rosing T. Utilizing green energy prediction to schedule mixed batch and service jobs in data centers. ACTA ACUST UNITED AC 2012. [DOI: 10.1145/2094091.2094105] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Lin J, Dyer C. Data-Intensive Text Processing with MapReduce. ACTA ACUST UNITED AC 2010. [DOI: 10.2200/s00274ed1v01y201006hlt007] [Citation(s) in RCA: 190] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Caulfield AM, Grupp LM, Swanson S. Gordon. ACTA ACUST UNITED AC 2009. [DOI: 10.1145/1508284.1508270] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Hari P, Ko K, Koukoumidis E, Kremer U, Martonosi M, Ottoni D, Peh LS, Zhang P. SARANA: language, compiler and run-time system support for spatially aware and resource-aware mobile computing. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2008;366:3699-3708. [PMID: 18672455 DOI: 10.1098/rsta.2008.0127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]