51
|
Voukelatou V, Miliou I, Giannotti F, Pappalardo L. Understanding peace through the world news. EPJ DATA SCIENCE 2022; 11:2. [PMID: 35079561 PMCID: PMC8777429 DOI: 10.1140/epjds/s13688-022-00315-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 12/26/2021] [Indexed: 05/06/2023]
Abstract
Peace is a principal dimension of well-being and is the way out of inequity and violence. Thus, its measurement has drawn the attention of researchers, policymakers, and peacekeepers. During the last years, novel digital data streams have drastically changed the research in this field. The current study exploits information extracted from a new digital database called Global Data on Events, Location, and Tone (GDELT) to capture peace through the Global Peace Index (GPI). Applying predictive machine learning models, we demonstrate that news media attention from GDELT can be used as a proxy for measuring GPI at a monthly level. Additionally, we use explainable AI techniques to obtain the most important variables that drive the predictions. This analysis highlights each country's profile and provides explanations for the predictions, and particularly for the errors and the events that drive these errors. We believe that digital data exploited by researchers, policymakers, and peacekeepers, with data science tools as powerful as machine learning, could contribute to maximizing the societal benefits and minimizing the risks to peace. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1140/epjds/s13688-022-00315-z.
Collapse
Affiliation(s)
- Vasiliki Voukelatou
- Scuola Normale Superiore, Pisa, Italy
- Institute of Information Science and Technologies, National Research Council (ISTI-CNR), Pisa, Italy
| | - Ioanna Miliou
- Department of Computer & Systems Sciences, Stockholm University, Stockholm, Sweden
| | - Fosca Giannotti
- Scuola Normale Superiore, Pisa, Italy
- Institute of Information Science and Technologies, National Research Council (ISTI-CNR), Pisa, Italy
| | - Luca Pappalardo
- Institute of Information Science and Technologies, National Research Council (ISTI-CNR), Pisa, Italy
| |
Collapse
|
52
|
Lai S, Sorichetta A, Steele J, Ruktanonchai CW, Cunningham AD, Rogers G, Koper P, Woods D, Bondarenko M, Ruktanonchai NW, Shi W, Tatem AJ. Global holiday datasets for understanding seasonal human mobility and population dynamics. Sci Data 2022; 9:17. [PMID: 35058466 PMCID: PMC8776767 DOI: 10.1038/s41597-022-01120-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 12/10/2021] [Indexed: 11/17/2022] Open
Abstract
Public and school holidays have important impacts on population mobility and dynamics across multiple spatial and temporal scales, subsequently affecting the transmission dynamics of infectious diseases and many socioeconomic activities. However, worldwide data on public and school holidays for understanding their changes across regions and years have not been assembled into a single, open-source and multitemporal dataset. To address this gap, an open access archive of data on public and school holidays in 2010-2019 across the globe at daily, weekly, and monthly timescales was constructed. Airline passenger volumes across 90 countries from 2010 to 2018 were also assembled to illustrate the usage of the holiday data for understanding the changing spatiotemporal patterns of population movements.
Collapse
Affiliation(s)
- Shengjie Lai
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK.
| | - Alessandro Sorichetta
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Jessica Steele
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Corrine W Ruktanonchai
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
- Population Health Sciences, Virginia Tech, Blacksburg, VA, 24061, USA
| | - Alexander D Cunningham
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Grant Rogers
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Patrycja Koper
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Dorothea Woods
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Maksym Bondarenko
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| | - Nick W Ruktanonchai
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
- Population Health Sciences, Virginia Tech, Blacksburg, VA, 24061, USA
| | - Weifeng Shi
- School of Public Health, Shandong First Medical University & Shandong Academy of Medical Sciences, Taian, 271000, China
| | - Andrew J Tatem
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton, SO17 1BJ, UK
| |
Collapse
|
53
|
Microestimates of wealth for all low- and middle-income countries. Proc Natl Acad Sci U S A 2022; 119:2113658119. [PMID: 35017299 PMCID: PMC8784134 DOI: 10.1073/pnas.2113658119] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/14/2021] [Indexed: 11/18/2022] Open
Abstract
Many critical policy decisions rely on data about the geographic distribution of wealth and poverty, yet only half of all countries have access to adequate data on poverty. This paper creates a complete and publicly available set of microestimates of the distribution of relative poverty and wealth across all 135 low- and middle-income countries. We provide extensive evidence of the accuracy and validity of the estimates and also provide confidence intervals for each microestimate to facilitate responsible downstream use. These methods and maps provide a set of tools to study economic development and growth, guide interventions, monitor and evaluate policies, and track the elimination of poverty worldwide. Many critical policy decisions, from strategic investments to the allocation of humanitarian aid, rely on data about the geographic distribution of wealth and poverty. Yet many poverty maps are out of date or exist only at very coarse levels of granularity. Here we develop microestimates of the relative wealth and poverty of the populated surface of all 135 low- and middle-income countries (LMICs) at 2.4 km resolution. The estimates are built by applying machine-learning algorithms to vast and heterogeneous data from satellites, mobile phone networks, and topographic maps, as well as aggregated and deidentified connectivity data from Facebook. We train and calibrate the estimates using nationally representative household survey data from 56 LMICs and then validate their accuracy using four independent sources of household survey data from 18 countries. We also provide confidence intervals for each microestimate to facilitate responsible downstream use. These estimates are provided free for public use in the hope that they enable targeted policy response to the COVID-19 pandemic, provide the foundation for insights into the causes and consequences of economic development and growth, and promote responsible policymaking in support of sustainable development.
Collapse
|
54
|
Kashyap R. Has demography witnessed a data revolution? Promises and pitfalls of a changing data ecosystem. Population Studies 2021; 75:47-75. [PMID: 34902280 DOI: 10.1080/00324728.2021.1969031] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Over the past 25 years, technological improvements that have made the collection, transmission, storage, and analysis of data significantly easier and more cost efficient have ushered in what has been described as the 'big data' era or the 'data revolution'. In the social sciences context, the data revolution has often been characterized in terms of increased volume and variety of data, and much excitement has focused on the growing opportunity to repurpose data that are the by-products of the digitalization of social life for research. However, many features of the data revolution are not new for demographers, who have long used large-scale population data and been accustomed to repurposing imperfect data not originally collected for research. Nevertheless, I argue that demography, too, has been affected by the data revolution, and the data ecosystem for demographic research has been significantly enriched. These developments have occurred across two dimensions. The first involves the augmented granularity, variety, and opportunities for linkage that have bolstered the capabilities of 'old' big population data sources, such as censuses, administrative data, and surveys. The second involves the growing interest in and use of 'new' big data sources, such as 'digital traces' generated through internet and mobile phone use, and related to this, the emergence of 'digital demography'. These developments have enabled new opportunities and offer much promise moving forward, but they also raise important ethical, technical, and conceptual challenges for the field.
Collapse
|
55
|
Tan S, Lai S, Fang F, Cao Z, Sai B, Song B, Dai B, Guo S, Liu C, Cai M, Wang T, Wang M, Li J, Chen S, Qin S, Floyd JR, Cao Z, Tan J, Sun X, Zhou T, Zhang W, Tatem AJ, Holme P, Chen X, Lu X. Mobility in China, 2020: a tale of four phases. Natl Sci Rev 2021; 8:nwab148. [PMID: 34876997 PMCID: PMC8645011 DOI: 10.1093/nsr/nwab148] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 08/05/2021] [Accepted: 08/10/2021] [Indexed: 02/05/2023] Open
Abstract
2020 was an unprecedented year, with rapid and drastic changes in human mobility due to the COVID-19 pandemic. To understand the variation in commuting patterns among the Chinese population across stable and unstable periods, we used nationwide mobility data from 318 million mobile phone users in China to examine the extreme fluctuations of population movements in 2020, ranging from the Lunar New Year travel season (chunyun), to the exceptional calm of COVID-19 lockdown, and then to the recovery period. We observed that cross-city movements, which increased substantially in chunyun and then dropped sharply during the lockdown, are primarily dependent on travel distance and the socio-economic development of cities. Following the Lunar New Year holiday, national mobility remained low until mid-February, and COVID-19 interventions delayed more than 72.89 million people returning to large cities. Mobility network analysis revealed clusters of highly connected cities, conforming to the social-economic division of urban agglomerations in China. While the mass migration back to large cities was delayed, smaller cities connected more densely to form new clusters. During the recovery period after travel restrictions were lifted, the netflows of over 55% city pairs reversed in direction compared to before the lockdown. These findings offer the most comprehensive picture of Chinese mobility at fine resolution across various scenarios in China and are of critical importance for decision making regarding future public-health-emergency response, transportation planning and regional economic development, among others.
Collapse
Affiliation(s)
- Suoyi Tan
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Shengjie Lai
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton SO17 1BJ, UK
| | - Fan Fang
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Ziqiang Cao
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Bin Sai
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Bing Song
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Bitao Dai
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Shuhui Guo
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Chuchu Liu
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Mengsi Cai
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Tong Wang
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Mengning Wang
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Jiaxu Li
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| | - Saran Chen
- School of Mathematics and Big Data, Foshan University, Foshan 510000, China
| | - Shuo Qin
- State Key Laboratory on Blind Signal Processing, Chengdu 610041, China
| | - Jessica R Floyd
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton SO17 1BJ, UK
| | - Zhidong Cao
- State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Jing Tan
- Chinese Evidence-Based Medicine Center, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Xin Sun
- Chinese Evidence-Based Medicine Center, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Tao Zhou
- Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 611713, China
| | - Wei Zhang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610047, China
| | - Andrew J Tatem
- WorldPop, School of Geography and Environmental Science, University of Southampton, Southampton SO17 1BJ, UK
| | - Petter Holme
- Tokyo Tech World Hub Research Initiative, Institute of Innovative Research, Tokyo Institute of Technology, Tokyo 226-8503, Japan
| | - Xiaohong Chen
- School of Business, Central South University, Changsha 410083, China
| | - Xin Lu
- College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
| |
Collapse
|
56
|
Studying segregation in Estonia using call data records. SOCIAL NETWORK ANALYSIS AND MINING 2021. [DOI: 10.1007/s13278-021-00817-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
57
|
Hadam S. Pendler Mobil: Die Verwendung von Mobilfunkdaten zur Unterstützung der amtlichen Pendlerstatistik. ASTA WIRTSCHAFTS- UND SOZIALSTATISTISCHES ARCHIV 2021. [PMCID: PMC8588940 DOI: 10.1007/s11943-021-00294-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Die Verfügbarkeit von kleinräumigen und aktuellen Pendlerverflechtungen sind für politische wie auch kommunale Entscheidungsfindungen von hoher Bedeutung. Aus dem Pendlerverhalten lassen sich Rückschlüsse auf Arbeitsmarktregionen und die Verteilung der Wohnbevölkerung ziehen, was unter anderem zu einer laufenden Verbesserung der Verkehrsinfrastruktur beiträgt. Die dafür notwendigen Daten veröffentlicht die amtliche Pendlerrechnung. Jedoch weist sie Verbesserungspotenzial im Hinblick auf die zeitliche und räumliche Darstellung der Pendlerverflechtungen von Erwerbstätigen sowie eine fachliche Erweiterung hinsichtlich der Bildungspendler auf. Dieser Artikel beschreibt die mit dem Projekt Pendler Mobil geprüften Erweiterungsmöglichkeiten der amtlichen Pendlerrechnung auf Basis von Quelle-Ziel-Matrizen aus Mobilfunkdaten. Mobilfunkdaten stellen aufgrund ihrer zeitlichen Aktualität und räumlich feinen Auflösung eine robuste Datengrundlage zur flexiblen Abbildung von potenziellen und regelmäßigen Pendlerbewegungen dar. Die potenzielle Leistungsfähigkeit der Mobilfunkdaten ermöglicht damit eine externe Validierung bestehender Pendlerrechnungen oder Pendlerstatistiken sowie eine beiderseitige Ergänzung zur Ermittlung und Darstellungen weiterer Formen des Pendelns der Erwerbsbevölkerung. Am Fallbeispiel des Bundeslandes Nordrhein-Westfalen werden im Folgenden Gemeinsamkeiten und Unterschiede der übereinstimmenden Pendlerverflechtungen auf Basis von Mobilfunkdaten und der amtlichen Pendlerrechnung erörtert. Dabei gehen wir auf die Herausforderungen der Aufbereitung und Definition geeigneter Mobilfunkdaten durch den Datenanbieter sowie weitere Einflüsse auf die Mobilfunkdaten, wie bspw. durch die zurückgelegte Distanz oder die Verweilzeiten mobiler Aktivitäten, ein. Besonders die Unterschätzung der mobilen Pendlerströme im Vergleich zur amtlichen Pendlerrechnung legt nahe, Modifizierungsansätze der Mobilfunkdaten zu diskutieren. Im Ergebnis können die vorliegenden Mobilfunkdaten potenziell die amtliche Pendlerrechnung durch kleinräumige Pendlerbewegungen in Städten in Form einer erweiterten Zielorts-Bestimmung unterstützen und die Identifizierung von stark frequentierten Arbeitsorten in Städten ermöglichen.
Collapse
|
58
|
Simini F, Barlacchi G, Luca M, Pappalardo L. A Deep Gravity model for mobility flows generation. Nat Commun 2021; 12:6576. [PMID: 34772925 PMCID: PMC8589995 DOI: 10.1038/s41467-021-26752-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 10/21/2021] [Indexed: 11/28/2022] Open
Abstract
The movements of individuals within and among cities influence critical aspects of our society, such as well-being, the spreading of epidemics, and the quality of the environment. When information about mobility flows is not available for a particular region of interest, we must rely on mathematical models to generate them. In this work, we propose Deep Gravity, an effective model to generate flow probabilities that exploits many features (e.g., land use, road network, transport, food, health facilities) extracted from voluntary geographic data, and uses deep neural networks to discover non-linear relationships between those features and mobility flows. Our experiments, conducted on mobility flows in England, Italy, and New York State, show that Deep Gravity achieves a significant increase in performance, especially in densely populated regions of interest, with respect to the classic gravity model and models that do not use deep neural networks or geographic data. Deep Gravity has good generalization capability, generating realistic flows also for geographic areas for which there is no data availability for training. Finally, we show how flows generated by Deep Gravity may be explained in terms of the geographic features and highlight crucial differences among the three considered countries interpreting the model's prediction with explainable AI techniques.
Collapse
Affiliation(s)
- Filippo Simini
- University of Bristol, Department of Engineering Mathematics, Bristol, UK
- The Alan Turing Institute, London, UK
- Argonne Leadership Computing Facility, Argonne National Laboratory Lemont, Lemont, IL, USA
| | | | - Massimilano Luca
- Fondazione Bruno Kessler, Trento, Italy
- Free University of Bolzano, Bolzano, Italy
| | - Luca Pappalardo
- Institute of Information Science and Technologies (ISTI), National Research Council (CNR), Pisa, Italy.
| |
Collapse
|
59
|
Charles-Edwards E, Corcoran J, Loginova J, Panczak R, White G, Whitehead A. A data fusion approach to the estimation of temporary populations: An application to Australia. PLoS One 2021; 16:e0259377. [PMID: 34762671 PMCID: PMC8584718 DOI: 10.1371/journal.pone.0259377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 10/18/2021] [Indexed: 11/23/2022] Open
Abstract
This study establishes a new method for estimating the monthly Average Population Present (APP) in Australian regions. Conventional population statistics, which enumerate people where they usually live, ignore the significant spatial mobility driving short term shifts in population numbers. Estimates of the temporary or ambient population of a region have several important applications including the provision of goods and services, emergency preparedness and serve as more appropriate denominators for a range of social statistics. This paper develops a flexible modelling framework to generate APP estimates from an integrated suite of conventional and novel data sources. The resultant APP estimates reveal the considerable seasonality in small area populations across Australia’s regions alongside the contribution of domestic and international visitors as well as absent residents to the observed monthly variations. The modelling framework developed in the paper is conceived in a manner such that it can be adapted and re-deployed both for use with alternative data sources as well as other situational contexts for the estimation of temporary populations.
Collapse
Affiliation(s)
- Elin Charles-Edwards
- Queensland Centre for Population Research, the University of Queensland, St Lucia, QLD, Australia
| | - Jonathan Corcoran
- Queensland Centre for Population Research, the University of Queensland, St Lucia, QLD, Australia
| | - Julia Loginova
- Queensland Centre for Population Research, the University of Queensland, St Lucia, QLD, Australia
- * E-mail:
| | - Radoslaw Panczak
- Queensland Centre for Population Research, the University of Queensland, St Lucia, QLD, Australia
| | - Gentry White
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
| | - Alexander Whitehead
- Queensland Centre for Population Research, the University of Queensland, St Lucia, QLD, Australia
| |
Collapse
|
60
|
Quantifying and Characterizing Urban Leisure Activities by Merging Multiple Sensing Big Data: A Case Study of Nanjing, China. LAND 2021. [DOI: 10.3390/land10111214] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Studying the spatiotemporal pattern of urban leisure activities helps us to understand the development and utilization of urban public space, people’s quality of life, and the happiness index. It has outstanding value for improving rational resource allocation, stimulating urban vitality, and promoting sustainable urban development. This study aims at discovering the spatiotemporal distribution patterns and people’s behavioral preferences of urban leisure activities using quantitative technology merging ubiquitous sensing big data. On the basis of modeling individual activity traces using mobile signaling data (MSD), we developed a space-time constrained dasymetric interpolation method to refine the urban leisure activity spatiotemporal distribution. We conducted an empirical study in Nanjing, China. The results indicate that leisure plays an essential role in daily human life, both on weekdays and weekends. Significant differences exist in spatiotemporal and type selection in urban leisure. The weekend afternoon is the breakout period of leisure, and entertainment is the most popular leisure activity. Furthermore, the correlation between leisure resource allocation and leisure activity participation was argued. Our findings confirm that data-driven approaches would be a promising method for analyzing human behavior patterns; therefore, assisting in land planning decisions and promoting social justice and sustainability.
Collapse
|
61
|
Yu H, Shao XT, Liu SY, Pei W, Kong XP, Wang Z, Wang DG. Estimating dynamic population served by wastewater treatment plants using location-based services data. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2021; 43:4627-4635. [PMID: 33928448 DOI: 10.1007/s10653-021-00954-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 04/20/2021] [Indexed: 06/12/2023]
Abstract
Wastewater-based epidemiology is a useful approach to estimate population-level exposure to a wide range of substances (e.g., drugs, chemicals, biological agents) by wastewater analysis. An important uncertainty in population normalized loads generated is related to the size and variability of the actual population served by wastewater treatment plants (WWTPs). Here, we built a population model using location-based services (LBS) data to estimate dynamic consumption of illicit drugs. First, the LBS data from Tencent Location Big Data and resident population were used to train a linear population model for estimating population (r2 = 0.92). Then, the spatiotemporal accuracy of the population model was validated. In terms of temporal accuracy, we compared the model-based population with the time-aligned ammonia nitrogen (NH4-N) population within the WWTP of SEG, showing a mean squared error of < 10%. In terms of spatial accuracy, we estimated the model-based population of 42 WWTPs in Dalian and compared it with the NH4-N and design population, indicating good consistency overall (5% less than NH4-N and 4% less than design). Furthermore, methamphetamine consumption and prevalence based on the model were calculated with an average of 111 mg/day/1000 inhabitants and 0.24%, respectively, and dynamically displayed on a visualization system for real-time monitoring. Our study provided a dynamic and accurate population for estimating the population-level use of illicit drugs, much improving the temporal and spatial trend analysis of drug use. Furthermore, accurate information on drug use could be used to assess population health risks in a community.
Collapse
Affiliation(s)
- Han Yu
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Xue-Ting Shao
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Si-Yu Liu
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Wei Pei
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Xiang-Peng Kong
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China
| | - Zhuang Wang
- Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, No. 219 Ningliu Road, Nanjing, 210044, China
| | - De-Gao Wang
- College of Environmental Science and Engineering, Dalian Maritime University, No. 1 Linghai Road, Dalian, 116026, China.
| |
Collapse
|
62
|
Salat H, Schläpfer M, Smoreda Z, Rubrichi S. Analysing the impact of electrification on rural attractiveness in Senegal with mobile phone data. ROYAL SOCIETY OPEN SCIENCE 2021; 8:201898. [PMID: 34754490 PMCID: PMC8493192 DOI: 10.1098/rsos.201898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 09/01/2021] [Indexed: 06/13/2023]
Abstract
Reliable and affordable access to electricity has become one of the basic needs for humans and is, as such, at the top of the development agenda. It contributes to socio-economic development by transforming the whole spectrum of people's lives-food, education, healthcare. It spurs new economic opportunities, thus improving livelihoods. Using a comprehensive dataset of pseudonymized mobile phone records, we analyse the impact of electrification on attractiveness for rural areas in Senegal. We extract communication and mobility flows from call detail records and show that electrification is positively and specifically correlated with centrality measures within the communication network and with the volume of incoming visitors. This increased influence is however circumscribed to a limited spatial extent, creating a complex competition with nearby areas. Nevertheless, we found that the volume of visitors between any two sites could be well predicted from the level of electrification at the destination and the living standard at the origin. In view of these results, we discuss how to obtain the best outcomes from a rural electrification planning strategy. We determine that electrifying clusters of rural sites is a better solution than centralizing electricity supplies to maximize the development of specifically targeted sites.
Collapse
Affiliation(s)
- Hadrien Salat
- Sociology and Economics of Networks and Services Department, Orange Innovation Research, 44 Avenue de la République, Châtillon 92320, France
- Future Cities Laboratory, Singapore-ETH Centre, ETH Zürich, 1 Create Way, CREATE Tower #06-01, Singapore 138602, Republic of Singapore
| | - Markus Schläpfer
- Future Cities Laboratory, Singapore-ETH Centre, ETH Zürich, 1 Create Way, CREATE Tower #06-01, Singapore 138602, Republic of Singapore
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Republic of Singapore
| | - Zbigniew Smoreda
- Sociology and Economics of Networks and Services Department, Orange Innovation Research, 44 Avenue de la République, Châtillon 92320, France
| | - Stefania Rubrichi
- Sociology and Economics of Networks and Services Department, Orange Innovation Research, 44 Avenue de la République, Châtillon 92320, France
| |
Collapse
|
63
|
Predicting cell phone adoption metrics using machine learning and satellite imagery. TELEMATICS AND INFORMATICS 2021. [DOI: 10.1016/j.tele.2021.101622] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
64
|
Rathinam F, Khatua S, Siddiqui Z, Malik M, Duggal P, Watson S, Vollenweider X. Using big data for evaluating development outcomes: A systematic map. CAMPBELL SYSTEMATIC REVIEWS 2021; 17:e1149. [PMID: 37051451 PMCID: PMC8354555 DOI: 10.1002/cl2.1149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
BACKGROUND Policy makers need access to reliable data to monitor and evaluate the progress of development outcomes and targets such as sustainable development outcomes (SDGs). However, significant data and evidence gaps remain. Lack of resources, limited capacity within governments and logistical difficulties in collecting data are some of the reasons for the data gaps. Big data-that is digitally generated, passively produced and automatically collected-offers a great potential for answering some of the data needs. Satellite and sensors, mobile phone call detail records, online transactions and search data, and social media are some of the examples of big data. Integrating big data with the traditional household surveys and administrative data can complement data availability, quality, granularity, accuracy and frequency, and help measure development outcomes temporally and spatially in a number of new ways.The study maps different sources of big data onto development outcomes (based on SDGs) to identify current evidence base, use and the gaps. The map provides a visual overview of existing and ongoing studies. This study also discusses the risks, biases and ethical challenges in using big data for measuring and evaluating development outcomes. The study is a valuable resource for evaluators, researchers, funders, policymakers and practitioners in their effort to contributing to evidence informed policy making and in achieving the SDGs. OBJECTIVES Identify and appraise rigorous impact evaluations (IEs), systematic reviews and the studies that have innovatively used big data to measure any development outcomes with special reference to difficult contexts. SEARCH METHODS A number of general and specialised data bases and reporsitories of organisations were searched using keywords related to big data by an information specialist. SELECTION CRITERIA The studies were selected on basis of whether they used big data sources to measure or evaluate development outcomes. DATA COLLECTION AND ANALYSIS Data collection was conducted using a data extraction tool and all extracted data was entered into excel and then analysed using Stata. The data analysis involved looking at trends and descriptive statistics only. MAIN RESULTS The search yielded over 17,000 records, which we then screened down to 437 studies which became the foundation of our systematic map. We found that overall, there is a sizable and rapidly growing number of measurement studies using big data but a much smaller number of IEs. We also see that the bulk of the big data sources are machine-generated (mostly satellites) represented in the light blue. We find that satellite data was used in over 70% of the measurement studies and in over 80% of the IEs. AUTHORS' CONCLUSIONS This map gives us a sense that there is a lot of work being done to develop appropriate measures using big data which could subsequently be used in IEs. Information on costs, ethics, transparency is lacking in the studies and more work is needed in this area to understand the efficacies related to the use of big data. There are a number of outcomes which are not being studied using big data, either due to the lack to applicability such as education or due to lack of awareness about the new methods and data sources. The map points to a number of gaps as well as opportunities where future researchers can conduct research.
Collapse
|
65
|
Semantic Enhancement of Human Urban Activity Chain Construction Using Mobile Phone Signaling Data. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10080545] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Data-driven urban human activity mining has become a hot topic of urban dynamic modeling and analysis. Semantic activity chain modeling with activity purpose provides scientific methodological support for the analysis and decision-making of human behavior, urban planning, traffic management, green sustainable development, etc. However, the spatial and temporal uncertainty of the ubiquitous mobile sensing data brings a huge challenge for modeling and analyzing human activities. Existing approaches for modeling and identifying human activities based on massive social sensing data rely on a large number of valid supervised samples or limited prior knowledge. This paper proposes an effective methodology for building human activity chains based on mobile phone signaling data and labeling activity purpose semantics to analyze human activity patterns, spatiotemporal behavior, and urban dynamics. We fully verified the effectiveness and accuracy of the proposed method in human daily activity process construction and activity purpose identification through accuracy comparison and spatial-temporal distribution exploration. This study further confirms the possibility of using big data to observe urban human spatiotemporal behavior.
Collapse
|
66
|
Nice J, Nahusenay H, Eckert E, Eisele TP, Ashton RA. Estimating malaria chemoprevention and vector control coverage using program and campaign data: A scoping review of current practices and opportunities. J Glob Health 2021; 10:020413. [PMID: 33110575 PMCID: PMC7568932 DOI: 10.7189/jogh.10.020413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Background Accurate estimation of intervention coverage is a vital component of malaria program monitoring and evaluation, both for process evaluation (how well program targets are achieved), and impact evaluation (whether intervention coverage had an impact on malaria burden). There is growing interest in maximizing the utility of program data to generate interim estimates of intervention coverage in the periods between large-scale cross-sectional surveys (the gold standard). As such, this study aimed to identify relevant concepts and themes that may guide future optimization of intervention coverage estimation using routinely collected data, or data collected during and following intervention campaigns, with a particular focus on strategies to define the denominator. Methods We conducted a scoping review of current practices to estimate malaria intervention coverage for insecticide-treated nets (ITNs); indoor residual spray (IRS); intermittent preventive treatment in pregnancy (IPTp); mass drug administration (MDA); and seasonal malaria chemoprevention (SMC) interventions; case management was excluded. Multiple databases were searched for relevant articles published from January 1, 2015 to June 1, 2018. Additionally, we identified and included other guidance relevant to estimating population denominators, with a focus on innovative techniques. Results While program data have the potential to provide intervention coverage data, there are still substantial challenges in selecting appropriate denominators. The review identified a lack of consistency in how coverage was defined and reported for each intervention type, with denominator estimation methods not clearly or consistently reported, and denominator estimates rarely triangulated with other data sources to present the feasible range of denominator values and consequently the range of likely coverage estimates. Conclusions Though household survey-based estimates of intervention coverage remain the gold standard, efforts should be made to further standardize practices for generating interim measurements of intervention coverage from program data, and for estimating and reporting population denominators. This includes fully describing any projections or adjustments made to existing census or population data, exploring opportunities to validate available data by comparing with other sources, and explaining how the denominator has been restricted (or not) to reflect exclusion criteria.
Collapse
Affiliation(s)
- Johanna Nice
- MEASURE Evaluation, Centre for Applied Malaria Research and Evaluation, Tulane School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Honelgn Nahusenay
- MEASURE Evaluation, Centre for Applied Malaria Research and Evaluation, Tulane School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Erin Eckert
- U.S. President's Malaria Initiative, United States Agency for International Development, Washington, D.C., USA.,RTI International, Washington, D.C., USA
| | - Thomas P Eisele
- Centre for Applied Malaria Research and Evaluation, Tulane School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| | - Ruth A Ashton
- MEASURE Evaluation, Centre for Applied Malaria Research and Evaluation, Tulane School of Public Health and Tropical Medicine, New Orleans, Louisiana, USA
| |
Collapse
|
67
|
Abstract
Abstract
Aggregated data from mobile network operators (MNOs) can provide snapshots of population mobility patterns in real time, generating valuable insights when other more traditional data sources are unavailable or out-of-date. The COVID-19 pandemic has highlighted the value of remotely-collected, high-frequency, localized data in inferring the economic impact of shocks to inform decision-making. However, proper protocols must be put in place to ensure end-to-end user-confidentiality and compliance with international best practice. We demonstrate how to build such a data pipeline, channeling data from MNOs through the national regulator to the analytical users, who in turn produce policy-relevant insights. The aggregated indicators analyzed offer a detailed snapshot of the decrease in mobility and increased out-migration from urban to rural areas during the COVID-19 lockdown. Recommendations based on lessons learned from this process can inform engagements with other regulators in creating data pipelines to inform policy-making.
Collapse
|
68
|
Pappalardo L, Ferres L, Sacasa M, Cattuto C, Bravo L. Evaluation of home detection algorithms on mobile phone data using individual-level ground truth. EPJ DATA SCIENCE 2021; 10:29. [PMID: 34094810 PMCID: PMC8170634 DOI: 10.1140/epjds/s13688-021-00284-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 05/12/2021] [Indexed: 06/12/2023]
Abstract
Inferring mobile phone users' home location, i.e., assigning a location in space to a user based on data generated by the mobile phone network, is a central task in leveraging mobile phone data to study social and urban phenomena. Despite its widespread use, home detection relies on assumptions that are difficult to check without ground truth, i.e., where the individual who owns the device resides. In this paper, we present a dataset that comprises the mobile phone activity of sixty-five participants for whom the geographical coordinates of their residence location are known. The mobile phone activity refers to Call Detail Records (CDRs), eXtended Detail Records (XDRs), and Control Plane Records (CPRs), which vary in their temporal granularity and differ in the data generation mechanism. We provide an unprecedented evaluation of the accuracy of home detection algorithms and quantify the amount of data needed for each stream to carry out successful home detection for each stream. Our work is useful for researchers and practitioners to minimize data requests and maximize the accuracy of the home antenna location. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1140/epjds/s13688-021-00284-9.
Collapse
Affiliation(s)
- Luca Pappalardo
- Institute of Information Science and Technologies (ISTI), National Research Council (CNR), Pisa, Italy
| | - Leo Ferres
- Faculty of Engineering, Universidad del Desarrollo, Santiago, Chile
- Telefónica R&D, Santiago, Chile
- ISI Foundation, Turin, Italy
| | | | - Ciro Cattuto
- University of Turin, Turin, Italy
- ISI Foundation, Turin, Italy
| | - Loreto Bravo
- Faculty of Engineering, Universidad del Desarrollo, Santiago, Chile
- Telefónica R&D, Santiago, Chile
| |
Collapse
|
69
|
Assessment of the Dynamic Exposure to PM 2.5 Based on Hourly Cell Phone Location and Land Use Regression Model in Beijing. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18115884. [PMID: 34070868 PMCID: PMC8199116 DOI: 10.3390/ijerph18115884] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 05/24/2021] [Accepted: 05/26/2021] [Indexed: 12/04/2022]
Abstract
The spatiotemporal locations of large populations are difficult to clearly characterize using traditional exposure assessment, mainly due to their complicated daily intraurban activities. This study aimed to extract hourly locations for the total population of Beijing based on cell phone data and assess their dynamic exposure to ambient PM2.5. The locations of residents were located by the cellular base stations that were keeping in contact with their cell phones. The diurnal activity pattern of the total population was investigated through the dynamic spatial distribution of all of the cell phones. The outdoor PM2.5 concentration was predicted in detail using a land use regression (LUR) model. The hourly PM2.5 map was overlapped with the hourly distribution of people for dynamic PM2.5 exposure estimation. For the mobile-derived total population, the mean level of PM2.5 exposure was 89.5 μg/m3 during the period from 2013 to 2015, which was higher than that reported for the census population (87.9 μg/m3). The hourly activity pattern showed that more than 10% of the total population commuted into the center of Beijing (e.g., the 5th ring road) during the daytime. On average, the PM2.5 concentration at workplaces was generally higher than in residential areas. The dynamic PM2.5 exposure pattern also varied with seasons. This study exhibited the strengths of mobile location in deriving the daily spatiotemporal activity patterns of the population in a megacity. This technology would refine future exposure assessment, including either small group cohort studies or city-level large population assessments.
Collapse
|
70
|
Khatib EJ, Perles Roselló MJ, Miranda-Páez J, Giralt V, Barco R. Mass Tracking in Cellular Networks for the COVID-19 Pandemic Monitoring. SENSORS (BASEL, SWITZERLAND) 2021; 21:3424. [PMID: 34069091 PMCID: PMC8155839 DOI: 10.3390/s21103424] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 04/27/2021] [Accepted: 05/04/2021] [Indexed: 12/19/2022]
Abstract
The year 2020 was marked by the emergence of the COVID-19 pandemic. After months of uncontrolled spread worldwide, a clear conclusion is that controlling the mobility of the general population can slow down the propagation of the pandemic. Tracking the location of the population enables better use of mobility limitation policies and the prediction of potential hotspots, as well as improved alert services to individuals that may have been exposed to the virus. With mobility in their core functionality and a high degree of penetration of mobile devices within the general population, cellular networks are an invaluable asset for this purpose. This paper shows an overview of the possibilities offered by cellular networks for the massive tacking of the population at different levels. The major privacy concerns are also reviewed and a specific use case is shown, correlating mobility and number of cases in the province of Málaga (Spain).
Collapse
Affiliation(s)
- Emil J. Khatib
- Department of Communications Engineering, Universidad de Málaga, 29071 Málaga, Spain;
| | | | - Jesús Miranda-Páez
- Department of Psychobiology and Methodology of Behavioral Sciences, Universidad de Málaga, 29071 Málaga, Spain;
| | - Victoriano Giralt
- Digital Transformation Vicerectorate, Universidad de Málaga, Innovation Director, 29071 Málaga, Spain;
| | - Raquel Barco
- Department of Communications Engineering, Universidad de Málaga, 29071 Málaga, Spain;
| |
Collapse
|
71
|
Introducing participatory fairness in emergency communication can support self-organization for survival. Sci Rep 2021; 11:7209. [PMID: 33785786 PMCID: PMC8010119 DOI: 10.1038/s41598-021-86635-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 03/18/2021] [Indexed: 11/14/2022] Open
Abstract
Participatory resilience of disaster-struck communities requires reliable communication for self-organized rescue, as conventional communication infrastructure is damaged. Disasters often lead to blackouts preventing citizens from charging their phones, leading to disparity in battery charges and a digital divide in communication opportunities. We propose a value-based emergency communication system based on participatory fairness, ensuring equal communication opportunities for all, regardless of inequality in battery charge. The proposed infrastructure-less emergency communication network automatically and dynamically (i) assigns high-battery phones as hubs, (ii) adapts the topology to changing battery charges, and (iii) self-organizes to remain robust and reliable when links fail or phones leave the network. The novelty of the proposed mobile protocol compared to mesh communication networks is demonstrated by comparative agent-based simulations. An evaluation using the Gini coefficient demonstrates that our network design results in fairer participation of all devices and a longer network lifetime, benefiting the community and its participants.
Collapse
|
72
|
Goel R, Sharma R, Aasa A. Understanding gender segregation through Call Data Records: An Estonian case study. PLoS One 2021; 16:e0248212. [PMID: 33765003 PMCID: PMC7993617 DOI: 10.1371/journal.pone.0248212] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 02/09/2021] [Indexed: 11/26/2022] Open
Abstract
Understanding segregation plays a significant role in determining the development pathways of a country as it can help governmental and other concerned agencies to prepare better-targeted policies for the needed groups. However, inferring segregation through alternative data, apart from governmental surveys remains limited due to the non-availability of representative datasets. In this work, we utilize Call Data Records (CDR) provided by one of Estonia’s major telecom operators to research the complexities of social interaction and human behavior in order to explain gender segregation. We analyze the CDR with two objectives. First, we study gender segregation by exploring the social network interactions of the CDR. We find that the males are tightly linked which allows information to spread faster among males compared to females. Second, we perform the micro-analysis using various users’ characteristics such as age, language, and location. Our findings show that the prime working-age population (i.e., (24,54] years) is more segregated than others. We also find that the Estonian-speaking population (both males and females) are more likely to interact with other Estonian-speaking individuals of the same gender. Further to ensure the quality of this dataset, we compare the CDR data features with publicly available Estonian census datasets. We observe that the CDR dataset is indeed a good representative of the Estonian population, which indicates that the findings of this study reasonably reflect the reality of gender segregation in the Estonian Landscape.
Collapse
Affiliation(s)
- Rahul Goel
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Rajesh Sharma
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Anto Aasa
- Department of Geography, University of Tartu, Tartu, Estonia
| |
Collapse
|
73
|
Coastal Tourism Spatial Planning at the Regional Unit: Identifying Coastal Tourism Hotspots Based on Social Media Data. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10030167] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
There is an increasing need for spatial planning to manage coastal tourism, and applying social media data has emerged as an effective strategy to support coastal tourism spatial planning. Researchers and decision-makers require spatially explicit information that effectively reveals the current visitation state of the region. The purpose of this study is to identify coastal tourism hotspots considering appropriate spatial units in the regional scale using social media data to examine the advantages and limitations of applying spatial hotspots to spatial planning. Data from Flickr and Twitter with 30″ spatial resolution were obtained from four South Korean regions. Coastal tourism hotspots were then derived using Getis-Ord Gi. Comparing the derived hotspot maps with the visitation rate distribution maps, the derived hotspot maps sufficiently identified the spatial influences of visitors and tourist attractions applicable for spatial planning. As the spatial autocorrelation of social media data differs based on the spatial unit, coastal tourism hotspots according to spatial unit are inevitably different. Spatial hotspots derived from the appropriate spatial unit using social media data are useful for coastal tourism spatial planning.
Collapse
|
74
|
Farzanehfar A, Houssiau F, de Montjoye YA. The risk of re-identification remains high even in country-scale location datasets. PATTERNS (NEW YORK, N.Y.) 2021; 2:100204. [PMID: 33748793 PMCID: PMC7961185 DOI: 10.1016/j.patter.2021.100204] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 11/27/2020] [Accepted: 01/07/2021] [Indexed: 11/30/2022]
Abstract
Although anonymous data are not considered personal data, recent research has shown how individuals can often be re-identified. Scholars have argued that previous findings apply only to small-scale datasets and that privacy is preserved in large-scale datasets. Using 3 months of location data, we (1) show the risk of re-identification to decrease slowly with dataset size, (2) approximate this decrease with a simple model taking into account three population-wide marginal distributions, and (3) prove that unicity is convex and obtain a linear lower bound. Our estimates show that 93% of people would be uniquely identified in a dataset of 60M people using four points of auxiliary information, with a lower bound at 22%. This lower bound increases to 87% when five points are available. Taken together, our results show how the privacy of individuals is very unlikely to be preserved even in country-scale location datasets.
Collapse
Affiliation(s)
- Ali Farzanehfar
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| | | | | |
Collapse
|
75
|
Goff DC, Khan SS, Lloyd-Jones D, Arnett DK, Carnethon MR, Labarthe DR, Loop MS, Luepker RV, McConnell MV, Mensah GA, Mujahid MS, O'Flaherty ME, Prabhakaran D, Roger V, Rosamond WD, Sidney S, Wei GS, Wright JS. Bending the Curve in Cardiovascular Disease Mortality: Bethesda + 40 and Beyond. Circulation 2021; 143:837-851. [PMID: 33617315 PMCID: PMC7905830 DOI: 10.1161/circulationaha.120.046501] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
More than 40 years after the 1978 Bethesda Conference on the Declining Mortality from Coronary Heart Disease provided the scientific community with a blueprint for systematic analysis to understand declining rates of coronary heart disease, there are indications the decline has ended or even reversed despite advances in our knowledge about the condition and treatment. Recent data show a more complex situation, with mortality rates for overall cardiovascular disease, including coronary heart disease and stroke, decelerating, whereas those for heart failure are increasing. To mark the 40th anniversary of the Bethesda Conference, the National Heart, Lung, and Blood Institute and the American Heart Association cosponsored the "Bending the Curve in Cardiovascular Disease Mortality: Bethesda + 40" symposium. The objective was to examine the immediate and long-term outcomes of the 1978 conference and understand the current environment. Symposium themes included trends and future projections in cardiovascular disease (in the United States and internationally), the evolving obesity and diabetes epidemics, and harnessing emerging and innovative opportunities to preserve and promote cardiovascular health and prevent cardiovascular disease. In addition, participant-led discussion explored the challenges and barriers in promoting cardiovascular health across the lifespan and established a potential framework for observational research and interventions that would begin in early childhood (or ideally in utero). This report summarizes the relevant research, policy, and practice opportunities discussed at the symposium.
Collapse
Affiliation(s)
- David Calvin Goff
- Division of Cardiovascular Sciences (D.C.G., G.S.W.), National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| | - Sadiya Sana Khan
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL (S.S.K., D.L-J., M.R.C., D.R.L.)
| | - Donald Lloyd-Jones
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL (S.S.K., D.L-J., M.R.C., D.R.L.)
| | - Donna K Arnett
- College of Public Health, University of Kentucky, Lexington (D.K.A.)
| | - Mercedes R Carnethon
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL (S.S.K., D.L-J., M.R.C., D.R.L.)
| | - Darwin R Labarthe
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL (S.S.K., D.L-J., M.R.C., D.R.L.)
| | - Matthew Shane Loop
- Department of Biostatistics (M.S.L.), Gillings School of Global Public Health, University of North Carolina Chapel Hill
| | - Russell V Luepker
- School of Public Health, University of Minnesota, Minneapolis (R.V.L.)
| | - Michael V McConnell
- Department of Medicine, Cardiovascular Medicine, School of Medicine, Stanford University, CA (M.V.M.)
- Google Health, Palo Alto, CA (M.V.M.)
| | - George A Mensah
- Center for Translation Research and Implementation Science (G.A.M.), National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| | - Mahasin S Mujahid
- Department of Epidemiology, School of Public Health, University of California, Berkeley (M.S.M.)
| | | | - Dorairaj Prabhakaran
- Public Health Foundation of India, Gurgaon (D.P.)
- Centre for Chronic Disease Control, New Delhi, India (D.P.)
- London School of Hygiene and Tropical Medicine, United Kingdom (D.P.)
| | - Véronique Roger
- Department of Cardiovascular Diseases, Mayo Clinic, Rochester, MN (V.R.)
| | - Wayne D Rosamond
- Department of Epidemiology (W.D.R.), Gillings School of Global Public Health, University of North Carolina Chapel Hill
| | - Stephen Sidney
- Division of Research, Kaiser Permanente Northern California, Oakland (S.S.)
| | - Gina S Wei
- Division of Cardiovascular Sciences (D.C.G., G.S.W.), National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| | - Janet S Wright
- Office of the Surgeon General, US Department of Health and Human Services, Washington, DC (J.S.W.)
| |
Collapse
|
76
|
Intraday Variation Mapping of Population Age Structure via Urban-Functional-Region-Based Scaling. REMOTE SENSING 2021. [DOI: 10.3390/rs13040805] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The spatial distribution of the population is uneven for various reasons, such as urban-rural differences and geographical conditions differences. As the basic element of the natural structure of the population, the age structure composition of populations also varies considerably across the world. Obtaining accurate and spatiotemporal population age structure maps is crucial for calculating population size at risk, analyzing populations mobility patterns, or calculating health and development indicators. During the past decades, many population maps in the form of administrative units and grids have been produced. However, these population maps are limited by the lack of information on the change of population distribution within a day and the age structure of the population. Urban functional regions (UFRs) are closely related to population mobility patterns, which can provide information about population variation intraday. Focusing on the area within the Beijing Fifth Ring Road, the political and economic center of Beijing, we showed how to use the temporal scaling factors obtained by analyzing the population survey sampling data and population dasymetric maps in different categories of UFRs to realize the intraday variation mapping of elderly individuals and children. The population dasymetric maps were generated on the basis of covariates related to population. In this article, 50 covariates were calculated from remote sensing data and geospatial data. However, not all covariates are associate with population distribution. In order to improve the accuracy of dasymetric maps and reduce the cost of mapping, it is necessary to select the optimal subset for the dasymetric model of elderly and children. The random forest recursive feature elimination (RF-RFE) algorithm was introduced to obtain the optimal subset of different age groups of people and generate the population dasymetric model in this article, as well as to screen out the optimal subset with 38 covariates and 26 covariates for the dasymetric models of the elderly and children, respectively. An accurate UFR identification method combining point of interest (POI) data and OpenStreetMap (OSM) road network data is also introduced in this article. The overall accuracy of the identification results of UFRs was 70.97%, which is quite accurate. The intraday variation maps of population age structure on weekdays and weekends were made within the Beijing Fifth Ring Road. Accuracy evaluation based on sampling data found that the overall accuracy was relatively high—R2 for each time period was higher than 0.5 and root mean square error (RMSE) was less than 0.05. On weekdays in particular, R2 for each time period was higher than 0.61 and RMSE was less than 0.02.
Collapse
|
77
|
Comparative Analysis of Geolocation Information through Mobile-Devices under Different COVID-19 Mobility Restriction Patterns in Spain. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10020073] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The COVID-19 pandemic is changing the world in unprecedented and unpredictable ways. Human mobility, being the greatest facilitator for the spread of the virus, is at the epicenter of this change. In order to study mobility under COVID-19, to evaluate the efficiency of mobility restriction policies, and to facilitate a better response to future crisis, we need to understand all possible mobility data sources at our disposal. Our work studies private mobility sources, gathered from mobile-phones and released by large technological companies. These data are of special interest because, unlike most public sources, it is focused on individuals rather than on transportation means. Furthermore, the sample of society they cover is large and representative. On the other hand, these data are not directly accessible for anonymity reasons. Thus, properly interpreting its patterns demands caution. Aware of that, we explore the behavior and inter-relations of private sources of mobility data in the context of Spain. This country represents a good experimental setting due to both its large and fast pandemic peak and its implementation of a sustained, generalized lockdown. Our work illustrates how a direct and naive comparison between sources can be misleading, as certain days (e.g., Sundays) exhibit a directly adverse behavior. After understanding their particularities, we find them to be partially correlated and, what is more important, complementary under a proper interpretation. Finally, we confirm that mobile-data can be used to evaluate the efficiency of implemented policies, detect changes in mobility trends, and provide insights into what new normality means in Spain.
Collapse
|
78
|
Fiorio L, Zagheni E, Abel G, Hill J, Pestre G, Letouzé E, Cai J. Analyzing the Effect of Time in Migration Measurement Using Georeferenced Digital Trace Data. Demography 2021; 58:51-74. [PMID: 33834241 PMCID: PMC8055474 DOI: 10.1215/00703370-8917630] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Georeferenced digital trace data offer unprecedented flexibility in migration estimation. Because of their high temporal granularity, many migration estimates can be generated from the same data set by changing the definition parameters. Yet despite the growing application of digital trace data to migration research, strategies for taking advantage of their temporal granularity remain largely underdeveloped. In this paper, we provide a general framework for converting digital trace data into estimates of migration transitions and for systematically analyzing their variation along a quasi-continuous time scale, analogous to a survival function. From migration theory, we develop two simple hypotheses regarding how we expect our estimated migration transition functions to behave. We then test our hypotheses on simulated data and empirical data from three platforms in two internal migration contexts: geotagged Tweets and Gowalla check-ins in the United States, and cell-phone call detail records in Senegal. Our results demonstrate the need for evaluating the internal consistency of migration estimates derived from digital trace data before using them in substantive research. At the same time, however, common patterns across our three empirical data sets point to an emergent research agenda using digital trace data to study the specific functional relationship between estimates of migration and time and how this relationship varies by geography and population characteristics.
Collapse
Affiliation(s)
- Lee Fiorio
- Department of Geography, University of Washington, Seattle, WA, USA
| | - Emilio Zagheni
- Max Planck Institute for Demographic Research, Rostock, Germany
| | - Guy Abel
- Asian Demographic Research Institute, Shanghai University, Shanghai, China.,Wittgenstein Centre (IIASA, VID/ÖAW, WU), International Institute for Applied Systems Analysis, Vienna, Austria
| | - Johnathan Hill
- Department of Geography, University of Washington, Seattle, WA, USA
| | | | - Emmanuel Letouzé
- Data-Pop Alliance, New York, NY, USA.,Massachusetts Institute of Technology Media Lab, Cambridge, MA, USA
| | - Jixuan Cai
- Department of Geography and Resource Management, The Chinese University of Hong Kong, Vienna, Austria.,Wittgenstein Centre (IIASA, VID/ÖAW, WU), International Institute for Applied Systems Analysis, Vienna, Austria
| |
Collapse
|
79
|
Yin L, Lin N, Zhao Z. Mining Daily Activity Chains from Large-Scale Mobile Phone Location Data. CITIES (LONDON, ENGLAND) 2021; 109:103013. [PMID: 33536696 PMCID: PMC7809620 DOI: 10.1016/j.cities.2020.103013] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 10/02/2020] [Accepted: 10/26/2020] [Indexed: 05/12/2023]
Abstract
Understanding residents' daily activity chains provides critical support for various applications in transportation, public health and many other related fields. Recently, mobile phone location datasets have been suggested for mining activity patterns because of their utility and large sample sizes. Although recently machine learning-based models seem to perform well in activity purpose inference using mobile phone location data, most of these models work as black boxes. To address these challenges, this study proposes a flexible white box method to mine human activity chains from large-scale mobile phone location data by integrating both the spatial and temporal features of daily activities with varying weights. We find that the frequency distribution of major activity chain patterns agrees well with the patterns derived based on a travel survey of Shenzhen and a state-of-the-art method. Moreover, a dataset covering over 16.5% of the city population can yield a reasonable outcome of the major activity patterns. The contributions of this study not only lie in offering an effective approach to mining daily activity chains from mobile phone location data but also involve investigating the impact of different data conditions on the model performance, which make using big trajectory data more practical for domain experts.
Collapse
Affiliation(s)
- Ling Yin
- Shenzhen Institutes of Advanced Technologies, Chinese Academy of Science, Shenzhen, China
| | - Nan Lin
- Shenzhen Institutes of Advanced Technologies, Chinese Academy of Science, Shenzhen, China
| | - Zhiyuan Zhao
- Academy of Digital China (Fujian), Fuzhou University, Fuzhou, China
- Key Lab of Spatial Data Mining and Information Sharing, Ministry of Education, Fuzhou, China
- National & Local Joint Engineering Research Center of Geo-spatial Information Technology, Fuzhou, China
| |
Collapse
|
80
|
Mahmud AS, Kabir MI, Engø-Monsen K, Tahmina S, Riaz BK, Hossain MA, Khanom F, Rahman MM, Rahman MK, Sharmin M, Hossain DM, Yasmin S, Ahmed MM, Lusha MAF, Buckee CO. Megacities as drivers of national outbreaks: The 2017 chikungunya outbreak in Dhaka, Bangladesh. PLoS Negl Trop Dis 2021; 15:e0009106. [PMID: 33529229 PMCID: PMC7880496 DOI: 10.1371/journal.pntd.0009106] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 02/12/2021] [Accepted: 01/04/2021] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Several large outbreaks of chikungunya have been reported in the Indian Ocean region in the last decade. In 2017, an outbreak occurred in Dhaka, Bangladesh, one of the largest and densest megacities in the world. Population mobility and fluctuations in population density are important drivers of epidemics. Measuring population mobility during outbreaks is challenging but is a particularly important goal in the context of rapidly growing and highly connected cities in low- and middle-income countries, which can act to amplify and spread local epidemics nationally and internationally. METHODS We first describe the epidemiology of the 2017 chikungunya outbreak in Dhaka and estimate incidence using a mechanistic model of chikungunya transmission parametrized with epidemiological data from a household survey. We combine the modeled dynamics of chikungunya in Dhaka, with mobility estimates derived from mobile phone data for over 4 million subscribers, to understand the role of population mobility on the spatial spread of chikungunya within and outside Dhaka during the 2017 outbreak. RESULTS We estimate a much higher incidence of chikungunya in Dhaka than suggested by official case counts. Vector abundance, local demographics, and population mobility were associated with spatial heterogeneities in incidence in Dhaka. The peak of the outbreak in Dhaka coincided with the annual Eid holidays, during which large numbers of people traveled from Dhaka to other parts of the country. We show that travel during Eid likely resulted in the spread of the infection to the rest of the country. CONCLUSIONS Our results highlight the impact of large-scale population movements, for example during holidays, on the spread of infectious diseases. These dynamics are difficult to capture using traditional approaches, and we compare our results to a standard diffusion model, to highlight the value of real-time data from mobile phones for outbreak analysis, forecasting, and surveillance.
Collapse
Affiliation(s)
- Ayesha S. Mahmud
- Department of Demography, University of California, Berkeley, Berkeley, California, United States of America
- Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Md. Iqbal Kabir
- National Institute of Preventive and Social Medicine, Dhaka, Bangladesh
- Directorate General of Health Services, Dhaka, Bangladesh
| | | | - Sania Tahmina
- Directorate General of Health Services, Dhaka, Bangladesh
| | | | - Md. Akram Hossain
- National Institute of Preventive and Social Medicine, Dhaka, Bangladesh
| | - Fahmida Khanom
- National Institute of Preventive and Social Medicine, Dhaka, Bangladesh
| | | | | | | | | | | | | | | | - Caroline O. Buckee
- Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| |
Collapse
|
81
|
Research on Human Travel Correlation for Urban Transport Planning Based on Multisource Data. SENSORS 2020; 21:s21010195. [PMID: 33396731 PMCID: PMC7795117 DOI: 10.3390/s21010195] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 12/26/2020] [Accepted: 12/27/2020] [Indexed: 11/16/2022]
Abstract
With the rapid development of positioning techniques, a large amount of human travel trajectory data is collected. These datasets have become an effective data resource for obtaining urban traffic patterns. However, many traffic analyses are only based on a single dataset. It is difficult to determine whether a single-dataset-based result can meet the requirement of urban transport planning. In response to this problem, we attempted to obtain traffic patterns and population distributions from the perspective of multisource traffic data using license plate recognition (LPR) data and cellular signaling (CS) data. Based on the two kinds of datasets, identification methods of residents’ travel stay point are proposed. For LPR data, it was identified based on different vehicle speed thresholds at different times. For CS data, a spatiotemporal clustering algorithm based on time allocation was proposed to recognize it. We then used the correlation coefficient r and the significance test p-values to analyze the correlations between the CS and LPR data in terms of the population distribution and traffic patterns. We studied two real-world datasets from five working days of human mobility data and found that they were significantly correlated for the stay and move population distributions. Then, the analysis scale was refined to hour level. We also found that they still maintain a significant correlation. Finally, the origin–destination (OD) matrices between traffic analysis zones (TAZs) were obtained. Except for a few TAZs with poor correlations due to the fewer LPR records, the correlations of the other TAZs remained high. It showed that the population distribution and traffic patterns computed by the two datasets were fairly similar. Our research provides a method to improve the analysis of complex travel patterns and behaviors and provides opportunities for travel demand modeling and urban transport planning. The findings can also help decision-makers understand urban human mobility and can serve as a guide for urban management and transport planning.
Collapse
|
82
|
Aubourg T, Demongeot J, Vuillerme N. Novel statistical approach for assessing the persistence of the circadian rhythms of social activity from telephone call detail records in older adults. Sci Rep 2020; 10:21464. [PMID: 33293551 PMCID: PMC7722744 DOI: 10.1038/s41598-020-77795-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 10/19/2020] [Indexed: 02/02/2023] Open
Abstract
How circadian rhythms of activity manifest themselves in social life of humans remains one of the most intriguing questions in chronobiology and a major issue for personalized medicine. Over the past years, substantial advances have been made in understanding the personal nature and the robustness—i.e. the persistence—of the circadian rhythms of social activity by the analysis of phone use. At this stage however, the consistency of such advances as their statistical validity remains unclear. The present paper has been specifically designed to address this issue. To this end, we propose a novel statistical procedure for the measurement of the circadian rhythms of social activity which is particularly well-suited for the existing framework of persistence analysis. Furthermore, we illustrate how this procedure works concretely by assessing the persistence of the circadian rhythms of telephone call activity from a 12-month call detail records (CDRs) dataset of adults over than 65 years. The results show the ability of our approach for assessing persistence with a statistical significance. In the field of CDRs analysis, this novel statistical approach can be used for completing the existing methods used to analyze the persistence of the circadian rhythms of a social nature. More importantly, it provides an opportunity to open up the analysis of CDRs for various domains of application in personalized medicine requiring access to statistical significance such as health care monitoring.
Collapse
Affiliation(s)
- Timothée Aubourg
- Univ. Grenoble Alpes, AGEIS, Grenoble, France. .,Orange Labs, Meylan, France. .,LabCom Telecom4Health, Univ. Grenoble Alpes & Orange Labs, Grenoble, France.
| | - Jacques Demongeot
- Univ. Grenoble Alpes, AGEIS, Grenoble, France.,LabCom Telecom4Health, Univ. Grenoble Alpes & Orange Labs, Grenoble, France.,Institut Universitaire de France, Paris, France
| | - Nicolas Vuillerme
- Univ. Grenoble Alpes, AGEIS, Grenoble, France.,LabCom Telecom4Health, Univ. Grenoble Alpes & Orange Labs, Grenoble, France.,Institut Universitaire de France, Paris, France
| |
Collapse
|
83
|
Dong L, Huang Z, Zhang J, Liu Y. Understanding the mesoscopic scaling patterns within cities. Sci Rep 2020; 10:21201. [PMID: 33273607 PMCID: PMC7712915 DOI: 10.1038/s41598-020-78135-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 11/20/2020] [Indexed: 11/10/2022] Open
Abstract
Understanding quantitative relationships between urban elements is crucial for a wide range of applications. The observation at the macroscopic level demonstrates that the aggregated urban quantities (e.g., gross domestic product) scale systematically with population sizes across cities, also known as urban scaling laws. However, at the mesoscopic level, we lack an understanding of whether the simple scaling relationship holds within cities, which is a fundamental question regarding the spatial origin of scaling in urban systems. Here, by analyzing four extensive datasets covering millions of mobile phone users and urban facilities, we investigate the scaling phenomena within cities. We find that the mesoscopic infrastructure volume and socioeconomic activity scale sub- and super-linearly with the active population, respectively. For a same scaling phenomenon, however, the exponents vary in cities of similar population sizes. To explain these empirical observations, we propose a conceptual framework by considering the heterogeneous distributions of population and facilities, and the spatial interactions between them. Analytical and numerical results suggest that, despite the large number of complexities that influence urban activities, the simple interaction rules can effectively explain the observed regularity and heterogeneity in scaling behaviors within cities.
Collapse
Affiliation(s)
- Lei Dong
- Institute of Remote Sensing and Geographical Information Systems, School of Earth and Space Sciences, Peking University, Beijing, 100871, China.,Senseable City Lab, Department of Urban Studies and Planning, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Zhou Huang
- Institute of Remote Sensing and Geographical Information Systems, School of Earth and Space Sciences, Peking University, Beijing, 100871, China
| | - Jiang Zhang
- School of System Science, Beijing Normal University, Beijing, 100875, China
| | - Yu Liu
- Institute of Remote Sensing and Geographical Information Systems, School of Earth and Space Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
84
|
Bergquist R, Kiani B, Manda S. First year with COVID-19: Assessment and prospects. GEOSPATIAL HEALTH 2020; 15. [PMID: 33461262 DOI: 10.4081/gh.2020.953] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Accepted: 10/28/2020] [Indexed: 06/12/2023]
Abstract
The vision of health for all by Dr. Halfdan Mahler, Director General of the World Health Organization (WHO) 1973 to 1988, guided public health approaches towards improving life for all those mired in poverty and disease. Research on the Neglected Tropical Diseases (NTDs) of the world's poor was advancing strongly when the coronavirus disease 2019 (COVID-19) struck. Although work on the NTDs did not grind to a halt, the situation is reminiscent of the author Stefan Zweig's passionate account of culture destruction in his book The World of Yesterday from 1941, which gives an insight as to how the war ended traditional life. His thoughts parallel the present situation; however, this time societies are not torn apart by war but instead isolated by a pandemic. It comes upon today's scientists to move fast to make COVID-19 less devastating than the Spanish flu of 1918-1920 that killed more than 3% of the world population...
Collapse
Affiliation(s)
| | - Behzad Kiani
- Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad.
| | - Samuel Manda
- Biostatistics Unit, South African Medical Research Council; Department of Statistics, University of Pretoria, Pretoria.
| |
Collapse
|
85
|
Every step you take, we’ll be watching you: nudging and the ramifications of GPS technology. AI & SOCIETY 2020. [DOI: 10.1007/s00146-020-01098-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
86
|
Heterogeneity in social and epidemiological factors determines the risk of measles outbreaks. Proc Natl Acad Sci U S A 2020; 117:30118-30125. [PMID: 33203683 DOI: 10.1073/pnas.1920986117] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Political and environmental factors-e.g., regional conflicts and global warming-increase large-scale migrations, posing extraordinary societal challenges to policymakers of destination countries. A common concern is that such a massive arrival of people-often from a country with a disrupted healthcare system-can increase the risk of vaccine-preventable disease outbreaks like measles. We analyze human flows of 3.5 million (M) Syrian refugees in Turkey inferred from massive mobile-phone data to verify this concern. We use multilayer modeling of interdependent social and epidemic dynamics to demonstrate that the risk of disease reemergence in Turkey, the main host country, can be dramatically reduced by 75 to 90% when the mixing of Turkish and Syrian populations is high. Our results suggest that maximizing the dispersal of refugees in the recipient population contributes to impede the spread of sustained measles epidemics, rather than favoring it. Targeted vaccination campaigns and policies enhancing social integration of refugees are the most effective strategies to reduce epidemic risks for all citizens.
Collapse
|
87
|
Relationships between Local Green Space and Human Mobility Patterns during COVID-19 for Maryland and California, USA. SUSTAINABILITY 2020. [DOI: 10.3390/su12229401] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Human mobility is a significant factor for disease transmission. Little is known about how the environment influences mobility during a pandemic. The aim of this study was to investigate an effect of green space on mobility reductions during the early stage of the COVID-19 pandemic in Maryland and California, USA. For 230 minor civil divisions (MCD) in Maryland and 341 census county divisions (CCD) in California, we obtained mobility data from Facebook Data for Good aggregating information of people using the Facebook app on their mobile phones with location history active. The users’ movement between two locations was used to calculate the number of users that traveled into an MCD (or CCD) for each day in the daytime hours between 11 March and 26 April 2020. Each MCD’s (CCD’s) vegetation level was estimated as the average Enhanced Vegetation Index (EVI) level for 1 January through 31 March 2020. We calculated the number of state and local parks, food retail establishments, and hospitals for each MCD (CCD). Results showed that the daily percent changes in the number of travels declined during the study period. This mobility reduction was significantly lower in Maryland MCDs with state parks (p-value = 0.045), in California CCDs with local-scale parks (p-value = 0.048). EVI showed no association with mobility in both states. This finding has implications for the potential impacts of green space on mobility under an outbreak. Future studies are needed to explore these findings and to investigate changes in health effects of green space during a pandemic.
Collapse
|
88
|
Better coverage, better outcomes? Mapping mobile network data to official statistics using satellite imagery and radio propagation modelling. PLoS One 2020; 15:e0241981. [PMID: 33166359 PMCID: PMC7652289 DOI: 10.1371/journal.pone.0241981] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 10/23/2020] [Indexed: 12/02/2022] Open
Abstract
Mobile sensing data has become a popular data source for geo-spatial analysis, however, mapping it accurately to other sources of information such as statistical data remains a challenge. Popular mapping approaches such as point allocation or voronoi tessellation provide only crude approximations of the mobile network coverage as they do not consider holes, overlaps and within-cell heterogeneity. More elaborate mapping schemes often require additional proprietary data operators are highly reluctant to share. In this paper, I use human settlement information extracted from publicly available satellite imagery in combination with stochastic radio propagation modelling techniques to account for that. I show in a simulation study and a real-world application on unemployment estimates in Senegal that better coverage approximations do not necessarily lead to better outcome predictions.
Collapse
|
89
|
Mapping the Urban Population in Residential Neighborhoods by Integrating Remote Sensing and Crowdsourcing Data. REMOTE SENSING 2020. [DOI: 10.3390/rs12193235] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Where urban dwellers live at a fine scale is essential for the planning of services and response to city emergencies. Currently, most existing population mapping approaches considered census data as observational data for specifying models. However, census data usually have low spatial resolution and low frequency. Here, we presented a framework for mapping populations in residential neighborhoods with 30 m spatial resolution with little dependency upon census data. The framework integrated remote sensing and crowdsourcing data. The observational populations and number of households at residential neighborhood scale were obtained from real-time crowdsourcing data instead of census data. We tested our framework in Beijing. We found that (1) the number of households from a real estate trade platform could be a good proxy for accurate observational population. (2) The accuracy of the mapping population in residential neighborhoods was reasonable. The mean absolute percentage error was 47.26% and the R2 was 0.78. (3) Our framework shows great potential in mapping the population in real time. Our findings expand the knowledge in estimating urban population. In addition, the proposed framework and approach provide an effective means to quantify population distribution data for cities, which is particularly important for many of the cities worldwide lacking census data at the residential neighborhood scale.
Collapse
|
90
|
Spatial Equity with Census Population Data vs. Floating Population Data: The Distribution of Earthquake Evacuation Shelters in Daegu, South Korea. SUSTAINABILITY 2020. [DOI: 10.3390/su12198046] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The spatial equity of outdoor evacuation sites designated for emergency evacuation must be secured. In particular, public administrators must ensure spatial equity in preparing for unpredictable evacuation demands, such as earthquakes. This study analyzed the spatial equity of earthquake evacuation shelters in Daegu, South Korea, by using population data at the local level by time- and date-based mobile phone location data (i.e., floating population data). We compared our analysis of the spatial equity and error rate in this case with census data. Ultimately, our results demonstrate that the use of census population data can cause significant error in evaluations of the equity of evacuation shelter accessibility when the floating population data acquired through mobile phone location data are assumed exact.
Collapse
|
91
|
Vannoni M, McKee M, Semenza JC, Bonell C, Stuckler D. Using volunteered geographic information to assess mobility in the early phases of the COVID-19 pandemic: a cross-city time series analysis of 41 cities in 22 countries from March 2nd to 26th 2020. Global Health 2020; 16:85. [PMID: 32967691 PMCID: PMC7509494 DOI: 10.1186/s12992-020-00598-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 07/13/2020] [Indexed: 12/03/2022] Open
Abstract
Objectives Restricting mobility is a central aim for lowering contact rates and preventing COVID-19 transmission. Yet the impact on mobility of different non-pharmaceutical countermeasures in the earlier stages of the pandemic is not well-understood. Design Trends were evaluated using Citymapper’s mobility index covering 2nd to 26th March 2020, expressed as percentages of typical usage periods from 0% as the lowest and 100% as normal. China and India were not covered. Multivariate fixed effects models were used to estimate the association of policies restricting movement on mobility before and after their introduction. Policy restrictions were assessed using the Oxford COVID-19 Government Response Stringency Index as well as measures coding the timing and degree of school and workplace closures, transport restrictions, and cancellation of mass gatherings. Setting 41 cities worldwide. Main outcome measures Citymapper’s mobility index. Results Mobility declined in all major cities throughout March. Larger declines were seen in European than Asian cities. The COVID-19 Government Response Stringency Index was strongly associated with declines in mobility (r = − 0.75, p < 0.001). After adjusting for time-trends, we observed that implementing non-pharmaceutical countermeasures was associated with a decline of mobility of 10.0% for school closures (95% CI: 4.36 to 15.7%), 15.0% for workplace closures (95% CI: 10.2 to 19.8%), 7.09% for cancelling public events (95% CI: 1.98 to 12.2%), 18.0% for closing public transport (95% CI: 6.74 to 29.2%), 13.3% for restricting internal movements (95% CI: 8.85 to 17.8%) and 5.30% for international travel controls (95% CI: 1.69 to 8.90). In contrast, as expected, there was no association between population mobility changes and fiscal or monetary measures or emergency healthcare investment. Conclusions Understanding the effect of public policy on mobility in the early stages is crucial to slowing and reducing COVID-19 transmission. By using Citymapper’s mobility index, this work provides the first evidence about trends in mobility and the impacts of different policy interventions, suggesting that closure of public transport, workplaces and schools are particularly impactful.
Collapse
Affiliation(s)
- Matia Vannoni
- Department of Political Economy, King's College London, Bush House North East Wing, 30 Aldwych, London, WC2B 4BG, UK
| | - Martin McKee
- Faculty of Public Health and Policy, London School of Hygiene and Tropical Medicine, 15-17 Tavistock Place, London, WC1H 9SH, UK
| | - Jan C Semenza
- Dondena Centre for Research on Social Dynamics and Public Policy and Department of Social & Political Sciences, Bocconi University, Milan, Italy
| | - Chris Bonell
- European Centre for Disease Prevention and Control, Gustav III:s Boulevard 40, 169 73, Solna, Sweden
| | - David Stuckler
- Dondena Centre for Research on Social Dynamics and Public Policy and Department of Social & Political Sciences, Bocconi University, Milan, Italy.
| |
Collapse
|
92
|
Uncovering temporal changes in Europe's population density patterns using a data fusion approach. Nat Commun 2020; 11:4631. [PMID: 32934205 PMCID: PMC7493994 DOI: 10.1038/s41467-020-18344-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Accepted: 08/04/2020] [Indexed: 11/08/2022] Open
Abstract
The knowledge of the spatial and temporal distribution of human population is vital for the study of cities, disaster risk management or planning of infrastructure. However, information on the distribution of population is often based on place-of-residence statistics from official sources, thus ignoring the changing population densities resulting from human mobility. Existing assessments of spatio-temporal population are limited in their detail and geographical coverage, and the promising mobile-phone records are hindered by issues concerning availability and consistency. Here, we present a multi-layered dasymetric approach that combines official statistics with geospatial data from emerging sources to produce and validate a European Union-wide dataset of population grids taking into account intraday and monthly population variations at 1 km2 resolution. The results reproduce and systematically quantify known insights concerning the spatio-temporal population density structure of large European cities, whose daytime population we estimate to be, on average, 1.9 times higher than night time in city centers.
Collapse
|
93
|
Fricke RM, Wood SA, Martin DR, Olden JD. A bobber’s perspective on angler-driven vectors of invasive species transmission. NEOBIOTA 2020. [DOI: 10.3897/neobiota.60.54579] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Prevention of aquatic invasive species is a fundamental management challenge. With hundreds of millions of people participating in fishing trips each year, understanding angler movements that transmit invasive species can provide critical insight into the most effective locations and scales at which to apply preventative measures. Recent evidence suggests that mobile technologies provide new opportunities to understand different types of angler movement behaviour beyond what is possible with infrequently and sparsely conducted in-person boat surveys and mail questionnaires. Here we capitalise on data provided by ReelSonar’s iBobber, a sonar-enabled bobber with over 5 M recorded fishing locations, globally. By quantifying geographic patterns of fishing activities and assessing how these patterns change seasonally, we explore angler behaviour across the entire continental United States in terms of fishing frequency and distance travelled between sites and characterise the attributes of fished ecosystems. We found that iBobber users (anglers) undertook 66,918 trips to 20,049 different water-bodies over a two-year period. Anglers who use iBobber were more likely to visit larger, deeper and more urbanised water-bodies and these water-bodies were over five times more likely to be a reservoir compared to a lake. Inter-water-body travel road distances averaged 93 km (SD = 277 km; range < 1–300 km) and nearly half of these movements occurred over a timespan of two days or less, a timeframe that we show falls well within the desiccation tolerance window of many prevalent plant and animal invasive species. Our study offers novel insight into spatiotemporal patterns of angler behaviour well beyond the geographical and temporal extent of conventional ground-collected approaches and carries important implications for predicting and preventing future transmission of aquatic invasive species via recreational fishing.
Collapse
|
94
|
Verschuuren M, van Oers H. Population health monitoring: an essential public health field in motion. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2020; 63:1134-1142. [PMID: 32857173 DOI: 10.1007/s00103-020-03205-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
BACKGROUND Population health monitoring, the regular and institutionalized production and dissemination of information and knowledge about the health status of a population, is an essential element of public health. Nevertheless, while epidemiology and biostatistics, for example, are well-recognized disciplines, this does not (yet) apply to population health monitoring. Over the past decade, however, it has matured as a distinct field of expertise. OBJECTIVES This paper presents a comprehensive model for population health monitoring and describes its current status as a field of expertise. It concludes with an overview of the most important developments that are likely to shape the health information systems and population health monitoring practices of the future. RESULTS AND CONCLUSIONS Combining the information pyramid (an application of the data-information-knowledge-wisdom hierarchy), describing outputs, and a so-called monitoring chain, describing activities, results in a comprehensive model for population health monitoring. The steps of the activity chain can be viewed as a stairway by which the information pyramid is climbed, reaching evidence-informed policymaking at the top. Population health monitoring has several inherent strengths, such as its high societal relevance; its integrative, comprehensive, and structured approach; and the fact that it makes use of routinely collected data. In practice, however, secondary use of routine data is often hampered by technical, motivational, economic, political, ethical, and legal barriers. Important developments that will shape health information systems and population health monitoring practices of the future include digitalization and data-driven technology, citizen science, and the growing need for intersectoral approaches. Population health monitoring practice will need to adapt in order to counteract the risks and reap the benefits that these developments hold.
Collapse
Affiliation(s)
- Marieke Verschuuren
- Independent public health consultant, Kovelaarstraat 32, 3582GP, Utrecht, The Netherlands.
| | - Hans van Oers
- National Institute of Public Health and the Environment, Bilthoven, The Netherlands.,Tilburg University, Tilburg, The Netherlands
| |
Collapse
|
95
|
Using OpenStreetMap Data and Machine Learning to Generate Socio-Economic Indicators. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2020. [DOI: 10.3390/ijgi9090498] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Socio-economic indicators are key to understanding societal challenges. They disassemble complex phenomena to gain insights and deepen understanding. Specific subsets of indicators have been developed to describe sustainability, human development, vulnerability, risk, resilience and climate change adaptation. Nonetheless, insufficient quality and availability of data often limit their explanatory power. Spatial and temporal resolution are often not at a scale appropriate for monitoring. Socio-economic indicators are mostly provided by governmental institutions and are therefore limited to administrative boundaries. Furthermore, different methodological computation approaches for the same indicator impair comparability between countries and regions. OpenStreetMap (OSM) provides an unparalleled standardized global database with a high spatiotemporal resolution. Surprisingly, the potential of OSM seems largely unexplored in this context. In this study, we used machine learning to predict four exemplary socio-economic indicators for municipalities based on OSM. By comparing the predictive power of neural networks to statistical regression models, we evaluated the unhinged resources of OSM for indicator development. OSM provides prospects for monitoring across administrative boundaries, interdisciplinary topics, and semi-quantitative factors like social cohesion. Further research is still required to, for example, determine the impact of regional and international differences in user contributions on the outputs. Nonetheless, this database can provide meaningful insight into otherwise unknown spatial differences in social, environmental or economic inequalities.
Collapse
|
96
|
Wang L, Fan H, Wang Y. Improving population mapping using Luojia 1-01 nighttime light image and location-based social media data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 730:139148. [PMID: 32402976 DOI: 10.1016/j.scitotenv.2020.139148] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/16/2020] [Accepted: 04/29/2020] [Indexed: 06/11/2023]
Abstract
Fine-resolution population mapping, which is vital to urban planning, public health, and disaster management, has gained considerable attention in socioeconomic and environmental studies. Although population distribution has been considered highly correlated with urban facilities, the quantitative relationship between the two has yet to be revealed when considering huge heterogeneity. To address this problem, the present study proposed a novel population mapping method by adopting Luojia 1-01 nighttime light imagery, points of interest (POI), and social media check-in data. A grid-based attraction degree (AD) model was built to quantify the possibility of population concentration in each geographic unit with a comprehensive consideration of the distribution and the popularity of facilities. On the basis of kernel density estimation, 16 attraction indexes were extracted by matching POI and check-in data. Multiple variables were used to train a random forest model, through which fine-scale population mapping was conducted in Zhejiang, China. The comparison between demographic and WorldPop data proved the high accuracy of our approach (R2 = 0.75 and 0.58). To explore the characteristics of the model further, the most appropriate search distance (650 m) and acquisition time (19:00-08:00) of the check-in data were discussed. The contrast experiment revealed that the model could outperform those from previous studies on rural and suburban areas with a few check-in points and low AD; thus, the mapping error caused by heterogeneity considerably decreased. The results indicated the proposed method has great potential in fine-scale population mapping.
Collapse
Affiliation(s)
- Luyao Wang
- State Key Lab for Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 129 Luoyu Road, Wuhan 430079, China; Center for Real Estate, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, USA.
| | - Hong Fan
- State Key Lab for Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 129 Luoyu Road, Wuhan 430079, China.
| | - Yankun Wang
- Research Institute for Smart Cities, Guangdong Key Laboratory of Urban Informatics, Shenzhen Key Laboratory of Spatial Smart Sensing and Services, School of Architecture and Urban Planning, Shenzhen University, 3688 Nanhai Road, Shenzhen 518061, China
| |
Collapse
|
97
|
Gender Gaps in the Use of Urban Space in Seoul: Analyzing Spatial Patterns of Temporary Populations Using Mobile Phone Data. SUSTAINABILITY 2020. [DOI: 10.3390/su12166481] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This study aims to examine the gender gaps in the use of urban space in Seoul, Korea, to provide empirical evidence for urban planning for gender equality. We analyzed daily temporary populations that were estimated using mobile phone data. We used the total, women’s, and men’s temporary populations as well as the subtraction of the temporary population of men from that of women (SMW) as dependent variables. We first conducted a visual analysis on temporary population density using kernel density estimation and then conducted a further analysis using spatial autocorrelation indicators and spatial regression models. The results demonstrate that: (1) Temporary population patterns for women and men showed similarities in that both were larger in business areas than in residential areas, which means that a large number of women were engaged in economic activities like men; (2) the pattern for SMW showed the opposite, that is, women were more active in residential areas and areas where neighborhood retail shops, cultural facilities, parks, and department stores were easily accessible; and (3) both women’s temporary population and SMW had spatial autocorrelation and thus showed clustering patterns that can be helpful in urban planning for gender equality in Korea.
Collapse
|
98
|
A Method for the Estimation of Finely-Grained Temporal Spatial Human Population Density Distributions Based on Cell Phone Call Detail Records. REMOTE SENSING 2020. [DOI: 10.3390/rs12162572] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Estimating and mapping population distributions dynamically at a city-wide spatial scale, including those covering suburban areas, has profound, practical, applications such as urban and transportation planning, public safety warning, disaster impact assessment and epidemiological modelling, which benefits governments, merchants and citizens. More recently, call detail record (CDR) of mobile phone data has been used to estimate human population distributions. However, there is a key challenge that the accuracy of such a method is difficult to validate because there is no ground truth data for the dynamic population density distribution in time scales such as hourly. In this study, we present a simple and accurate method to generate more finely grained temporal-spatial population density distributions based upon CDR data. We designed an experiment to test our method based upon the use of a deep convolutional generative adversarial network (DCGAN). In this experiment, the highest spatial resolution of every grid cell is 125125 square metre, while the temporal resolution can vary from minutes to hours with varying accuracy. To demonstrate our method, we present an application of how to map the estimated population density distribution dynamically for CDR big data from Beijing, choosing a half hour as the temporal resolution. Finally, in order to cross-check previous studies that claim the population distribution at nighttime (from 8 p.m. to 8 a.m. on the next day) mapped by Beijing census data are similar to the ground truth data, we estimated the baseline distribution, first, based upon records in CDRs. Second, we estimate a baseline distribution based upon Global Navigation Satellite System (GNSS) data. The results also show the Root Mean Square Error (RMSE) is about 5000 while the two baseline distributions mentioned above have an RMSE of over 13,500. Our estimation method provides a fast and simple process to map people’s actual density distributions at a more finely grained, i.e., hourly, temporal resolution.
Collapse
|
99
|
Estimating small-area population density in Sri Lanka using surveys and Geo-spatial data. PLoS One 2020; 15:e0237063. [PMID: 32756580 PMCID: PMC7406065 DOI: 10.1371/journal.pone.0237063] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 07/20/2020] [Indexed: 11/24/2022] Open
Abstract
Country-level census data are typically collected once every 10 years. However, conflicts, migration, urbanization, and natural disasters can rapidly shift local population patterns. This study demonstrates the feasibility of a “bottom-up”-method to estimate local population density in the between-census years by combining household surveys with contemporaneous geo-spatial data, including village-area and satellite imagery-based indicators. We apply this technique to the case of Sri Lanka using Poisson regression models based on variables selected using the Least Absolute Shrinkage and Selection Operator (LASSO). The model is estimated in villages sampled in the 2012/13 Household Income and Expenditure Survey, and is employed to obtain out-of-sample density estimates in the non-surveyed villages. These estimates approximate the census density accurately and are more precise than other bottom-up studies using similar geo-spatial data. While most open-source population products redistribute census population “top-down” from higher to lower spatial units using areal interpolation and dasymetric mapping techniques, these products become less accurate as the census itself ages. Our method circumvents the problem of the aging census by relying instead on more up-to-date household surveys. The collective evidence suggests that our method is cost effective in tracking local population density with greater frequency in the between-census years.
Collapse
|
100
|
|