1
|
Using Multi-Source Geospatial Information to Reduce the Saturation Problem of DMSP/OLS Nighttime Light Data. REMOTE SENSING 2022. [DOI: 10.3390/rs14143264] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
The DMSP/OLS Nighttime light (NTL) data directly reflect the spatial distribution and light intensity of artificial lighting from the Earth’s surface at night, and has become an emerging instrument for urbanization research, including in the monitoring of urban expansion, assessment of socio-economic vitality, and estimation of energy consumption and population. However, due to the imperfect sensor design of DMSP/OLS, the dynamic range of the digital number (DN) of NTL is limited (0, 63), leading to a significant saturation problem when describing the actual light intensity, especially in dense urban areas with high light intensity. This saturation problem masks spatial differences in light intensity and weakens the reliability of DMSP/OLS NTL data. Therefore, this study proposes a novel desaturation indicator that combines NDBI and POI, the Building and POI Density-Adjusted Nighttime Light Index (BPANTLI), to regulate the DMSP/OLS NTL saturation problem based on the spatial characteristics of urban structures and human activity intensity. The proposed method is applied to three urban agglomerations with the most severe light saturation issues in China. The geographical detector model is firstly utilized to quantify the effectiveness of NDBI and POI in reflecting the difference in light intensity distribution from the NTL potential saturation region (NTL DN value (53, 63)) and NTL unsaturation region (NTL DN value (0, 52)), so as to clarify the feasibility of developing the BPANTLI. The applicability of BPANTLI is validated through three aspects—comparison of the desaturation capacity and the performance of delineating light intensity; verification of the consistency of BPANTLI with radiometric calibration nighttime light product (RCNTL) and NPP/VIIRS data; and assessing the accuracy of the BPANTLI in estimating socio-economic parameters (GDP, electricity consumption, population density). The results indicate that the BPANTLI possesses superior capability in regulating the NTL saturation problem, achieving good performance in distinguishing inner-urban structures. The regulated results reveal a remarkably improved correspondence with the RCNTL and NPP/VIIRS data, providing a more realistic picture of the light intensity distribution. It is worth noting that, given the advantages of NDBI and POI vector data in spatial resolution, the BPANTLI established in this study can overcome the limitation of the spatial resolution of DMSP/OLS nighttime lighting data and achieve dynamic transformation of the spatial resolution. The higher spatial resolution desaturation results allow for a better characterization of the light intensity distribution. Moreover, the BPANTLI-regulated light intensity significantly improves the accuracy of estimating electricity consumption, GDP, and population density, which provides a valuable reference for urban socio-economic activity assessment. Thus, the BPANTLI proposed in this study can be considered as a reasonable desaturation method with a high application value.
Collapse
|
2
|
Psyllidis A, Gao S, Hu Y, Kim EK, McKenzie G, Purves R, Yuan M, Andris C. Points of Interest (POI): a commentary on the state of the art, challenges, and prospects for the future. COMPUTATIONAL URBAN SCIENCE 2022; 2:20. [PMID: 35789810 PMCID: PMC9239975 DOI: 10.1007/s43762-022-00047-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 06/07/2022] [Indexed: 11/26/2022]
Abstract
In this commentary, we describe the current state of the art of points of interest (POIs) as digital, spatial datasets, both in terms of their quality and affordings, and how they are used across research domains. We argue that good spatial coverage and high-quality POI features - especially POI category and temporality information - are key for creating reliable data. We list challenges in POI geolocation and spatial representation, data fidelity, and POI attributes, and address how these challenges may affect the results of geospatial analyses of the built environment for applications in public health, urban planning, sustainable development, mobility, community studies, and sociology. This commentary is intended to shed more light on the importance of POIs both as standalone spatial datasets and as input to geospatial analyses.
Collapse
Affiliation(s)
- Achilleas Psyllidis
- Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, Delft, 2628CE the Netherlands
| | - Song Gao
- Department of Geography, University of Wisconsin, Madison, USA
| | - Yingjie Hu
- Department of Geography, University at Buffalo, Buffalo, USA
| | - Eun-Kyeong Kim
- Department of Geography, University of Zurich, Zurich, Switzerland
| | - Grant McKenzie
- Department of Geography, McGill University, Montreal, Canada
| | - Ross Purves
- Department of Geography, University of Zurich, Zurich, Switzerland
| | - May Yuan
- School of Economic, Political and Policy Sciences, University of Texas at Dallas, Richardson, USA
| | - Clio Andris
- School of City & Regional Planning and School of Interactive Computing, Georgia Institute of Technology, Atlanta, USA
| |
Collapse
|
3
|
A System for Aligning Geographical Entities from Large Heterogeneous Sources. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11020096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Aligning points of interest (POIs) from heterogeneous geographical data sources is an important task that helps extend map data with information from different datasets. This task poses several challenges, including differences in type hierarchies, labels (different formats, languages, and levels of detail), and deviations in the coordinates. Scalability is another major issue, as global-scale datasets may have tens or hundreds of millions of entities. In this paper, we propose the GeographicaL Entities AligNment (GLEAN) system for efficiently matching large geographical datasets based on spatial partitioning with an adaptable margin. In particular, we introduce a text similarity measure based on the local-context relevance of tokens used in combination with sentence embeddings. We then come up with a scalable type embedding model. Finally, we demonstrate that our proposed system can efficiently handle the alignment of large datasets while improving the quality of alignments using the proposed entity similarity measure.
Collapse
|
4
|
An End-to-End Point of Interest (POI) Conflation Framework. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10110779] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Point of interest (POI) data serves as a valuable source of semantic information for places of interest and has many geospatial applications in real estate, transportation, and urban planning. With the availability of different data sources, POI conflation serves as a valuable technique for enriching data quality and coverage by merging the POI data from multiple sources. This study proposes a novel end-to-end POI conflation framework consisting of six steps, starting with data procurement, schema standardisation, taxonomy mapping, POI matching, POI unification, and data verification. The feasibility of the proposed framework was demonstrated in a case study conducted in the eastern region of Singapore, where the POI data from five data sources was conflated to form a unified POI dataset. Based on the evaluation conducted, the resulting unified dataset was found to be more comprehensive and complete than any of the five POI data sources alone. Furthermore, the proposed approach for identifying POI matches between different data sources outperformed all baseline approaches with a matching accuracy of 97.6% with an average run time below 3 min when matching over 12,000 POIs to result in 8699 unique POIs, thereby demonstrating the framework’s scalability for large scale implementation in dense urban contexts.
Collapse
|
5
|
Towards Automatic Points of Interest Matching. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2020. [DOI: 10.3390/ijgi9050291] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Complementing information about particular points, places, or institutions, i.e., so-called Points of Interest (POIs) can be achieved by matching data from the growing number of geospatial databases; these include Foursquare, OpenStreetMap, Yelp, and Facebook Places. Doing this potentially allows for the acquisition of more accurate and more complete information about POIs than would be possible by merely extracting the information from each of the systems alone. Problem: The task of Points of Interest matching, and the development of an algorithm to perform this automatically, are quite challenging problems due to the prevalence of different data structures, data incompleteness, conflicting information, naming differences, data inaccuracy, and cultural and language differences; in short, the difficulties experienced in the process of obtaining (complementary) information about the POI from different sources are due, in part, to the lack of standardization among Points of Interest descriptions; a further difficulty stems from the vast and rapidly growing amount of data to be assessed on each occasion. Research design and contributions: To propose an efficient algorithm for automatic Points of Interest matching, we: (1) analyzed available data sources—their structures, models, attributes, number of objects, the quality of data (number of missing attributes), etc.—and defined a unified POI model; (2) prepared a fairly large experimental dataset consisting of 50,000 matching and 50,000 non-matching points, taken from different geographical, cultural, and language areas; (3) comprehensively reviewed metrics that can be used for assessing the similarity between Points of Interest; (4) proposed and verified different strategies for dealing with missing or incomplete attributes; (5) reviewed and analyzed six different classifiers for Points of Interest matching, conducting experiments and follow-up comparisons to determine the most effective combination of similarity metric, strategy for dealing with missing data, and POIs matching classifier; and (6) presented an algorithm for automatic Points of Interest matching, detailing its accuracy and carrying out a complexity analysis. Results and conclusions: The main results of the research are: (1) comprehensive experimental verification and numerical comparisons of the crucial Points of Interest matching components (similarity metrics, approaches for dealing with missing data, and classifiers), indicating that the best Points of Interest matching classifier is a combination of random forest algorithm coupled with marking of missing data and mixing different similarity metrics for different POI attributes; and (2) an efficient greedy algorithm for automatic POI matching. At a cost of just 3.5% in terms of accuracy, it allows for reducing POI matching time complexity by two orders of magnitude in comparison to the exact algorithm.
Collapse
|
6
|
Different Sourcing Point of Interest Matching Method Considering Multiple Constraints. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2020. [DOI: 10.3390/ijgi9040214] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Point of interest (POI) matching is critical but is the most technically difficult part of multi-source POI fusion. The accurate matching of POIs from different sources is important for the effective reuse of POI data. However, the existing research on POI matching usually adopts weak constraints, which leads to a low POI matching accuracy. To address the shortcomings of previous studies, this paper proposes a POI matching method with multiple determination constraints. First, according to various attributes (name, class, and spatial location), a new calculation model considering spatial topology, name role labeling, and bottom-up class constraints is established. In addition, the optimal threshold values corresponding to the different attribute constraints are determined. Second, according to the multiattribute constraint values and optimal thresholds, a constraint model with multiple strict determination constraints is proposed. Finally, actual POI data from Baidu Map and Gaode Map in Dongying city is used to validate the method. Comparing to the existing method, the accuracy and recall of the proposed method increase 0.3% and 7.1%, respectively. The experimental results demonstrate that the proposed POI matching method attains a high matching accuracy and high feasibility.
Collapse
|
7
|
Point of Interest Matching between Different Geospatial Datasets. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2019. [DOI: 10.3390/ijgi8100435] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Point of interest (POI) matching finds POI pairs that refer to the same real-world entity, which is the core issue in geospatial data integration. To address the low accuracy of geospatial entity matching using a single feature attribute, this study proposes a method that combines the D–S (Dempster–Shafer) evidence theory and a multiattribute matching strategy. During POI data preprocessing, this method calculates the spatial similarity, name similarity, address similarity, and category similarity between pairs from different geospatial datasets, using the multiattribute matching strategy. The similarity calculation results of these four types of feature attributes were used as independent evidence to construct the basic probability distribution. A multiattribute model was separately constructed using the improved combination rule of the D–S evidence theory, and a series of decision thresholds were set to give the final entity matching results. We tested our method with a dataset containing Baidu POIs and Gaode POIs from Beijing. The results showed the following—(1) the multiattribute matching model based on improved DS evidence theory had good performance in terms of precision, recall, and F1 for entity-matching from different datasets; (2) among all models, the model combining the spatial, name, and category (SNC) attributes obtained the best performance in the POI entity matching process; and (3) the method could effectively address the low precision of entity matching using a single feature attribute.
Collapse
|