1
|
Chow TE, Yip PSF, Wong KP. An integrated framework of mobile crowd estimation for the 2019, July 1st rally in Hong Kong. Multimed Tools Appl 2023:1-18. [PMID: 37362686 PMCID: PMC10152033 DOI: 10.1007/s11042-023-15417-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 05/24/2022] [Accepted: 04/18/2023] [Indexed: 06/28/2023]
Abstract
Traditional approach of mobile crowd estimation involves counting a group of individuals at a specific place, manually, in real-time. It is a laborious exercise that can be physically and mentally demanding. In Hong Kong, a large rally can last more than six hours, making the manual count method susceptible to human errors. While crowd counting using object detection and tracking has been well-established in computer vision, such application has remained relatively small scale within a controlled indoor setting (e.g. counting people at fixed gateways in a mall). No attempt to date has applied the automatic crowd counting method to count hundreds of thousands of people along an open stretch of rally route within the complex urban outdoor landscape. This research proposed an integrated approach that combines the capture-recapture method in statistics and a Convolutional Neural Network (CNN) method in computer vision to count the mobile crowd. The research teams implemented the integrative approach and counted 276,970 people with a 95% confidence interval of 263,663 to 290,276 in the 2019, July 1st Rally in Hong Kong. This work counted the attendance of a large-scale rally as a proof of concept to fill in a gap in the empirical studies. The intellectual merits and research findings shed useful insights to improve mobile population estimation and leverage alternative data sources to support related scientific applications.
Collapse
Affiliation(s)
- T. Edwin Chow
- Department of Geography and Environmental Studies, Texas State University, San Marcos, TX 78666 USA
| | - Paul S. F. Yip
- Department of Social Work and Social Administration, University of Hong Kong, Hong Kong SAR, China
| | | |
Collapse
|
2
|
Lyu L, Han R, Chen Z. Cascaded parallel crowd counting network with multi-resolution collaborative representation. APPL INTELL 2023; 53:3002-3016. [PMID: 35607431 PMCID: PMC9117858 DOI: 10.1007/s10489-022-03639-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/14/2022] [Indexed: 01/14/2023]
Abstract
Accurately estimating the size and density distribution of a crowd from images is of great importance to public safety and crowd management during the COVID-19 pandemic, but it is very challenging as it is affected by many complex factors, including perspective distortion and background noise information. In this paper, we propose a novel multi-resolution collaborative representation framework called the cascaded parallel network (CP-Net), consisting of three parallel scale-specific branches connected in a cascading mode. In the framework, the three cascaded multi-resolution branches efficiently capture multi-scale features through their specific receptive fields. Additionally, multi-level feature fusion and information filtering are performed continuously on each branch to resist noise interference and perspective distortion. Moreover, we design an information exchange module across independent branches to refine the features extracted by each specific branch and deal with perspective distortion by using complementary information of multiple resolutions. To further improve the robustness of the network to scale variance and generate high-quality density maps, we construct a multi-receptive field fusion module to aggregate multi-scale features more comprehensively. The performance of our proposed CP-Net is verified on the challenging counting datasets (UCF_CC_50, UCF-QNRF, Shanghai Tech A&B, and WorldExpo'10), and the experimental results demonstrate the superiority of the proposed method.
Collapse
Affiliation(s)
- Lei Lyu
- grid.410585.d0000 0001 0495 1805School of Information Science and Engineering, Shandong Normal University, Jinan, 250358 China ,Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, 250358 China
| | - Run Han
- grid.410585.d0000 0001 0495 1805School of Information Science and Engineering, Shandong Normal University, Jinan, 250358 China ,Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, 250358 China
| | - Ziming Chen
- Shandong Zhengzhong Information Technology Co., LTD, Jinan, 250014 China ,Shandong Digital Applied Science Research Institute Co.,LTD, Jinan, 250101 China
| |
Collapse
|
3
|
Peng S, Yin B, Yang Q, He Q, Wang L. Exploring density rectification and domain adaption method for crowd counting. Neural Comput Appl 2023; 35:3551-3569. [PMID: 36267471 PMCID: PMC9568950 DOI: 10.1007/s00521-022-07917-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 09/30/2022] [Indexed: 01/31/2023]
Abstract
Crowd counting has received increasing attention due to its important roles in multiple fields, such as social security, commercial applications, epidemic prevention and control. To this end, we explore two critical issues that seriously affect the performance of crowd counting including nonuniform crowd density distribution and cross-domain problems. Aiming at the nonuniform crowd density distribution issue, we propose a density rectifying network (DRNet) that consists of several dual-layer pyramid fusion modules (DPFM) and a density rectification map (DRmap) auxiliary learning module. The proposed DPFM is embedded into DRNet to integrate multi-scale crowd density features through dual-layer pyramid fusion. The devised DRmap auxiliary learning module further rectifies the incorrect crowd density estimation by adaptively weighting the initial crowd density maps. With respect to the cross-domain issue, we develop a domain adaptation method of randomly cutting mixed dual-domain images, which learns domain-invariance features and decreases the domain gap between the source domain and the target domain from global and local perspectives. Experimental results indicate that the devised DRNet achieves the best mean absolute error (MAE) and competitive mean squared error (MSE) compared with other excellent methods on four benchmark datasets. Additionally, a series of cross-domain experiments are conducted to demonstrate the effectiveness of the proposed domain adaption method. Significantly, when the A and B parts of the Shanghaitech dataset are the source domain and target domain respectively, the proposed domain adaption method decreases the MAE of DRNet by 47.6 % .
Collapse
Affiliation(s)
- Sifan Peng
- grid.59053.3a0000000121679639Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230027 Anhui China
| | - Baoqun Yin
- grid.59053.3a0000000121679639Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230027 Anhui China
| | - Qianqian Yang
- grid.59053.3a0000000121679639Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230027 Anhui China
| | - Qing He
- grid.59053.3a0000000121679639Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230027 Anhui China
| | - Luyang Wang
- grid.59053.3a0000000121679639Department of Automation, University of Science and Technology of China, Huangshan Road, Hefei, 230027 Anhui China
| |
Collapse
|
4
|
Li P, Zhang M, Wan J, Jiang M. DMPNet: densely connected multi-scale pyramid networks for crowd counting. PeerJ Comput Sci 2022; 8:e902. [PMID: 35494810 PMCID: PMC9044264 DOI: 10.7717/peerj-cs.902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 02/07/2022] [Indexed: 06/14/2023]
Abstract
Crowd counting has been widely studied by deep learning in recent years. However, due to scale variation caused by perspective distortion, crowd counting is still a challenging task. In this paper, we propose a Densely Connected Multi-scale Pyramid Network (DMPNet) for count estimation and the generation of high-quality density maps. The key component of our network is the Multi-scale Pyramid Network (MPN), which can extract multi-scale features of the crowd effectively while keeping the resolution of the input feature map and the number of channels unchanged. To increase the information transfer between the network layer, we used dense connections to connect multiple MPNs. In addition, we also designed a novel loss function, which can help our model achieve better convergence. To evaluate our method, we conducted extensive experiments on three challenging benchmark crowd counting datasets. Experimental results show that compared with the state-of-the-art algorithms, DMPNet performs well in both parameters and results. The code is available at: https://github.com/lpfworld/DMPNet.
Collapse
Affiliation(s)
- Pengfei Li
- Computer & Software School, Hangzhou Dianzi University, Hangzhou, Zhejiang, China
| | - Min Zhang
- Computer & Software School, Hangzhou Dianzi University, Hangzhou, Zhejiang, China
| | - Jian Wan
- Computer & Software School, Hangzhou Dianzi University, Hangzhou, Zhejiang, China
| | - Ming Jiang
- Computer & Software School, Hangzhou Dianzi University, Hangzhou, Zhejiang, China
| |
Collapse
|
5
|
Zhou F, Zhao H, Zhang Y, Zhang Q, Liang L, Li Y, Duan Z. COMAL: compositional multi-scale feature enhanced learning for crowd counting. Multimed Tools Appl 2022; 81:20541-20560. [PMID: 35291715 PMCID: PMC8914450 DOI: 10.1007/s11042-022-12249-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 06/11/2021] [Accepted: 01/14/2022] [Indexed: 06/14/2023]
Abstract
Accurately modeling the crowd's head scale variations is an effective way to improve the counting accuracy of the crowd counting methods. Most counting networks apply a multi-branch network structure to obtain different scales of head features. Although they have achieved promising results, they do not perform very well on the extreme scale variation scene due to the limited scale representability. Meanwhile, these methods are prone to recognize background objects as foreground crowds in complex scenes due to the limited context and high-level semantic information. We propose a compositional multi-scale feature enhanced learning approach (COMAL) for crowd counting to handle the above limitations. COMAL enhances the multi-scale feature representations from three aspects: (1) The semantic enhanced module (SEM) is developed for embedding the high-level semantic information to the multi-scale features; (2) The diversity enhanced module (DEM) is proposed to enrich the variety of crowd features' different scales; (3) The context enhanced module (CEM) is designed for strengthening the multi-scale features with more context information. Based on the proposed COMAL, we develop a crowd counting network under the encoder-decoder framework and perform extensive experiments on ShanghaiTech, UCF_CC_50, and UCF-QNRF datasets. Qualitative and quantitive results demonstrate the effectiveness of the proposed COMAL.
Collapse
Affiliation(s)
- Fangbo Zhou
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Huailin Zhao
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Yani Zhang
- School of Computer Science and Information Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Qing Zhang
- School of Computer Science and Information Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Lanjun Liang
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Yaoyao Li
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Zuodong Duan
- Science and Technology on Electromechanical Dynamic Control Laboratory, School of Mechatronical Engineering, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
6
|
Abstract
Developments in technology have facilitated the emergence of new crowd counting organisations. Some of the organisations have established platforms to disseminate their data, making it available to researchers for the first time. These databases promise to increase the quality and quantity of research in various fields. In the late 2010s, specialist crowd counting organisations emerged with the sole purpose of counting crowds at protests and disseminating the results, sometimes in a purely partisan manner. Because of the contemporary relevance of protest behaviour, we frame our discussion within this context. For social scientists considering the utilisation of these new databases, it is essential that crowd numbers be linked to underlying human behaviour in a way that promises a chain of connections to investigate and explore. We use behavioural economics to show why relative crowd size may be important for human decision-makers. And we show how the significance of relative crowd size relates to other aspects of the human decision-making process, including risk preferences and probability assessments. Far from being a theory of protest behaviour, we present a behavioural economics-based primer for empirical researchers and social scientists engaging with newly available crowd counting data. The conclusions may apply in other contexts and might be extended to encompass specific types of behaviour, including aggression and violence. Indeed, the conclusions may guide the analysis of the emergence of the crowd counting organisations themselves.
Collapse
Affiliation(s)
- Peter J Phillips
- Faculty of Business, Education, Law and Arts, University of Southern Queensland, Toowoomba, QLD 4350 Australia
| | - Gabriela Pohl
- Faculty of Business, Education, Law and Arts, University of Southern Queensland, Toowoomba, QLD 4350 Australia
| |
Collapse
|