1
|
Cai H, Lan L, Zhang J, Zhang X, Luo Z. SiamDF: Tracking training data-free siamese tracker. Neural Netw 2023; 165:705-720. [PMID: 37385024 DOI: 10.1016/j.neunet.2023.06.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 03/19/2023] [Accepted: 06/06/2023] [Indexed: 07/01/2023]
Abstract
Much progress has been made in siamese tracking, primarily benefiting from increasing huge training data. However, very little attention has been really paid to the role of huge training data in learning an effective siamese tracker. In this study, we undertake an in-depth analysis of this issue from a novel optimization perspective, and observe that training data is particularly adept at background suppression, thereby refining target representation. Inspired by this insight, we present a data-free siamese tracking algorithm named SiamDF, which requires only a pre-trained backbone and no further fine-tuning on additional training data. Particularly, to suppress background distractors, we separately improve two branches of siamese tracking by retaining the pure target region as target input with the removal of template background, and by exploring an efficient inverse transformation to maintain the constant aspect ratio of target state in search region. Besides, we further promote the center displacement prediction of the entire backbone by eliminating its spatial stride deviations caused by convolution-like quantification operations. Our experimental results on several popular benchmarks demonstrate that SiamDF, free from both offline fine-tuning and online update, achieves impressive performance compared to well-established unsupervised and supervised tracking methods.
Collapse
Affiliation(s)
- Huayue Cai
- Institute for Quantum & State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha, China
| | - Long Lan
- Institute for Quantum & State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha, China
| | - Jing Zhang
- School of Computer Science, University of Sydney, Sydney, Australia
| | - Xiang Zhang
- Institute for Quantum & State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha, China; Laboratory of Digitizing Software for Frontier Equipment, National University of Defense Technology, Changsha, China.
| | - Zhigang Luo
- Science and Technology on Parallel and Distributed Laboratory, College of Computer, National University of Defense Technology, Changsha, China
| |
Collapse
|
2
|
Liu Y, Yan H, Zhang W, Li M, Liu L. An adaptive spatiotemporal correlation filtering visual tracking method. PLoS One 2023; 18:e0279240. [PMID: 36607906 PMCID: PMC9821422 DOI: 10.1371/journal.pone.0279240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 12/03/2022] [Indexed: 01/07/2023] Open
Abstract
Discriminative correlation filter (DCF) tracking algorithms are commonly used for visual tracking. However, we observed that different spatio-temporal targets exhibit varied visual appearances, and most DCF-based trackers neglect to exploit this spatio-temporal information during the tracking process. To address the above-mentioned issues, we propose a three-way adaptive spatio-temporal correlation filtering tracker, named ASCF, that makes fuller use of the spatio-temporal information during tracking. To be specific, we extract rich local and global visual features based on the Conformer network, establish three correlation filters at different spatio-temporal locations during the tracking process, and the three correlation filters independently track the target. Then, to adaptively select the correlation filter to achieve target tracking, we employ the average peak-to-correlation energy (APCE) and the peak-to-sidelobe ratio (PSR) to measure the reliability of the tracking results. In addition, we propose an adaptive model update strategy that adjusts the update frequency of the three correlation filters in different ways to avoid model drift due to the introduction of similar objects or background noise. Extensive experimental results on five benchmarks demonstrate that our algorithm achieves excellent performance compared to state-of-the-art trackers.
Collapse
Affiliation(s)
- Yuhan Liu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - He Yan
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
- * E-mail:
| | - Wei Zhang
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Mengxue Li
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Lingkun Liu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
3
|
Zhang Y, Wang C, Shi Q, Feng Y, Chen C. Adversarial gradient-based meta learning with metric-based test. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
4
|
Yuan D, Shu X, Liu Q, Zhang X, He Z. Robust thermal infrared tracking via an adaptively multi-feature fusion model. Neural Comput Appl 2023; 35:3423-3434. [PMID: 36245795 PMCID: PMC9553631 DOI: 10.1007/s00521-022-07867-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 09/21/2022] [Indexed: 01/31/2023]
Abstract
When dealing with complex thermal infrared (TIR) tracking scenarios, the single category feature is not sufficient to portray the appearance of the target, which drastically affects the accuracy of the TIR target tracking method. In order to address these problems, we propose an adaptively multi-feature fusion model (AMFT) for the TIR tracking task. Specifically, our AMFT tracking method adaptively integrates hand-crafted features and deep convolutional neural network (CNN) features. In order to accurately locate the target position, it takes advantage of the complementarity between different features. Additionally, the model is updated using a simple but effective model update strategy to adapt to changes in the target during tracking. In addition, a simple but effective model update strategy is adopted to adapt the model to the changes of the target during the tracking process. We have shown through ablation studies that the adaptively multi-feature fusion model in our AMFT tracking method is very effective. Our AMFT tracker performs favorably on PTB-TIR and LSOTB-TIR benchmarks compared with state-of-the-art trackers.
Collapse
Affiliation(s)
- Di Yuan
- Guangzhou Institute of Technology, Xidian University, Guangzhou, 510555 China
| | - Xiu Shu
- School of Science, Harbin Institute of Technology, Shenzhen, 518055 China
| | - Qiao Liu
- National Center for Applied Mathematics in Chongqing, Chongqing Normal University, Chongqing, 401331 China
| | - Xinming Zhang
- School of Science, Harbin Institute of Technology, Shenzhen, 518055 China
| | - Zhenyu He
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, 518055 China
| |
Collapse
|
5
|
Liang W, Ding D, Wei G. Siamese visual tracking combining granular level multi-scale features and global information. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109435] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
6
|
Huang B, Xu T, Shen Z, Jiang S, Zhao B, Bian Z. SiamATL: Online Update of Siamese Tracking Network via Attentional Transfer Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:7527-7540. [PMID: 33417585 DOI: 10.1109/tcyb.2020.3043520] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Visual object tracking with semantic deep features has recently attracted much attention in computer vision. Especially, Siamese trackers, which aim to learn a decision making-based similarity evaluation, are widely utilized in the tracking community. However, the online updating of the Siamese fashion is still a tricky issue due to the limitation, which is a tradeoff between model adaption and degradation. To address such an issue, in this article, we propose a novel attentional transfer learning-based Siamese network (SiamATL), which fully exploits the previous knowledge to inspire the current tracker learning in the decision-making module. First, we explicitly model the template and surroundings by using an attentional online update strategy to avoid template pollution. Then, we introduce an instance-transfer discriminative correlation filter (ITDCF) to enhance the distinguishing ability of the tracker. Finally, we suggest a mutual compensation mechanism that integrates cross-correlation matching and ITDCF detection into the decision-making subnetwork to achieve online tracking. Comprehensive experiments demonstrate that our approach outperforms state-of-the-art tracking algorithms on multiple large-scale tracking datasets.
Collapse
|
7
|
Lightweight Target-Aware Attention Learning Network-Based Target Tracking Method. MATHEMATICS 2022. [DOI: 10.3390/math10132299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Siamese network trackers based on pre-trained depth features have achieved good performance in recent years. However, the pre-trained depth features are trained in advance on large-scale datasets, which contain feature information of a large number of objects. There may be a pair of interference and redundant information for a single tracking target. To learn a more accurate target feature information, this paper proposes a lightweight target-aware attention learning network to learn the most effective channel features of the target online. The lightweight network uses a designed attention learning loss function to learn a series of channel features with weights online with no complex parameters. Compared with the pre-trained features, the channel features with weights can represent the target more accurately. Finally, the lightweight target-aware attention learning network is unified into a Siamese tracking network framework to implement target tracking effectively. Experiments on several datasets demonstrate that the tracker proposed in this paper has good performance.
Collapse
|
8
|
Multiple Traffic Target Tracking with Spatial-Temporal Affinity Network. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:9693767. [PMID: 35655505 PMCID: PMC9152393 DOI: 10.1155/2022/9693767] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 04/12/2022] [Accepted: 05/06/2022] [Indexed: 11/18/2022]
Abstract
Traffic target tracking is a core task in intelligent transportation system because it is useful for scene understanding and vehicle autonomous driving. Most state-of-the-art (SOTA) multiple object tracking (MOT) methods adopt a two-step procedure: object detection followed by data association. The object detection has made great progress with the development of deep learning. However, the data association still heavily depends on hand crafted constraints, such as appearance, shape, and motion, which need to be elaborately trained for a special object. In this study, a spatial-temporal encoder-decoder affinity network is proposed for multiple traffic targets tracking, aiming to utilize the power of deep learning to learn a robust spatial-temporal affinity feature of the detections and tracklets for data association. The proposed spatial-temporal affinity network contains a two-stage transformer encoder module to encode the features of the detections and the tracked targets at the image level and the tracklet level, aiming to capture the spatial correlation and temporal history information. Then, a spatial transformer decoder module is designed to compute the association affinity, where the results from the two-stage transformer encoder module are fed back to fully capture and encode the spatial and temporal information from the detections and the tracklets of the tracked targets. Thus, efficient affinity computation can be applied to perform data association in online tracking. To validate the effectiveness of the proposed method, three popular multiple traffic target tracking datasets, KITTI, UA-DETRAC, and VisDrone, are used for evaluation. On the KITTI dataset, the proposed method is compared with 15 SOTA methods and achieves 86.9% multiple object tracking accuracy (MOTA) and 85.71% multiple object tracking precision (MOTP). On the UA-DETRAC dataset, 12 SOTA methods are used to compare with the proposed method, and the proposed method achieves 20.82% MOTA and 35.65% MOTP, respectively. On the VisDrone dataset, the proposed method is compared with 10 SOTA trackers and achieves 40.5% MOTA and 74.1% MOTP, respectively. All those experimental results show that the proposed method is competitive to the state-of-the-art methods by obtaining superior tracking performance.
Collapse
|
9
|
|
10
|
Technical Diagnosis on Elite Female Discus Athletes Based on Grey Relational Analysis. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:8504369. [PMID: 35463289 PMCID: PMC9033319 DOI: 10.1155/2022/8504369] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 03/21/2022] [Accepted: 03/24/2022] [Indexed: 11/29/2022]
Abstract
The personalized training of elite athletes is the key to the breakthrough of Chinese track and field events in the Tokyo Olympic Games. The brilliance of Chinese women's throwing benefits from the closed-loop personalized training system. Training starts from the construction of accurate personalized technology and physical fitness model. This paper introduces the concept of champion model and puts forward a targeted technical training system based on the differentiation research of champion model. This paper mainly studies and analyzes some important technical parameters and achievements of elite female discus athletes aged 2018∼2021 by using the methods of Pearson, partial correlation and grey correlation analysis. We select from several technical parameters with significant differences, choose from several technical parameters that have significant difference, and calculate the correlation parameters and results. The results show that the influence degree of these technical parameters is as follows: torso angle of right foot touching the ground, discus release angle, discus release speed, shoulder and arm passing angle of left foot off the ground, discus moving distance of double support, center of mass moving distance of double support, and time of the second single support stage. This is different from our view that the hand speed is the most important, so the training of elite athletes should be more refined and specialized to promote the improvement of their performance. Through the application of technical diagnosis model in Chen Yang in a period of time, Chen Yang got her best result (65.14 m) in Chongqing Athletics Championship, which verified the success of the technical diagnosis model and application in this study.
Collapse
|
11
|
Fast Tracking Algorithm Based on Spatial Regularization Correlation Filter. INFORMATION 2022. [DOI: 10.3390/info13040184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
To solve the problem of the redundant number of training samples in a correlation filter-based tracking algorithm, the training samples were implicitly extended by circular shifts of the given target patches, and all the extended samples were used as negative samples for the fast online learning of the filter. Since all these shifted patches were not true negative samples of the target, the tracking process suffered from boundary effects, especially in challenging situations such as occlusion and background clutter, which can significantly impair the tracking performance of the tracker. Spatial regularization in the SRDCF tracking algorithm is an effective way to mitigate boundary effects, but it comes at the cost of highly increased time complexity, resulting in a very slow tracking speed of the SRDCF algorithm that cannot achieve a real-time tracking effect. To address this issue, we proposed a fast-tracking algorithm based on spatially regularized correlation filters that efficiently optimized the solved filters by replacing the Gauss–Seidel method in the SRDCF algorithm with the alternating direction multiplier method. The problem of slow speed in the SRDCF tracking algorithm improved, and the improved FSRCF algorithm achieved real-time tracking. An adaptive update mechanism was proposed by using the feedback from the high confidence tracking results to avoid model corruption. That is, a robust confidence evaluation criterion was introduced in the model update phase, which combined the maximum response criterion and the average peak correlation energy APCE criterion to determine whether to update the filter, thereby avoiding filter model drift and improving the target tracking accuracy and speed. We conducted extensive experiments on datasets OTB-2015, OTB-2013, UAV123, and TC128, and the experimental results show that the proposed algorithm exhibits a more stable and accurate tracking performance in the presence of occlusion and background clutter during tracking.
Collapse
|
12
|
Real-Time Ship Tracking under Challenges of Scale Variation and Different Visibility Weather Conditions. JOURNAL OF MARINE SCIENCE AND ENGINEERING 2022. [DOI: 10.3390/jmse10030444] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Visual ship tracking provides crucial kinematic traffic information to maritime traffic participants, which helps to accurately predict ship traveling behaviors in the near future. Traditional ship tracking models obtain a satisfactory performance by exploiting distinct features from maritime images, which may fail when the ship scale varies in image sequences. Moreover, previous frameworks have not paid much attention to weather condition interferences (e.g., visibility). To address this challenge, we propose a scale-adaptive ship tracking framework with the help of a kernelized correlation filter (KCF) and a log-polar transformation operation. First, the proposed ship tracker employs a conventional KCF model to obtain the raw ship position in the current maritime image. Second, both the previous step output and ship training sample are transformed into a log-polar coordinate system, which are further processed with the correlation filter to determine ship scale factor and to suppress the negative influence of the weather conditions. We verify the proposed ship tracker performance on three typical maritime scenarios under typical navigational weather conditions (i.e., sunny, fog). The findings of the study can help traffic participants efficiently obtain maritime situation awareness information from maritime videos, in real time, under different visibility weather conditions.
Collapse
|
13
|
He X, Chen CYC. Exploring reliable visual tracking via target embedding network. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
14
|
Adaptive segmentation model for liver CT images based on neural network and level set method. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.01.081] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
15
|
Zhang H, Chen J, Nie G, Lin Y, Yang G, Zhang W(C. Light regression memory and multi-perspective object special proposals for abrupt motion tracking. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
16
|
Yadav SK, Tiwari K, Pandey HM, Akbar SA. A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106970] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
|
17
|
Yuan C, Yang L. Robust twin extreme learning machines with correntropy-based metric. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2020.106707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|