1
|
Yu M, Li W, Yu Y, Zhao Y, Xiao L, Lauschke VM, Cheng Y, Zhang X, Wang Y. Deep learning large-scale drug discovery and repurposing. NATURE COMPUTATIONAL SCIENCE 2024; 4:600-614. [PMID: 39169261 DOI: 10.1038/s43588-024-00679-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 07/17/2024] [Indexed: 08/23/2024]
Abstract
Large-scale drug discovery and repurposing is challenging. Identifying the mechanism of action (MOA) is crucial, yet current approaches are costly and low-throughput. Here we present an approach for MOA identification by profiling changes in mitochondrial phenotypes. By temporally imaging mitochondrial morphology and membrane potential, we established a pipeline for monitoring time-resolved mitochondrial images, resulting in a dataset comprising 570,096 single-cell images of cells exposed to 1,068 United States Food and Drug Administration-approved drugs. A deep learning model named MitoReID, using a re-identification (ReID) framework and an Inflated 3D ResNet backbone, was developed. It achieved 76.32% Rank-1 and 65.92% mean average precision on the testing set and successfully identified the MOAs for six untrained drugs on the basis of mitochondrial phenotype. Furthermore, MitoReID identified cyclooxygenase-2 inhibition as the MOA of the natural compound epicatechin in tea, which was successfully validated in vitro. Our approach thus provides an automated and cost-effective alternative for target identification that could accelerate large-scale drug discovery and repurposing.
Collapse
Affiliation(s)
- Min Yu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | | | - Yunru Yu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Yu Zhao
- School of Basic Medical Sciences, Zhejiang Chinese Medical University, Hangzhou, China
| | - Lizhi Xiao
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | - Volker M Lauschke
- Department of Physiology and Pharmacology, Karolinska Institute, Stockholm, Sweden
| | - Yiyu Cheng
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
- State Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, China.
| | - Xingcai Zhang
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA.
| | - Yi Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
- State Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, China.
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou, China.
- Center for system biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
2
|
Chen X, Wang J, Liu Q. The visual statistical learning overcomes scene dissimilarity through an independent clustering process. J Vis 2024; 24:5. [PMID: 39110583 PMCID: PMC11314707 DOI: 10.1167/jov.24.8.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 07/03/2024] [Indexed: 08/11/2024] Open
Abstract
Contextual cueing is a phenomenon of visual statistical learning observed in visual search tasks. Previous research has found that the degree of deviation of items from its centroid, known as variability, determines the extent of generalization for that repeated scene. Introducing variability increases dissimilarity between multiple occurrences of the same repeated layout significantly. However, current theories do not explain the mechanisms that help to overcome this dissimilarity during contextual cue learning. We propose that the cognitive system initially abstracts specific scenes into scene layouts through an automatic clustering unrelated to specific repeated scenes, and subsequently uses these abstracted scene layouts for contextual cue learning. Experiment 1 indicates that introducing greater variability in search scenes leads to a hindering in the contextual cue learning. Experiment 2 further establishes that conducting extensive visual searches involving spatial variability in entirely novel scenes facilitates subsequent contextual cue learning involving corresponding scene variability, confirming that learning clustering knowledge precedes the contextual cue learning and is independent of specific repeated scenes. Overall, this study demonstrates the existence of multiple levels of learning in visual statistical learning, where item-level learning can serve as material for layout-level learning, and the generalization reflects the constraining role of item-level knowledge on layout-level knowledge.
Collapse
Affiliation(s)
- Xiaoyu Chen
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China
| | - Jie Wang
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China
| | - Qiang Liu
- Institute of Brain and Psychological Sciences, Sichuan Normal University, Chengdu, China
| |
Collapse
|
3
|
Andrews TJ, Rogers D, Mileva M, Watson DM, Wang A, Burton AM. A narrow band of image dimensions is critical for face recognition. Vision Res 2023; 212:108297. [PMID: 37527594 DOI: 10.1016/j.visres.2023.108297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 07/07/2023] [Accepted: 07/12/2023] [Indexed: 08/03/2023]
Abstract
A key challenge in human and computer face recognition is to differentiate information that is diagnostic for identity from other sources of image variation. Here, we used a combined computational and behavioural approach to reveal critical image dimensions for face recognition. Behavioural data were collected using a sorting and matching task with unfamiliar faces and a recognition task with familiar faces. Principal components analysis was used to reveal the dimensions across which the shape and texture of faces in these tasks varied. We then asked which image dimensions were able to predict behavioural performance across these tasks. We found that the ability to predict behavioural responses in the unfamiliar face tasks increased when the early PCA dimensions (i.e. those accounting for most variance) of shape and texture were removed from the analysis. Image similarity also predicted the output of a computer model of face recognition, but again only when the early image dimensions were removed from the analysis. Finally, we found that recognition of familiar faces increased when the early image dimensions were removed, decreased when intermediate dimensions were removed, but then returned to baseline recognition when only later dimensions were removed. Together, these findings suggest that early image dimensions reflect ambient changes, such as changes in viewpoint or lighting, that do not contribute to face recognition. However, there is a narrow band of image dimensions for shape and texture that are critical for the recognition of identity in humans and computer models of face recognition.
Collapse
Affiliation(s)
| | - Daniel Rogers
- Department of Psychology, University of York, York YO10 5DD, UK
| | - Mila Mileva
- Department of Psychology, University of York, York YO10 5DD, UK
| | - David M Watson
- Department of Psychology, University of York, York YO10 5DD, UK
| | - Ao Wang
- Department of Psychology, University of York, York YO10 5DD, UK
| | - A Mike Burton
- Department of Psychology, University of York, York YO10 5DD, UK
| |
Collapse
|
4
|
Li X, Tyagi A. Block-Active ADMM to Minimize NMF with Bregman Divergences. SENSORS (BASEL, SWITZERLAND) 2023; 23:7229. [PMID: 37631765 PMCID: PMC10459034 DOI: 10.3390/s23167229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 08/10/2023] [Accepted: 08/16/2023] [Indexed: 08/27/2023]
Abstract
Over the last ten years, there has been a significant interest in employing nonnegative matrix factorization (NMF) to reduce dimensionality to enable a more efficient clustering analysis in machine learning. This technique has been applied in various image processing applications within the fields of computer vision and sensor-based systems. Many algorithms exist to solve the NMF problem. Among these algorithms, the alternating direction method of multipliers (ADMM) and its variants are one of the most popular methods used in practice. In this paper, we propose a block-active ADMM method to minimize the NMF problem with general Bregman divergences. The subproblems in the ADMM are solved iteratively by a block-coordinate-descent-type (BCD-type) method. In particular, each block is chosen directly based on the stationary condition. As a result, we are able to use much fewer auxiliary variables and the proposed algorithm converges faster than the previously proposed algorithms. From the theoretical point of view, the proposed algorithm is proved to converge to a stationary point sublinearly. We also conduct a series of numerical experiments to demonstrate the superiority of the proposed algorithm.
Collapse
Affiliation(s)
| | - Akhilesh Tyagi
- Department of Electrical and Computer Engineering, Iowa State University, Ames, IA 50010, USA;
| |
Collapse
|
5
|
Balas B, Sandford A, Ritchie K. Not the norm: Face likeness is not the same as similarity to familiar face prototypes. Iperception 2023; 14:20416695231171355. [PMID: 37151573 PMCID: PMC10161317 DOI: 10.1177/20416695231171355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 04/06/2023] [Indexed: 05/09/2023] Open
Abstract
Face images depicting the same individual can differ substantially from one another. Ecological variation in pose, expression, lighting, and other sources of appearance variability complicates the recognition and matching of unfamiliar faces, but acquired familiarity leads to the ability to cope with these challenges. Among the many ways that face of the same individual can vary, some images are judged to be better likenesses of familiar individuals than others. Simply put, these images look more like the individual under consideration than others. But what does it mean for an image to be a better likeness than another? Does likeness entail typicality, or is it something distinct from this? We examined the relationship between the likeness of face images and the similarity of those images to average images of target individuals using a set of famous faces selected for reciprocal familiarity/unfamiliarity across US and UK participants. We found that though likeness judgments are correlated with similarity-to-prototype judgments made by both familiar and unfamiliar participants, this correlation was smaller than the correlation between similarity judgments made by different participant groups. This implies that while familiarity weakens the relationship between likeness and similarity-to-prototype judgments, it does not change similarity-to-prototype judgments to the same degree.
Collapse
Affiliation(s)
- Benjamin Balas
- Psychology Department, North Dakota State University, Fargo, ND, USA
| | - Adam Sandford
- Psychology Department, University of Guelph-Humber, Toronto, Ontario, Canada
| | - Kay Ritchie
- School of Psychology, University of Lincoln, Lincoln, UK
| |
Collapse
|
6
|
Cai X, Liu L, Zhu L, Zhang H. Dual-modality hard mining triplet-center loss for visible infrared person re-identification. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106772] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
7
|
|
8
|
Hayes S. Analysing Texture in Portraits. Perception 2020; 49:1283-1310. [PMID: 33302773 DOI: 10.1177/0301006620975705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This case study is an initial exploration as to whether the depiction of texture in a set of portraits, all portraying the same Sitter, is related to the familiar likeness assessments reported in a companion paper containing a principal component analysis (PCA) of the portraits' depiction of shape. Somewhat unexpectedly, a texture PCA failed to discriminate the high from low likeness portraits, despite experimentation with different pre-processing methods to reduce the portraits' high level of uninformative, image-level texture variability. There were some findings arising from these analyses, and while only indicative at this stage, include that linear histogram matching is effective in reducing variability in portrait brightness; that depicting, and perhaps exaggerating, shading relating to lighting direction may enhance portrait likeness; and, that whether minimised or exaggerated, lighting direction can be portrayed somewhat anomalously. Furthermore, and in agreement with findings from photographs, shape and texture were not found to be independent variables, and shape-free image registration, while very usefully enabling a comparison of closely corresponding pixel coordinate values, could itself be a confounding factor for undertaking a texture PCA with portraits produced under relatively ambient conditions.
Collapse
|
9
|
Wang Z. Eco-tourism benefit evaluation of Yellow River based on principal component analysis. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2020. [DOI: 10.3233/jifs-189288] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In the era of big data transformation, with the emergence of COVID-19, tourism has been given more social responsibilities. Tourism construction in the Yellow River Basin is an indispensable part of tourism construction in China. This paper analyzes the existing eco-tourism resources in Kaifeng City and Shandong Province, as well as the necessity and construction conditions of developing tourism. In this paper, principal component analysis is used to analyze the resource conditions, regional conditions and environmental conditions of the Yellow River tourism resources. The comprehensive evaluation model and index system of tourism resources are constructed. Big data transformation has been realized. The purpose of this paper is to clarify the current situation and potential of tourism in the Yellow River Basin, and to provide reference for the development of tourism in the Yellow River Basin during COVID-19.
Collapse
Affiliation(s)
- Zhenpeng Wang
- KaiFeng University, Kaifeng DongJing Avenue, Henan, China
| |
Collapse
|
10
|
Deep Learning of Appearance Affinity for Multi-Object Tracking and Re-Identification: A Comparative View. ELECTRONICS 2020. [DOI: 10.3390/electronics9111757] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Recognizing the identity of a query individual in a surveillance sequence is the core of Multi-Object Tracking (MOT) and Re-Identification (Re-Id) algorithms. Both tasks can be addressed by measuring the appearance affinity between people observations with a deep neural model. Nevertheless, the differences in their specifications and, consequently, in the characteristics and constraints of the available training data for each one of these tasks, arise from the necessity of employing different learning approaches to attain each one of them. This article offers a comparative view of the Double-Margin-Contrastive and the Triplet loss function, and analyzes the benefits and drawbacks of applying each one of them to learn an Appearance Affinity model for Tracking and Re-Identification. A batch of experiments have been conducted, and their results support the hypothesis concluded from the presented study: Triplet loss function is more effective than the Contrastive one when an Re-Id model is learnt, and, conversely, in the MOT domain, the Contrastive loss can better discriminate between pairs of images rendering the same person or not.
Collapse
|
11
|
|
12
|
Liang G, Lan X, Chen X, Zheng K, Wang S, Zheng N. Cross-View Person Identification Based on Confidence-Weighted Human Pose Matching. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:3821-3835. [PMID: 30794171 DOI: 10.1109/tip.2019.2899782] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Cross-view person identification (CVPI) from multiple temporally synchronized videos taken by multiple wearable cameras from different, varying views is a very challenging but important problem, which has attracted more interest recently. Current state-of-the-art performance of CVPI is achieved by matching appearance and motion features across videos, while the matching of pose features does not work effectively given the high inaccuracy of the 3D pose estimation on videos/images collected in the wild. To address this problem, we first introduce a new metric of confidence to the estimated location of each human-body joint in 3D human pose estimation. Then, a mapping function, which can be hand-crafted or learned directly from the datasets, is proposed to combine the inaccurately estimated human pose and the inferred confidence metric to accomplish CVPI. Specifically, the joints with higher confidence are weighted more in the pose matching for CVPI. Finally, the estimated pose information is integrated into the appearance and motion features to boost the CVPI performance. In the experiments, we evaluate the proposed method on three wearable-camera video datasets and compare the performance against several other existing CVPI methods. The experimental results show the effectiveness of the proposed confidence metric, and the integration of pose, appearance, and motion produces a new state-of-the-art CVPI performance.
Collapse
|
13
|
Power Quality Disturbance Monitoring and Classification Based on Improved PCA and Convolution Neural Network for Wind-Grid Distribution Systems. ENERGIES 2019. [DOI: 10.3390/en12071280] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The excessive use of power semiconductor devices in a grid utility increases the malfunction of the control system, produces power quality disturbances (PQDs) and reduces the electrical component life. The present work proposes a novel algorithm based on Improved Principal Component Analysis (IPCA) and 1-Dimensional Convolution Neural Network (1-D-CNN) for detection and classification of PQDs. Firstly, IPCA is used to extract the statistical features of PQDs such as Root Mean Square, Skewness, Range, Kurtosis, Crest Factor, Form Factor. IPCA is decomposed into four levels. The principal component (PC) is obtained by IPCA, and it contains a maximum amount of original data as compare to PCA. 1-D-CNN is also used to extract features such as mean, energy, standard deviation, Shannon entropy, and log-energy entropy. The statistical analysis is employed for optimal feature selection. Secondly, these improved features of the PQDs are fed to the 1-D-CNN-based classifier to gain maximum classification accuracy. The proposed IPCA-1-D-CNN is utilized for classification of 12 types of synthetic and simulated single and multiple PQDs. The simulated PQDs are generated from a modified IEEE bus system with wind energy penetration in the balanced distribution system. Finally, the proposed IPCA-1-D-CNN algorithm has been tested with noise (50 dB to 20 dB) and noiseless environment. The obtained results are compared with SVM and other existing techniques. The comparative results show that the proposed method gives significantly higher classification accuracy.
Collapse
|
14
|
Zhong W, Jiang L, Zhang T, Ji J, Xiong H. Combining multilevel feature extraction and multi-loss learning for person re-identification. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.01.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
15
|
Srivastava M, Suvarna S, Srivastava A, Bharathiraja S. Automated emergency paramedical response system. Health Inf Sci Syst 2018; 6:22. [PMID: 30483400 DOI: 10.1007/s13755-018-0061-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Accepted: 10/19/2018] [Indexed: 11/29/2022] Open
Abstract
With the evolution of technology, the fields of medicine and science have also witnessed numerous advancements. In medical emergencies, a few minutes can be the difference between life and death. The obstacles encountered while providing medical assistance can be eliminated by ensuring quicker care and accessible systems. To this effect, the proposed end-to-end system-automated emergency paramedical response system (AEPRS) is semi-autonomous and utilizes aerial distribution by drones, for providing medical supplies on site in cases of paramedical emergencies as well as for patients with a standing history of diseases. Security of confidential medical information is a major area of concern for patients. Confidentiality has been achieved by using decentralised distributed computing to ensure security for the users without involving third-party institutions. AEPRS focuses not only on urban areas but also on semi-urban and rural areas. In urban areas where access to internet is widely available, a healthcare chatbot caters to the individual users and provides a diagnosis based on the symptoms provided by the patients. In semi-urban and rural areas, community hospitals have the option of providing specialised healthcare in spite of the absence of a specialised doctor. Additionally, object recognition and face recognition by using the concept of edge AI enables deep neural networks to run on the edge, without the need for GPU or internet connectivity to connect to the cloud. AEPRS is an airborne emergency medical supply delivery system. It uses the data entered by the user to deduce the best possible solution, in case of an alerted emergency situation and responds to the user accordingly.
Collapse
Affiliation(s)
- Mashrin Srivastava
- School of Computing Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - Saumya Suvarna
- School of Computing Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - Apoorva Srivastava
- School of Computing Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - S Bharathiraja
- School of Computing Science and Engineering, Vellore Institute of Technology, Chennai, India
| |
Collapse
|
16
|
Ahmad F, Dar WM. Classification of Alzheimer's Disease Stages: An Approach Using PCA-Based Algorithm. Am J Alzheimers Dis Other Demen 2018; 33:433-439. [PMID: 30058341 PMCID: PMC10852453 DOI: 10.1177/1533317518790038] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Early diagnosis of Alzheimer's disease (AD) allows individuals and their health managers to manage healthier medication. We proposed an approach for classification of AD stages, with respect to principal component analysis (PCA)-based algorithm. The PCA has been extensively applied as the most auspicious face-recognition algorithm. For the proposed algorithm, 100 images of 10 children were transformed for feature extraction and covariance matrix was constructed to obtain eigenvalues. The eigenvector provided a useful framework for face recognition. For the classification of AD stages, magnetic resonance imaging (MRI) and functional magnetic resonance imaging (fMRI) data were obtained from Alzheimer's Disease Neuroimaging Initiative database. Hippocampus is one of the most affected regions by AD. Thus, we selected clusters of voxels from the "hippocampus" of AD screening stage (mild cognitive impairment), AD stage 1, AD stage 2, and AD stage 3. By using eigenvectors corresponding to maximum eigenvalues of fMRI data, the purposed algorithm classified the voxels of AD stages effectively.
Collapse
Affiliation(s)
- Fayyaz Ahmad
- Department of Statistics, University of Gujrat, Gujrat, Pakistan
| | | |
Collapse
|
17
|
An Efficient Multiscale Scheme Using Local Zernike Moments for Face Recognition. APPLIED SCIENCES-BASEL 2018. [DOI: 10.3390/app8050827] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
18
|
Ragab A, de Carné de Carnavalet X, Yacout S, Ouali MS. Face recognition using multi-class Logical Analysis of Data. PATTERN RECOGNITION AND IMAGE ANALYSIS 2017. [DOI: 10.1134/s1054661817020092] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
19
|
Wu A, Zheng WS, Lai JH. Robust Depth-Based Person Re-Identification. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2017; 26:2588-2603. [PMID: 28252397 DOI: 10.1109/tip.2017.2675201] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Person re-identification (re-id) aims to match people across non-overlapping camera views. So far the RGB-based appearance is widely used in most existing works. However, when people appeared in extreme illumination or changed clothes, the RGB appearance-based re-id methods tended to fail. To overcome this problem, we propose to exploit depth information to provide more invariant body shape and skeleton information regardless of illumination and color change. More specifically, we exploit depth voxel covariance descriptor and further propose a locally rotation invariant depth shape descriptor called Eigen-depth feature to describe pedestrian body shape. We prove that the distance between any two covariance matrices on the Riemannian manifold is equivalent to the Euclidean distance between the corresponding Eigen-depth features. Furthermore, we propose a kernelized implicit feature transfer scheme to estimate Eigen-depth feature implicitly from RGB image when depth information is not available. We find that combining the estimated depth features with RGB-based appearance features can sometimes help to better reduce visual ambiguities of appearance features caused by illumination and similar clothes. The effectiveness of our models was validated on publicly available depth pedestrian datasets as compared to related methods for re-id.
Collapse
|
20
|
Choi S, Shin JH, Lee J, Sheridan P, Lu WD. Experimental Demonstration of Feature Extraction and Dimensionality Reduction Using Memristor Networks. NANO LETTERS 2017; 17:3113-3118. [PMID: 28437615 DOI: 10.1021/acs.nanolett.7b00552] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Memristors have been considered as a leading candidate for a number of critical applications ranging from nonvolatile memory to non-Von Neumann computing systems. Feature extraction, which aims to transform input data from a high-dimensional space to a space with fewer dimensions, is an important technique widely used in machine learning and pattern recognition applications. Here, we experimentally demonstrate that memristor arrays can be used to perform principal component analysis, one of the most commonly used feature extraction techniques, through online, unsupervised learning. Using Sanger's rule, that is, the generalized Hebbian algorithm, the principal components were obtained as the memristor conductances in the network after training. The network was then used to analyze sensory data from a standard breast cancer screening database with high classification success rate (97.1%).
Collapse
Affiliation(s)
- Shinhyun Choi
- Department of Electrical Engineering and Computer Science, University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Jong Hoon Shin
- Department of Electrical Engineering and Computer Science, University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Jihang Lee
- Department of Electrical Engineering and Computer Science, University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Patrick Sheridan
- Department of Electrical Engineering and Computer Science, University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Wei D Lu
- Department of Electrical Engineering and Computer Science, University of Michigan , Ann Arbor, Michigan 48109, United States
| |
Collapse
|
21
|
InterFace: A software package for face image warping, averaging, and principal components analysis. Behav Res Methods 2016; 49:2002-2011. [DOI: 10.3758/s13428-016-0837-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
22
|
|
23
|
Rama Varior R. Learning Invariant Color Features for Person Reidentification. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2016; 25:3395-3410. [PMID: 26890868 DOI: 10.1109/tip.2016.2531280] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Matching people across multiple camera views known as person reidentification is a challenging problem due to the change in visual appearance caused by varying lighting conditions. The perceived color of the subject appears to be different under different illuminations. Previous works use color as it is or address these challenges by designing color spaces focusing on a specific cue. In this paper, we propose an approach for learning color patterns from pixels sampled from images across two camera views. The intuition behind this work is that, even though varying lighting conditions across views affect the pixel values of the same color, the final representation of a particular color should be stable and invariant to these variations, i.e., they should be encoded with the same values. We model color feature generation as a learning problem by jointly learning a linear transformation and a dictionary to encode pixel values. We also analyze different photometric invariant color spaces as well as popular color constancy algorithm for person reidentification. Using color as the only cue, we compare our approach with all the photometric invariant color spaces and show superior performance over all of them. Combining with other learned low-level and high-level features, we obtain promising results in VIPeR, Person Re-ID 2011, and CAVIAR4REID data sets.
Collapse
|
24
|
Shi SC, Guo CC, Lai JH, Chen SZ, Hu XJ. Person re-identification with multi-level adaptive correspondence models. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.05.072] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
25
|
Gargiulo F, Fratini A, Sansone M, Sansone C. Subject identification via ECG fiducial-based systems: influence of the type of QT interval correction. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2015; 121:127-136. [PMID: 26143963 DOI: 10.1016/j.cmpb.2015.05.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2014] [Revised: 05/20/2015] [Accepted: 05/28/2015] [Indexed: 06/04/2023]
Abstract
Electrocardiography (ECG) has been recently proposed as biometric trait for identification purposes. Intra-individual variations of ECG might affect identification performance. These variations are mainly due to Heart Rate Variability (HRV). In particular, HRV causes changes in the QT intervals along the ECG waveforms. This work is aimed at analysing the influence of seven QT interval correction methods (based on population models) on the performance of ECG-fiducial-based identification systems. In addition, we have also considered the influence of training set size, classifier, classifier ensemble as well as the number of consecutive heartbeats in a majority voting scheme. The ECG signals used in this study were collected from thirty-nine subjects within the Physionet open access database. Public domain software was used for fiducial points detection. Results suggested that QT correction is indeed required to improve the performance. However, there is no clear choice among the seven explored approaches for QT correction (identification rate between 0.97 and 0.99). MultiLayer Perceptron and Support Vector Machine seemed to have better generalization capabilities, in terms of classification performance, with respect to Decision Tree-based classifiers. No such strong influence of the training-set size and the number of consecutive heartbeats has been observed on the majority voting scheme.
Collapse
Affiliation(s)
- Francesco Gargiulo
- University of Naples Federico II, Department of Electrical Engineering and Information Technologies, via Claudio 21, 80125 Naples, Italy
| | - Antonio Fratini
- Aston University, School of Life and Health Sciences, Aston Triangle, B4 7ET Birmingham, United Kingdom.
| | - Mario Sansone
- University of Naples Federico II, Department of Electrical Engineering and Information Technologies, via Claudio 21, 80125 Naples, Italy
| | - Carlo Sansone
- University of Naples Federico II, Department of Electrical Engineering and Information Technologies, via Claudio 21, 80125 Naples, Italy.
| |
Collapse
|
26
|
Brain diffusivity pattern is individual-specific information. Neuroscience 2015; 301:395-402. [DOI: 10.1016/j.neuroscience.2015.06.032] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 06/08/2015] [Accepted: 06/17/2015] [Indexed: 12/20/2022]
|
27
|
FERNÁNDEZ-MARTÍNEZ JUANLUIS, CERNEA ANA. NUMERICAL ANALYSIS AND COMPARISON OF SPECTRAL DECOMPOSITION METHODS IN BIOMETRIC APPLICATIONS. INT J PATTERN RECOGN 2014. [DOI: 10.1142/s0218001414560011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Face recognition is a challenging problem in computer vision and artificial intelligence. One of the main challenges consists in establishing a low-dimensional feature representation of the images having enough discriminatory power to perform high accuracy classification. Different methods of supervised and unsupervised classification can be found in the literature, but few numerical comparisons among them have been performed on the same computing platform. In this paper, we perform this kind of comparison, revisiting the main spectral decomposition methods for face recognition. We also introduce for the first time, the use of the noncentered PCA and the 2D discrete Chebyshev transform for biometric applications. Faces are represented by their spectral features, that is, their projections onto the different spectral basis. Classification is performed using different norms and/or the cosine defined by the Euclidean scalar product in the space of spectral attributes. Although this constitutes a simple algorithm of unsupervised classification, several important conclusions arise from this analysis: (1) All the spectral methods provide approximately the same accuracy when they are used with the same energy cutoff. This is an important conclusion since many publications try to promote one specific spectral method with respect to other methods. Nevertheless, there exist small variations on the highest median accuracy rates: PCA, 2DPCA and DWT perform better in this case. Also all the covariance-free spectral decomposition techniques based on single images (DCT, DST, DCHT, DWT, DWHT, DHT) are very interesting since they provide high accuracies and are not computationally expensive compared to covariance-based techniques. (2) The use of local spectral features generally provide higher accuracies than global features for the spectral methods which use the whole training database (PCA, NPCA, 2DPCA, Fisher's LDA, ICA). For the methods based on orthogonal transformations of single images, global features calculated using the whole size of the images appear to perform better. (3) The distance criterion generally provides a higher accuracy than the cosine criterion. The use of other p-norms (p > 2) provides similar results to the Euclidean norm, nevertheless some methods perform better. (4) No spectral method can provide 100% accuracy by itself. Therefore, other kind of attributes and supervised learning algorithms are needed. These results are coherent for the ORL and FERET databases. Finally, although this comparison has been performed for the face recognition problem, it could be generalized to other biometric authentication problems.
Collapse
Affiliation(s)
| | - ANA CERNEA
- Mathematics Department, Oviedo University, C/ Calvo Sotelo s/n 33007 Oviedo, Spain
| |
Collapse
|
28
|
Hussein Al-Arashi W, Ibrahim H, Azmin Suandi S. Optimizing principal component analysis performance for face recognition using genetic algorithm. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2013.08.022] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
29
|
Kainerstorfer JM, Polizzotto MN, Uldrick TS, Rahman R, Hassan M, Najafizadeh L, Ardeshirpour Y, Wyvill KM, Aleman K, Smith PD, Yarchoan R, Gandjbakhche AH. Evaluation of non-invasive multispectral imaging as a tool for measuring the effect of systemic therapy in Kaposi sarcoma. PLoS One 2013; 8:e83887. [PMID: 24386302 PMCID: PMC3873970 DOI: 10.1371/journal.pone.0083887] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2013] [Accepted: 11/09/2013] [Indexed: 11/18/2022] Open
Abstract
Diffuse multi-spectral imaging has been evaluated as a potential non-invasive marker of tumor response. Multi-spectral images of Kaposi sarcoma skin lesions were taken over the course of treatment, and blood volume and oxygenation concentration maps were obtained through principal component analysis (PCA) of the data. These images were compared with clinical and pathological responses determined by conventional means. We demonstrate that cutaneous lesions have increased blood volume concentration and that changes in this parameter are a reliable indicator of treatment efficacy, differentiating responders and non-responders. Blood volume decreased by at least 20% in all lesions that responded by clinical criteria and increased in the two lesions that did not respond clinically. Responses as assessed by multi-spectral imaging also generally correlated with overall patient clinical response assessment, were often detectable earlier in the course of therapy, and are less subject to observer variability than conventional clinical assessment. Tissue oxygenation was more variable, with lesions often showing decreased oxygenation in the center surrounded by a zone of increased oxygenation. This technique could potentially be a clinically useful supplement to existing response assessment in KS, providing an early, quantitative, and non-invasive marker of treatment effect.
Collapse
Affiliation(s)
- Jana M. Kainerstorfer
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Mark N. Polizzotto
- HIV and AIDS Malignancy Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Thomas S. Uldrick
- HIV and AIDS Malignancy Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Rafa Rahman
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Moinuddin Hassan
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Laleh Najafizadeh
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Yasaman Ardeshirpour
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Kathleen M. Wyvill
- HIV and AIDS Malignancy Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Karen Aleman
- HIV and AIDS Malignancy Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Paul D. Smith
- Biomedical Instrumentation and Multiscale Imaging Section, Laboratory of Cellular Imaging and Macromolecular Biophysics, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Robert Yarchoan
- HIV and AIDS Malignancy Branch, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Amir H. Gandjbakhche
- Section on Analytical and Functional Biophotonics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
30
|
Chaudhary A, Chaudhury S. Smart space construction: Integration of robots in a visual sensor network. 2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI) 2013. [DOI: 10.1109/urai.2013.6677455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
31
|
|
32
|
Kviatkovsky I, Adam A, Rivlin E. Color invariants for person reidentification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2013; 35:1622-1634. [PMID: 23681991 DOI: 10.1109/tpami.2012.246] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
We revisit the problem of specific object recognition using color distributions. In some applications--such as specific person identification--it is highly likely that the color distributions will be multimodal and hence contain a special structure. Although the color distribution changes under different lighting conditions, some aspects of its structure turn out to be invariants. We refer to this structure as an intradistribution structure, and show that it is invariant under a wide range of imaging conditions while being discriminative enough to be practical. Our signature uses shape context descriptors to represent the intradistribution structure. Assuming the widely used diagonal model, we validate that our signature is invariant under certain illumination changes. Experimentally, we use color information as the only cue to obtain good recognition performance on publicly available databases covering both indoor and outdoor conditions. Combining our approach with the complementary covariance descriptor, we demonstrate results exceeding the state-of-the-art performance on the challenging VIPeR and CAVIAR4REID databases.
Collapse
Affiliation(s)
- Igor Kviatkovsky
- Department of Computer Science, Technion-Israel Institute of Technology, Technion City, Haifa 32000, Israel.
| | | | | |
Collapse
|
33
|
Abraham J, Champod C, Lennard C, Roux C. Spatial analysis of corresponding fingerprint features from match and close non-match populations. Forensic Sci Int 2012; 230:87-98. [PMID: 23153799 DOI: 10.1016/j.forsciint.2012.10.034] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2012] [Revised: 09/29/2012] [Accepted: 10/18/2012] [Indexed: 11/30/2022]
Abstract
The development of statistical models for forensic fingerprint identification purposes has been the subject of increasing research attention in recent years. This can be partly seen as a response to a number of commentators who claim that the scientific basis for fingerprint identification has not been adequately demonstrated. In addition, key forensic identification bodies such as ENFSI [1] and IAI [2] have recently endorsed and acknowledged the potential benefits of using statistical models as an important tool in support of the fingerprint identification process within the ACE-V framework. In this paper, we introduce a new Likelihood Ratio (LR) model based on Support Vector Machines (SVMs) trained with features discovered via morphometric and spatial analyses of corresponding minutiae configurations for both match and close non-match populations often found in AFIS candidate lists. Computed LR values are derived from a probabilistic framework based on SVMs that discover the intrinsic spatial differences of match and close non-match populations. Lastly, experimentation performed on a set of over 120,000 publicly available fingerprint images (mostly sourced from the National Institute of Standards and Technology (NIST) datasets) and a distortion set of approximately 40,000 images, is presented, illustrating that the proposed LR model is reliably guiding towards the right proposition in the identification assessment of match and close non-match populations. Results further indicate that the proposed model is a promising tool for fingerprint practitioners to use for analysing the spatial consistency of corresponding minutiae configurations.
Collapse
Affiliation(s)
- Joshua Abraham
- Centre for Forensic Science, University of Technology Sydney, PO Box 123, Broadway NSW 2007, Australia.
| | | | | | | |
Collapse
|
34
|
Smeets D, Claes P, Hermans J, Vandermeulen D, Suetens P. A Comparative Study of 3-D Face Recognition Under Expression Variations. ACTA ACUST UNITED AC 2012. [DOI: 10.1109/tsmcc.2011.2174221] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
35
|
CAO JIANGTAO, KUBOTA NAOYUKI, LI PING, LIU HONGHAI. THE VISUAL-AUDIO INTEGRATED RECOGNITION METHOD FOR USER AUTHENTICATION SYSTEM OF PARTNER ROBOTS. INT J HUM ROBOT 2012. [DOI: 10.1142/s0219843611002678] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Some of noncontact biometric ways have been used for user authentication system of partner robots, such as visual-based recognition methods and speech recognition. However, the methods of visual-based recognition are sensitive to the light noise and speech recognition systems are perturbed to the acoustic environment and sound noise. Inspiring from the human's capability of compensating visual information (looking) with audio information (hearing), a visual-audio integrating method is proposed to deal with the disturbance of light noise and to improve the recognition accuracy. Combining with the PCA-based and 2DPCA-based face recognition, a two-stage speaker recognition algorithm is used to extract useful personal identity information from speech signals. With the statistic properties of visual background noise, the visual-audio integrating method is performed to draw the final decision. The proposed method is evaluated on a public visual-audio dataset VidTIMIT and a partner robot authentication system. The results verified the visual-audio integrating method can obtain satisfied recognition results with strong robustness.
Collapse
Affiliation(s)
- JIANGTAO CAO
- School of Information and Control Engineering, Liaoning Shihua University, Fushun, China
| | - NAOYUKI KUBOTA
- Graduate School of System Design, Tokyo Metropolitan University, Japan
| | - PING LI
- School of Information and Control Engineering, Liaoning Shihua University,Fushun, China
| | - HONGHAI LIU
- School of Creative Technology, University of Portsmouth, UK
| |
Collapse
|
36
|
Uttam S, Goodman NA, Neifeld MA. Feature-specific difference imaging. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2012; 21:638-652. [PMID: 21859603 DOI: 10.1109/tip.2011.2165549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Difference images quantify changes in the object scene over time. In this paper, we use the feature-specific imaging paradigm to present methods for estimating a sequence of difference images from a sequence of compressive measurements of the object scene. Our goal is twofold. First is to design, where possible, the optimal sensing matrix for taking compressive measurements. In scenarios where such sensing matrices are not tractable, we consider plausible candidate sensing matrices that either use the available a priori information or are nonadaptive. Second, we develop closed-form and iterative techniques for estimating the difference images. We specifically look at l(2)- and l(1)-based methods. We show that l(2)-based techniques can directly estimate the difference image from the measurements without first reconstructing the object scene. This direct estimation exploits the spatial and temporal correlations between the object scene at two consecutive time instants. We further develop a method to estimate a generalized difference image from multiple measurements and use it to estimate the sequence of difference images. For l(1)-based estimation, we consider modified forms of the total-variation method and basis pursuit denoising. We also look at a third method that directly exploits the sparsity of the difference image. We present results to show the efficacy of these techniques and discuss the advantages of each.
Collapse
Affiliation(s)
- Shikhar Uttam
- School of Medicine, University of Pittsburgh, Pittsburgh, PA 15261, USA.
| | | | | |
Collapse
|
37
|
|
38
|
Local Descriptors Encoded by Fisher Vectors for Person Re-identification. COMPUTER VISION – ECCV 2012. WORKSHOPS AND DEMONSTRATIONS 2012. [DOI: 10.1007/978-3-642-33863-2_41] [Citation(s) in RCA: 151] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
39
|
Ben Jemaa Y, Derbel A, Ben Jmaa A. 2DPCA fractal features and genetic algorithm for efficient face representation and recognition. EURASIP JOURNAL ON INFORMATION SECURITY 2011. [DOI: 10.1186/1687-417x-2011-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
40
|
Poon B, Ashraful Amin M, Yan H. Performance evaluation and comparison of PCA Based human face recognition methods for distorted images. INT J MACH LEARN CYB 2011. [DOI: 10.1007/s13042-011-0023-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
41
|
Căleanu CD, Mao X, Pradel G, Moga S, Xue Y. Combined pattern search optimization of feature extraction and classification parameters in facial recognition. Pattern Recognit Lett 2011. [DOI: 10.1016/j.patrec.2011.03.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
42
|
Singh C, Mittal N, Walia E. Face recognition using Zernike and complex Zernike moment features. PATTERN RECOGNITION AND IMAGE ANALYSIS 2011. [DOI: 10.1134/s1054661811010044] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
43
|
Douzal-Chouakria A, Billard L, Diday E. Principal component analysis for interval-valued observations. Stat Anal Data Min 2011. [DOI: 10.1002/sam.10118] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
44
|
|
45
|
Narayanan NK, Kabeer V. Face Recognition Using Nonlinear Feature Parameter and Artificial Neural Network. INT J COMPUT INT SYS 2010. [DOI: 10.1080/18756891.2010.9727723] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
46
|
Venkat I, De Wilde P. Robust Gait Recognition by Learning and Exploiting Sub-gait Characteristics. Int J Comput Vis 2010. [DOI: 10.1007/s11263-010-0362-6] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
47
|
Kainerstorfer JM, Ehler M, Amyot F, Hassan M, Demos SG, Chernomordik V, Hitzenberger CK, Gandjbakhche AH, Riley JD. Principal component model of multispectral data for near real-time skin chromophore mapping. JOURNAL OF BIOMEDICAL OPTICS 2010; 15:046007. [PMID: 20799809 PMCID: PMC2929259 DOI: 10.1117/1.3463010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2010] [Revised: 05/24/2010] [Accepted: 05/24/2010] [Indexed: 05/23/2023]
Abstract
Multispectral images of skin contain information on the spatial distribution of biological chromophores, such as blood and melanin. From this, parameters such as blood volume and blood oxygenation can be retrieved using reconstruction algorithms. Most such approaches use some form of pixelwise or volumetric reconstruction code. We explore the use of principal component analysis (PCA) of multispectral images to access blood volume and blood oxygenation in near real time. We present data from healthy volunteers under arterial occlusion of the forearm, experiencing ischemia and reactive hyperemia. Using a two-layered analytical skin model, we show reconstruction results of blood volume and oxygenation and compare it to the results obtained from our new spectral analysis based on PCA. We demonstrate that PCA applied to multispectral images gives near equivalent results for skin chromophore mapping and quantification with the advantage of being three orders of magnitude faster than the reconstruction algorithm.
Collapse
Affiliation(s)
- Jana M Kainerstorfer
- National Institutes of Health, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Section on Analytical and Functional Biophotonics (PPITS/SAFB), Bethesda, Maryland 20892, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Phillips PJ, Scruggs WT, O'Toole AJ, Flynn PJ, Bowyer KW, Schott CL, Sharpe M. FRVT 2006 and ICE 2006 large-scale experimental results. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2010; 32:831-846. [PMID: 20299708 DOI: 10.1109/tpami.2009.59] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
This paper describes the large-scale experimental results from the Face Recognition Vendor Test (FRVT) 2006 and the Iris Challenge Evaluation (ICE) 2006. The FRVT 2006 looked at recognition from high-resolution still frontal face images and 3D face images, and measured performance for still frontal face images taken under controlled and uncontrolled illumination. The ICE 2006 evaluation reported verification performance for both left and right irises. The images in the ICE 2006 intentionally represent a broader range of quality than the ICE 2006 sensor would normally acquire. This includes images that did not pass the quality control software embedded in the sensor. The FRVT 2006 results from controlled still and 3D images document at least an order-of-magnitude improvement in recognition performance over the FRVT 2002. The FRVT 2006 and the ICE 2006 compared recognition performance from high-resolution still frontal face images, 3D face images, and the single-iris images. On the FRVT 2006 and the ICE 2006 data sets, recognition performance was comparable for high-resolution frontal face, 3D face, and the iris images. In an experiment comparing human and algorithms on matching face identity across changes in illumination on frontal face images, the best performing algorithms were more accurate than humans on unfamiliar faces.
Collapse
Affiliation(s)
- P Jonathon Phillips
- National Institute of Standards and Technology (NIST), Gaithersburg, MD 20899, USA.
| | | | | | | | | | | | | |
Collapse
|
49
|
Furl N, Phillips PJ, O'Toole AJ. Face recognition algorithms and the other-race effect: computational mechanisms for a developmental contact hypothesis. Cogn Sci 2010. [DOI: 10.1207/s15516709cog2606_4] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
|
50
|
|