1
|
Chang D, Pang K, Du R, Tong Y, Song YZ, Ma Z, Guo J. Making a Bird AI Expert Work for You and Me. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:12068-12084. [PMID: 37159309 DOI: 10.1109/tpami.2023.3274593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
As powerful as fine-grained visual classification (FGVC) is, responding your query with a bird name of "Whip-poor-will" or "Mallard" probably does not make much sense. This however commonly accepted in the literature, underlines a fundamental question interfacing AI and human - what constitutes transferable knowledge for human to learn from AI? This paper sets out to answer this very question using FGVC as a test bed. Specifically, we envisage a scenario where a trained FGVC model (the AI expert) functions as a knowledge provider in enabling average people (you and me) to become better domain experts ourselves. Assuming an AI expert trained using expert human labels, we anchor our focus on asking and providing solutions for two questions: (i) what is the best transferable knowledge we can extract from AI, and (ii) what is the most practical means to measure the gains in expertise given that knowledge? We propose to represent knowledge as highly discriminative visual regions that are expert-exclusive and instantiate it via a novel multi-stage learning framework. A human study of 15,000 trials shows our method is able to consistently improve people of divergent bird expertise to recognise once unrecognisable birds. We further propose a crude but benchmarkable metric TEMI and therefore allow future efforts in this direction to be comparable to ours without the need of large-scale human studies.
Collapse
|
2
|
Marris JE, Perfors A, Mitchell D, Wang W, McCusker MW, Lovell TJH, Gibson RN, Gaillard F, Howe PDL. Evaluating the effectiveness of different perceptual training methods in a difficult visual discrimination task with ultrasound images. Cogn Res Princ Implic 2023; 8:19. [PMID: 36940041 PMCID: PMC10027970 DOI: 10.1186/s41235-023-00467-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 01/16/2023] [Indexed: 03/21/2023] Open
Abstract
Recent work has shown that perceptual training can be used to improve the performance of novices in real-world visual classification tasks with medical images, but it is unclear which perceptual training methods are the most effective, especially for difficult medical image discrimination tasks. We investigated several different perceptual training methods with medically naïve participants in a difficult radiology task: identifying the degree of hepatic steatosis (fatty infiltration of the liver) in liver ultrasound images. In Experiment 1a (N = 90), participants completed four sessions of standard perceptual training, and participants in Experiment 1b (N = 71) completed four sessions of comparison training. There was a significant post-training improvement for both types of training, although performance was better when the trained task aligned with the task participants were tested on. In both experiments, performance initially improves rapidly, with learning becoming more gradual after the first training session. In Experiment 2 (N = 200), we explored the hypothesis that performance could be improved by combining perceptual training with explicit annotated feedback presented in a stepwise fashion. Although participants improved in all training conditions, performance was similar regardless of whether participants were given annotations, or underwent training in a stepwise fashion, both, or neither. Overall, we found that perceptual training can rapidly improve performance on a difficult radiology task, albeit not to a comparable level as expert performance, and that similar levels of performance were achieved across the perceptual training paradigms we compared.
Collapse
Affiliation(s)
- Jessica E Marris
- Melbourne School of Psychological Sciences, University of Melbourne, Parkville, Australia.
| | - Andrew Perfors
- Melbourne School of Psychological Sciences, University of Melbourne, Parkville, Australia
| | - David Mitchell
- Radiology, Sligo University Hospital, Sligo, Ireland
- Department of Radiology, The Royal Melbourne Hospital, Parkville, Australia
| | - Wayland Wang
- Department of Radiology, The Royal Melbourne Hospital, Parkville, Australia
| | - Mark W McCusker
- Department of Radiology, The Royal Melbourne Hospital, Parkville, Australia
- Department of Radiology, University of Melbourne, Parkville, Australia
| | | | - Robert N Gibson
- Department of Radiology, The Royal Melbourne Hospital, Parkville, Australia
- Department of Radiology, University of Melbourne, Parkville, Australia
| | - Frank Gaillard
- Department of Radiology, The Royal Melbourne Hospital, Parkville, Australia
- Department of Radiology, University of Melbourne, Parkville, Australia
| | - Piers D L Howe
- Melbourne School of Psychological Sciences, University of Melbourne, Parkville, Australia
| |
Collapse
|
3
|
The Potential of Using an Eye Tracker in Architectural Education: Three Perspectives for Ordinary Users, Students and Lecturers. BUILDINGS 2021. [DOI: 10.3390/buildings11060245] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The aim of this paper is to discuss the potential of eye trackers as tools providing diversified support for the architectural education of future designers. The possibility to track eye movement guaranteed by this group of devices enables an extension of knowledge on the non-professional perception of architectural creations. It also allows people to monitor progress while learning and verify a project’s assumptions as well as provide lecturers with an opportunity to optimize didactic methods. The paper includes authors’ ideas for modifications of teaching methods applied at technical universities. It is a result of an analysis of research related to the perception of urban designs and architectural objects—research during which noticeable differences were observed between how experts and non-professionals perceive these structures. What also contributed to the contents of this paper was a comprehension of the wide range of eye tracking studies examining the level of acquisition of specialist professional skills. The presented ideas are also based on the analysis and adaptation of eye-tracking research conducted by scientists within other areas of life.
Collapse
|
4
|
Busey TA, Heise N, Hicklin RA, Ulery BT, Buscaglia J. Characterizing missed identifications and errors in latent fingerprint comparisons using eye-tracking data. PLoS One 2021; 16:e0251674. [PMID: 34029319 PMCID: PMC8143401 DOI: 10.1371/journal.pone.0251674] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 04/29/2021] [Indexed: 11/18/2022] Open
Abstract
Latent fingerprint examiners sometimes come to different conclusions when comparing fingerprints, and eye-gaze behavior may help explain these outcomes. missed identifications (missed IDs) are inconclusive, exclusion, or No Value determinations reached when the consensus of other examiners is an identification. To determine the relation between examiner behavior and missed IDs, we collected eye-gaze data from 121 latent print examiners as they completed a total 1444 difficult (latent-exemplar) comparisons. We extracted metrics from the gaze data that serve as proxies for underlying perceptual and cognitive capacities. We used these metrics to characterize potential mechanisms of missed IDs: Cursory Comparison and Mislocalization. We find that missed IDs are associated with shorter comparison times, fewer regions visited, and fewer attempted correspondences between the compared images. Latent print comparisons resulting in erroneous exclusions (a subset of missed IDs) are also more likely to have fixations in different regions and less accurate correspondence attempts than those comparisons resulting in identifications. We also use our derived metrics to describe one atypical examiner who made six erroneous identifications, four of which were on comparisons intended to be straightforward exclusions. The present work helps identify the degree to which missed IDs can be explained using eye-gaze behavior, and the extent to which missed IDs depend on cognitive and decision-making factors outside the domain of eye-tracking methodologies.
Collapse
Affiliation(s)
- Thomas A. Busey
- Psychological and Brain Sciences, Indiana University, Bloomington, Indiana, United States of America
| | - Nicholas Heise
- Intelligence and Analytics, Noblis, Reston, Virginia, United States of America
| | - R. Austin Hicklin
- Intelligence and Analytics, Noblis, Reston, Virginia, United States of America
| | - Bradford T. Ulery
- Intelligence and Analytics, Noblis, Reston, Virginia, United States of America
| | - JoAnn Buscaglia
- Research and Support Unit, Federal Bureau of Investigation Laboratory, Quantico, Virginia, United States of America
- * E-mail:
| |
Collapse
|
5
|
Robson SG, Tangen JM, Searston RA. The effect of expertise, target usefulness and image structure on visual search. Cogn Res Princ Implic 2021; 6:16. [PMID: 33709197 PMCID: PMC7977019 DOI: 10.1186/s41235-021-00282-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 02/19/2021] [Indexed: 11/18/2022] Open
Abstract
Experts outperform novices on many cognitive and perceptual tasks. Extensive training has tuned experts to the most relevant information in their specific domain, allowing them to make decisions quickly and accurately. We compared a group of fingerprint examiners to a group of novices on their ability to search for information in fingerprints across two experiments-one where participants searched for target features within a single fingerprint and another where they searched for points of difference between two fingerprints. In both experiments, we also varied how useful the target feature was and whether participants searched for these targets in a typical fingerprint or one that had been scrambled. Experts more efficiently located targets when searching for them in intact but not scrambled fingerprints. In Experiment 1, we also found that experts more efficiently located target features classified as more useful compared to novices, but this expert-novice difference was not present when the target feature was classified as less useful. The usefulness of the target may therefore have influenced the search strategies that participants used, and the visual search advantages that experts display appear to depend on their vast experience with visual regularity in fingerprints. These results align with a domain-specific account of expertise and suggest that perceptual training ought to involve learning to attend to task-critical features.
Collapse
Affiliation(s)
- Samuel G Robson
- School of Psychology, The University of Queensland, St Lucia, 4072, QLD, Australia.
| | - Jason M Tangen
- School of Psychology, The University of Queensland, St Lucia, 4072, QLD, Australia
| | - Rachel A Searston
- School of Psychology, The University of Adelaide, Adelaide, 5005, SA, Australia
| |
Collapse
|
6
|
Robson SG, Searston RA, Edmond G, McCarthy DJ, Tangen JM. An expert–novice comparison of feature choice. APPLIED COGNITIVE PSYCHOLOGY 2020. [DOI: 10.1002/acp.3676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Samuel G. Robson
- School of Psychology The University of Queensland Brisbane Queensland Australia
| | - Rachel A. Searston
- School of Psychology The University of Adelaide Adelaide South Australia Australia
| | - Gary Edmond
- School of Law University of New South Wales Sydney New South Wales Australia
| | - Duncan J. McCarthy
- Forensic Services Branch Queensland Police Service Brisbane Queensland Australia
| | - Jason M. Tangen
- School of Psychology The University of Queensland Brisbane Queensland Australia
| |
Collapse
|
7
|
van Lopik K, Sinclair M, Sharpe R, Conway P, West A. Developing augmented reality capabilities for industry 4.0 small enterprises: Lessons learnt from a content authoring case study. COMPUT IND 2020. [DOI: 10.1016/j.compind.2020.103208] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
8
|
Wisniewski MG, Radell ML, Church BA, Mercado E. Benefits of fading in perceptual learning are driven by more than dimensional attention. PLoS One 2017; 12:e0180959. [PMID: 28723976 PMCID: PMC5516993 DOI: 10.1371/journal.pone.0180959] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Accepted: 06/23/2017] [Indexed: 11/23/2022] Open
Abstract
Individuals learn to classify percepts effectively when the task is initially easy and then gradually increases in difficulty. Some suggest that this is because easy-to-discriminate events help learners focus attention on discrimination-relevant dimensions. Here, we tested whether such attentional-spotlighting accounts are sufficient to explain easy-to-hard effects in auditory perceptual learning. In two experiments, participants were trained to discriminate periodic, frequency-modulated (FM) tones in two separate frequency ranges (300–600 Hz or 3000–6000 Hz). In one frequency range, sounds gradually increased in similarity as training progressed. In the other, stimulus similarity was constant throughout training. After training, participants showed better performance in their progressively trained frequency range, even though the discrimination-relevant dimension across ranges was the same. Learning theories that posit experience-dependent changes in stimulus representations and/or the strengthening of associations with differential responses, predict the observed specificity of easy-to-hard effects, whereas attentional-spotlighting theories do not. Calibrating the difficulty and temporal sequencing of training experiences to support more incremental representation-based learning can enhance the effectiveness of practice beyond any benefits gained from explicitly highlighting relevant dimensions.
Collapse
Affiliation(s)
- Matthew G. Wisniewski
- 711 Human Performance Wing, U.S. Air Force Research Laboratory, Wright-Patterson Air Force Base, Ohio, United States of America
- * E-mail:
| | - Milen L. Radell
- Department of Psychology, Niagara University, Lewiston, New York, United States of America
| | - Barbara A. Church
- Department of Psychology, Language Research Center, Georgia State University, Georgia, United States of America
| | - Eduardo Mercado
- Department of Psychology, University at Buffalo, The State University of New York, Buffalo, New York, United States of America
| |
Collapse
|
9
|
Roads B, Mozer MC, Busey TA. Correction: Using Highlighting to Train Attentional Expertise. PLoS One 2016; 11:e0149368. [PMID: 26859384 PMCID: PMC4747669 DOI: 10.1371/journal.pone.0149368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|