Elazary L, Itti L. A Bayesian model for efficient visual search and recognition.
Vision Res 2010;
50:1338-52. [PMID:
20080120 DOI:
10.1016/j.visres.2010.01.002]
[Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Revised: 11/13/2009] [Accepted: 01/05/2010] [Indexed: 11/30/2022]
Abstract
Humans employ interacting bottom-up and top-down processes to significantly speed up search and recognition of particular targets. We describe a new model of attention guidance for efficient and scalable first-stage search and recognition with many objects (117,174 images of 1147 objects were tested, and 40 satellite images). Performance for recognition is on par or better than SIFT and HMAX, while being, respectively, 1500 and 279 times faster. The model is also used for top-down guided search, finding a desired object in a 5x5 search array within four attempts, and improving performance for finding houses in satellite images.
Collapse