1
|
|
2
|
Arbib MA. Towards a Computational Comparative Neuroprimatology: Framing the language-ready brain. Phys Life Rev 2015; 16:1-54. [PMID: 26482863 DOI: 10.1016/j.plrev.2015.09.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2015] [Revised: 09/11/2015] [Accepted: 09/22/2015] [Indexed: 10/23/2022]
Abstract
We make the case for developing a Computational Comparative Neuroprimatology to inform the analysis of the function and evolution of the human brain. First, we update the mirror system hypothesis on the evolution of the language-ready brain by (i) modeling action and action recognition and opportunistic scheduling of macaque brains to hypothesize the nature of the last common ancestor of macaque and human (LCA-m); and then we (ii) introduce dynamic brain modeling to show how apes could acquire gesture through ontogenetic ritualization, hypothesizing the nature of evolution from LCA-m to the last common ancestor of chimpanzee and human (LCA-c). We then (iii) hypothesize the role of imitation, pantomime, protosign and protospeech in biological and cultural evolution from LCA-c to Homo sapiens with a language-ready brain. Second, we suggest how cultural evolution in Homo sapiens led from protolanguages to full languages with grammar and compositional semantics. Third, we assess the similarities and differences between the dorsal and ventral streams in audition and vision as the basis for presenting and comparing two models of language processing in the human brain: A model of (i) the auditory dorsal and ventral streams in sentence comprehension; and (ii) the visual dorsal and ventral streams in defining "what language is about" in both production and perception of utterances related to visual scenes provide the basis for (iii) a first step towards a synthesis and a look at challenges for further research.
Collapse
|
3
|
Barrès V, Lee J. Template Construction Grammar: From Visual Scene Description to Language Comprehension and Agrammatism. Neuroinformatics 2013; 12:181-208. [DOI: 10.1007/s12021-013-9197-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
4
|
Michaelsen E, Doktorski L, Luetjen K. An accumulating interpreter for cognitive vision production systems. PATTERN RECOGNITION AND IMAGE ANALYSIS 2012. [DOI: 10.1134/s1054661812030066] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Michaelsen E, Jäger K, Roschkowski D, Doktorski L, Arens M. On the semantics of object-oriented landmark recognition. PATTERN RECOGNITION AND IMAGE ANALYSIS 2012. [DOI: 10.1134/s1054661812010270] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Abstract
In this paper, we study the problem of recovering approximate shape from the shading of a three-dimensional object in a single image when knowledge about the object is available. The application of knowledge-based methods to low-level image processing tasks will help overcome problems that arise from processing images using a pixel-based approach. Shape-from-shading has generally been approached by precognitive vision methods where a standard operator is applied to the image based on assumptions about the imaging process and generic properties of what appears. This paper explores some advantages of applying knowledge and hypotheses about what appears in the image. The knowledge and hypotheses used here come from domain knowledge and edge-matching. Specifically, we are able to find solutions to some problems that cannot be solved by other methods and gain advantages in terms of computation speed over similar approaches. Further, we can fully automate the derivation of the approximate shape of an object. This paper demonstrates the efficacy of using knowledge in the basic operation of an early vision operator, and so introduces a new paradigm for computer vision that may be applied to other early vision operators.
Collapse
Affiliation(s)
- NICK BARNES
- Computer Vision and Machine Intelligence Lab (CVMIL), Department of Computer Science, The University of Melbourne, 221 Bouverie Street, Carlton, Victoria, Australia, 3053, Australia
| | - ZHI-QIANG LIU
- Computer Vision and Machine Intelligence Lab (CVMIL), Department of Computer Science, The University of Melbourne, 221 Bouverie Street, Carlton, Victoria, Australia, 3053, Australia
| |
Collapse
|
7
|
AHLRICHS U, PAULUS D, NIEMANN H. KNOWLEDGE-BASED SCENE EXPLORATION USING COMPUTER VISION AND LEARNED ANALYSIS STRATEGIES. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s021800140400337x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this contribution we demonstrate how the task of visual scene exploration can be solved by a knowledge-based vision system. During scene exploration, the system searches for a fixed number of a priori known objects in a static scene. If not all objects are visible using the initial camera set-up, the camera parameters have to be adjusted and the camera has to be moved by the system. This problem is reduced to the choice of optimal camera actions. The information about the objects and the camera actions is uniformly represented in a semantic network. In addition, a control algorithm is provided that finds the optimal assignment from objects to parts of a scene based on a suitable analysis strategy. This strategy is acquired by the system itself using reinforcement learning methods. The paper focuses on aspects of knowledge representation concerning the integration of camera actions and on the integration of reinforcement learning methods in a semantic network formalism and applies them in a realistic setup. Experiments are shown for images of two office rooms.
Collapse
Affiliation(s)
- U. AHLRICHS
- University Erlangen–Nuremberg, Martensstr. 3, 91058 Erlangen, Germany
| | - D. PAULUS
- University Erlangen–Nuremberg, Martensstr. 3, 91058 Erlangen, Germany
| | - H. NIEMANN
- University Erlangen–Nuremberg, Martensstr. 3, 91058 Erlangen, Germany
| |
Collapse
|
8
|
Michaelsen E, Doktorski L, Luetjen K. An accumulating interpreter for cognitive vision production systems. PATTERN RECOGNITION AND IMAGE ANALYSIS 2011. [DOI: 10.1134/s1054661811020775] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
9
|
Michaelsen E, Stilla U, Soergel U, Doktorski L. Extraction of building polygons from SAR images: Grouping and decision-level in the GESTALT system. Pattern Recognit Lett 2010. [DOI: 10.1016/j.patrec.2009.10.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
10
|
Status and Development of Natural Scene Understanding for Vision-based Outdoor Moblie Robot. ACTA ACUST UNITED AC 2010. [DOI: 10.3724/sp.j.1004.2010.00001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
11
|
Bovenkamp EGP, Dijkstra J, Bosch JG, Reiber JHC. User-agent cooperation in multiagent IVUS image segmentation. IEEE TRANSACTIONS ON MEDICAL IMAGING 2009; 28:94-105. [PMID: 19116192 DOI: 10.1109/tmi.2008.927351] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Automated interpretation of complex images requires elaborate knowledge and model-based image analysis, but often needs interaction with an expert as well. This research describes expert interaction with a multiagent image interpretation system using only a restricted vocabulary of high-level user interactions. The aim is to minimize inter- and intra-observer variability by keeping the total number of interactions as low and simple as possible. The multiagent image interpretation system has elaborate high-level knowledge-based control over low-level image segmentation algorithms. Agents use contextual knowledge to keep the number of interactions low but, when in doubt, present the user with the most likely interpretation of the situation. The user, in turn, can correct, supplement, and/or confirm the results of image-processing agents. This is done at a very high level of abstraction such that no knowledge of the underlying segmentation methods, parameters or agent functioning is needed. High-level interaction thereby replaces more traditional contour correction methods like inserting points and/or (re)drawing contours. This makes it easier for the user to obtain good results, while inter- and intra-observer variability are kept minimal, since the image segmentation itself remains under control of image-processing agents. The system has been applied to intravascular ultrasound (IVUS) images. Experiments show that with an average of 2-3 high-level user interactions per correction, segmentation results substantially improve while the variation is greatly reduced. The achieved level of accuracy and repeatability is equivalent to that of manual drawing by an expert.
Collapse
Affiliation(s)
- E G P Bovenkamp
- Division of Image Processing, Departmentof Radiology, Leiden University Medical Center, 2300RC Leiden, TheNetherlands.
| | | | | | | |
Collapse
|
12
|
Describing visual scenes: Towards a neurolinguistics based on construction grammar. Brain Res 2008; 1225:146-62. [DOI: 10.1016/j.brainres.2008.04.075] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2008] [Revised: 04/07/2008] [Accepted: 04/21/2008] [Indexed: 11/20/2022]
|
13
|
Michaelsen E, Thiele A, Cadario E, Soergel U. Building extraction based on stereo analysis of high-resolution SAR images taken from orthogonal aspect directions. PATTERN RECOGNITION AND IMAGE ANALYSIS 2008. [DOI: 10.1134/s1054661808020077] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
14
|
Markou M, Singh M, Singh S. Neural network analysis of MINERVA scene image benchmark. Neural Comput Appl 2006. [DOI: 10.1007/s00521-005-0004-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
15
|
A Learning Framework for Object Recognition on Image Understanding. PATTERN RECOGNITION AND IMAGE ANALYSIS 2005. [DOI: 10.1007/11492542_39] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
16
|
More Principled Design of Pervasive Computing Systems. ENGINEERING HUMAN COMPUTER INTERACTION AND INTERACTIVE SYSTEMS 2005. [DOI: 10.1007/11431879_20] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
17
|
Cooperative agents society organized as an irregular pyramid: A mammography segmentation application. Pattern Recognit Lett 2003. [DOI: 10.1016/s0167-8655(03)00077-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
|
19
|
|
20
|
Vivarelli F, Williams CK. Comparing Bayesian neural network algorithms for classifying segmented outdoor images. Neural Netw 2001; 14:427-37. [PMID: 11411630 DOI: 10.1016/s0893-6080(01)00024-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
In this paper we investigate the Bayesian training of neural networks for region labelling of segmented outdoor scenes; the data are drawn from the Sowerby Image Database of British Aerospace. Neural networks are trained with two Bayesian methods, (i) the evidence framework of MacKay (1992a,b) and (ii) a Markov Chain Monte Carlo method due to Neal (1996). The performance of the two methods is compared to evaluating the empirical learning curves of neural networks trained with the two methods. We also investigate the use of the Automatic Relevance Determination method for input feature selection.
Collapse
Affiliation(s)
- F Vivarelli
- The Knowledge Lab-NCR Financial Solutions, London, UK.
| | | |
Collapse
|
21
|
The metaphorical brains. ARTIF INTELL 1998. [DOI: 10.1016/s0004-3702(98)00028-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
22
|
|
23
|
Campbell NW, Thomas BT, Troscianko T. Automatic segmentation and classification of outdoor images using neural networks. Int J Neural Syst 1997; 8:137-44. [PMID: 9228585 DOI: 10.1142/s0129065797000161] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The paper describes how neural networks may be used to segment and label objects in images. A self-organising feature map is used for the segmentation phase, and we quantify the quality of the segmentations produced as well as the contribution made by colour and texture features. A multi-layer perception is trained to label the regions produced by the segmentation process. It is shown that 91.1% of the image area is correctly classified into one of eleven categories which include cars, houses, fences, roads, vegetation and sky.
Collapse
Affiliation(s)
- N W Campbell
- Advanced Computing Research Centre, University of Bristol, UK.
| | | | | |
Collapse
|
24
|
Morino V, Foresti G, Regazzoni C. A distributed probabilistic system for adaptive regulation of image processing parameters. ACTA ACUST UNITED AC 1996; 26:1-20. [DOI: 10.1109/3477.484434] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
25
|
|
26
|
Leow WK, Miikkulainen R. VISOR: Schema-based scene analysis with structured neural networks. Neural Process Lett 1994. [DOI: 10.1007/bf02310938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
27
|
|
28
|
Fierens F, Van Cleynenbreugel J, Suetens P, Oosterlinck A. Iconic representation of visual data and models. Pattern Recognit Lett 1991. [DOI: 10.1016/0167-8655(91)90076-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
29
|
Split-and-merge image segmentation based on localized feature analysis and statistical tests. ACTA ACUST UNITED AC 1991. [DOI: 10.1016/1049-9652(91)90030-n] [Citation(s) in RCA: 59] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
30
|
|
31
|
Fennema C, Hanson A, Riseman E, Beveridge J, Kumar R. Model-directed mobile robot navigation. ACTA ACUST UNITED AC 1990. [DOI: 10.1109/21.61206] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
32
|
|
33
|
|
34
|
Riseman EM, Hanson AR. Computer vision research at the University of Massachusetts?Themes and progress. Int J Comput Vis 1989. [DOI: 10.1007/bf00158164] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
35
|
|