A Nonparametric Approach to Bottom-Up Visual Saliency
PDF WebThis paper addresses the bottom-up influence of local image information on human eye movements. Most existing computational models use a set of biologically plausible linear filters, e.g., Gabor or Difference-of-Gaussians filters as a front-end, the outputs of which are nonlinearly combined into a real number that indicates visual saliency. Unfortunately, this requires many design parameters such as the number, type, and size of the front-end filters, as well as the choice of nonlinearities, weighting and normalization schemes etc., for which biological plausibility cannot always be justified. As a result, these parameters have to be chosen in a more or less ad hoc way. Here, we propose to emph{learn} a visual saliency model directly from human eye movement data. The model is rather simplistic and essentially parameter-free, and therefore contrasts recent developments in the field that usually aim at higher prediction rates at the cost of additional parameters and increasing model complexity. Experimental results show that - despite the lack of any biological prior knowledge - our model performs comparably to existing approaches, and in fact learns image features that resemble findings from several previous studies. In particular, its maximally excitatory stimuli have center-surround structure, similar to receptive fields in the early human visual system.
| Author(s): | Kienzle, W. and Wichmann, FA. and Schölkopf, B. and Franz, MO. |
| Links: | |
| Book Title: | Advances in Neural Information Processing Systems 19 |
| Journal: | Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference |
| Pages: | 689-696 |
| Year: | 2007 |
| Month: | September |
| Day: | 0 |
| Editors: | B Sch{\"o}lkopf and J Platt and T Hofmann |
| Publisher: | MIT Press |
| BibTeX Type: | Conference Paper (inproceedings) |
| Address: | Cambridge, MA, USA |
| Event Name: | 20th Annual Conference on Neural Information Processing Systems (NIPS 2006) |
| Event Place: | Vancouver, BC, Canada |
| Digital: | 0 |
| Electronic Archiving: | grant_archive |
| ISBN: | 0-262-19568-2 |
| Language: | en |
| Organization: | Max-Planck-Gesellschaft |
| School: | Biologische Kybernetik |
BibTeX
@inproceedings{4147,
title = {A Nonparametric Approach to Bottom-Up Visual Saliency},
journal = {Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference},
booktitle = {Advances in Neural Information Processing Systems 19},
abstract = {This paper addresses the bottom-up influence of local image
information on human eye movements. Most existing computational models use a set of biologically plausible linear filters, e.g., Gabor or Difference-of-Gaussians filters as a front-end, the outputs of which are nonlinearly combined into a real number that indicates visual saliency. Unfortunately, this requires many design parameters such as the number, type, and size of the
front-end filters, as well as the choice of nonlinearities,
weighting and normalization schemes etc., for which biological plausibility cannot always be justified. As a result, these parameters have to be chosen in a more or less ad hoc way. Here, we propose to emph{learn} a visual saliency model directly from human eye movement data. The model is rather simplistic and essentially parameter-free, and therefore contrasts recent developments in the field that usually aim at higher prediction rates at the cost of additional parameters and increasing model complexity. Experimental results show that - despite the lack of
any biological prior knowledge - our model performs comparably to existing approaches, and in fact learns image features that resemble findings from several previous studies. In particular, its maximally excitatory stimuli have center-surround structure, similar to receptive fields in the early human visual system.},
pages = {689-696},
editors = {B Sch{\"o}lkopf and J Platt and T Hofmann},
publisher = {MIT Press},
organization = {Max-Planck-Gesellschaft},
school = {Biologische Kybernetik},
address = {Cambridge, MA, USA},
month = sep,
year = {2007},
author = {Kienzle, W. and Wichmann, FA. and Sch{\"o}lkopf, B. and Franz, MO.},
month_numeric = {9}
}
