What makes a good model of natural images?

Abstract

Many low-level vision algorithms assume a prior probability over images, and there has been great interest in trying to learn this prior from examples. Since images are very non Gaussian, high dimensional, continuous signals, learning their distribution presents a tremendous computational challenge. Perhaps the most successful recent algorithm is the Fields of Experts (FOE) [20] model which has shown impressive performance by modeling image statistics with a product of potentials defined on filter outputs. However, as in previous models of images based on filter outputs [30], calculating the probability of an image given the model requires evaluating an intractable partition function. This makes learning very slow (requires Monte-Carlo sampling at every step) and makes it virtually impossible to compare the likelihood of two different models. Given this computational difficulty, it is hard to say whether nonintu-itive features learned by such models represent a true property of natural images or an artifact of the approximations used during learning. In this paper we present (1) tractable lower and upper bounds on the partition function of models based on filter outputs and (2) efficient learning algorithms that do not require any sampling. Our results are based on recent results in machine learning that deal with Gaussian potentials. We extend these results to non-Gaussian potentials and derive a novel, basis rotation algorithm for approximating the maximum likelihood filters. Our results allow us to (1) rigorously compare the likelihood of different models and (2) calculate high likelihood models of natural image statistics in a matter of minutes. Applying our results to previous models shows that the nonintuitive features are not an artifact of the learning process but rather are capturing robust properties of natural images.

Keywords

Computer scienceLikelihood functionGaussianAlgorithmArtificial intelligenceFilter (signal processing)Importance samplingMonte Carlo methodPartition (number theory)Machine learningMathematicsEstimation theoryStatisticsComputer vision

Affiliated Institutions

Hebrew University of Jerusalem IL

Related Publications

Assessing Approximations for Gaussian Process Classification

Malte Kuß , Carl Edward Rasmussen

Gaussian processes are attractive models for probabilistic classification but unfortunately exact inference is analytically intractable. We compare Laplace&amp;amp;amp;amp;a...

2005 25 citations

MCMC Methods for Multi-Response Generalized Linear Mixed Models: The<b>MCMCglmm</b><i>R</i>Package

Jarrod D. Hadfield

Generalized linear mixed models provide a flexible framework for modeling a range of data, although with non-Gaussian response variables the likelihood cannot be obtained in clo...

2010 Journal of Statistical Software 4603 citations

Comparison of Bayesian and maximum-likelihood inference of population genetic parameters

Peter Beerli

Abstract Comparison of the performance and accuracy of different inference methods, such as maximum likelihood (ML) and Bayesian inference, is difficult because the inference me...

2005 Bioinformatics 935 citations

Part-Based Statistical Models for Object Classification and Detection

Elliot Joel Bernstein , Yali Amit

We propose using simple mixture models to define a set of mid-level binary local features based on binary oriented edge input. The features capture natural local structures in t...

2005 31 citations

Smooth Skyride through a Rough Skyline: Bayesian Coalescent-Based Inference of Population Dynamics

Vladimir N. Minin , Erik Bloomquist , Marc A. Suchard

Kingman's coalescent process opens the door for estimation of population genetics model parameters from molecular sequences. One paramount parameter of interest is the effective...

2008 Molecular Biology and Evolution 735 citations

Publication Info

Year: 2007
Type: article
Pages: 1-8
Citations: 310
Access: Closed

External Links