Object class recognition by unsupervised scale-invariant learning

R. Fergus, P. Perona, A. Zisserman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).

Original languageEnglish (US)
Title of host publicationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2
StatePublished - 2003
Event2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Madison, WI, United States
Duration: Jun 18 2003Jun 20 2003

Other

Other2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
CountryUnited States
CityMadison, WI
Period6/18/036/20/03

Fingerprint

Maximum likelihood
Animals
Entropy
Railroad cars
Detectors

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Software
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Fergus, R., Perona, P., & Zisserman, A. (2003). Object class recognition by unsupervised scale-invariant learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 2)

Object class recognition by unsupervised scale-invariant learning. / Fergus, R.; Perona, P.; Zisserman, A.

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 2 2003.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fergus, R, Perona, P & Zisserman, A 2003, Object class recognition by unsupervised scale-invariant learning. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. vol. 2, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Madison, WI, United States, 6/18/03.
Fergus R, Perona P, Zisserman A. Object class recognition by unsupervised scale-invariant learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 2. 2003
Fergus, R. ; Perona, P. ; Zisserman, A. / Object class recognition by unsupervised scale-invariant learning. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 2 2003.
@inproceedings{6da26d38a9ab43b8aa10989420c237f1,
title = "Object class recognition by unsupervised scale-invariant learning",
abstract = "We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).",
author = "R. Fergus and P. Perona and A. Zisserman",
year = "2003",
language = "English (US)",
volume = "2",
booktitle = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

}

TY - GEN

T1 - Object class recognition by unsupervised scale-invariant learning

AU - Fergus, R.

AU - Perona, P.

AU - Zisserman, A.

PY - 2003

Y1 - 2003

N2 - We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).

AB - We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).

UR - http://www.scopus.com/inward/record.url?scp=0041940256&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0041940256&partnerID=8YFLogxK

M3 - Conference contribution

VL - 2

BT - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

ER -