Abstract

Natural scene classification is a fundamental challenge in computer vision. By far, the majority of studies have limited their scope to scenes from single image stills and thereby ignore potentially informative temporal cues. The current paper is concerned with determining the degree of performance gain in considering short videos for recognizing natural scenes. Towards this end, the impact of multiscale orientation measurements on scene classification is systematically investigated, as related to: (i) spatial appearance, (ii) temporal dynamics and (iii) joint spatial appearance and dynamics. These measurements in visual space, x-y, and spacetime, x-y-t, are recovered by a bank of spatiotemporal oriented energy filters. In addition, a new data set is introduced that contains 420 image sequences spanning fourteen scene categories, with temporal scene information due to objects and surfaces decoupled from camera-induced ones. This data set is used to evaluate classification performance of the various orientation-related representations, as well as state-of-the-art alternatives. It is shown that a notable performance increase is realized by spatiotemporal approaches in comparison to purely spatial or purely temporal methods.

Keywords

Orientation (vector space)Artificial intelligenceComputer scienceSet (abstract data type)Computer visionScene statisticsPattern recognition (psychology)Space (punctuation)Natural (archaeology)GeographyMathematicsPerception

Affiliated Institutions

Related Publications

NeRF

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using ...

2021 Communications of the ACM 4497 citations

Publication Info

Year
2012
Type
article
Pages
1306-1313
Citations
127
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

127
OpenAlex

Cite This

Konstantinos G. Derpanis, Matthieu Lecce, Kostas Daniilidis et al. (2012). Dynamic scene understanding: The role of orientation features in space and time in scene classification. , 1306-1313. https://doi.org/10.1109/cvpr.2012.6247815

Identifiers

DOI
10.1109/cvpr.2012.6247815