Learning to recognize objects in egocentric activities

Alireza Fathi; Xiaofeng Ren; James M. Rehg

doi:10.1109/cvpr.2011.5995444

Abstract

This paper addresses the problem of learning object models from egocentric video of household activities, using extremely weak supervision. For each activity sequence, we know only the names of the objects which are present within it, and have no other knowledge regarding the appearance or location of objects. The key to our approach is a robust, unsupervised bottom up segmentation method, which exploits the structure of the egocentric domain to partition each frame into hand, object, and background categories. By using Multiple Instance Learning to match object instances across sequences, we discover and localize object occurrences. Object representations are refined through transduction and object-level classifiers are trained. We demonstrate encouraging results in detecting novel object instances using models produced by weakly-supervised learning.

Keywords

Artificial intelligenceObject (grammar)Computer scienceExploitSegmentationDomain (mathematical analysis)Partition (number theory)Cognitive neuroscience of visual object recognitionComputer visionLearning objectFrame (networking)Pattern recognition (psychology)Machine learningMathematics

Affiliated Institutions

Related Publications

Discovering objects and their location in images

Josef Šivic , Bryan Russell , Alexei A. Efros +2 more

We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic Laten...

2005 980 citations

The Role of Context for Object Detection and Semantic Segmentation in the Wild

Roozbeh Mottaghi , Xianjie Chen , Xiaobai Liu +5 more

In this paper we study the role of context in existing state-of-the-art detection and segmentation approaches. Towards this goal, we label every pixel of PASCAL VOC 2010 detecti...

2014 1424 citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick , Jeff Donahue , Trevor Darrell +1 more

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that...

2014 30615 citations

Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

Ning Zhang , Ryan Farrell , Forrest Iandola +1 more

Recognizing objects in fine-grained domains can be extremely challenging due to the subtle differences between subcategories. Discriminative markings are often highly localized,...

2013 201 citations

Self-taught object localization with deep networks

Loris Bazzani , Alessandra Bergamo , Dragomir Anguelov +1 more

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in ima...

2016 144 citations

Publication Info

Year: 2011
Type: article
Citations: 534
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Learning to recognize objects in egocentric activities

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

534

OpenAlex

Cite This

APA Style

                            
                                    Alireza Fathi, 
                                
                                    Xiaofeng Ren, 
                                
                                    James M. Rehg
                                
                            (2011). 
                            Learning to recognize objects in egocentric activities. 
                            
                            .
                            https://doi.org/10.1109/cvpr.2011.5995444

Identifiers

DOI: 10.1109/cvpr.2011.5995444