PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud

Abstract

In this paper, we propose PointRCNN for 3D object detection from raw point cloud. The whole framework is composed of two stages: stage-1 for the bottom-up 3D proposal generation and stage-2 for refining proposals in the canonical coordinates to obtain the final detection results. Instead of generating proposals from RGB image or projecting point cloud to bird's view or voxels as previous methods do, our stage-1 sub-network directly generates a small number of high-quality 3D proposals from point cloud in a bottom-up manner via segmenting the point cloud of the whole scene into foreground points and background. The stage-2 sub-network transforms the pooled points of each proposal to canonical coordinates to learn better local spatial features, which is combined with global semantic features of each point learned in stage-1 for accurate box refinement and confidence prediction. Extensive experiments on the 3D detection benchmark of KITTI dataset show that our proposed architecture outperforms state-of-the-art methods with remarkable margins by using only point cloud as input. The code is available at https://github.com/sshaoshuai/PointRCNN.

Keywords

Point cloudComputer scienceBenchmark (surveying)Artificial intelligenceObject detectionComputer visionPoint (geometry)Object (grammar)RGB color modelCloud computingDeep learningData miningPattern recognition (psychology)MathematicsGeography

Affiliated Institutions

Chinese University of Hong Kong HK

Related Publications

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Yin Zhou , Oncel Tuzel

Accurate detection of objects in 3D point clouds is a central problem in many applications, such as autonomous navigation, housekeeping robots, and augmented/virtual reality. To...

2018 4245 citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick , Jeff Donahue , Trevor Darrell +1 more

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that...

2014 30615 citations

Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition

Dong Yu , Sabato Marco Siniscalchi , Li Deng +1 more

Generation of high-precision sub-phonetic attribute (also known as phonological features) and phone lattices is a key frontend component for detection-based bottom-up speech rec...

2012 64 citations

FCOS: Fully Convolutional One-Stage Object Detection

Zhi Tian , Chunhua Shen , Hao Chen +1 more

We propose a fully convolutional one-stage object detector (FCOS) to solve object detection in a per-pixel prediction fashion, analogue to semantic segmentation. Almost all stat...

2019 5672 citations

Deep High-Resolution Representation Learning for Human Pose Estimation

Ke Sun , Bin Xiao , Dong Liu +1 more

In this paper, we are interested in the human pose estimation problem with a focus on learning reliable high-resolution representations. Most existing methods recover high-resol...

2019 5184 citations

Publication Info

Year: 2019
Type: article
Pages: 770-779
Citations: 2773
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2773

OpenAlex

Cite This

APA Style

                            
                                    Shaoshuai Shi, 
                                
                                    Xiaogang Wang, 
                                
                                    Hongsheng Li
                                
                            (2019). 
                            PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud. 
                            
                            , 770-779.
                            https://doi.org/10.1109/cvpr.2019.00086

Identifiers

DOI: 10.1109/cvpr.2019.00086