Abstract

We propose pixelNeRF, a learning framework that predicts a continuous neural scene representation conditioned on one or few input images. The existing approach for constructing neural radiance fields [27] involves optimizing the representation to every scene independently, requiring many calibrated views and significant compute time. We take a step towards resolving these shortcomings by introducing an architecture that conditions a NeRF on image inputs in a fully convolutional manner. This allows the network to be trained across multiple scenes to learn a scene prior, enabling it to perform novel view synthesis in a feed-forward manner from a sparse set of views (as few as one). Leveraging the volume rendering approach of NeRF, our model can be trained directly from images with no explicit 3D supervision. We conduct extensive experiments on ShapeNet benchmarks for single image novel view synthesis tasks with held-out objects as well as entire unseen categories. We further demonstrate the flexibility of pixelNeRF by demonstrating it on multi-object ShapeNet scenes and real scenes from the DTU dataset. In all cases, pixelNeRF outperforms current state-of-the-art baselines for novel view synthesis and single image 3D reconstruction. For the video and code, please visit the project website:https://alexyu.net/pixelnerf.

Keywords

Computer scienceRadianceRendering (computer graphics)Artificial intelligenceConvolutional neural networkRepresentation (politics)Artificial neural networkSet (abstract data type)Computer visionView synthesisCode (set theory)Image synthesisObject (grammar)Image (mathematics)Flexibility (engineering)

Affiliated Institutions

Related Publications

NeRF

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using ...

2021 Communications of the ACM 4497 citations

Publication Info

Year
2021
Type
article
Pages
4576-4585
Citations
1273
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1273
OpenAlex

Cite This

Alex Yu, Vickie Ye, Matthew Tancik et al. (2021). pixelNeRF: Neural Radiance Fields from One or Few Images. , 4576-4585. https://doi.org/10.1109/cvpr46437.2021.00455

Identifiers

DOI
10.1109/cvpr46437.2021.00455