DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields

Razavi Khosroshahi, Hamed; Sancho Aragon, Jaime; Bang, Gun; Lafruit, Gauthier; Juarez, Eduardo; Teratani, Mehrdad

doi:doi/10.1016/j.jvcir.2024.104365

Citer

DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields

par Razavi Khosroshahi, Hamed

;Sancho Aragon, Jaime ;Bang, Gun ;Lafruit, Gauthier

;Juarez, Eduardo ;Teratani, Mehrdad

Référence Journal of visual communication and image representation, 107, 104365
Publication Publié, 2025-03

Article révisé par les pairs

Résumé :

Neural Radiance Fields (NeRF) demonstrate impressive capabilities in rendering novel views of specific scenes by learning an implicit volumetric representation from posed RGB images without any depth information. View synthesis is the computational process of synthesizing novel images of a scene from different viewpoints, based on a set of existing images. One big problem is the need for a large number of images in the training datasets for neural network-based view synthesis frameworks. The challenge of data augmentation for view synthesis applications has not been addressed yet. NeRF models require comprehensive scene coverage in multiple views to accurately estimate radiance and density at any point. In cases without sufficient coverage of scenes with different viewing directions, cannot effectively interpolate or extrapolate unseen scene parts. In this paper, we introduce a new pipeline to tackle this data augmentation problem using depth data. We use MPEG's Depth Estimation Reference Software and Reference View Synthesizer to add novel non-existent views to the training sets needed for the NeRF framework. Experimental results show that our approach improves the quality of the rendered images using NeRF's model. The average quality increased by 6.4 dB in terms of Peak Signal-to-Noise Ratio (PSNR), with the highest increase being 11 dB. Our approach not only adds the ability to handle the sparsely captured multiview content to be used in the NeRF framework, but also makes NeRF more accurate and useful for creating high-quality virtual views.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields

Documents en relation

DI-fusion