PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty

Shravan Venkatraman*1, Rakesh Raj Madavan*2, Pavan Kumar Sathya Venkatesh*3
1Mohamed bin Zayed University of AI, UAE 2University of Amsterdam, The Netherlands 3University of Massachusetts, Amherst, USA

CVPRW (GenRecon3D) 2026

PCM-NeRF, a probabilistic framework that represents each camera pose as a distribution with learnable uncertainty, automatically damping gradient updates for unreliable views and preventing outlier cameras from destabilizing neural surface reconstruction.


PCM-NeRF Teaser Left PCM-NeRF Teaser Right

Abstract

Neural surface reconstruction methods typically treat camera poses as fixed values, assuming perfect accuracy from Structure-from-Motion (SfM) systems. This assumption breaks down with imperfect pose estimates, leading to distorted or incomplete reconstructions. We present PCM-NeRF, a probabilistic framework that augments neural surface reconstruction with per-camera learnable uncertainty, built directly on SG-NeRF. Rather than treating all cameras equally throughout optimization, we represent each pose as a distribution with a learnable mean and variance, initialized from SfM correspondence quality. An uncertainty regularization loss couples the learned variance to view confidence — derived from both geometric correspondences and rendering quality — and the resulting uncertainty directly modulates the effective pose learning rate: uncertain cameras receive damped gradient updates, preventing poorly initialized views from corrupting the reconstruction. Experiments on challenging scenes with severe pose outliers demonstrate that PCM-NeRF consistently outperforms state-of-the-art methods in both Chamfer Distance and F-Score, particularly for geometrically complex structures, without requiring foreground masks.


Contributions

  • We introduce a probabilistic per-camera pose representation for NSR in which each pose is modeled as a distribution in SE(3) with a learnable mean and per-axis variance, initialized from SfM correspondence quality and evolved jointly with the scene.
  • We derive an uncertainty-modulated pose learning rate that inversely scales each camera's effective gradient step by its learned uncertainty, automatically preventing outlier views from destabilizing reconstruction without manual filtering or threshold selection.
  • We propose a cross-modal consistency loss that aligns learned variance parameters with view confidence derived from both geometric correspondences and rendering quality, anchoring uncertainty to observable evidence rather than rendering gradients alone.
  • We demonstrate state-of-the-art surface reconstruction on challenging outlier-pose scenes, achieving the lowest Chamfer Distance on 6 of 8 scenes and the highest F-Score on 6 of 8 scenes.

PCM-NeRF Pipeline Overview

PCM-NeRF: Method

PCM-NeRF builds directly on SG-NeRF, extending gradient-based pose refinement with a lightweight probabilistic pose representation. Each camera pose Pi is modeled as a distribution in SE(3) with learnable mean μi = (ri, ti) and diagonal covariance Σi = diag(σ²r,i, σ²t,i), parameterized in log-space for numerical stability. All rendering uses the mean pose directly — adding no inference-time overhead — while the variance parameters are optimized through the uncertainty regularization loss.

View reliability is estimated from SfM correspondence density and a rolling buffer of per-camera PSNR scores, blended as γi(t) = (1−α)γi(0) + α·γ̂i(t) with α = 0.7. The uncertainty regularization loss encourages σ̄i ≈ (1−γi), and the learned uncertainty modulates the effective pose learning rate: ηipose = η₀ / (1 + σ̄i · κ), where κ = 5.0. Geometric consistency is enforced via the volumetric IoU loss over Mixture-of-Gaussian density distributions along matched rays, inherited from SG-NeRF.


Uncertainty Feedback Loop

Qualitative Results


Qualitative Comparisons

Qualitative comparison across Clock, Deaf, and Farmer scenes. PCM-NeRF recovers finer geometric detail compared to all baselines, particularly in regions where outlier pose initialization corrupts reconstruction.


Hyperparameter Sensitivity


Weight Sensitivity Analysis

Effect of IoU weight (λIoU) and uncertainty weight (λunc) on Chamfer Distance and F-Score. Best results are achieved at λIoU = 0.2 and λunc = 0.05.


Ablation Study: Component Contributions


Results averaged across all 8 scenes. Both the probabilistic uncertainty module and the feature correspondence structure contribute positively; the best performance is achieved only when both are active together.

Prob. Uncertainty Feature Corr. CD ↓ F-Score ↑
1.180.72
1.330.68
0.460.85
0.320.93

Quantitative Comparison


Chamfer Distance (↓) across all 8 scenes. PCM-NeRF achieves the best result on 6 of 8 scenes, reducing mean CD by 28.8% relative to SG-NeRF and improving mean F-Score from 0.85 to 0.93.

Method BabyBearBellClock DeafFarmerPavilionSculpture Mean CD ↓Mean F ↑
NeuS 0.690.313.331.16 0.552.490.290.66 1.190.73
Neuralangelo 0.700.650.38 0.594.891.950.31
BARF 1.080.283.310.19 0.462.130.380.57 1.050.73
SCNeRF 1.190.273.741.33 0.461.450.230.81 1.190.66
GARF 2.042.253.090.50 0.591.580.960.57 1.450.70
L2G-NeRF 1.150.291.260.24 0.402.180.460.37 0.790.76
Joint-TensoRF 3.111.222.490.36 0.362.511.350.70 1.510.59
SG-NeRF 0.560.250.980.15 0.450.870.200.22 0.460.85
Ours (PCM-NeRF) 0.250.240.510.16 0.230.790.240.20 0.330.93

Chamfer Distance ↓ per scene. Bold = best per column. PCM-NeRF achieves the lowest CD on 6 of 8 scenes.


BibTeX

@inproceedings{pcmnerf2026,
  title     = {PCM-NeRF: Probabilistic Camera Modeling for Neural Radiance Fields under Pose Uncertainty},
  author    = {Venkatraman, Shravan and Madavan, Rakesh Raj and {Pavan Kumar S}},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
  year      = {2026}
}