The implementation of VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field which provides a a new baseline of applying NeRFs to visual relocalization task. We use zip-nerf as implicit learning map (ILM) and 3D reconstruction as explicit geometric map (EGM) for efficient localization. In the localization process, we use SFD2 as local features, IMP as matcher, and online rendering of patches for matching. Results on 7Scenes and CambridgeLandmarks are promising and much better than previous LENS and NeRF-loc. However, the pose accuracy on Aachen dataset is not satisfying because of the poor quality of rendered images.
7Scenes dataset (patch size=15, median position (cm), rotation (deg) errors and average percentage of poses within error of 5cm, 5deg)
| chess | fire | heads | office | pumpkin | kitchen | stairs | Average (%) | |
|---|---|---|---|---|---|---|---|---|
| LENS | 3, 1.3 | 10, 3.7 | 7, 5.8 | 7, 1.9 | 8, 2.2 | 9, 2.2 | 14, 3.6 | - |
| NeRF-loc | 2, 1.1 | 2, 1.1 | 1, 1.9 | 2, 1.1 | 3, 1.3 | 3, 1.5 | 3, 1.3 | 89.5 |
| ACE | 2, 1.1 | 2, 1.8 | 2, 1.1 | 3, 1.4 | 3, 1.3 | 3, 1.3 | 3, 1.2 | 97.1 |
| SP+SG | 0, 0.1 | 1, 0.2 | 0, 0.2 | 1, 0.2 | 1, 0.1 | 0, 0.1 | 2, 0.6 | 95.7 |
| SFD2+IMP | 0, 0.1 | 1, 0.2 | 0, 0.2 | 1, 0.2 | 1, 0.2 | 0, 0 | 2, 0.5 | 95.7 |
| VRS-NeRF | 0, 0.1 | 1, 0.2 | 0, 0.2 | 1, 0.2 | 1, 0.2 | 0, 0.1 | 3, 0.8 | 93.1 |
CambridgeLandmarks (patch size=15, median position (cm), rotation (deg) errors and average percentage of poses within error of 25cm, 2deg)
| Kings College | Great Court | Old Hospital | Shop Facade | St Mary Church | Average (%) | |
|---|---|---|---|---|---|---|
| LENS | 33, 0.5 | - | 44, 0.9 | 27, 1.6 | 53, 1.6 | - |
| NeRF-loc | 7, 0.2 | 25, 0.1 | 18, 0.4 | 11, 0.2 | 4, 0.2 | - |
| ACE | 18, 0.4 | 42, 0.2 | 31, 0.6 | 5, 0.3 | 19, 0.6 | 54.68 |
| SP+SG | 7, 0.1 | 12, 0.1 | 9, 0.2 | 2, 0.1 | 4, 0.1 | 89.4 |
| SFD2+IMP | 7, 0.1 | 11, 0.1 | 10, 0.2 | 2, 0.1 | 4, 0.1 | 89.1 |
| VRS-NeRF | 9, 0.1 | - | 11, 0.2 | 2, 0.1 | 5, 0.2 | 89.3 |
| Day | Night | |
|---|---|---|
| ESAC | 42.6 / 59.6 / 75.5 | 3.1 / 9.2 / 11.2 |
| HSCNet | 71.1 / 81.9 / 91.7 | 32.7 / 43.9 / 65.3 |
| SP+SPG | 89.6 / 95.4 / 98.8 | 86.7 / 93.9 / 100.0 |
| SFD2+IMP | 89.7 / 96.5 / 98.9 | 84.7 / 94.9 / 100.0 |
| VRS-NeRF (15) | 60.8 / 67.8 / 73.1 | 19.4 / 22.4 / 25.5 |
| VRS-NeRF (31) | 70.1 / 76.9 / 80.9 | 44.9 / 51.0 / 62.2 |
@article{xue2024vrs, author = {Fei Xue and Ignas Budvytis and Daniel Olmeda Reino and Roberto Cipolla}, title = {VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field}, booktitle = {ECCVW}, year = {2024} } @inproceedings{sfd22023, title={{SFD2: Semantic-guided Feature Detection and Description}}, author={Xue, Fei and Budvytis, Ignas and Cipolla, Roberto}, booktitle={CVPR}, year={2023} } @inproceedings{imp2023, title={IMP: Iterative Matching and Pose Estimation with Adaptive Pooling}, author={Xue, Fei and Budvytis, Ignas and Cipolla, Roberto}, booktitle={CVPR}, year={2023} } @inproceedings{barron2023zipnerf, title={Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields}, author={Jonathan T. Barron and Ben Mildenhall and Dor Verbin and Pratul P. Srinivasan and Peter Hedman}, booktitle={ICCV}, year={2023} } 


