MOISST: Multimodal Optimization of Implicit Scene for SpatioTemporal calibration
IROS 2023

Quentin Herau
Nathan Piasco
Moussab Bennehar
Luis Roldão
Dzmitry Tsishkou
Cyrille Migniot
Pascal Vasseur
Cédric Demonceaux
[Paper]

Effect of calibration on novel view synthesis: training positions in red, ground truth positions in green, reference position in blue. The RGB images (top) and depth maps (middle) are rendered from an implicit neural 3D scene trained from non-calibrated (left) and calibrated with MOISST (right) sensors.


Abstract

With the recent advances in autonomous driving and the decreasing cost of LiDARs, the use of multimodal sensor systems is on the rise. However, in order to make use of the information provided by a variety of complimentary sensors, it is necessary to accurately calibrate them. We take advantage of recent advances in computer graphics and implicit volumetric scene representation to tackle the problem of multi-sensor spatial and temporal calibration. Thanks to a new formulation of the Neural Radiance Field (NeRF) optimization, we are able to jointly optimize calibration parameters along with scene representation based on radiometric and geometric measurements. Our method enables accurate and robust calibration from data captured in uncontrolled and unstructured urban environments, making our solution more scalable than existing calibration solutions. We demonstrate the accuracy and robustness of our method in urban scenes typically encountered in autonomous driving scenarios.



Supplementary video


Framework

Overview of MOISST optimization framework: First, the model is initialized with rays generated using rough spatial and temporal calibration priors in addition to the reference frame trajectory. After each optimization step, the rays are regenerated and fed to the NeRF model. We then render RGB images and depth maps which are used along the ground truth ones to compute the losses and propagate the gradients. Gradient descent algorithm is finally used to optimize both NeRF and calibration parameters.



Cite

Q. Herau, N. Piasco, M. Bennehar, L. Roldão, D. Tsishkou, C. Migniot, P. Vasseur, C. Demonceaux.
MOISST: Multimodal Optimization of Implicit Scene for SpatioTemporal calibration.
(hosted on IEEE)


[Bibtex]