Here we show rendered views from the variants of our model, trained on DL3DV and evaluated zero-shot on Tanks & Temples, against 3DGS which is optimized per scene for ~13 minutes.
Hover over the labels at the left to select which output to view. Hold mouse down anywhere to flip to ground truth for comparison.