Light3R-SfM: A Scalable and Efficient Feed-Forward Approach to Structure-from-Motion
Structure-from-motion (SfM) focuses on recovering camera positions and building 3D scenes from multiple images. This process is important for tasks like 3D reconstruction and novel view synthesis. A major challenge comes from processing large image collections efficiently while maintaining accuracy. Several approaches rely on the optimization of camera poses and scene geometry. However, these have usually increased computational costs substantially, and scaling SfM for large datasets remains challenging due to the sensitivity of balancing speed, accuracy, and memory consumption. Currently, SfM methods follow two main approaches: incremental and global. Incremental methods build 3D scenes step by step, starting from two images, while global methods align all cameras at once before reconstruction. Both rely on feature detection, matching, 3D triangulation, and optimization, leading to high computational costs and memory usage. Some learning-based methods improve accuracy but stru...