Abstract: Visual simultaneous localization and mapping (SLAM) systems that assume static environments often struggle with dynamic objects, resulting in degraded localization robustness. To address ...
Abstract: 3D semantic segmentation plays a fundamental and crucial role to understand 3D scenes. While contemporary state-of-the-art techniques predominantly concentrate on elevating the overall ...
A fundamental challenge for GUI agents is robustly grounding natural language instructions, which requires not only precise spatial alignment (locating elements accurately) but also correct semantic ...