system for sparse semantic scene understanding using VLMs. Architect a cloud-retrieval pipeline consisting of scene... representation storage, localization and data retrieval, as well VLM-based map creation. Implement the system using off-the-shelf...