Abstract: Vision-Language Models (VLMs) have advanced cross-modal understanding and generation, yet their domain adaptability remains limited. To address the lack of high-quality captions for fish ...
Abstract: This paper addresses the problem of robust end-effector pose control for micro-nano free-flying space robots operating in proximity to targets. To tackle this issue, we propose a ...