ByteDance's UI-TARS-1.5-7B is a Qwen2.5-VL fine-tune specifically for GUI understanding. Apache 2.0, 7B, runs on vLLM natively ScreenSpot-Pro 49.6%, ScreenSpot-V2 94.2%, OSWorld 27.5% Inherits Qwen2.5 ...
This is the github repository of GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent. In this work, we propose GUI-explorer. It synergizes two key components: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results