[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents
agent vision-language-model vision-language-action computer-use gui-agent vision-language-action-model computer-use-agent tongui
-
Updated
Dec 1, 2025 - HTML