ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Paper
•
2411.17465
•
Published
•
89
One Vision-Language-Action Model for GUI Agent
Generate clickable coordinates on a screenshot
Totally Free + Zero Barriers + No Login Required