UI-TARS-7B-SFT

Next-generation native GUI proxy model that seamlessly interacts with graphical user interfaces.

CommonProductProductivityMulti-modal interactionAutomation
UI-TARS, developed by ByteDance's research team, is a next-generation native GUI proxy model aimed at seamless interaction with graphical user interfaces leveraging human-like perception, reasoning, and action capabilities. This model integrates all key components such as perception, reasoning, localization, and memory, enabling end-to-end task automation without predefined workflows or manual rules. Its main advantages include powerful multi-modal interaction capabilities, high-precision visual perception and semantic understanding, and excellent performance across various complex task scenarios. This model is particularly suitable for automation of GUI interactions, such as in automated testing and smart office environments, significantly improving work efficiency.
Visit

UI-TARS-7B-SFT Visit Over Time

Monthly Visits

21315886

Bounce Rate

45.50%

Page per Visit

5.2

Visit Duration

00:05:02

UI-TARS-7B-SFT Visit Trend

UI-TARS-7B-SFT Visit Geography

UI-TARS-7B-SFT Traffic Sources

UI-TARS-7B-SFT Alternatives