Loading…
Loading…
An AI agent capability where the model directly controls a desktop or browser—clicking buttons, typing text, navigating menus, and reading screens like a human user. Computer use agents can operate any software, even without an API, by interacting with the GUI. This enables automation of legacy systems, complex SaaS workflows, and tasks that span multiple applications without custom integrations.
Back to glossary