What is a Computer Use Agent?
You might’ve heard of Gemini Computer Use, Claude Computer Use, or OpenAI’s Computer Using Agent.
These are powerful tools that can convert natural language into actions on the computer. However, you’d otherwise need to write your own code to convert these actions into Playwright commands.
Stagehand not only handles the execution of Computer Use outputs, but also lets you hot-swap between Google, OpenAI, and Anthropic models with one line of code. You can find more information on the performance of different computer use models by visiting our evals page.
How to use a Computer Use Agent in Stagehand
Stagehand lets you use Computer Use Agents with one line of code:IMPORTANT! Configure your browser dimensionsComputer Use Agents will often return XY-coordinates to click on the screen, so you’ll need to configure your browser dimensions.If not specified, the default browser dimensions are 1288 x 711. You can also configure the browser dimensions in the
browserbaseSessionCreateParams or localBrowserLaunchOptions options.Configuring browser dimensions
Browser configuration differs by environment:- BROWSERBASE
- LOCAL
Direct your Computer Use Agent
Callexecute on the agent to assign a task to the agent.
maxSteps:
Select Your Computer Use Model
Stagehand supports computer use models from Google, Anthropic, and OpenAI. You can find all supported models on the models page.- Google
- Anthropic
- OpenAI

