Interact with a website
You can use Stagehand to intelligently interact with a website using AI
Executing actions
Stagehand has an act()
function that can be used to execute actions on a page using natural language. Here’s an example of Stagehand to find jobs on LinkedIn:
This workflow is as simple as the following lines of code:
The page
object extends the Playwright page object, so you can use any of the Playwright page methods with it.
Read structured data from the page
You can use the extract()
method to extract structured data from the page. Here’s an example of how to extract the job title from the job posting:
Stagehand uses Zod to help you define the schema of the data to be extracted.
Preview/Cache an action
Sometimes you want to preview an action before it’s executed. You can do this by calling page.observe()
before act()
.
action
will be a JSON object that describes the action to be taken.
For more on caching, see the caching docs.
What actions can I take?
Stagehand maps natural language to Playwright actions.
We generally support the following actions:
Action | Description |
---|---|
scrollIntoView | Scrolls an element into the visible area of the browser window |
scrollTo | Scrolls to a specific percentage of the page height |
fill | Fills in form fields with specified text |
type | Types text into input fields (alias for fill ) |
press | Simulates pressing keyboard keys |
click | Clicks on elements matching the specified selector |
nextChunk | Scrolls the height of the viewport by 100% |
prevChunk | Scrolls the height of the viewport by -100% |
Each of these actions can be triggered using natural language commands. For example: