Skip to main content

Browser AI Agent

AI agent that remotely controls browsers using vision-based automation

Properties

PropertyValue
Typebrowser-ai-agent
FamilyAI
GroupBrowser

Parameters

NameLabelTypeDescriptionRequired
modelModelSelectAI model to use for vision-based automationYes
taskBrowser TaskStringThe task to perform in the browser (e.g., ‘Click the Sign In button’)Yes
systemPromptSystem PromptStringSystem prompt for the AI agentNo
maxStepsMax StepsNumberMaximum number of automation steps to takeNo
temperatureTemperatureNumberAI creativity level (lower = more deterministic)No

Outputs

NameLabelTypeDescription
successsuccessbooleanWhether the task was completed successfully
stepsExecutedstepsExecutednumberNumber of steps executed
actionLogactionLogarrayLog of all actions taken with screenshots
finalScreenshotfinalScreenshotstringURL of the final screenshot
htmlReporthtmlReportstringHTML report with step-by-step screenshots
animationScriptsanimationScriptsarrayAnimation data for video playback