
Why Skyvern?
Traditional approaches to browser automations required writing custom scripts for websites, often relying on DOM parsing and XPath-based interactions which would break whenever the website layouts changed. Instead of only relying on code-defined XPath interactions, Skyvern relies on prompts in addition to computer vision and LLMs to the mix to parse items in the viewport in real-time, create a plan for interaction and interact with them. This approach gives us a few advantages:- Skyvern can operate on websites itβs never seen before, as itβs able to map visual elements to actions necessary to complete a workflow, without any customized code
- Skyvern is resistant to website layout changes, as there are no pre-determined XPaths or other selectors our system is looking for while trying to navigate
- Skyvern is able to take a single workflow and apply it to a large number of websites, as itβs able to reason through the interactions necessary to complete the workflow
- Skyvern leverages LLMs to reason through interactions to ensure we can cover complex situations. Examples include:
- If you wanted to get an auto insurance quote from Geico, the answer to a common question βWere you eligible to drive at 18?β could be inferred from the driver receiving their license at age 16
- If you were doing competitor analysis, itβs understanding that an Arnold Palmer 22 oz can at 7/11 is almost definitely the same product as a 23 oz can at Gopuff (even though the sizes are slightly different, which could be a rounding error!)
How it works
Skyvern was inspired by the Task-Driven autonomous agent design popularized by BabyAGI and AutoGPT β with one major bonus: we give Skyvern the ability to interact with websites using browser automation libraries like Playwright.

Skyvern use-cases
Automatically apply to jobs
Watch Skyvern automatically apply to jobs
Automate e-commerce transactions
Watch Skyvern automate purchases on e-commerce websites
Interact with government websites
Watch Skyvern automate interacting with government websites
Generate insurance quotes
Watch Skyvern navigate complex multi-page forms in any language

