Expect

Expect is a skill for testing your agent's code in a real browser.

Demo →

Getting Started

Ask your agent to Fetch https://www.expect.dev/ and run init script
Run /expect inside Claude Code, Codex, and more
Expect spawns subagents simulating real logged-in users to find issues and regressions
Your agent will fix any issues Expect finds, then re-run to verify

FAQ

1. What is Expect?

A skill that reads your git changes, generates a test plan, and runs it in a real browser with Playwright. It hooks into your existing agent (Claude Code, Codex, Cursor) and runs entirely on your machine. It checks for performance (long animation frames, INP, LCP), security (npm deps, CSRF attacks, vulns), design tweaks (broken hover states, links, buttons), and app completeness (missing metadata, dead links).

2. Why not just use Puppeteer, Playwright, or Cypress?

Instead of writing scripts, maintaining selectors, and wiring up assertions, Expect reads your code changes and tests them in a real browser automatically. It's like giving your agent QA superpowers.

3. How is this different from computer-use agents?

General-purpose browser tools rely on screenshots and mouse coordinates. Expect is purpose-built for testing: it uses Playwright for fast DOM automation, reads your code changes, generates a test plan, and runs it with your real cookies, then reports back what's broken so the agent can fix it.

4. Does it work in CI?

Yes. Use --ci or the add github-action command to set up a workflow that tests every PR. In CI mode it runs headless, skips cookie extraction, auto-approves the plan, and enforces a 30-minute timeout.

5. Does it support mobile testing?

Coming soon.

6. Is there a hosted or enterprise version?

Coming soon. Email aiden@million.dev if you have questions or ideas.

Options

Flag	Description	Default
`-m, --message <instruction>`	Natural language instruction for what to test	-
`-f, --flow <slug>`	Reuse a saved flow by its slug	-
`-y, --yes`	Run immediately without confirmation	-
`-a, --agent <provider>`	Agent provider (`claude`, `codex`, `copilot`, `gemini`, `cursor`, `opencode`, `droid`, `pi`)	auto-detect
`-t, --target <target>`	What to test: `unstaged`, `branch`, or `changes`	`changes`
`-u, --url <urls...>`	Base URL(s) for the dev server (skips port picker)	-
`--browser-mode <mode>`	Browser mode: `headed` or `headless`	`headed`
`--cdp <url>`	Connect to an existing Chrome via CDP WebSocket URL	-
`--profile <name>`	Reuse a Chrome profile by name (e.g. Default)	-
`--no-cookies`	Skip system browser cookie extraction	-
`--ci`	Force CI mode: headless, no cookies, auto-yes, 30-min timeout	-
`--timeout <ms>`	Execution timeout in milliseconds	-
`--output <format>`	Output format: `text` or `json`	`text`
`--verbose`	Enable verbose logging	-
`-v, --version`	Print version	-
`-h, --help`	Display help	-

Supported Agents

Expect works with the following coding agents. It auto-detects which agents are installed on your PATH. If multiple are available, it defaults to the first one found. Use -a <provider> to pick a specific agent.

Agent	Flag	Install
Claude Code	`-a claude`	`npm install -g @anthropic-ai/claude-code`
Codex	`-a codex`	`npm install -g @openai/codex`
GitHub Copilot	`-a copilot`	`npm install -g @github/copilot`
Gemini CLI	`-a gemini`	`npm install -g @google/gemini-cli`
Cursor	`-a cursor`	cursor.com
OpenCode	`-a opencode`	`npm install -g opencode-ai`
Factory Droid	`-a droid`	`npm install -g droid`
Pi	`-a pi`	`npm install -g @mariozechner/pi-coding-agent`

Resources & Contributing Back

Want to try it out? Check out our demo.

Find a bug? Head over to our issue tracker and we'll do our best to help. We love pull requests, too!

We expect all contributors to abide by the terms of our Code of Conduct.

→ Start contributing on GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 898 Commits
.agents/skills		.agents/skills
.changeset		.changeset
.claude		.claude
.cursor		.cursor
.github		.github
.repos		.repos
.specs		.specs
.vite-hooks		.vite-hooks
.vscode		.vscode
apps		apps
docs		docs
packages		packages
tmp		tmp
.gitignore		.gitignore
.gitmodules		.gitmodules
.mcp.json		.mcp.json
.npmrc		.npmrc
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PUBLISHING_GUIDE.md		PUBLISHING_GUIDE.md
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
turbo.json		turbo.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Expect

Demo →

Getting Started

FAQ

1. What is Expect?

2. Why not just use Puppeteer, Playwright, or Cypress?

3. How is this different from computer-use agents?

4. Does it work in CI?

5. Does it support mobile testing?

6. Is there a hosted or enterprise version?

Options

Supported Agents

Resources & Contributing Back

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Expect

Demo →

Getting Started

FAQ

1. What is Expect?

2. Why not just use Puppeteer, Playwright, or Cypress?

3. How is this different from computer-use agents?

4. Does it work in CI?

5. Does it support mobile testing?

6. Is there a hosted or enterprise version?

Options

Supported Agents

Resources & Contributing Back

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages