Step 1: Create a Project
A project is the top-level container for everything — builds, devices, scenarios, and runs.
- Name — A clear name for your game or app (e.g., “QuickHits Casino”, “Animals & Coins”)
- Prefix — A short 2–5 letter code (e.g.,
QHC,INNP). Shows up on run IDs to keep things organized. If you skip it, one is generated from the project name - Description — Optional notes about the project
Step 2: Upload a Build
A build is a specific version of your game that you want to test.
- Label — A human-readable version name (e.g., “v2.4.1-staging”, “Release 3.65”)
- Platform — Android, iOS, or Web
- File — The app binary (
.apkfor Android,.ipafor iOS, or a URL for web) - Build number — Optional numeric identifier from your CI system
| Build Status | Meaning |
|---|---|
| Processing | Build is being validated and prepared |
| Ready | Build is available for runs |
| Error | Something went wrong during processing — check the file format and re-upload |
Step 3: Set Up Devices
Devices are the real phones or emulators your tests will run on.
- Device name — e.g., “Samsung Galaxy S23”, “iPhone 15”
- Platform — Android, iOS, or Web (must match your build)
- Manufacturer / Model / OS — Helps you pick the right device for your testing needs
- Availability — Shows whether the device is currently available in the farm
Step 4: Configure the Agent
The agent template defines how the AI agent behaves during test runs.
- Name — A label for this agent configuration (e.g., “Default Agent”, “Casino Expert”)
- Guidelines — Behavioral rules for the agent:
- “Always dismiss cookie banners before testing”
- “Accept terms of service when prompted”
- “Decline or skip all login/sign-up requests”
- “Close pop-ups unless instructed to interact with them”
- Capabilities — The tools the agent can use during every scenario (tap, scroll, swipe, type text, etc.). These are set once at the agent template level — you can’t choose different tools per scenario or per run
Step 5: Write Your Knowledge
Knowledge is the game context document that tells the agent what your game is and how it works.
- Describe each screen the agent might encounter
- Explain key game mechanics and terminology
- Note any non-obvious UI patterns (hidden menus, long-press actions, swipe gestures)
Step 6: Create Skills
Skills are short behavioral instructions that tell the agent how to handle specific situations.
- Name + Description — e.g., “Cheat Menu Usage”, “Video Ad Handling”
- Body — Step-by-step instructions for a specific behavior
- Scope — Can be set as project defaults (always included) or selected per scenario
- “How to use the debug tool: tap Settings, scroll down, tap Debug, enter the code…”
- “When a video ad appears, wait for 15 seconds, then look for the X button to appear and close it”
Step 7: Create a Scenario with Checkpoints
A scenario is the actual test plan. Checkpoints are the milestones the agent must reach.
- Give it a clear name — e.g., “Complete First Island”, “Purchase Shop Item”
- Set preconditions — What state should the game be in when the test starts?
- Add checkpoints in order — Each checkpoint needs:
- Goal (required) — What the agent should achieve: “Close the daily bonus popup”
- Instructions (optional) — How to achieve it, if the path isn’t obvious
- Timeout — How long the agent should try (default: 5 minutes)
Step 8: Add Verifications (Optional)
Verifications are automated quality checks that run after the test completes.
- Query — “Is the HUD fully visible and not overlapping other UI elements?”
- Scope — Which part of the run to examine (specific checkpoint, time window, or the entire run)
- Severity — How serious a failure would be (blocker, critical, major, minor, info)
Step 9: Start the Run
You’re ready. Start a new run by selecting:- Scenario — The test plan you just created
- Build — Which version of your game (must be in Ready status)
- Device — Which device to run on (platform must match the build)
- Skills — Which skills to include (project defaults are pre-selected)
- Mode — Formal run or Playground
- Formal run — Runs to completion, device released automatically. Best for structured testing
- Playground — Agent pauses when done, device stays alive for you to explore. Best for debugging and iteration
Step 10: Monitor and Review
Once the run starts, you can watch it in real time:- Live device screen — See exactly what the agent sees, streamed to your browser
- Interactive control — The device is fully interactive. Click, drag, and swipe on the screen to help the agent if it gets stuck — it will continue on its own from wherever you leave it
- Checkpoint progress — A live checklist showing which milestones have been reached
- Agent reasoning — The agent’s thought process in real time
- Pass/Fail status — Did all required checkpoints pass?
- Video recording — Full replay of the device screen
- Checkpoint details — Screenshot and timestamp for each checkpoint reached or failed
- Verification results — If you added verifications, see which assertions passed or failed with supporting evidence
- Agent logs — The full reasoning trace and action history
Quick Checklist
| Step | What | Where | Required? |
|---|---|---|---|
| 1 | Create a project | Projects | Yes |
| 2 | Upload a build | Builds | Yes |
| 3 | Set up devices | Devices | Yes |
| 4 | Configure the agent | Admin → Agent Templates | Yes (defaults usually work) |
| 5 | Write knowledge | Knowledge | Recommended |
| 6 | Create skills | Skills | Optional |
| 7 | Create a scenario | Scenarios | Yes |
| 8 | Add verifications | Scenario → Verifications | Optional |
| 9 | Start the run | Scenario → New Run | Yes |
| 10 | Monitor and review | Run screen | — |