Running a Test End-to-End

This guide walks you through every step of running a test on Duzz — from initial project setup to reviewing results. Follow these steps in order and you’ll have your first test running in minutes.

Step 1: Create a Project

A project is the top-level container for everything — builds, devices, scenarios, and runs.

Every test starts with a project. A project typically maps to one game or app.

Name — A clear name for your game or app (e.g., “QuickHits Casino”, “Animals & Coins”)
Prefix — A short 2–5 letter code (e.g., QHC, INNP). Shows up on run IDs to keep things organized. If you skip it, one is generated from the project name
Description — Optional notes about the project

Think of the project as the home base for everything related to one game. All builds, scenarios, devices, and results live under it.

Step 2: Upload a Build

A build is a specific version of your game that you want to test.

Before you can run any test, you need at least one build uploaded.

Label — A human-readable version name (e.g., “v2.4.1-staging”, “Release 3.65”)
Platform — Android, iOS, or Web
File — The app binary (.apk for Android, .ipa for iOS, or a URL for web)
Build number — Optional numeric identifier from your CI system

After uploading, the build goes through a short processing step. Once the status shows Ready, it can be used in runs.

Build Status	Meaning
Processing	Build is being validated and prepared
Ready	Build is available for runs
Error	Something went wrong during processing — check the file format and re-upload

Make sure the build platform matches the device you plan to run on. An Android build won’t work on an iOS device.

Step 3: Set Up Devices

Devices are the real phones or emulators your tests will run on.

Duzz uses a cloud device farm to provide remote devices. You allocate devices to your project from the available inventory.

Device name — e.g., “Samsung Galaxy S23”, “iPhone 15”
Platform — Android, iOS, or Web (must match your build)
Manufacturer / Model / OS — Helps you pick the right device for your testing needs
Availability — Shows whether the device is currently available in the farm

You don’t need to configure the device itself — Duzz handles installing the build, launching the app, and setting up the video stream automatically.

Allocate at least one device per platform you want to test on. You can always add more later for parallel testing.

Step 4: Configure the Agent

The agent template defines how the AI agent behaves during test runs.

Every project has at least one agent template. This is where you shape the agent’s personality and knowledge.

Name — A label for this agent configuration (e.g., “Default Agent”, “Casino Expert”)
Guidelines — Behavioral rules for the agent:
- “Always dismiss cookie banners before testing”
- “Accept terms of service when prompted”
- “Decline or skip all login/sign-up requests”
- “Close pop-ups unless instructed to interact with them”
Capabilities — The tools the agent can use during every scenario (tap, scroll, swipe, type text, etc.). These are set once at the agent template level — you can’t choose different tools per scenario or per run

Most users start with the default agent template and only customize guidelines once they see how the agent behaves with their specific game.

Guidelines are your most powerful lever. If the agent keeps doing something you don’t want (like tapping on ads), add a guideline to correct it.

Step 5: Write Your Knowledge

Knowledge is the game context document that tells the agent what your game is and how it works.

This is a markdown document that describes your game at a high level — mechanics, UI layout, terminology, expected behaviors. It’s always included in every run.

Describe each screen the agent might encounter
Explain key game mechanics and terminology
Note any non-obvious UI patterns (hidden menus, long-press actions, swipe gestures)

Knowledge is versioned and tied to builds, so you can track how your game documentation evolves alongside your app.

Good knowledge dramatically improves agent performance. The more the agent understands about your game, the better it navigates and tests.

Step 6: Create Skills

Skills are short behavioral instructions that tell the agent how to handle specific situations.

While knowledge describes the game world, skills describe specific actions or techniques.

Name + Description — e.g., “Cheat Menu Usage”, “Video Ad Handling”
Body — Step-by-step instructions for a specific behavior
Scope — Can be set as project defaults (always included) or selected per scenario

Examples:

“How to use the debug tool: tap Settings, scroll down, tap Debug, enter the code…”
“When a video ad appears, wait for 15 seconds, then look for the X button to appear and close it”

Skills are optional but powerful. If the agent keeps struggling with a specific game mechanic, write a skill for it.

Step 7: Create a Scenario with Checkpoints

A scenario is the actual test plan. Checkpoints are the milestones the agent must reach.

This is the core of your test definition. For each scenario:

Give it a clear name — e.g., “Complete First Island”, “Purchase Shop Item”
Set preconditions — What state should the game be in when the test starts?
Add checkpoints in order — Each checkpoint needs:
- Goal (required) — What the agent should achieve: “Close the daily bonus popup”
- Instructions (optional) — How to achieve it, if the path isn’t obvious
- Timeout — How long the agent should try (default: 5 minutes)

Start simple. Your first scenario should have 2–3 checkpoints testing one straightforward flow. You can always add complexity later.

Step 8: Add Verifications (Optional)

Verifications are automated quality checks that run after the test completes.

While checkpoints track whether the agent reached each goal, verifications check whether the game behaved correctly along the way. For each verification, add assertions — natural-language questions evaluated against the run’s recorded artifacts:

Query — “Is the HUD fully visible and not overlapping other UI elements?”
Scope — Which part of the run to examine (specific checkpoint, time window, or the entire run)
Severity — How serious a failure would be (blocker, critical, major, minor, info)

Verifications are optional for your first run. Start with checkpoints only, then add verifications once you’re comfortable with the basics.

Step 9: Start the Run

You’re ready. Start a new run by selecting:

Scenario — The test plan you just created
Build — Which version of your game (must be in Ready status)
Device — Which device to run on (platform must match the build)
Skills — Which skills to include (project defaults are pre-selected)
Mode — Formal run or Playground
- Formal run — Runs to completion, device released automatically. Best for structured testing
- Playground — Agent pauses when done, device stays alive for you to explore. Best for debugging and iteration

Hit Start and the run begins.

Step 10: Monitor and Review

Once the run starts, you can watch it in real time:

Live device screen — See exactly what the agent sees, streamed to your browser
Interactive control — The device is fully interactive. Click, drag, and swipe on the screen to help the agent if it gets stuck — it will continue on its own from wherever you leave it
Checkpoint progress — A live checklist showing which milestones have been reached
Agent reasoning — The agent’s thought process in real time

When the run completes, review the results:

Pass/Fail status — Did all required checkpoints pass?
Video recording — Full replay of the device screen
Checkpoint details — Screenshot and timestamp for each checkpoint reached or failed
Verification results — If you added verifications, see which assertions passed or failed with supporting evidence
Agent logs — The full reasoning trace and action history

Quick Checklist

Step	What	Where	Required?
1	Create a project	Projects	Yes
2	Upload a build	Builds	Yes
3	Set up devices	Devices	Yes
4	Configure the agent	Admin → Agent Templates	Yes (defaults usually work)
5	Write knowledge	Knowledge	Recommended
6	Create skills	Skills	Optional
7	Create a scenario	Scenarios	Yes
8	Add verifications	Scenario → Verifications	Optional
9	Start the run	Scenario → New Run	Yes
10	Monitor and review	Run screen	—

Getting Started

Core Concepts

Running a Test End-to-End

Step 1: Create a Project

Step 2: Upload a Build

Step 3: Set Up Devices

Step 4: Configure the Agent

Step 5: Write Your Knowledge

Step 6: Create Skills

Step 7: Create a Scenario with Checkpoints

Step 8: Add Verifications (Optional)

Step 9: Start the Run

Step 10: Monitor and Review

Quick Checklist

Getting Started

Core Concepts

​Step 1: Create a Project

​Step 2: Upload a Build

​Step 3: Set Up Devices

​Step 4: Configure the Agent

​Step 5: Write Your Knowledge

​Step 6: Create Skills

​Step 7: Create a Scenario with Checkpoints

​Step 8: Add Verifications (Optional)

​Step 9: Start the Run

​Step 10: Monitor and Review

​Quick Checklist

Step 1: Create a Project

Step 2: Upload a Build

Step 3: Set Up Devices

Step 4: Configure the Agent

Step 5: Write Your Knowledge

Step 6: Create Skills

Step 7: Create a Scenario with Checkpoints

Step 8: Add Verifications (Optional)

Step 9: Start the Run

Step 10: Monitor and Review

Quick Checklist