Docs/SDK/Agent Runtime

Agent Runtime

Updated 2026-01-11 - Runtime verification support for agent loops with assertion predicates and task completion tracking.

Think Jest for AI web agents. AgentRuntime provides Jest-style semantic assertions so agents can verify what changed on the page instead of guessing when they're "done".

Assertions at a Glance (v1 + v2)

Sentience provides Jest-style assertions for AI web agents.

Instead of trusting that an agent probably clicked the right thing, Sentience verifies outcomes using structured snapshots from a live, rendered browser (post-SPA hydration).

Sentience does not parse static HTML and does not rely on vision by default.

Rendered DOM, not HTML parsing

Sentience snapshots the rendered DOM + layout from a real browser after SPA hydration. This works reliably on JS-heavy applications where static HTML scraping fails.

What's included

v1 — Deterministic assertions

State-aware checks: enabled, checked, value, expanded
Semantic selectors (no brittle CSS/XPath)
Clear failure reasons + nearest-match suggestions

v2 — Reliability & recovery

.eventually() retries with bounded backoff
Snapshot confidence + exhaustion detection
Optional vision fallback (last resort)

Local-first by default

With structured snapshots, 3B–14B local models are viable. Larger models improve planning and recovery — not DOM parsing.

How it works (high level)

Snapshot the rendered DOM + layout from a live browser
Execute actions deterministically
Verify outcomes with pass/fail assertions
Retry or escalate only when the page is unstable

Vision models are used only if structural signals are exhausted — never by default.

Vision is not the primary signal

Vision is used only after snapshot confidence is exhausted. Assertions remain invariant; only the perception layer changes.

Overview

The AgentRuntime class provides a thin runtime wrapper that combines:

Browser session management via the BrowserBackend protocol
Snapshot/query helpers
Tracer for event emission
Assertion/verification methods

It's designed for agent verification loops where you need to repeatedly take snapshots, execute actions, and verify results.

New in 2026-01-11: AgentRuntime is now framework-agnostic and accepts any browser implementing the BrowserBackend protocol. This allows integration with browser-use, Playwright, or any CDP-based browser through a single backend parameter.

Quick Start

With browser-use (Recommended)

from browser_use import BrowserSession, BrowserProfile
from sentience import get_extension_dir
from sentience.backends import BrowserUseAdapter
from sentience.agent_runtime import AgentRuntime
from sentience.tracing import Tracer, JsonlTraceSink

# 1. Setup browser-use with Sentience extension
profile = BrowserProfile(args=[f"--load-extension={get_extension_dir()}"])
session = BrowserSession(browser_profile=profile)
await session.start()

# 2. Create backend from browser-use session
adapter = BrowserUseAdapter(session)
backend = await adapter.create_backend()

# 3. Navigate using browser-use
page = await session.get_current_page()
await page.goto("https://example.com")

# 4. Create AgentRuntime with backend
sink = JsonlTraceSink("trace.jsonl")
tracer = Tracer(run_id="test-run", sink=sink)
runtime = AgentRuntime(backend=backend, tracer=tracer)

# 5. Use runtime for verification loop
await runtime.snapshot()
runtime.assert_(url_matches(r"example\.com"), label="on_homepage")
runtime.assert_(exists("role=button"), label="has_buttons")

# 6. Check task completion
if runtime.assert_done(exists("text~'Success'"), label="task_complete"):
    print("Task completed!")

With Pro/Enterprise Tier (Gateway Refinement)

# Same setup as above, but with API key for smart element ranking
runtime = AgentRuntime(
    backend=backend,
    tracer=tracer,
    sentience_api_key="sk_pro_xxxxx",  # Enables Gateway refinement
)

# Snapshots now use server-side ML ranking/filtering
await runtime.snapshot()  # Elements are refined by Gateway

AgentRuntime Class

Constructor

from sentience.agent_runtime import AgentRuntime

# New: Using backend parameter (recommended)
runtime = AgentRuntime(
    backend=backend,              # Any BrowserBackend implementation
    tracer=tracer,                # Tracer for event emission
    sentience_api_key="sk_...",   # Optional: Pro/Enterprise Gateway refinement
)

Parameters:

backend - Any browser implementing the BrowserBackend protocol (browser-use, Playwright, CDP-based)
tracer - Tracer for emitting verification events
sentience_api_key (optional) - API key for Pro/Enterprise tier Gateway refinement

Backward Compatibility

For existing AsyncSentienceBrowser users, use the factory method:

from sentience import AsyncSentienceBrowser
from sentience.agent_runtime import AgentRuntime

async with AsyncSentienceBrowser() as browser:
    page = await browser.new_page()
    await page.goto("https://example.com")

    # Use factory method for backward compatibility
    runtime = await AgentRuntime.from_sentience_browser(
        browser=browser,
        page=page,
        tracer=tracer,
    )
    await runtime.snapshot()

Properties

Property	Type	Description
`step_id` / `stepId`	`string \| null`	Current step identifier
`step_index` / `stepIndex`	`number`	Current step index (0-based)
`last_snapshot` / `lastSnapshot`	`Snapshot \| null`	Most recent snapshot
`is_task_done` / `isTaskDone`	`boolean`	Whether task is complete

BrowserBackend Protocol

The BrowserBackend protocol defines the minimal interface required for browser integration. Any browser framework can work with AgentRuntime by implementing this protocol.

Protocol Methods

Method	Description
`eval(expression)`	Execute JavaScript in page context
`call(fn, args)`	Call JavaScript function with arguments
`get_url()`	Get current page URL
`screenshot_png()`	Capture viewport screenshot
`mouse_click()`	Perform mouse click action
`mouse_move()`	Move mouse to coordinates
`wheel()`	Scroll using mouse wheel
`type_text()`	Send keyboard input
`wait_ready_state()`	Wait for document ready state
`refresh_page_info()`	Get viewport and scroll info
`get_layout_metrics()`	Get page layout metrics

Backend Implementations

The SDK provides built-in backend implementations:

Backend	Use Case
`BrowserUseAdapter`	For browser-use integration via `CDPBackendV0`
`PlaywrightBackend`	For direct Playwright usage
`CDPBackendV0`	Low-level CDP-based browser control

Core Methods

snapshot() - Take Page Snapshot

Takes a snapshot of the current page state. Updates lastSnapshot which is used as context for assertions.

# Take snapshot (required before element assertions)
snap = runtime.snapshot()
print(f"Found {len(snap.elements)} elements")

Returns: Snapshot - Current page state

begin_step() / beginStep() - Start New Step

Begins a new verification step. Generates a new step ID, clears previous assertions, and increments step index.

# Begin a new step
step_id = runtime.begin_step("Navigate to checkout")
print(f"Step ID: {step_id}")

# Or with explicit step index
step_id = runtime.begin_step("Verify cart", step_index=2)

Parameters:

goal (string) - Description of what this step aims to achieve
step_index / stepIndex (number, optional) - Explicit step index (auto-increments if omitted)

Returns: string - Generated step ID

assert_() / assert() - Evaluate Assertion

Evaluates an assertion predicate against the current snapshot state. Results are accumulated for the step and emitted as verification events.

# URL assertion
url_ok = runtime.assert_(url_contains("checkout"), "on_checkout_page")

# Element assertion
has_btn = runtime.assert_(exists("role=button text~'Pay'"), "has_pay_button")

# Required assertion (gates step success)
ready = runtime.assert_(
    all_of(url_contains("checkout"), exists("role=button")),
    "checkout_ready",
    required=True
)

Parameters:

predicate (Predicate) - Predicate function to evaluate
label (string) - Human-readable label for this assertion
required (boolean, optional) - If true, gates step success (default: false)

Returns: boolean - True if assertion passed

assert_done() / assertDone() - Assert Task Completion

Asserts task completion with a required assertion. When passed, marks the task as done.

# Check if task goal is achieved
if runtime.assert_done(exists("text~'Order Confirmed'"), "order_placed"):
    print("Order successfully placed!")
    # runtime.is_task_done is now True

Parameters:

predicate (Predicate) - Predicate function to evaluate
label (string) - Human-readable label for this assertion

Returns: boolean - True if task is complete

Predicate Helpers

URL Predicates

from sentience import url_matches, url_contains

# Regex match on URL
runtime.assert_(url_matches(r"https://.*\.example\.com"), "is_https")

# Substring match on URL
runtime.assert_(url_contains("checkout"), "on_checkout")

Element Predicates

from sentience import exists, not_exists, element_count

# Element exists (using query syntax)
runtime.assert_(exists("role=button text~'Submit'"), "has_submit")

# Element does not exist
runtime.assert_(not_exists("text~'Error'"), "no_errors")

# Element count check
runtime.assert_(element_count("role=listitem", min=5), "has_items")

Combinators

from sentience import all_of, any_of

# All conditions must pass
runtime.assert_(
    all_of(
        url_contains("checkout"),
        exists("role=button text~'Pay'"),
        not_exists("text~'Error'")
    ),
    "checkout_ready"
)

# Any condition must pass
runtime.assert_(
    any_of(
        exists("text~'Success'"),
        exists("text~'Complete'")
    ),
    "task_done"
)

Custom Predicates

from sentience import custom
from sentience.verification import AssertContext, AssertOutcome

def my_predicate(ctx: AssertContext) -> AssertOutcome:
    # Custom logic using ctx.snapshot and ctx.url
    has_items = len(ctx.snapshot.elements) > 10
    return AssertOutcome(
        passed=has_items,
        reason="Found sufficient elements" if has_items else "Too few elements",
        details={"element_count": len(ctx.snapshot.elements)}
    )

runtime.assert_(custom(my_predicate), "custom_check")

Assertion Status Methods

Check Assertion Results

# Check if all assertions in current step passed
if runtime.all_assertions_passed():
    print("All assertions passed!")

# Check if all required assertions passed
if runtime.required_assertions_passed():
    print("All required assertions passed!")

# Check if task is done
if runtime.is_task_done:
    print("Task complete!")

Get Assertions for Step End

Retrieve accumulated assertions for inclusion in trace step_end events:

# Get assertions data for step_end event
assertions_data = runtime.get_assertions_for_step_end()
print(f"Assertions: {assertions_data['assertions']}")
print(f"Task done: {assertions_data.get('task_done', False)}")
print(f"Task label: {assertions_data.get('task_done_label')}")

# Flush and clear assertions for next step
assertions = runtime.flush_assertions()

Reset Task State

For multi-task runs, reset the task completion state:

# Reset task_done state for next task
runtime.reset_task_done()

Trace Integration

Assertions are automatically emitted as verification events to the tracer, making them visible in Studio timeline.

Verification Event Schema

{
  "type": "verification",
  "data": {
    "kind": "assert",
    "label": "on_checkout_page",
    "passed": true,
    "required": false,
    "reason": "URL contains 'checkout'",
    "details": { "url": "https://example.com/checkout" }
  },
  "step_id": "abc-123"
}

Task Done Event

When assert_done() passes:

{
  "type": "verification",
  "data": {
    "kind": "task_done",
    "label": "order_placed",
    "passed": true
  },
  "step_id": "abc-123"
}

Complete Example

from sentience import (
    AgentRuntime,
    SentienceBrowser,
    all_of,
    exists,
    not_exists,
    url_contains,
    url_matches,
)
from sentience.tracer_factory import create_tracer

def main():
    # Setup
    tracer = create_tracer(api_key="sk_...", run_id="verification-demo", upload_trace=False)
    browser = SentienceBrowser(api_key="sk_...", headless=False)
    browser.start()

    try:
        runtime = AgentRuntime(browser, browser.page, tracer)

        # Navigate
        browser.page.goto("https://example.com")
        browser.page.wait_for_load_state("networkidle")

        # Step 1: Verify page loaded
        runtime.begin_step("Verify page loaded correctly")
        runtime.snapshot()

        # Run assertions
        runtime.assert_(url_contains("example.com"), "on_example_domain")
        runtime.assert_(url_matches(r"https://.*example\.com"), "url_is_https")
        runtime.assert_(exists("role=heading"), "has_heading")
        runtime.assert_(not_exists("text~'Error'"), "no_error_message")

        # Combined assertion
        runtime.assert_(
            all_of(url_contains("example"), exists("role=link")),
            "page_fully_ready",
        )

        # Check task completion
        if runtime.assert_done(exists("text~'Example Domain'"), "reached_example_page"):
            print("Task completed!")

        # Summary
        print(f"All passed: {runtime.all_assertions_passed()}")
        print(f"Task complete: {runtime.is_task_done}")

    finally:
        tracer.close(blocking=True)
        browser.close()

if __name__ == "__main__":
    main()

API Reference

Predicate Functions

Predicate	Description
`url_matches(pattern)` / `urlMatches(pattern)`	URL matches regex pattern
`url_contains(substring)` / `urlContains(substring)`	URL contains substring
`exists(query)`	Element matching query exists in snapshot
`not_exists(query)` / `notExists(query)`	No element matching query exists
`element_count(query, min, max)` / `elementCount(query, opts)`	Element count within range
`all_of(...predicates)` / `allOf(...predicates)`	All predicates must pass
`any_of(...predicates)` / `anyOf(...predicates)`	Any predicate must pass
`custom(fn)`	Custom predicate function

AssertOutcome

Property	Type	Description
`passed`	`boolean`	Whether assertion passed
`reason`	`string`	Human-readable explanation
`details`	`object`	Additional context data

AssertContext

Property	Type	Description
`snapshot`	`Snapshot \| null`	Current page snapshot
`url`	`string \| null`	Current page URL
`step_id` / `stepId`	`string \| null`	Current step identifier

If you're interested in using Sentience assertions for enterprise QA workflows — pre-release validation, regression testing, and monitoring critical user flows — see AI-Driven QA with Sentience.

Tracing & Debugging

AI-Driven QA

Agent Runtime

Assertions at a Glance (v1 + v2)

Rendered DOM, not HTML parsing

What's included

Local-first by default

How it works (high level)

Vision is not the primary signal

Overview

Quick Start

With browser-use (Recommended)

With Pro/Enterprise Tier (Gateway Refinement)

AgentRuntime Class

Constructor

Backward Compatibility

Properties

BrowserBackend Protocol

Protocol Methods

Backend Implementations

Core Methods

snapshot() - Take Page Snapshot

begin_step() / beginStep() - Start New Step

assert_() / assert() - Evaluate Assertion

assert_done() / assertDone() - Assert Task Completion

Predicate Helpers

URL Predicates

Element Predicates

Combinators

Custom Predicates

Assertion Status Methods

Check Assertion Results

Get Assertions for Step End

Reset Task State

Trace Integration

Verification Event Schema

Task Done Event

Complete Example

API Reference

Predicate Functions

AssertOutcome

AssertContext

Related: AI-Driven QA for Enterprises