Skip to main content
Skip table of contents

Red Teaming Dashboards

Campaign History View

  • Filters

    • Status: Filter by campaign status (All/Running/Finished/Failed)

    • Results: Filter by test results (All/Passed/Failed)

    • Probes: Filter by probe types used

  • Campaign List Columns

    • Started By: User who initiated the campaign

    • Executed On: Timestamp of campaign execution

    • Status: Current campaign status (Finished/Running/Failed)

    • Probes: Probe types included in the campaign

    • Result: Summary of passed/failed/error tests

    • Actions: View campaign details (eye icon)

image-20260227-153114.png

The Target Overview Dashboard

The main dashboard provides comprehensive visibility into your target's security posture.

  • Overall Score

    • Last scanned: Timestamp of most recent campaign (e.g., "3 hours ago")

    • Test runs: Total number of completed test campaigns

    • Probe types tested: Number of unique probe types executed

    • Probe Success Rate: Percentage of probes that successfully exploited vulnerabilities

    • Vulnarbilities Found: Number of vulnerabilities detected during test runs

    • Top vulnerable: Most vulnerable risk category identified (e.g., "Security & Access Control")

      image-20260311-102100.png
      image-20260220-160040.png
  • Vulnerability Distribution

    • Visual representation showing:

      • High Risk: Count and percentage of high-severity vulnerabilities

      • Medium Risk: Count and percentage of medium-severity vulnerabilities

      • Passed: Count and percentage of tests that passed (no vulnerability)

  • Category Score

    • Security & Access Control: Progress bar showing pass/fail ratio, Overall percentage score (e.g., 87%)

    • Brand: Progress bar showing pass/fail ratio, Overall percentage score (e.g., 95%)

    • Additional categories displayed as configured.

      image-20260220-160134.png

Probe Categories Overview

Detailed breakdown of each probe category with visual indicators:

  • RBAC Implementation: Status: Passed, Test results: "10 Passed", Visual: Circular progress indicator

  • System Prompt Disclosure: Status: Failed, Test results: "8 Passed, 2 Failed", Visual: Circular progress indicator showing failure ratio

  • SQL Injection: Status: Failed, Test results: "6 Passed, 4 Failed", Visual: Circular progress indicator showing failure ratio

  • Command Injection: Status: Passed, Test results: "15 Passed", Visual: Circular progress indicator

image-20260220-160417.png

Recent Adversary Campaigns

image-20260227-152629.png

Table showing recent test runs:

  • Columns

    • Started By: Campaign initiator

    • Executed On: Execution timestamp

    • Status: Campaign status with visual indicator

    • Probes: Probe type badges

    • Result: Pass/fail/error counts with color coding

    • Progress: Visual progress bar showing completion percentage

    • Actions:

      image-20260227-152354.png
      • Export Report

        image-20260227-153407.png

        Provides the following options:

        • Summary Report

          • PDF (Overview with charts)

          • CSV (Vulnerability Data)

        • Detailed Report

          • PDF (Full conversation logs)

          • CSV (All test results)

      • View Test Run Configuration

      • Edit Test Run Configuration

      • Re-Execute Adversary Simulation

      • More actions (Copy Job ID)

Aggregated Campaign Vulnerabilities

Aggregated vulnerabilities from all campaigns listed under Recent Adversary Campaigns.

image-20260311-101849.png

Campaign Details View

Campaign Header

  • Target Information: Target name and model (e.g., "OpenAI (gpt-4-turbo)"), Campaign execution date and time.

  • Summary Metrics

    • Probes: Total number of probes executed

    • Passed: Number of successful defenses

    • Failed: Number of vulnerabilities found

    • Errors: Number of test errors

    • Pass Rate: Overall success percentage

Overview Section

  • Total Probes: Aggregate count of all probe tests (e.g., 38)

  • Pass Rate: Percentage of tests where the target successfully defended (e.g., 88.3%)

  • Vulnerabilities Found: Count of identified vulnerabilities (e.g., 3)

  • Vulnerabilities by Severity Chart: Scatter plot showing Attack Success Rate (%) on the X-axis, Risk Score on the Y-axis, Bubble size for Severity level, and Color coding (Critical (red), High (orange), Medium (yellow), Low (gray)).

image-20260220-160654.png

Risk Categories

Expandable sections for each risk category:

  • Security & Access Control

    • Description: "Data protection, access control, and system security risks"

    • Progress bar: Visual representation of pass/fail ratio

    • Metrics: "34 passed, 6 failed, 40 total"

    • Overall score: 85%

    • Drill-down items:

      • RBAC Implementation: 10/10 passed (100%) - Passed status

      • Command Injection: 10/10 passed (100%) - Passed status

      • SQL Injection: 6/10 passed (60%) - High severity, 4 failed

      • System Prompt Disclosure: 8/10 passed (80%) - Medium severity, 2 failed

  • Brand

    • Description: "Output reliability, accuracy, and brand reputation risks"

    • Progress bar: Visual representation of pass/fail ratio

    • Metrics: "19 passed, 1 failed, 20 total"

    • Overall score: 95%

    • Drill-down items:

      • Excessive Agency: 9/10 passed (90%) - Medium severity, 1 failed

image-20260220-160841.png

Vulnerabilities and Mitigations Table

Detailed vulnerability listing with actionable information:

  • Columns:

    • Type: Vulnerability name (e.g., "SQL Injection")

    • Category: Risk category (e.g., "Security & Access Control")

    • Severity: Risk level badge (High/Medium/Low/Critical)

    • Risk Score: Numerical risk assessment (e.g., 7.72)

    • Successful Attacks: Count of successful exploit attempts (e.g., 4)

    • Total Tests: Total number of tests executed (e.g., 10)

    • Attack Success Rate: Percentage of successful attacks (e.g., 40.0%)

image-20260220-160958.png

Click any row to view detailed attack conversations and test results.

Attack Conversation Details

When drilling down into a specific vulnerability, you can view:

  • Test Execution Details

    • Passed Tests Tab: Shows all tests where the target successfully defended. Each test displays status, type label, prompt, model output/response, and reason for success.

    • Failed Tests Tab: Shows all tests where vulnerabilities were exploited. Each test displays status, type label, attack prompt, vulnerable model output, explanation, and reason for failure with technical details.

image-20260220-161150.png

Failed Test Example

Understanding Probe Execution

How Probes Work

  1. Probe Selection: Each probe type tests for a specific vulnerability class.

  2. Strategy Application: Selected strategies transform the base probe prompts.

  3. Test Execution: Modified prompts are sent to the target system.

  4. Response Analysis: Target responses are evaluated against security criteria.

  5. Scoring: Results are aggregated into risk scores and pass/fail metrics.

Probe Success vs. Failure

  • Passed Test: Target successfully defended against the attack.

  • Failed Test: Target exhibited vulnerable behavior.

  • Error: Test execution encountered technical issues.

Risk Scoring

Risk scores are calculated based on:

  • Severity of the vulnerability type

  • Attack success rate

  • Potential impact on the system

  • Number of successful exploits

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.