Red Teaming Dashboards
Campaign History View
Filters
Status: Filter by campaign status (All/Running/Finished/Failed)
Results: Filter by test results (All/Passed/Failed)
Probes: Filter by probe types used
Campaign List Columns
Started By: User who initiated the campaign
Executed On: Timestamp of campaign execution
Status: Current campaign status (Finished/Running/Failed)
Probes: Probe types included in the campaign
Result: Summary of passed/failed/error tests
Actions: View campaign details (eye icon)

The Target Overview Dashboard
The main dashboard provides comprehensive visibility into your target's security posture.
Overall Score
Last scanned: Timestamp of most recent campaign (e.g., "3 hours ago")
Test runs: Total number of completed test campaigns
Probe types tested: Number of unique probe types executed
Probe Success Rate: Percentage of probes that successfully exploited vulnerabilities
Vulnarbilities Found: Number of vulnerabilities detected during test runs
Top vulnerable: Most vulnerable risk category identified (e.g., "Security & Access Control")


Vulnerability Distribution
Visual representation showing:
High Risk: Count and percentage of high-severity vulnerabilities
Medium Risk: Count and percentage of medium-severity vulnerabilities
Passed: Count and percentage of tests that passed (no vulnerability)
Category Score
Security & Access Control: Progress bar showing pass/fail ratio, Overall percentage score (e.g., 87%)
Brand: Progress bar showing pass/fail ratio, Overall percentage score (e.g., 95%)
Additional categories displayed as configured.

Probe Categories Overview
Detailed breakdown of each probe category with visual indicators:
RBAC Implementation: Status: Passed, Test results: "10 Passed", Visual: Circular progress indicator
System Prompt Disclosure: Status: Failed, Test results: "8 Passed, 2 Failed", Visual: Circular progress indicator showing failure ratio
SQL Injection: Status: Failed, Test results: "6 Passed, 4 Failed", Visual: Circular progress indicator showing failure ratio
Command Injection: Status: Passed, Test results: "15 Passed", Visual: Circular progress indicator

Recent Adversary Campaigns

Table showing recent test runs:
Columns
Started By: Campaign initiator
Executed On: Execution timestamp
Status: Campaign status with visual indicator
Probes: Probe type badges
Result: Pass/fail/error counts with color coding
Progress: Visual progress bar showing completion percentage
Actions:

Export Report

Provides the following options:
Summary Report
PDF (Overview with charts)
CSV (Vulnerability Data)
Detailed Report
PDF (Full conversation logs)
CSV (All test results)
View Test Run Configuration
Edit Test Run Configuration
Re-Execute Adversary Simulation
More actions (Copy Job ID)
Aggregated Campaign Vulnerabilities
Aggregated vulnerabilities from all campaigns listed under Recent Adversary Campaigns.

Campaign Details View
Campaign Header
Target Information: Target name and model (e.g., "OpenAI (gpt-4-turbo)"), Campaign execution date and time.
Summary Metrics
Probes: Total number of probes executed
Passed: Number of successful defenses
Failed: Number of vulnerabilities found
Errors: Number of test errors
Pass Rate: Overall success percentage
Overview Section
Total Probes: Aggregate count of all probe tests (e.g., 38)
Pass Rate: Percentage of tests where the target successfully defended (e.g., 88.3%)
Vulnerabilities Found: Count of identified vulnerabilities (e.g., 3)
Vulnerabilities by Severity Chart: Scatter plot showing Attack Success Rate (%) on the X-axis, Risk Score on the Y-axis, Bubble size for Severity level, and Color coding (Critical (red), High (orange), Medium (yellow), Low (gray)).

Risk Categories
Expandable sections for each risk category:
Security & Access Control
Description: "Data protection, access control, and system security risks"
Progress bar: Visual representation of pass/fail ratio
Metrics: "34 passed, 6 failed, 40 total"
Overall score: 85%
Drill-down items:
RBAC Implementation: 10/10 passed (100%) - Passed status
Command Injection: 10/10 passed (100%) - Passed status
SQL Injection: 6/10 passed (60%) - High severity, 4 failed
System Prompt Disclosure: 8/10 passed (80%) - Medium severity, 2 failed
Brand
Description: "Output reliability, accuracy, and brand reputation risks"
Progress bar: Visual representation of pass/fail ratio
Metrics: "19 passed, 1 failed, 20 total"
Overall score: 95%
Drill-down items:
Excessive Agency: 9/10 passed (90%) - Medium severity, 1 failed

Vulnerabilities and Mitigations Table
Detailed vulnerability listing with actionable information:
Columns:
Type: Vulnerability name (e.g., "SQL Injection")
Category: Risk category (e.g., "Security & Access Control")
Severity: Risk level badge (High/Medium/Low/Critical)
Risk Score: Numerical risk assessment (e.g., 7.72)
Successful Attacks: Count of successful exploit attempts (e.g., 4)
Total Tests: Total number of tests executed (e.g., 10)
Attack Success Rate: Percentage of successful attacks (e.g., 40.0%)

Click any row to view detailed attack conversations and test results.
Attack Conversation Details
When drilling down into a specific vulnerability, you can view:
Test Execution Details
Passed Tests Tab: Shows all tests where the target successfully defended. Each test displays status, type label, prompt, model output/response, and reason for success.
Failed Tests Tab: Shows all tests where vulnerabilities were exploited. Each test displays status, type label, attack prompt, vulnerable model output, explanation, and reason for failure with technical details.

Failed Test Example
Understanding Probe Execution
How Probes Work
Probe Selection: Each probe type tests for a specific vulnerability class.
Strategy Application: Selected strategies transform the base probe prompts.
Test Execution: Modified prompts are sent to the target system.
Response Analysis: Target responses are evaluated against security criteria.
Scoring: Results are aggregated into risk scores and pass/fail metrics.
Probe Success vs. Failure
Passed Test: Target successfully defended against the attack.
Failed Test: Target exhibited vulnerable behavior.
Error: Test execution encountered technical issues.
Risk Scoring
Risk scores are calculated based on:
Severity of the vulnerability type
Attack success rate
Potential impact on the system
Number of successful exploits