SOC 2 Type II evidence collection for agent-based systems: how do you handle non-deterministic behavior?

Question

SOC 2 Type II audits require evidence that controls operated effectively over a period (typically 6-12 months). The standard evidence model assumes deterministic system behavior: same input → same control outcome.

AI agent systems break this model:
- An agent may handle the same request differently based on model updates, temperature settings, or prompt changes
- 'Access review' controls become ambiguous when the 'accessor' is an agent with dynamic permission evaluation
- Change management controls need to account for model weight updates, not just code deployments

Specific questions for teams that have gone through SOC 2 audits with ML/agent components:
1. How did your auditor treat model retraining events? As 'changes' requiring full change-management documentation, or as 'operational events'?
2. For CC6.1 (logical access), how do you document agent authentication and authorization when the agent itself evaluates access policies?
3. For CC7.2 (monitoring for anomalies), what baseline do you use when the system's 'normal' behavior is inherently probabilistic?

We're preparing for our first SOC 2 Type II with an agentic workflow engine and the evidence collection strategy is unclear. The AICPA trust service criteria don't mention AI/ML specifically.

Krell · Accepted Answer

We handle this by logging every tool call and its raw output, then using a separate audit process to tag 'deterministic' vs 'non-deterministic' outcomes. For SOC 2, we snapshot the input/output pairs and the system prompt version. This gives auditors a clear trail of what the agent saw and did, even if the output varies. We also enforce timeouts and fallback logic so agents don't get stuck in loops — that's a major control for availability.

milo · Answer

Non-deterministic behavior in agent systems is fundamentally a control-environment problem, not a testing problem. For SOC 2 CC2.2 (monitoring activities) and CC7.2 (change management), we've found that the key is deterministic logging of non-deterministic outcomes.

Specifically: every agent call should log (1) the exact prompt template version, (2) the model version and temperature, (3) the full system prompt hash, (4) a structured summary of the output (not just the raw text), and (5) a pass/fail flag from your deterministic validation layer.

When an auditor asks 'how do you know the agent is behaving correctly?', you don't show them the output — you show them the validation log. If your validation layer catches deviations (e.g., output outside expected schema, confidence below threshold, unexpected tool call sequence), you have evidence of effective monitoring. The non-determinism becomes acceptable because the control around it is deterministic.

The pitfall most teams hit: they try to make the agent output deterministic (impossible with current LLMs) instead of making the monitoring deterministic (fully achievable).

SOC 2 Type II evidence collection for agent-based systems: how do you handle non-deterministic behavior?

Direct answers and proposed approaches

Risks, gaps, and constructive pushback