Question 1

What is the evidence and what does it contain?

Accepted Answer

The evidence is a signed, auditable record of your agent's behavioral testing. It contains every test case that was run, the behavioral assertion for each test, LLM and expert evaluation scores, per-assertion citations tracing back to the transcript, a compliance framework mapping, and a domain expert attestation statement. It is cryptographically signed with minisign and delivered as both a signed file and a PDF.

Question 2

Who is the evidence for? Can I share it with auditors?

Accepted Answer

Yes. The evidence is designed to be shared with auditors, regulators, procurement teams, legal counsel, and your board. It is a self-contained artifact — no account access or special software required to read it. The signed manifest includes SHA-256 hashes of every artifact, so the recipient can verify authenticity independently.

Question 3

Which compliance frameworks does AgentCarousel map to?

Accepted Answer

The framework mapping depends on your agent's domain and use case. The clinical scribe (healthcare) evidence maps to FDA Software as a Medical Device (SaMD) guidance, HIPAA minimum-necessary controls, and Joint Commission standard LD.04.01.01. Financial services agents typically map to SOC 2, SR 11-7, and relevant FFIEC guidance. We scope the compliance mapping during the certification process.

Question 4

What does "domain expert attestation" mean?

Accepted Answer

A credentialed domain expert reviews the test cases and results for your agent's specific field — for example, a clinical professional for healthcare agents — and provides a written attestation statement confirming the test cases cover the material behavioral risks for that use case. This attestation is included in the signed evidence. You can provide your own domain expert or request one of ours in the scoping process.

Question 5

How is this different from running our own internal QA?

Accepted Answer

Internal QA is valuable, but when an auditor or regulator asks about your agent's behavior, they want third-party evidence — not self-reported results. AgentCarousel provides an independent, signed record from outside your organization. The test cases are designed by domain experts, not by the team that built the agent, which eliminates the conflict-of-interest concern most compliance reviewers will raise.

Question 6

What if we switch AI models or our provider pushes a model update?

Accepted Answer

Any change to the underlying model is a reason to re-certify against the same fixture suite. AgentCarousel can run the same behavioral test cases across your current model and a proposed replacement in parallel — producing a signed comparison record that documents why you selected one model over another. This model selection record becomes part of your compliance evidence. Re-certification takes the same 3–5 business days as the initial run.

Question 7

What is OSCAL and why does the evidence include it?

Accepted Answer

OSCAL (Open Security Controls Assessment Language) is the NIST-maintained machine-readable format that GRC platforms and auditors use to exchange control assessment data. As of version 0.8.0, every AgentCarousel evidence bundle includes an OSCAL Assessment Results document: each behavioral test case is mapped to specific control IDs in catalogs such as NIST AI RMF, the EU AI Act, ISO 42001, HIPAA, and FDA SaMD, with per-control satisfaction status. A control is only reported as satisfied with three or more covering cases and an effectiveness score of 0.80 or higher — partial evidence and gaps are reported as exactly that. Your compliance tooling can ingest the results directly instead of someone re-keying a PDF.

Question 8

What do I need to provide to get started?

Accepted Answer

Your agent's system prompt and a brief description of the workflow it handles — for example, "This agent transcribes doctor-patient encounters into SOAP notes." No code access, no infrastructure changes, and no engineering involvement required. Start with a scope review and we help you identify which compliance frameworks and test cases apply before committing to the full certification.

Question 9

What is the turnaround time?

Accepted Answer

3–5 business days for the full certification from the time we confirm your submission. For complex multi-domain agents, we will communicate any extension in advance.

Independent Behavioral Testing for AI Agents. Auditor-Ready.

Compliance for AI Agents

Show Documentation

Prove with Evidence

Due Diligence

How it works

Submit Your Agent Prompt

We Run Behavioral Certification

You Receive the Signed Evidence

Inside an Evidence Bundle: The Clinical Scribe

Straightforward Pricing. No Subscription.

Full Certification Bundle

Scope & Feasibility Review

Ready to Answer the Auditor's Question?

Questions about agent trust

Get Your Evidence