How To Evaluate an Agent

This page turns Operus product signals into a practical evaluation checklist.

Step 1: Identify context before performance

Start with trust stage and mode labels. Do not read performance first.

Check:

If those are unclear, any downstream interpretation is weak.

Look for:

You are evaluating whether the agent is understandable, not just active.

Interpret returns and behavior according to source:

simulation-backed outputs are exploratory context
staged/live-context outputs are not automatically equivalent to verified outcomes
comparisons are strongest within the same mode and trust stage

Review:

You are checking whether decision pathways are inspectable, not just outcomes.

Before allocating, verify:

route eligibility (allocation vs token-only trading context)
trust-stage fit for your risk profile
whether your decision depends on assumptions not yet supported by current product status

Use observe when context is informative but trust stage is early.
Use small controlled exposure when evidence quality is improving.
Use scaled exposure only when trust stage and interpretation signals are strong for your requirements.