CodeNames Oversight Results Explorer

Base: Basic protocol without assistance
Consultancy: Protocol where model provides target selection justification
Critiques: Protocol with critique-based oversight

How to Navigate

Use the navigation menu to explore results from three training protocols:

Each subfolder follows the naming pattern: [overseer-type]-adv-[incentive-strength]