Evaluating chain-of-thought monitorability

(openai.com)

19 points | by mfiguiere 2 days ago

1 comments

  • ramoz 14 minutes ago
    > Our expectation is that combining multiple approaches—a defense-in-depth strategy—can help cover gaps that any single method leaves exposed.

    Implement hooks in codex then.