mirror of
https://github.com/anthropics/claude-plugins-official.git
synced 2026-03-20 23:43:07 +00:00
Deep-mode allows bounded local computation but must NOT use WebFetch or WebSearch. Finding the solution on AoPS is not solving the problem. Adds explicit NO WEB prompt block and orchestrator self-restraint note. Found by Ralph's test run (skill solved 5/6 then started fetching dgrozev.wordpress.com and artofproblemsolving.com for P6).
math-olympiad
Competition math solver with adversarial verification.
The problem
Self-verification gets fooled. A verifier that sees the reasoning is biased toward agreement. arXiv:2503.21934 ("Proof or Bluff") showed 85.7% self-verified IMO success drops to <5% under human grading.
The approach
- Context-isolated verification: verifier sees only the clean proof, never the reasoning trace
- Pattern-armed adversarial checks: not "is this correct?" but "does this accidentally prove RH?" / "extract the general lemma, find a 2×2 counterexample"
- Calibrated abstention: says "no confident solution" rather than bluff
- Presentation pass: produces clean LaTeX/PDF after verification passes
Validation
17/18 IMO+Putnam 2025 problems solved, 0 false positives, 2 novel proofs found. See the skill's eval data in the anthropic monorepo.
Install
/plugin install math-olympiad@claude-plugins-official
Use
> Solve this IMO problem: [statement]
The skill auto-triggers on "IMO", "Putnam", "olympiad", "verify this proof", etc.