mirror of https://github.com/anthropics/claude-plugins-official.git synced 2026-03-20 23:43:07 +00:00

Files

Ralph Furman 9720278412 math-olympiad: forbid web access in deep mode

Deep-mode allows bounded local computation but must NOT use WebFetch
or WebSearch. Finding the solution on AoPS is not solving the problem.
Adds explicit NO WEB prompt block and orchestrator self-restraint note.

Found by Ralph's test run (skill solved 5/6 then started fetching
dgrozev.wordpress.com and artofproblemsolving.com for P6).

2026-03-20 00:39:08 +00:00

.claude-plugin

Add math-olympiad skill — adversarial verification for competition math

2026-03-19 23:17:36 +00:00

skills/math-olympiad

math-olympiad: forbid web access in deep mode

2026-03-20 00:39:08 +00:00

README.md

Add math-olympiad skill — adversarial verification for competition math

2026-03-19 23:17:36 +00:00

README.md

math-olympiad

Competition math solver with adversarial verification.

The problem

Self-verification gets fooled. A verifier that sees the reasoning is biased toward agreement. arXiv:2503.21934 ("Proof or Bluff") showed 85.7% self-verified IMO success drops to <5% under human grading.

The approach

Context-isolated verification: verifier sees only the clean proof, never the reasoning trace
Pattern-armed adversarial checks: not "is this correct?" but "does this accidentally prove RH?" / "extract the general lemma, find a 2×2 counterexample"
Calibrated abstention: says "no confident solution" rather than bluff
Presentation pass: produces clean LaTeX/PDF after verification passes

Validation

17/18 IMO+Putnam 2025 problems solved, 0 false positives, 2 novel proofs found. See the skill's eval data in the anthropic monorepo.

Install

/plugin install math-olympiad@claude-plugins-official

Use

> Solve this IMO problem: [statement]

The skill auto-triggers on "IMO", "Putnam", "olympiad", "verify this proof", etc.

README.md Unescape Escape

math-olympiad

The problem

The approach

Validation

Install

Use

README.md