mirror of https://github.com/anthropics/claude-plugins-official.git synced 2026-03-20 11:33:08 +00:00

Files

Ralph Furman c3f6d9e9fa Fix YAML frontmatter — quote description, replace colon with em-dash

Unquoted colon in 'calibrated confidence: will say' broke strict YAML parsing.
CC's parser is lenient but fix for robustness.

2026-03-20 02:07:27 +00:00

.claude-plugin

Add math-olympiad skill — adversarial verification for competition math

2026-03-19 23:17:36 +00:00

skills/math-olympiad

Fix YAML frontmatter — quote description, replace colon with em-dash

2026-03-20 02:07:27 +00:00

README.md

Add math-olympiad skill — adversarial verification for competition math

2026-03-19 23:17:36 +00:00

README.md

math-olympiad

Competition math solver with adversarial verification.

The problem

Self-verification gets fooled. A verifier that sees the reasoning is biased toward agreement. arXiv:2503.21934 ("Proof or Bluff") showed 85.7% self-verified IMO success drops to <5% under human grading.

The approach

Context-isolated verification: verifier sees only the clean proof, never the reasoning trace
Pattern-armed adversarial checks: not "is this correct?" but "does this accidentally prove RH?" / "extract the general lemma, find a 2×2 counterexample"
Calibrated abstention: says "no confident solution" rather than bluff
Presentation pass: produces clean LaTeX/PDF after verification passes

Validation

17/18 IMO+Putnam 2025 problems solved, 0 false positives, 2 novel proofs found. See the skill's eval data in the anthropic monorepo.

Install

/plugin install math-olympiad@claude-plugins-official

Use

> Solve this IMO problem: [statement]

The skill auto-triggers on "IMO", "Putnam", "olympiad", "verify this proof", etc.

README.md Unescape Escape

math-olympiad

The problem

The approach

Validation

Install

Use

README.md