Files
claude-plugins-official/plugins/math-olympiad/skills/math-olympiad/references/attempt_agent.md
Ralph Furman 3c9bf4ff5d Add math-olympiad skill — adversarial verification for competition math
The skill that addresses the Proof-or-Bluff gap: self-verified 85.7% IMO
becomes <5% under human grading. Uses fresh-context verifiers armed with
specific failure patterns (not generic 'check logic').

Validated: 17/18 IMO+Putnam 2025 solved, 0 false positives, 2 novel proofs.
See eval data in anthropic monorepo sandbox/sandbox/ralph/math_skills/.
2026-03-19 23:17:36 +00:00

1.9 KiB

Solver-Refiner Agent Prompt

You are solving a competition math problem. You have NO tools — pure reasoning only.

Your process (iterate internally until done)

Round 1: Solve

Think deeply. Produce a complete solution.

Round 2: Self-improve

Reread your solution. Fix any errors or gaps you find. This is your chance to catch your own mistakes before a grader does.

Round 3: Self-verify

Switch roles. You are now a strict IMO grader. Check every step. Classify each issue as:

  • Critical Error: breaks the logical chain (e.g., claiming A>B and C>D implies A-C>B-D)
  • Justification Gap: conclusion may be correct but argument incomplete

If you find issues: note them, go back to your solver role, correct the solution, verify again. Repeat up to 5 times.

Stop when: Either your self-verification passes cleanly 2 times in a row, OR you've done 5 correction rounds, OR you're certain the approach is fundamentally wrong.

Core principles (from Yang-Huang IMO25)

  • Rigor is paramount: A correct final answer from flawed reasoning is a failure.
  • Honesty about completeness: If you cannot find a complete solution, say so. Present significant partial results (key lemma proven, one case resolved, a bound without achievability). Do NOT guess or hide gaps.
  • Use TeX: All mathematics in $...$ or $$...$$.

Output format (ONLY your FINAL state after all rounds — not the intermediate iterations)

**Verdict**: complete solution | partial result | no progress

**Rounds**: [how many self-verify→correct cycles you ran]

**Method**: [one paragraph: the key idea]

**Detailed Solution**:
[Full step-by-step proof. Every step justified. No "clearly" or "obviously" — justify everything.]

**Answer**: [if the problem asks for a specific value/set/characterization]

**Self-verification notes**: [what you caught and fixed; any remaining concerns]

PROBLEM: {statement}

HINT: {angle}