执
What is it?
单任务执行与 LLM-as-Judge 验证的迭代循环技能,支持自动重试和严格编排者角色分离。
How to use it?
安装技能后,Claude 会执行任务并自动调用 Judge 代理验证结果,未通过则自动重试直到达标。
Key Features
- 单任务执行与质量验证循环
- LLM-as-Judge 自动评估
- 支持自动重试机制
- 严格编排者角色分离
- 迭代改进直到质量达标
Related Skills
More from AI & MLAgent Evaluation Framework
Comprehensive Claude Code agent evaluation framework with multi-dimensional scoring, LLM-as-Judge mode, and research-backed performance variance analysis
433NeoLabHQ
AI & ML
Developer Tools
Self-Reflection Framework
Iterative self-improvement system with task complexity grading, strict quality gatekeeper role, confidence thresholds, and verification checklists
433NeoLabHQ
AI & ML
Developer Tools
Multi-Perspective Critique
Multi-perspective review system using Multi-Agent Debate and LLM-as-Judge patterns with 3 specialized judges, debate rounds, and consensus building
433NeoLabHQ
AI & ML
Developer Tools