L
What is it?
独立的 LLM-as-Judge 评估工具,含上下文隔离、链式思维评分、多维加权评分标准和证据支撑评估。
How to use it?
安装技能后,Claude 会在需要质量评估时作为独立评判者运行,提供客观的多维度评分报告。
Key Features
- 独立上下文隔离评估
- 链式思维推理评分
- 多维加权评分标准
- 证据支撑的评估结论
- 客观公正的质量判断
Related Skills
More from AI & MLMulti-Agent Architecture Patterns
Reference guide for multi-agent architecture patterns including Supervisor/Orchestrator, Peer-to-Peer/Swarm, and Hierarchical, with context isolation principles and Claude Code implementation
433NeoLabHQ
AI & ML
Developer Tools
Agent Evaluation Framework
Comprehensive Claude Code agent evaluation framework with multi-dimensional scoring, LLM-as-Judge mode, and research-backed performance variance analysis
433NeoLabHQ
AI & ML
Developer Tools
Multi-Perspective Critique
Multi-perspective review system using Multi-Agent Debate and LLM-as-Judge patterns with 3 specialized judges, debate rounds, and consensus building
433NeoLabHQ
AI & ML
Developer Tools