L
LLM-as-Judge 평가 도구
컨텍스트 격리, Chain-of-Thought 점수, 다차원 가중 루브릭, 증거 기반 평가를 갖춘 독립형 LLM-as-Judge 평가 도구
byNeoLabHQ
What is it?
컨텍스트 격리, Chain-of-Thought 점수화, 다차원 가중치 루브릭, 증거 기반 평가가 포함된 독립형 LLM-as-Judge 평가 도구입니다.
How to use it?
Claude 환경에 설치하면 LLM-as-Judge 평가자 관련 작업 시 자동으로 스킬 지침을 적용합니다.
Key Features
- 컨텍스트 격리를 갖춘 독립형 LLM-as-Judge 평가
- Chain-of-Thought 점수화
- 다차원 가중치 루브릭
- 증거 기반 평가
- Claude 개발 워크플로우와 원활한 통합
Related Skills
More from AI & MLMulti-Agent Architecture Patterns
Reference guide for multi-agent architecture patterns including Supervisor/Orchestrator, Peer-to-Peer/Swarm, and Hierarchical, with context isolation principles and Claude Code implementation
433NeoLabHQ
AI & ML
Developer Tools
Agent Evaluation Framework
Comprehensive Claude Code agent evaluation framework with multi-dimensional scoring, LLM-as-Judge mode, and research-backed performance variance analysis
433NeoLabHQ
AI & ML
Developer Tools
Multi-Perspective Critique
Multi-perspective review system using Multi-Agent Debate and LLM-as-Judge patterns with 3 specialized judges, debate rounds, and consensus building
433NeoLabHQ
AI & ML
Developer Tools