LLM-as-Judgeエバリュエーター

Name: LLM-as-Judgeエバリュエーター
Author: NeoLabHQ

コンテキスト分離、Chain-of-Thoughtスコアリング、多次元加重ルーブリック、エビデンスに裏付けされた評価を備えたスタンドアロンのLLM-as-Judge評価ツール

byNeoLabHQ

Home/AI & ML/LLM-as-Judgeエバリュエーター

What is it?

LLM-as-Judgeパターンを使用して出力を評価し、品質スコア、改善提案、合格/不合格の判定を行うジャッジスキルです。

How to use it?

Claude環境にインストールすると、出力評価関連の作業時に自動的にスキルのガイドラインを適用します。

Key Features

LLM-as-Judgeパターンによる出力評価、品質スコアリング、改善提案
Claude開発ワークフローとのシームレスな統合
ジャッジパターンの包括的なガイドラインとベストプラクティス

View on GitHub

GitHub Stats

Stars

Forks

Last Update

Author

NeoLabHQ

License

GPL-3.0

Version

1.0.0

Features

Related Skills

Multi-Agent Architecture Patterns

Reference guide for multi-agent architecture patterns including Supervisor/Orchestrator, Peer-to-Peer/Swarm, and Hierarchical, with context isolation principles and Claude Code implementation

433NeoLabHQ

AI & ML

Developer Tools

Agent Evaluation Framework

Comprehensive Claude Code agent evaluation framework with multi-dimensional scoring, LLM-as-Judge mode, and research-backed performance variance analysis

433NeoLabHQ

AI & ML

Developer Tools