LLM 评判工具

Name: LLM 评判工具
Author: NeoLabHQ

带上下文隔离和思维链推理的独立 LLM 评判评估工具

byNeoLabHQ

Home/AI & ML/LLM 评判工具

What is it?

独立的 LLM-as-Judge 评估工具，含上下文隔离、链式思维评分、多维加权评分标准和证据支撑评估。

How to use it?

安装技能后，Claude 会在需要质量评估时作为独立评判者运行，提供客观的多维度评分报告。

Key Features

独立上下文隔离评估
链式思维推理评分
多维加权评分标准
证据支撑的评估结论
客观公正的质量判断

View on GitHub

GitHub Stats

Stars

Forks

Last Update

Author

NeoLabHQ

License

GPL-3.0

Version

1.0.0

Features

Related Skills

Multi-Agent Architecture Patterns

Reference guide for multi-agent architecture patterns including Supervisor/Orchestrator, Peer-to-Peer/Swarm, and Hierarchical, with context isolation principles and Claude Code implementation

433NeoLabHQ

AI & ML

Developer Tools

Agent Evaluation Framework

Comprehensive Claude Code agent evaluation framework with multi-dimensional scoring, LLM-as-Judge mode, and research-backed performance variance analysis

433NeoLabHQ

AI & ML

Developer Tools