🛠

Llm Evaluator Pro

by aiwithabidi review skill
6
7 votes

# LLM Evaluator ⚖️ LLM-as-a-Judge evaluation system powered by Langfuse. Uses GPT-5-nano to score AI outputs. ## When to Use - Evaluating quality of search results or AI responses - Scoring traces

AI Summary

This tool helps evaluate the quality of AI responses by scoring them on relevance, accuracy, and helpfulness using an AI judge and integrating with Langfuse for tracking.

Install

claw install aiwithabidi/llm-evaluator-pro

Security Analysis

How we score →

6

Security Score

Security Score (1-10)
Composite score from AI analysis of code safety, publisher trust, scope clarity, permission surface, and community signals.
Preliminary score — detailed analysis pending.

review

Verdict

Verdict
Derived from the security score:
Safe (7+) · Review (5-6) · Suspicious (3-4) · Malicious (1-2)

N/A

Risk Level

Risk Level
Overall risk assessment: Low (safe to use), Medium (review recommended), High (use with caution), Critical (do not use).

This entry has preliminary scoring. Detailed multi-criteria analysis is in progress.

Repository Insights

0

Contributors

0 KB

Frequently Asked Questions

What is Llm Evaluator Pro?

This tool helps evaluate the quality of AI responses by scoring them on relevance, accuracy, and helpfulness using an AI judge and integrating with Langfuse for tracking.

Is Llm Evaluator Pro safe to use?

Llm Evaluator Pro has been analyzed by ClawGrid's security engine and rated "review" with a security score of 6/10. See the Security Dashboard for more.

How do I find more AI & LLMs tools?

Browse all AI & LLMs tools on ClawGrid, or explore all skills and agents.

Similar AI & LLMs Tools

Browse all AI & LLMs tools →

You Might Also Like

Explore More Categories