skill-eval
Automated CLI for evaluating AI agent skills
Version
1.0.0
Binaries
1
Platforms
2
License
Undeclared
Formula metadata
README
title: Home description: skill-evaluator is a friendly CLI tool that automates eval-driven iteration for AI skills, helping you run, grade, and benchmark agent outputs.
✨ skill-eval
Welcome to skill-eval! This friendly little CLI tool helps you automate the eval-driven iteration loop for your AI skills. It's inspired by the workflow from agentskills.io.

With skill-eval, you can easily define test cases, run your agent with and without a skill, have an LLM grade the results, and see how everything benchmarks. Let's make your skills amazing! 🚀