Skip to main content

skill-eval

Automated CLI for evaluating AI agent skills

Version 1.0.0 1binary License undeclared

Last synced:

Version
1.0.0
Binaries
1
Platforms
2
License
Undeclared

Formula metadata

Version
1.0.0
Homepage
https://github.com/matt-riley/skill-evaluator
Formula file
Formula/skill-eval.rb
Install
brew install matt-riley/tools/skill-eval
Binary name
skill-eval
Platforms
macOS (Intel, Apple Silicon), Linux (x86_64, arm64)
License
Not declared in the formula

README


title: Home description: skill-evaluator is a friendly CLI tool that automates eval-driven iteration for AI skills, helping you run, grade, and benchmark agent outputs.

✨ skill-eval

Welcome to skill-eval! This friendly little CLI tool helps you automate the eval-driven iteration loop for your AI skills. It's inspired by the workflow from agentskills.io.

Skill Evaluator in action!

With skill-eval, you can easily define test cases, run your agent with and without a skill, have an LLM grade the results, and see how everything benchmarks. Let's make your skills amazing! 🚀