🤖AI & LLM
370
70

ai-evals

Help users create and run AI evaluations. Use when someone is building evals for LLM products, measuring model quality, creating test cases, designing rubrics, or trying to systematically measure AI output quality.

#eval#llm#testing#quality-management
Share
Quick Install
>_npx skills add refoundai/lenny-skills
Documentation
Loading documentation...
Repository
Repositoryrefoundai/lenny-skills
Stars370
Last UpdatedJan 31, 2026
Related Skills
271,400
6,331

find-skills

Helps users discover and install agent skills based on their queries.

vercel-labs
vercel-labs/skills
46,800
19,561

agent-browser

A CLI tool for AI agents to automate browser tasks like navigation, form filling, and data scraping.

vercel-labs
vercel-labs/agent-browser
34,600
79,803

browser-use

Automates browser interactions for web testing, form filling, screenshots, and data extraction.

browser-use
browser-use/browser-use
32,600
86,065

skill-creator

A guide for creating effective AI skills that extend Claude's capabilities with specialized knowledge, workflows, or tool integrations.

anthropics
anthropics/skills
24,400
55,506

brainstorming

A skill for brainstorming and exploring user intent before implementing creative work.

obra
obra/superpowers