PR

promptfoo

@promptfoo

Test your LLM apps

United States of America

Website

@promptfoo

Joined June 2023

194 Followers

0 Following

AI Insights & Achievements

44/ 100

Quality

Avg 576.1 stars per repo.

Growth

14 new projects with 34 stars.

Influencer

194 followers.

Profile README

Promptfoo: LLM evals & red teaming

promptfoo is a developer-friendly local tool for testing LLM applications. Stop the trial-and-error approach - start shipping secure, reliable AI apps.

Website · Getting Started · Red Teaming · Documentation · Discord

Quick Start

# Install and initialize project
npx promptfoo@latest init

# Run your first evaluation
npx promptfoo eval

See Getting Started (evals) or Red Teaming (vulnerability scanning) for more.

What can you do with Promptfoo?

Test your prompts and models with automated evaluations
Secure your LLM apps with red teaming and vulnerability scanning
Compare models side-by-side (OpenAI, Anthropic, Azure, Bedrock, Ollama, and more)
Automate checks in CI/CD
Review pull requests for LLM-related security and compliance issues with code scanning
Share results with your team

Here's what it looks like in action:

prompt evaluation matrix - web viewer

It works on the command line too:

prompt evaluation matrix - command line

It also can generate security vulnerability reports:

gen ai red team

Why Promptfoo?

🚀 Developer-first: Fast, with features like live reload and caching
🔒 Private: LLM evals run 100% locally - your prompts never leave your machine
🔧 Flexible: Works with any LLM API or programming language
💪 Battle-tested: Powers LLM apps serving 10M+ users in production
📊 Data-driven: Make decisions based on metrics, not gut feel
🤝 Open source: MIT licensed, with an active community

Learn More

Contributing

We welcome contributions! Check out our contributing guide to get started.

Join our Discord community for help and discussion.

Public Repos

18

Total Stars

10369

Total Forks

924

Public Gists

0

Top Languages

Based on primary language of repositories

GitHub Analytics