Skip to main content

Using the web viewer

After running an eval, view results in your browser:

npx promptfoo@latest view

See promptfoo view for CLI options.

promptfoo web viewer

Keyboard Shortcuts​

ShortcutAction
Ctrl+K / Cmd+KOpen eval selector
EscClear search
Shift (hold)Show extra cell actions

Toolbar​

  • Eval selector - Switch between evals
  • Zoom - Scale columns (50%-200%)
  • Display mode - Filter: All, Failures, Passes, Errors, Different, Highlights
  • Search - Text or regex
  • Filters - By metrics, metadata, pass/fail. Operators: =, contains, >, <

Display mode dropdown

Table Settings​

Table Settings dialog

  • Columns - Toggle variable and prompt visibility
  • Truncation - Max text length, word wrap
  • Rendering - Markdown, JSON prettification
  • Inference details - Tokens, latency, cost, tokens/sec
  • Media - Image size limits; double-click for lightbox

Cell Actions​

Hover to reveal actions. Hold Shift for more:

ActionDescription
πŸ”DetailsFull output, prompt, variables, grading results
πŸ‘PassMark as passed (score = 1.0)
πŸ‘ŽFailMark as failed (score = 0.0)
πŸ”’ScoreSet custom score (0-1)
✏️CommentAdd notes
⭐HighlightMark for review (Shift)
πŸ“‹CopyCopy to clipboard (Shift)
πŸ”—ShareLink to this output (Shift)

Ratings and comments persist and are included in exportsβ€”use them to build training datasets.

Eval Actions​

Eval actions menu

  • Edit name - Rename eval
  • Edit and re-run - Open in eval creator
  • Compare - Diff against another eval (green = added, red = removed)
  • View YAML - Show config
  • Download - Opens export dialog:
ExportUse case
YAML configRe-run the eval
Failed tests onlyDebug failures
CSV / JSONAnalysis, reporting
DPO JSONPreference training data
Human Eval YAMLHuman labeling workflows
Burp payloadsSecurity testing (red team only)
  • Copy - Duplicate eval
  • Share - Generate URL (see Sharing)
  • Delete

Results Charts​

Toggle with Show Charts.

Results charts

Pass Rate​

Percentage of tests where all assertions passed.

Score Distribution​

Histogram of scores per prompt. Each test score = mean of its assertion scores. See weighted assertions.

Scatter Plot​

Compare two prompts head-to-head. Click to select prompts.

  • Green = Prompt 2 scored higher
  • Red = Prompt 1 scored higher
  • Gray = Same score

Sharing​

Eval actions β†’ Share generates a URL.

Cloud​

Free at promptfoo.app. Links are private to your organization.

promptfoo auth login -k YOUR_API_KEY
promptfoo share

Self-hosted​

For self-hosted deployments:

promptfooconfig.yaml
sharing:
apiBaseUrl: http://your-server:3000
appBaseUrl: http://your-server:3000

Or set via API Settings in the top-right menu. See sharing docs for auth and CI/CD.

URL Parameters​

Viewer state syncs to the URLβ€”bookmark or share filtered views:

ParameterValues
filterModeall, failures, passes, errors, different, highlights
searchAny text
/eval/abc123?filterMode=failures&search=timeout