Skip to main content

Javascript assertions

The javascript assertion allows you to provide a custom JavaScript function to validate the LLM output.

A variable named output is injected into the context. The function should return true if the output passes the assertion, and false otherwise. If the function returns a number, it will be treated as a score.

You can use any valid JavaScript code in your function. The output of the LLM is provided as the output variable:

assert:
- type: javascript
value: "output.includes('Hello, World!')"

In the example above, the javascript assertion checks if the output includes the string "Hello, World!". If it does, the assertion passes and a score of 1 is recorded. If it doesn't, the assertion fails and a score of 0 is returned.

If you want to return a custom score, your function should return a number. For example:

assert:
- type: javascript
value: Math.log(output.length) * 10
threshold: 0.5 # any value above 0.5 will pass

In the example above, the longer the output, the higher the score.

If your function throws an error, the assertion will fail and the error message will be included in the reason for the failure. For example:

assert:
- type: javascript
value: |
if (errorCase) {
throw new Error('This is an error');
}
return {
pass: false,
score: 0,
reason: 'Assertion failed',
};

Handling objects

If the LLM outputs a JSON object (such as in the case of tool/function calls), then output will already be parsed as an object:

assert:
- type: javascript
value: output[0].function.name === 'get_current_weather'

Return type

The return value of your Javascript function can be a boolean, number, or a GradingResult:

type JavascriptAssertionResult = boolean | number | GradingResult;

// Used for more complex results
interface GradingResult {
pass: boolean;
score: number;
reason: string;
componentResults?: GradingResult[];
}

If componentResults is set, a table of assertion details will be shown in the test output modal in the Eval view.

Multiline functions

Javascript assertions support multiline strings:

assert:
- type: javascript
value: |
// Insert your scoring logic here...
if (output === 'Expected output') {
return {
pass: true,
score: 0.5,
};
}
return {
pass: false,
score: 0,
reason: 'Assertion failed',
};

Using test context

The context variable contains the prompt and test case variables:

interface AssertContext {
// Raw prompt sent to LLM
prompt: string;

// Test case variables
vars: Record<string, string | object>;
}

For example, if the test case has a var example, access it in your JavaScript function like this:

tests:
- description: 'Test with context'
vars:
example: 'Example text'
assert:
- type: javascript
value: 'output.includes(context.vars.example)'

You can also use the context variable to perform more complex checks. For example, you could check if the output is longer than a certain length defined in your test case variables:

tests:
- description: 'Test with context'
vars:
min_length: 10
assert:
- type: javascript
value: 'output.length >= context.vars.min_length'

External script

To reference an external file, use the file:// prefix:

assert:
- type: javascript
value: file://relative/path/to/script.js
config:
maximumOutputSize: 10

You can specify a particular function to use by appending it after a colon:

assert:
- type: javascript
value: file://relative/path/to/script.js:customFunction

The JavaScript file must export an assertion function. Here are examples:

// Default export
module.exports = (output, context) => {
return output.length > 10;
};
// Named exports
module.exports.customFunction = (output, context) => {
return output.includes('specific text');
};

Here's an example using configuration data defined in the assertion's YAML file:

module.exports = (output, context) => {
return output.length <= context.config.maximumOutputSize;
};

Here's a more complex example that uses an async function to hit an external validation service:

const VALIDATION_ENDPOINT = 'https://example.com/api/validate';

async function evaluate(modelResponse) {
try {
const response = await fetch(VALIDATION_ENDPOINT, {
method: 'POST',
headers: {
'Content-Type': 'text/plain',
},
body: modelResponse,
});

const data = await response.json();
return data;
} catch (error) {
throw error;
}
}

async function main(output, context) {
const success = await evaluate(output);
console.log(`success: ${testResult}`);
return success;
}

module.exports = main;

You can also return complete GradingResult objects. For example:

module.exports = (output, context) => {
console.log('Prompt:', context.prompt);
console.log('Vars', context.vars.topic);

// You can return a bool...
// return output.toLowerCase().includes('bananas');

// A score (where 0 = Fail)...
// return 0.5;

// Or an entire grading result, which can be simple...
let result = {
pass: output.toLowerCase().includes('bananas'),
score: 0.5,
reason: 'Contains banana',
};

// Or include nested assertions...
result = {
pass: true,
score: 0.75,
reason: 'Looks good to me',
componentResults: [
{
pass: output.toLowerCase().includes('bananas'),
score: 0.5,
reason: 'Contains banana',
namedScores: {
'Uses banana': 1.0,
},
},
{
pass: output.toLowerCase().includes('yellow'),
score: 0.5,
reason: 'Contains yellow',
namedScores: {
Yellowish: 0.66,
},
},
],
};

return result;
};

ES modules

ES modules are supported, but must have a .mjs file extension. Alternatively, if you are transpiling Javascript or Typescript, we recommend pointing promptfoo to the transpiled plain Javascript output.

Other assertion types

For more info on assertions, see Test assertions.