HTTP/HTTPS API
Setting the provider ID to a URL sends an HTTP request to the endpoint. This provides a general-purpose way to use any HTTP endpoint for inference.
The provider configuration allows you to construct the HTTP request and extract the inference result from the response.
providers:
- id: https
config:
url: 'https://example.com/generate'
method: 'POST'
headers:
'Content-Type': 'application/json'
body:
myPrompt: '{{prompt}}'
transformResponse: 'json.output' # Extract the "output" field from the response
The placeholder variable {{prompt}}
will be replaced with the final prompt for the test case. You can also reference test variables as you construct the request:
providers:
- id: https
config:
url: 'https://example.com/generateTranslation'
body:
prompt: '{{prompt}}'
model: '{{model}}'
translate: '{{language}}'
tests:
- vars:
model: 'gpt-4o-mini'
language: 'French'
body
can be a string or JSON object. If the body is a string, the Content-Type
header defaults to text/plain
unless specified otherwise. If the body is an object, then content type is automatically set to application/json
.
JSON Example
providers:
- id: https
config:
url: 'https://example.com/generateTranslation'
body:
model: '{{model}}'
translate: '{{language}}'
Form-data Example
providers:
- id: https
config:
headers:
'Content-Type': 'application/x-www-form-urlencoded'
body: 'model={{model}}&translate={{language}}'
Sending a raw HTTP request
You can also send a raw HTTP request by specifying the request
property in the provider configuration. This allows you to have full control over the request, including headers and body.
Here's an example of how to use the raw HTTP request feature:
providers:
- id: https
config:
request: |
POST /v1/completions HTTP/1.1
Host: api.example.com
Content-Type: application/json
Authorization: Bearer {{api_key}}
{
"model": "llama3.1-405b-base",
"prompt": "{{prompt}}",
"max_tokens": 100
}
transformResponse: 'json.content' # extract the "content" field from the response
In this example:
- The
request
property contains a raw HTTP request, including the method, path, headers, and body. - You can use template variables like
{{api_key}}
and{{prompt}}
within the raw request. These will be replaced with actual values when the request is sent. - The
transformResponse
property is used to extract the desired information from the JSON response.
You can also load the raw request from an external file using the file://
prefix:
providers:
- id: https
config:
request: file://path/to/request.txt
transformResponse: 'json.text'
This path is relative to the directory containing the Promptfoo config file.
Then create a file at path/to/request.txt
:
POST /api/generate HTTP/1.1
Host: example.com
Content-Type: application/json
{"prompt": "Tell me a joke"}
Nested objects
Nested objects are supported and should be passed to the dump
function.
providers:
- id: https
config:
url: 'https://example.com/generateTranslation'
body:
messages: '{{messages | dump}}'
model: '{{model}}'
translate: '{{language}}'
tests:
- vars:
messages:
- role: 'user'
content: 'foobar'
- role: 'assistant'
content: 'baz'
model: 'gpt-4o-mini'
language: 'French'
Note that any valid JSON string within body
will be converted to a JSON object.
Query parameters
Query parameters can be specified in the provider config using the queryParams
field. These will be appended to the URL as GET parameters.
providers:
- id: https
config:
url: 'https://example.com/search'
method: 'GET'
queryParams:
q: '{{prompt}}'
foo: 'bar'
Using as a library
If you are using promptfoo as a node library, you can provide the equivalent provider config:
{
// ...
providers: [{
id: 'https',
config: {
url: 'https://example.com/generate',
method: 'POST',
headers: {
'Content-Type': 'application/json',
},
body: {
foo: '{{bar}}',
},
transformResponse: (json) => json.output,
}
}],
}
Request Transform
Request transform modifies your prompt after it is rendered but before it is sent to a provider API. This allows you to:
- Format prompts into specific message structures
- Add metadata or context
- Handle nuanced message formats for multi-turn conversations
Basic Usage
providers:
- id: https
config:
url: 'https://api.example.com/chat'
transformRequest: '{"message": "{{prompt}}"}'
body:
user_message: '{{prompt}}'
Transform Types
String Template
Use Nunjucks templates to transform the prompt:
transformRequest: '{"text": "{{prompt}}"}'
JavaScript Function
Define a function that transforms the prompt:
transformRequest: (prompt) => JSON.stringify({ text: prompt, timestamp: Date.now() });
File-based Transform
Load a transform from an external file:
transformRequest: 'file://transforms/request.js'
Example transform file (transforms/request.js):
module.exports = (prompt) => {
return {
text: prompt,
metadata: {
timestamp: Date.now(),
version: '1.0',
},
};
};
You can also specify a specific function to use:
transformRequest: 'file://transforms/request.js:transformRequest'
Response Transform
The transformResponse
option allows you to extract and transform the API response. If no transformResponse
is specified, the provider will attempt to parse the response as JSON. If JSON parsing fails, it will return the raw text response.
You can override this behavior by specifying a transformResponse
in the provider config. The transformResponse
can be one of the following:
- A string containing a JavaScript expression
- A function
- A file path (prefixed with
file://
) to a JavaScript module
Parsing a JSON response
By default, the entire response is returned as the output. If your API responds with a JSON object and you want to pick out a specific value, use the transformResponse
property to set a JavaScript snippet that manipulates the provided json
object.
For example, this transformResponse
configuration:
providers:
- id: https
config:
url: 'https://example.com/openai-compatible/chat/completions'
# ...
transformResponse: 'json.choices[0].message.content'
Extracts the message content from this response:
{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"created": 1677858242,
"model": "gpt-4o-mini",
"usage": {
"prompt_tokens": 13,
"completion_tokens": 7,
"total_tokens": 20
},
"choices": [
{
"message": {
"role": "assistant",
"content": "\n\nThis is a test!"
},
"logprobs": null,
"finish_reason": "stop",
"index": 0
}
]
}
Parsing a text response
If your API responds with a text response, you can use the transformResponse
property to set a JavaScript snippet that manipulates the provided text
object.
For example, this transformResponse
configuration:
providers:
- id: https
config:
url: 'https://example.com/api'
# ...
transformResponse: 'text.slice(11)'
Extracts the message content "hello world" from this response:
Assistant: hello world
Response Parser Types
String parser
You can use a string containing a JavaScript expression to extract data from the response:
providers:
- id: https
config:
url: 'https://example.com/api'
transformResponse: 'json.choices[0].message.content'
This expression will be evaluated with two variables available:
json
: The parsed JSON response (if the response is valid JSON)text
: The raw text response
Function parser
When using promptfoo as a Node.js library, you can provide a function as the response parser:
{
providers: [{
id: 'https',
config: {
url: 'https://example.com/generate',
transformResponse: (json, text) => {
// Custom parsing logic
return json.choices[0].message.content;
},
}
}],
}
File-based parser
You can use a JavaScript file as a response parser by specifying the file path with the file://
prefix. The file path is resolved relative to the directory containing the promptfoo configuration file.
providers:
- id: https
config:
url: 'https://example.com/api'
transformResponse: 'file://path/to/parser.js'
The parser file should export a function that takes two arguments (json
and text
) and returns the parsed output. For example:
module.exports = (json, text) => {
return json.choices[0].message.content;
};
You can also use a default export:
export default (json, text) => {
return json.choices[0].message.content;
};
You can also specify a function name to be imported from a file:
providers:
- id: https
config:
url: 'https://example.com/api'
transformResponse: 'file://path/to/parser.js:parseResponse'
This will import the function parseResponse
from the file path/to/parser.js
.
Session management
Server-side session management
When using an HTTP provider with multi-turn redteam attacks like GOAT and Crescendo, you may need to maintain session IDs between rounds. The HTTP provider will automatically extract the session ID from the response headers and store it in the vars
object.
Create a session parser that extracts the session ID from the response headers and returns it. All of the same formats of response parsers are supported.
The input to the session parser is an object with a headers
field, which contains the response headers:
{
headers: Record<string, string>;
}
Simple header parser:
sessionParser: 'set-cookie'
The parser can take a string, file or function like the response parser. If you just include a string, it will be treated as a field on the headers
object.
Then you need to set the session ID in the vars
object for the next round:
providers:
- id: https
config:
url: 'https://example.com/api'
headers:
'Cookie': '{{sessionId}}'
Client-side session management
If you want the Promptfoo client to send a unique session or conversation ID with each test case, you can add a transformVars
option to your Promptfoo or redteam config. This is useful for multi-turn evals or multi-turn redteam attacks where the provider maintains a conversation state.
For example:
defaultTest:
options:
transformVars: '{ ...vars, sessionId: context.uuid }'
Now you can use the sessionId
variable in your HTTP target config:
providers:
- id: https
config:
url: 'https://example.com/api'
headers:
'x-promptfoo-session': '{{sessionId}}'
body:
user_message: '{{prompt}}'
Request Retries
The HTTP provider automatically retries failed requests in the following scenarios:
- Rate limiting (HTTP 429)
- Server errors
- Network failures
By default, it will attempt up to 4 retries with exponential backoff. You can configure the maximum number of retries using the maxRetries
option:
providers:
- id: http
config:
url: https://api.example.com/v1/chat
maxRetries: 2 # Override default of 4 retries
Reference
Supported config options:
Option | Type | Description |
---|---|---|
url | string | The URL to send the HTTP request to. If not provided, the id of the provider will be used as the URL. |
request | string | A raw HTTP request to send. This will override the url , method , headers , body , and queryParams options. |
method | string | HTTP method (GET, POST, etc). Defaults to POST if body is provided, GET otherwise. |
headers | Record<string, string> | Key-value pairs of HTTP headers to include in the request. |
body | object | string | The request body. For POST requests, objects are automatically stringified as JSON. |
queryParams | Record<string, string> | Key-value pairs of query parameters to append to the URL. |
transformRequest | string | Function | A function, string template, or file path to transform the prompt before sending it to the API. |
transformResponse | string | Function | Transforms the API response using a JavaScript expression (e.g., 'json.result'), function, or file path (e.g., 'file://parser.js'). Replaces the deprecated responseParser field. |
maxRetries | number | Maximum number of retry attempts for failed requests. Defaults to 4. |
validateStatus | Function | A function that takes a status code and returns a boolean indicating if the response should be treated as successful. By default, accepts all status codes. |
Note: All string values in the config (including those nested in headers
, body
, and queryParams
) support Nunjucks templating. This means you can use the {{prompt}}
variable or any other variables passed in the test context.
In addition to a full URL, the provider id
field accepts http
or https
as values.
Configuration Generator
Use the generator below to create an HTTP provider configuration based on your endpoint:
Example Request
Example Response
Error Handling
The HTTP provider throws errors for:
- Network errors or request failures
- Invalid response parsing
- Session parsing errors
- Invalid request configurations
- Status codes that fail the configured validation (if
validateStatus
is set)
By default, all response status codes are accepted. You can customize this using the validateStatus
option:
providers:
- id: https
config:
url: 'https://example.com/api'
# Function-based validation
validateStatus: (status) => status < 500 # Accept any status below 500
# Or string-based expression
validateStatus: 'status >= 200 && status <= 299' # Accept only 2xx responses
# Or load from file
validateStatus: 'file://validators/status.js' # Load default export
validateStatus: 'file://validators/status.js:validateStatus' # Load specific function
Example validator file (validators/status.js
):
export default (status) => status < 500;
// Or named export
export function validateStatus(status) {
return status < 500;
}
The provider automatically retries certain errors (like rate limits) based on maxRetries
, while other errors are thrown immediately.