Skip to content

アプリケーション内で をクリックすると、お使いのDataRobotバージョンに関する全プラットフォームドキュメントにアクセスできます。

LLM Test Results (GenAI)

This page outlines the operations, endpoints, parameters, and example requests and responses for the LLM Test Results (GenAI).

GET /api/v2/genai/llmTestResults/

List LLM test results.

Code samples

# You can also use wget
curl -X GET https://app.datarobot.com/api/v2/genai/llmTestResults/ \
  -H "Accept: application/json" \
  -H "Authorization: Bearer {access-token}"

Parameters

Name In Type Required Description
llmTestConfigurationId query string false LLM Test Configuration ID.
llmBlueprintId query string false LLM Blueprint ID.
llmTestSuiteId query string false LLM Test Suite ID.
offset query integer false Skip the specified number of values.
limit query integer false Retrieve only the specified number of values.

Example responses

200 Response

{
  "count": 0,
  "data": [
    {
      "creationDate": "2019-08-24T14:15:22Z",
      "creationUserId": "string",
      "creationUserName": "string",
      "errorMessage": "An unknown error occurred while generating a LLM test result.",
      "errorResolution": "Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance.",
      "executionStatus": "NEW",
      "gradingResult": "PASS",
      "id": "string",
      "insightEvaluationResults": [
        {
          "aggregationType": "average",
          "aggregationValue": 0,
          "chatId": "string",
          "chatName": "string",
          "evaluationDatasetConfigurationId": "string",
          "evaluationDatasetName": "string",
          "evaluationName": "string",
          "executionStatus": "NEW",
          "gradingResult": "PASS",
          "id": "string",
          "insightGradingCriteria": {
            "passThreshold": 100
          },
          "lastUpdateDate": "2019-08-24T14:15:22Z",
          "llmTestResultId": "string",
          "maxNumPrompts": 0,
          "metricName": "string",
          "promptSamplingStrategy": "random_without_replacement"
        }
      ],
      "llmBlueprintId": "string",
      "llmBlueprintSnapshot": {
        "description": "string",
        "id": "string",
        "llmId": "azure-openai-gpt-3.5-turbo",
        "llmSettings": {
          "maxCompletionLength": 0,
          "systemPrompt": "string",
          "temperature": 0,
          "topP": 0
        },
        "name": "string",
        "playgroundId": "string",
        "promptType": "CHAT_HISTORY_AWARE",
        "snapshotDate": "2019-08-24T14:15:22Z",
        "vectorDatabaseId": "string",
        "vectorDatabaseSettings": {
          "addNeighborChunks": false,
          "maxDocumentsRetrievedPerPrompt": 1,
          "maxTokens": 1
        }
      },
      "llmTestConfigurationId": "string",
      "llmTestConfigurationName": "string",
      "llmTestGradingCriteria": {
        "passThreshold": 100
      },
      "llmTestSuiteId": "string",
      "passPercentage": 0,
      "useCaseId": "string"
    }
  ],
  "next": "string",
  "previous": "string",
  "totalCount": 0
}

Responses

Status Meaning Description Schema
200 OK Successful Response ListLLMTestResultResponse
422 Unprocessable Entity Validation Error HTTPValidationErrorResponse

To perform this operation, you must be authenticated by means of one of the following methods:

BearerAuth

POST /api/v2/genai/llmTestResults/

Create a new LLM test result.

Code samples

# You can also use wget
curl -X POST https://app.datarobot.com/api/v2/genai/llmTestResults/ \
  -H "Content-Type: application/json" \
  -H "Accept: application/json" \
  -H "Authorization: Bearer {access-token}" \
  -d '{CreateLLMTestResultRequest}'

Body parameter

{
  "llmBlueprintId": "string",
  "llmTestConfigurationId": "string"
}

Parameters

Name In Type Required Description
body body CreateLLMTestResultRequest true none

Example responses

202 Response

{
  "creationDate": "2019-08-24T14:15:22Z",
  "creationUserId": "string",
  "creationUserName": "string",
  "errorMessage": "An unknown error occurred while generating a LLM test result.",
  "errorResolution": "Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance.",
  "executionStatus": "NEW",
  "gradingResult": "PASS",
  "id": "string",
  "insightEvaluationResults": [
    {
      "aggregationType": "average",
      "aggregationValue": 0,
      "chatId": "string",
      "chatName": "string",
      "evaluationDatasetConfigurationId": "string",
      "evaluationDatasetName": "string",
      "evaluationName": "string",
      "executionStatus": "NEW",
      "gradingResult": "PASS",
      "id": "string",
      "insightGradingCriteria": {
        "passThreshold": 100
      },
      "lastUpdateDate": "2019-08-24T14:15:22Z",
      "llmTestResultId": "string",
      "maxNumPrompts": 0,
      "metricName": "string",
      "promptSamplingStrategy": "random_without_replacement"
    }
  ],
  "llmBlueprintId": "string",
  "llmBlueprintSnapshot": {
    "description": "string",
    "id": "string",
    "llmId": "azure-openai-gpt-3.5-turbo",
    "llmSettings": {
      "maxCompletionLength": 0,
      "systemPrompt": "string",
      "temperature": 0,
      "topP": 0
    },
    "name": "string",
    "playgroundId": "string",
    "promptType": "CHAT_HISTORY_AWARE",
    "snapshotDate": "2019-08-24T14:15:22Z",
    "vectorDatabaseId": "string",
    "vectorDatabaseSettings": {
      "addNeighborChunks": false,
      "maxDocumentsRetrievedPerPrompt": 1,
      "maxTokens": 1
    }
  },
  "llmTestConfigurationId": "string",
  "llmTestConfigurationName": "string",
  "llmTestGradingCriteria": {
    "passThreshold": 100
  },
  "llmTestSuiteId": "string",
  "passPercentage": 0,
  "useCaseId": "string"
}

Responses

Status Meaning Description Schema
202 Accepted Successful Response LLMTestResultResponse
422 Unprocessable Entity Validation Error HTTPValidationErrorResponse

To perform this operation, you must be authenticated by means of one of the following methods:

BearerAuth

DELETE /api/v2/genai/llmTestResults/{llmTestResultId}/

Delete an existing LLM test result.

Code samples

# You can also use wget
curl -X DELETE https://app.datarobot.com/api/v2/genai/llmTestResults/{llmTestResultId}/ \
  -H "Accept: application/json" \
  -H "Authorization: Bearer {access-token}"

Parameters

Name In Type Required Description
llmTestResultId path string true The ID of the LLM Test Result to delete.

Example responses

422 Response

{
  "detail": [
    {
      "loc": [
        "string"
      ],
      "msg": "string",
      "type": "string"
    }
  ]
}

Responses

Status Meaning Description Schema
204 No Content Successful Response None
422 Unprocessable Entity Validation Error HTTPValidationErrorResponse

To perform this operation, you must be authenticated by means of one of the following methods:

BearerAuth

GET /api/v2/genai/llmTestResults/{llmTestResultId}/

Retrieve an existing LLM test result.

Code samples

# You can also use wget
curl -X GET https://app.datarobot.com/api/v2/genai/llmTestResults/{llmTestResultId}/ \
  -H "Accept: application/json" \
  -H "Authorization: Bearer {access-token}"

Parameters

Name In Type Required Description
llmTestResultId path string true The ID of the LLM Test Result to retrieve.

Example responses

200 Response

{
  "creationDate": "2019-08-24T14:15:22Z",
  "creationUserId": "string",
  "creationUserName": "string",
  "errorMessage": "An unknown error occurred while generating a LLM test result.",
  "errorResolution": "Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance.",
  "executionStatus": "NEW",
  "gradingResult": "PASS",
  "id": "string",
  "insightEvaluationResults": [
    {
      "aggregationType": "average",
      "aggregationValue": 0,
      "chatId": "string",
      "chatName": "string",
      "evaluationDatasetConfigurationId": "string",
      "evaluationDatasetName": "string",
      "evaluationName": "string",
      "executionStatus": "NEW",
      "gradingResult": "PASS",
      "id": "string",
      "insightGradingCriteria": {
        "passThreshold": 100
      },
      "lastUpdateDate": "2019-08-24T14:15:22Z",
      "llmTestResultId": "string",
      "maxNumPrompts": 0,
      "metricName": "string",
      "promptSamplingStrategy": "random_without_replacement"
    }
  ],
  "llmBlueprintId": "string",
  "llmBlueprintSnapshot": {
    "description": "string",
    "id": "string",
    "llmId": "azure-openai-gpt-3.5-turbo",
    "llmSettings": {
      "maxCompletionLength": 0,
      "systemPrompt": "string",
      "temperature": 0,
      "topP": 0
    },
    "name": "string",
    "playgroundId": "string",
    "promptType": "CHAT_HISTORY_AWARE",
    "snapshotDate": "2019-08-24T14:15:22Z",
    "vectorDatabaseId": "string",
    "vectorDatabaseSettings": {
      "addNeighborChunks": false,
      "maxDocumentsRetrievedPerPrompt": 1,
      "maxTokens": 1
    }
  },
  "llmTestConfigurationId": "string",
  "llmTestConfigurationName": "string",
  "llmTestGradingCriteria": {
    "passThreshold": 100
  },
  "llmTestSuiteId": "string",
  "passPercentage": 0,
  "useCaseId": "string"
}

Responses

Status Meaning Description Schema
200 OK Successful Response LLMTestResultResponse
422 Unprocessable Entity Validation Error HTTPValidationErrorResponse

To perform this operation, you must be authenticated by means of one of the following methods:

BearerAuth

Schemas

AggregationType

"average"

AggregationType

Properties

Name Type Required Restrictions Description
AggregationType string false The type of the metric aggregation.

Enumerated Values

Property Value
AggregationType [average, percentYes, classPercentCoverage, ngramImportance, guardConditionPercentYes]

AggregationValue

{
  "item": "string",
  "value": 0
}

AggregationValue

Properties

Name Type Required Restrictions Description
item string true The name of the item.
value number true The value associated with the item.

CommonLLMSettings

{
  "maxCompletionLength": 0,
  "systemPrompt": "string",
  "temperature": 0,
  "topP": 0
}

CommonLLMSettings

Properties

Name Type Required Restrictions Description
maxCompletionLength integer¦null false Maximum number of tokens allowed in the completion. The combined count of this value and prompt tokens must be below the model's maximum context size, where prompt token count is comprised of system prompt, user prompt, recent chat history, and vector database citations.
systemPrompt string¦null false maxLength: 500000
System prompt guides the style of the LLM response. It is a "universal" prompt, prepended to all individual prompts.
temperature number¦null false Temperature controls the randomness of model output, where higher values return more diverse output and lower values return more deterministic results.
topP number¦null false Top P sets a threshold that controls the selection of words included in the response, based on a cumulative probability cutoff for token selection. For example, 0.2 considers only the top 20% probability mass. Higher numbers return more diverse options for outputs.

CreateLLMTestResultRequest

{
  "llmBlueprintId": "string",
  "llmTestConfigurationId": "string"
}

CreateLLMTestResultRequest

Properties

Name Type Required Restrictions Description
llmBlueprintId string true The LLM Blueprint ID associated with the LLM Test result.
llmTestConfigurationId string true The use case ID associated with the LLM Test result.

CustomModelLLMSettings

{
  "externalLlmContextSize": 128,
  "systemPrompt": "string",
  "validationId": "string"
}

CustomModelLLMSettings

Properties

Name Type Required Restrictions Description
externalLlmContextSize integer¦null false maximum: 128000
minimum: 128
The external LLM's context size, in tokens. This value is only used for pruning documents supplied to the LLM when a vector database is associated with the LLM blueprint. It does not affect the external LLM's actual context size in any way and is not supplied to the LLM.
systemPrompt string¦null false maxLength: 500000
System prompt guides the style of the LLM response. It is a "universal" prompt, prepended to all individual prompts.
validationId string¦null false The validation ID of the custom model LLM.

ExecutionStatus

"NEW"

ExecutionStatus

Properties

Name Type Required Restrictions Description
ExecutionStatus string false Job and entity execution status.

Enumerated Values

Property Value
ExecutionStatus [NEW, RUNNING, COMPLETED, REQUIRES_USER_INPUT, SKIPPED, ERROR]

GradingResult

"PASS"

GradingResult

Properties

Name Type Required Restrictions Description
GradingResult string false Grading result.

Enumerated Values

Property Value
GradingResult [PASS, FAIL]

HTTPValidationErrorResponse

{
  "detail": [
    {
      "loc": [
        "string"
      ],
      "msg": "string",
      "type": "string"
    }
  ]
}

HTTPValidationErrorResponse

Properties

Name Type Required Restrictions Description
detail [ValidationError] false none

InsightEvaluationResultResponse

{
  "aggregationType": "average",
  "aggregationValue": 0,
  "chatId": "string",
  "chatName": "string",
  "evaluationDatasetConfigurationId": "string",
  "evaluationDatasetName": "string",
  "evaluationName": "string",
  "executionStatus": "NEW",
  "gradingResult": "PASS",
  "id": "string",
  "insightGradingCriteria": {
    "passThreshold": 100
  },
  "lastUpdateDate": "2019-08-24T14:15:22Z",
  "llmTestResultId": "string",
  "maxNumPrompts": 0,
  "metricName": "string",
  "promptSamplingStrategy": "random_without_replacement"
}

InsightEvaluationResultResponse

Properties

Name Type Required Restrictions Description
aggregationType AggregationType¦null false Aggregation type.
aggregationValue any false Aggregation value. None indicates that the aggregation failed.

anyOf

Name Type Required Restrictions Description
» anonymous number false none

or

Name Type Required Restrictions Description
» anonymous [AggregationValue] false [An individual record in an itemized metric aggregation.]

continued

Name Type Required Restrictions Description
chatId string true Chat ID.
chatName string¦null true maxLength: 5000
Chat name.
evaluationDatasetConfigurationId string¦null false Evaluation dataset configuration ID.
evaluationDatasetName string¦null false maxLength: 5000
Evaluation dataset name.
evaluationName string true maxLength: 5000
Evaluation name.
executionStatus ExecutionStatus false The execution status of the insight evaluation result.
gradingResult GradingResult¦null false The grading result for this insight evaluation result. If not specified, execution status is not COMPLETED.
id string true Insight evaluation result ID.
insightGradingCriteria InsightGradingCriteria true Insight grading criteria.
lastUpdateDate string(date-time) true Last update date of the insight evaluation result (ISO 8601 formatted).
llmTestResultId string true LLM test result ID this insight evaluation result is associated to.
maxNumPrompts integer true Number of prompts used in evaluation.
metricName string¦null false Name of the metric.
promptSamplingStrategy PromptSamplingStrategy true Prompt sampling strategy for maxNumPrompts.

InsightGradingCriteria

{
  "passThreshold": 100
}

InsightGradingCriteria

Properties

Name Type Required Restrictions Description
passThreshold integer true maximum: 100
minimum: 0
The percentage threshold for Pass result. Greater than or equal to this threshold indicates a Pass.

LLMBlueprintSnapshot

{
  "description": "string",
  "id": "string",
  "llmId": "azure-openai-gpt-3.5-turbo",
  "llmSettings": {
    "maxCompletionLength": 0,
    "systemPrompt": "string",
    "temperature": 0,
    "topP": 0
  },
  "name": "string",
  "playgroundId": "string",
  "promptType": "CHAT_HISTORY_AWARE",
  "snapshotDate": "2019-08-24T14:15:22Z",
  "vectorDatabaseId": "string",
  "vectorDatabaseSettings": {
    "addNeighborChunks": false,
    "maxDocumentsRetrievedPerPrompt": 1,
    "maxTokens": 1
  }
}

LLMBlueprintSnapshot

Properties

Name Type Required Restrictions Description
description string true The description of the LLMBlueprint at the time of snapshotting.
id string true The ID of the LLMBlueprint for which the snapshot was produced.
llmId LanguageModelTypeId¦null false The ID of the LLM selected for this LLM blueprint.
llmSettings any false A key/value dictionary of LLM settings.

anyOf

Name Type Required Restrictions Description
» anonymous CommonLLMSettings false The settings that are available for all non-custom LLMs.

or

Name Type Required Restrictions Description
» anonymous CustomModelLLMSettings false The settings that are available for custom model LLMs.

continued

Name Type Required Restrictions Description
name string true The name of the LLMBlueprint at the time of snapshotting.
playgroundId string true The playground id of the LLMBlueprint.
promptType PromptType true The prompting type of the LLMBlueprint at the time of snapshotting.
snapshotDate string(date-time) false The date when the snapshot was produced.
vectorDatabaseId string¦null false The ID of the vector database linked to this LLM blueprint.
vectorDatabaseSettings VectorDatabaseSettings¦null false A key/value dictionary of vector database settings.

LLMTestGradingCriteria

{
  "passThreshold": 100
}

LLMTestGradingCriteria

Properties

Name Type Required Restrictions Description
passThreshold integer true maximum: 100
minimum: 0
The percentage threshold for Pass results across dataset-insight pairs.

LLMTestResultErrorMessages

"An unknown error occurred while generating a LLM test result."

LLMTestResultErrorMessages

Properties

Name Type Required Restrictions Description
LLMTestResultErrorMessages string false Error messages for LLM Test Result.

Enumerated Values

Property Value
LLMTestResultErrorMessages [An unknown error occurred while generating a LLM test result., One or more types for the aggregations produced are invalid for the LLM test result., An error occurred while aggregating the LLM test results., One or more LLM blueprints specified in the LLM Test configuration are invalid and/or not found., One or more evaluation datasets specified in the LLM Test configuration are invalid and/or not found., The LLM Test configuration is invalid and/or not found., An error occurred while uploading the out-of-the-box dataset specified in the LLM Test Result.]

LLMTestResultErrorResolutions

"Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance."

LLMTestResultErrorResolutions

Properties

Name Type Required Restrictions Description
LLMTestResultErrorResolutions string false Error resolutions for LLM Test Result.

Enumerated Values

Property Value
LLMTestResultErrorResolutions [Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance., Please ensure that the metrics selected in the LLM Test Configuration utilized for the LLM Test Result can produce a supported aggregation type. After correcting the LLM Test Configuration, try submitting a new LLM Test Result. If the issue persists, contact the DataRobot team for assistance., Please validate that the submitted LLM Test Configuration is valid. Please verify that the dataset evaluations in the LLM Test Configuration are valid for compliance testing. After correcting the LLM Test Configuration, if needed, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance., Please ensure that the LLM blueprints specified in the LLM Test Configuration still exist and are valid. If the issue persists, contact the DataRobot team for assistance., Please ensure that the evaluation datasets specified in the LLM Test Configuration still exist and are valid. If the issue persists, contact the DataRobot team for assistance., Please ensure that the LLM Test Configuration still exists and is valid. If the issue persists, contact the DataRobot team for assistance., Please ensure that the out-of-the-box dataset specified in the LLM Test Result is valid. Please try to re-submit the LLM Test Result as this may have been a temporary issue. If the issue persists, contact the DataRobot team for assistance.]

LLMTestResultResponse

{
  "creationDate": "2019-08-24T14:15:22Z",
  "creationUserId": "string",
  "creationUserName": "string",
  "errorMessage": "An unknown error occurred while generating a LLM test result.",
  "errorResolution": "Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance.",
  "executionStatus": "NEW",
  "gradingResult": "PASS",
  "id": "string",
  "insightEvaluationResults": [
    {
      "aggregationType": "average",
      "aggregationValue": 0,
      "chatId": "string",
      "chatName": "string",
      "evaluationDatasetConfigurationId": "string",
      "evaluationDatasetName": "string",
      "evaluationName": "string",
      "executionStatus": "NEW",
      "gradingResult": "PASS",
      "id": "string",
      "insightGradingCriteria": {
        "passThreshold": 100
      },
      "lastUpdateDate": "2019-08-24T14:15:22Z",
      "llmTestResultId": "string",
      "maxNumPrompts": 0,
      "metricName": "string",
      "promptSamplingStrategy": "random_without_replacement"
    }
  ],
  "llmBlueprintId": "string",
  "llmBlueprintSnapshot": {
    "description": "string",
    "id": "string",
    "llmId": "azure-openai-gpt-3.5-turbo",
    "llmSettings": {
      "maxCompletionLength": 0,
      "systemPrompt": "string",
      "temperature": 0,
      "topP": 0
    },
    "name": "string",
    "playgroundId": "string",
    "promptType": "CHAT_HISTORY_AWARE",
    "snapshotDate": "2019-08-24T14:15:22Z",
    "vectorDatabaseId": "string",
    "vectorDatabaseSettings": {
      "addNeighborChunks": false,
      "maxDocumentsRetrievedPerPrompt": 1,
      "maxTokens": 1
    }
  },
  "llmTestConfigurationId": "string",
  "llmTestConfigurationName": "string",
  "llmTestGradingCriteria": {
    "passThreshold": 100
  },
  "llmTestSuiteId": "string",
  "passPercentage": 0,
  "useCaseId": "string"
}

LLMTestResultResponse

Properties

Name Type Required Restrictions Description
creationDate string(date-time) true LLM test result creation date (ISO 8601 formatted).
creationUserId string true ID of the user that created this LLM test result.
creationUserName string true The name of the user who created this LLM result.
errorMessage LLMTestResultErrorMessages¦null false The error message if the LLM Test Result failed.
errorResolution LLMTestResultErrorResolutions¦null false The error resolution message if the LLM Test Result failed.
executionStatus ExecutionStatus true The LLM Test execution status.
gradingResult GradingResult¦null false The grading result based on the llm test grading criteria. If not specified, execution status is not COMPLETED.
id string true LLM test result ID.
insightEvaluationResults [InsightEvaluationResultResponse] true The Insight evaluation results.
llmBlueprintId string true LLM Blueprint ID.
llmBlueprintSnapshot LLMBlueprintSnapshot true A snapshot of the llm blueprint entity at the time of LLM Test execution.
llmTestConfigurationId string true LLM test configuration ID this LLM result is associated to.
llmTestConfigurationName string¦null true maxLength: 5000
minLength: 1
minLength: 1
Name of the LLM test configuration this LLM result is associated to.
llmTestGradingCriteria LLMTestGradingCriteria true LLM test grading criteria.
llmTestSuiteId string¦null false LLM test suite ID to which the LLM test configuration is associated to.
passPercentage number¦null false The percentage of underlying insight evaluation results that have a PASS grading result. If not specified, execution status is not COMPLETED.
useCaseId string true Use case ID this LLM test result belongs to.

LanguageModelTypeId

"azure-openai-gpt-3.5-turbo"

LanguageModelTypeId

Properties

Name Type Required Restrictions Description
LanguageModelTypeId string false The ID that defines the type of the LLM.

Enumerated Values

Property Value
LanguageModelTypeId [azure-openai-gpt-3.5-turbo, azure-openai-gpt-3.5-turbo-16k, azure-openai-gpt-4, azure-openai-gpt-4-32k, azure-openai-gpt-4-turbo, azure-openai-gpt-4-o, amazon-titan, anthropic-claude-2, anthropic-claude-3-haiku, anthropic-claude-3-sonnet, anthropic-claude-3-opus, google-bison, google-gemini-1.5-flash, google-gemini-1.5-pro, custom-model]

ListLLMTestResultResponse

{
  "count": 0,
  "data": [
    {
      "creationDate": "2019-08-24T14:15:22Z",
      "creationUserId": "string",
      "creationUserName": "string",
      "errorMessage": "An unknown error occurred while generating a LLM test result.",
      "errorResolution": "Please validate that the submitted LLM Test Configuration is valid. If so, try submitting a new LLM Test Result. Otherwise contact the DataRobot team for assistance.",
      "executionStatus": "NEW",
      "gradingResult": "PASS",
      "id": "string",
      "insightEvaluationResults": [
        {
          "aggregationType": "average",
          "aggregationValue": 0,
          "chatId": "string",
          "chatName": "string",
          "evaluationDatasetConfigurationId": "string",
          "evaluationDatasetName": "string",
          "evaluationName": "string",
          "executionStatus": "NEW",
          "gradingResult": "PASS",
          "id": "string",
          "insightGradingCriteria": {
            "passThreshold": 100
          },
          "lastUpdateDate": "2019-08-24T14:15:22Z",
          "llmTestResultId": "string",
          "maxNumPrompts": 0,
          "metricName": "string",
          "promptSamplingStrategy": "random_without_replacement"
        }
      ],
      "llmBlueprintId": "string",
      "llmBlueprintSnapshot": {
        "description": "string",
        "id": "string",
        "llmId": "azure-openai-gpt-3.5-turbo",
        "llmSettings": {
          "maxCompletionLength": 0,
          "systemPrompt": "string",
          "temperature": 0,
          "topP": 0
        },
        "name": "string",
        "playgroundId": "string",
        "promptType": "CHAT_HISTORY_AWARE",
        "snapshotDate": "2019-08-24T14:15:22Z",
        "vectorDatabaseId": "string",
        "vectorDatabaseSettings": {
          "addNeighborChunks": false,
          "maxDocumentsRetrievedPerPrompt": 1,
          "maxTokens": 1
        }
      },
      "llmTestConfigurationId": "string",
      "llmTestConfigurationName": "string",
      "llmTestGradingCriteria": {
        "passThreshold": 100
      },
      "llmTestSuiteId": "string",
      "passPercentage": 0,
      "useCaseId": "string"
    }
  ],
  "next": "string",
  "previous": "string",
  "totalCount": 0
}

ListLLMTestResultResponse

Properties

Name Type Required Restrictions Description
count integer true The number of records on this page.
data [LLMTestResultResponse] true The list of records.
next string¦null true The URL to the next page, or null if there is no such page.
previous string¦null true The URL to the previous page, or null if there is no such page.
totalCount integer true The total number of records.

PromptSamplingStrategy

"random_without_replacement"

PromptSamplingStrategy

Properties

Name Type Required Restrictions Description
PromptSamplingStrategy string false The prompt sampling strategy for the evaluation dataset configuration.

Enumerated Values

Property Value
PromptSamplingStrategy [random_without_replacement, first_n_rows]

PromptType

"CHAT_HISTORY_AWARE"

PromptType

Properties

Name Type Required Restrictions Description
PromptType string false Determines whether chat history is submitted as context to the user prompt.

Enumerated Values

Property Value
PromptType [CHAT_HISTORY_AWARE, ONE_TIME_PROMPT]

ValidationError

{
  "loc": [
    "string"
  ],
  "msg": "string",
  "type": "string"
}

ValidationError

Properties

Name Type Required Restrictions Description
loc [anyOf] true none

anyOf

Name Type Required Restrictions Description
» anonymous string false none

or

Name Type Required Restrictions Description
» anonymous integer false none

continued

Name Type Required Restrictions Description
msg string true none
type string true none

VectorDatabaseSettings

{
  "addNeighborChunks": false,
  "maxDocumentsRetrievedPerPrompt": 1,
  "maxTokens": 1
}

VectorDatabaseSettings

Properties

Name Type Required Restrictions Description
addNeighborChunks boolean false Add neighboring chunks to those that the similarity search retrieves, such that when selected, search returns i, i-1, and i+1.
maxDocumentsRetrievedPerPrompt integer¦null false maximum: 10
minimum: 1
The maximum number of chunks to retrieve from the vector database.
maxTokens integer¦null false maximum: 51200
minimum: 1
The maximum number of tokens to retrieve from the vector database.

更新しました January 24, 2025