Authorizations
Your workspace API token. You can find it in Workspace Settings
Body
The ID of the model whose settings you want to update.
The settings you want to update for the model.
curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}
This endpoint updates the settings of a model. You can configure Min/Max Replicas, Timeout and Concurrency Settings
curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}
Show properties
curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}