curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}
This endpoint updates the settings of a model. You can configure Min/Max Replicas, Timeout and Concurrency Settings
curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}
Show properties
curl --location 'https://api.inferless.com/rest/model/settings/update/' \
--header 'Authorization: <workspace-token>' \
--header 'Content-Type: application/json' \
--data '{
"model_id": "<model-id>",
"data": {
"min_replica": 0,
"max_replica": 2,
"scale_down_delay": 30,
"inference_time": 120,
"is_dedicated": false,
"machine_type": "T4",
"container_concurrency": 10,
"is_input_output_enabled": false
}
}'
{
"status": "success",
"details": "Model updated successfully"
}