Supported Frameworks
Below are the supported frameworks for models that can be uploaded into Inferless.- Pytorch
- Tensorflow
- ONNX (This is currently not supported)
File structure requirements
In the case of model in Sagemaker being:- PyTorch
-
The model file needs to have a
model.pt
file -
By default, AWS Sagemaker creates it as
model.pth
.
model.pt
, please follow the below instructions.
- The Training script used in sagemaker typically saves the model using torch.save
- But for Inferless pytorch runtime we need model.pt file which can be created by modifying the training script with the following instead of the above
- Once this is done the estimator.fit function trains the model and saves the trained model in model.pt format.
- This can then be uploaded to s3 using the estimator.deploy function
- You can now upload the Model by entering the ARN link to inferless using the model import option.
<Video here>
- Tensorflow
- There is no need for any additional steps/file modifications. The ARN link of the generated model can directly be used for import