s3://infer-global-models/sample-module/gpt2-medium-1.5gb.zip
"Add a custom model"
button that you see on the top right. An import wizard will open up.Click on Add Model
AWS S3
as the method of uploading.
AWS account
. This is a mandatory step as this helps us download the file from your AWS account.
After you have selected Repo
Cloud URL
, which is a S3 Link of the Model file from AWS.
Enter the details as noted
Automatic rebuild
for your model, enable it
Set runtime and configuration
Deploy
to start the model import process.
Review all the details carefully before proceeding
"In Progress/Failed"
tab.View the model under `In-Progress/ Failed`
-> API -> Inference Endpoint details.
Here you would find the API endpoints that can be called. You can click on the copy button on the right and can call your model.
Under the API Tab, you can view the API endpoint details.
Sample for now