There are several ways to import your model, but for the purpose of this example, we will be using Hugging Face. By the end of this tutorial, you will have the ability to deploy a Hugging Face model in Inferless.
"Model Name"
(you can also use the copy button), Task Type
, Model Framework,
and Model Type
. These will be required for the next steps."HuggingFace"
button that you see on dashboard. An import wizard will open up.Click on Add Model
model name
(The name that you wish to call your model), Choose the model type
(Eg: Transformer), Choose the task type
(Eg: Text generation) and Huggingface model name
.Enter the details as noted
Automatic rebuild
for your model, enable it
Modify the code
Set runtime and configuration
Deploy
to start the model import process.
Review all the details carefully before proceeding
"In Progress/Failed"
tab.View the model under `In-Progress/ Failed`
-> API -> Inference Endpoint details.
Here you would find the API endpoints that can be called. You can click on the copy button on the right and can call your model.
Under the API Tab, you can view the API endpoint details.