transformer
or a diffuser
the based model from Hugging face.
Hugging Face
-> Copy and create a repo in GitHub
-> Load the model repo into Inferless.
You can use the imported GitHub Repo to change the pre-processing and post-processing code.
"Model Name"
(you can also use the copy button), Task Type
, Model Framework,
and Model Type
. These will be required for the next steps.
The fields to be copied/noted are mentioned in RED.
"Huggingface"
. An import wizard will open up.Click on Add Model
model name
(The name that you wish to call your model), Choose the model type
(Eg: Transformer), Choose the task type
(Eg: Text generation) and Huggingface model name
.Enter the details as noted
Automatic rebuild
for your model, enable it
Enter the details as noted
Set runtime and configuration
Deploy
to start the model import process.
Review all the details carefully before proceeding
"In Progress/Failed"
tab.View the model under `In-Progress/ Failed`
"My Model" -> API -> Inference Endpoint details.
Here you would find the API endpoints that can be called. You can click on the copy button on the right and can call your model.
Under the API Tab, you can view the API endpoint details.
Sample for now