Login to Inferless.com console and copy the CLI keys from Keys section here
Now install Inferless CLI package using this command
Login to Inferless CLI using this command and paste the CLI keys here, you’ll be logged in
To deploy a model with Inferless you would ideally require 3 files
app.py
is a Python file that plays a crucial role in setting up and running models on the Inferless platform. It typically contains a class with three main functions:input_schema.py
is a python file specifies the input parameters for your model’s API calls.config.yaml(Runtime)
refers to the software and dependencies you can add to your runtime environment to support your model’s specific needs.To get started with your first deployment we will provide you app.py
and input_schema.py
. Run this command to download the files
Now that the files are downloaded. Initialise the model using this command
Once the model is initialised. Deploy it using this command
Hurray! You’ve successfully deployed your first model with us
Runtime in Inferless refers to the environment and configuration in which your model runs. It includes:
Create a new file called inferless_runtime_config.yaml
with below data
Run the below command to create the runtime
Deploy with runtime
Volumes in Inferless are NFS-like writable storage spaces that can be connected to multiple replicas simultaneously. They serve several key purposes:
To create a new volume. Run this command
New volume will be created and you will be shown the infer_path
where you can store the weights. Make sure you keep this handy
Once the volume is created. Upload the weights using this command and paste the infer_path
in the destination
Your files will be copied to the server and your volume is ready to be used.
To use these weights, in app.py
you can specify the mount path from where the weights can be accessed (Eg: /var/nfs-mount/<volume_name>
)
Run the command to create a new model using the above model weights
Once the model is initialized. Deploy it using this command (Make sure the path defined inside app.py
and here in the command should be same)