Deploy a model in production

Once you have trained your model, you can deploy it in production. This section provides several guides in how to deploy your model in different environments.

  Overview

Start here to get an overview of the different deployment options.

Deployment options in AI4OS
 Deploy in the platform (serverless)

Deploy your model in the platform using the serverless option, using a shared serverless environment.

Deploy a model on the AI4OS Inference platform using the Dashboard
Deploy in the platform (dedicated)

Deploy your model in the platform using a dedicated deployment and a load balancer.

Deploy a model on AI4OS dedicated resources using the Dashboard
 Deploy in your cloud

Deploy your model in your cloud using the provided Docker image.

Deploy a model on your own cloud resources
 Manual serverless deployment

Deploy your model in the platform using the serverless option, but manually configuring the deployment. This is an advanced option.

Manually deploy a serverless inference endpoint