How to
Deploy on AWS Elastic Kubernetes Engine
LLMstudio is available as a container service on AWS Marketplace. This tutorial shows to deploy the containerized application on EKS.
Prerequisites
In order to follow this tutorial you need to:
- Have AWSMarketplaceFullAccess policy permission in your AWS account
- Have a valid subscription to LLMstudio on AWS marketplace
Setup Prerequisite command-line tools
- Open AWS CloudShell
- Run the following commands to install the prerequisite command-line tools:
Pull Images from ECR
When you subscribe to the product on AWS Marketplace, you’ll be given a command to pull the images from ECR. After running the given commanf you’ll get the ECR Repository URL and the image version, which you need to set as environment variables.
Create cluster
- Create a file called cluster.yaml with the following format:
- Run the following command to create an EKS cluster.
Wait for EKS Cluster
- It takes 10-20 minutes for the EKS cluster to be created.
- You will see a prompt in the CLI when the cluster is ready.
Create Deployment File
- Create a file called deployment.yaml with the following format:
- Add environment variables needed, e.g the API Keys for the models you need:
- Run the following command to create a deployment file for your container image with the environment variables set:
Deploy Container Image
Run the following command to deploy the container image into the EKS cluster:
Check Pod Status
Once the container starts, check the pod status with the following command:
- Wait and re-run the previous command until the STATUS shows
Running
. - The output should look similar to this:
View Logs
Run the following command to get the log output of the pod:
You’ll see the application server startup:
And that’s it! You’ve deployed LLMstudio 🎉