List Existing Models
First, let’s verify that no models are currently loaded by running the list command:Prepare Your Model
Before importing, ensure your model is stored in an S3 bucket. We recommend using a dedicated S3 bucket for storing models provided by Poolside.Import the Models
Use the following command to import your chat model (malibu):Remember to ensure that your EKS pod role has sufficient permissions to read from your target S3 buckets.
<MODEL_NAME>the unique name of your model (e.g.,malibu_0703)<NUMBER_OF_REPLICAS>the number of replicas to create (e.g.,2)<YOU_MODEL_BUCKET>the S3 bucket where your model is stored
Authentication
You will be prompted to authenticate through your identity provider. Open the provided URL in a web browser and log in with valid credentials. Ensure you have the necessary permissions to access the S3 bucket and import models into Poolside. If you encounter any issues, check your AWS credentials and Poolside permissions.Verification
After the import process is complete, run the list command again to confirm that your model has been successfully added:Testing the API
After successfully loading the model, we can also test the API to ensure that everything is working properly. First, you need to get the bearer token for your logged in user. We can do this by checking the Splash config file for the token with a ps prefix. Run:Note: Remember to insert your own token as the value for the auth header. Detailed documentation on the Poolside API can be found by navigating to
http://<api.domain.com>/docs