Deploying NudgeBee AI on AWS Bedrock (Custom Model)

Overview

This guide details the deployment of NudgeBee AI using AWS Bedrock's custom model hosting feature.

Prerequisites

AWS account with access to Bedrock and S3
Trained NudgeBee model in .tar.gz
Model uploaded to an S3 bucket
IAM role with required access

Step-by-Step Guide

Step 1: Upload Model to S3

Same as SageMaker: upload nudgebee_model.tar.gz to an S3 bucket.

Step 2: Register the Model in Bedrock

Go to Amazon Bedrock
Click Custom Models > Create Custom Model
Name the model (e.g., nudgebee-custom-llm)
Enter the S3 path for model artifacts
Select IAM role
Click Create Model

Step 3: Deploy the Model

Go to Custom Models > Deploy Model
Select model and choose instance type (e.g., ml.g5.2xlarge)
Set autoscaling (optional)
Click Deploy and wait until active

RAG and LLM Server Configuration

RAG Server (Bedrock)

EMBEDDINGS_PROVIDER=bedrock
EMBEDDINGS_PROVIDER_REGION=<AWS_Region>
EMBEDDINGS_MODEL_NAME=<Custom_Bedrock_Model_ID>

LLM Server (Bedrock)

LLM_PROVIDER=bedrock
LLM_PROVIDER_REGION=<AWS_Region>
LLM_MODEL_NAME=<Custom_Bedrock_Model_ID>

Testing

Use AWS CLI:

aws bedrock-runtime invoke-model \
  --model-id <Custom_Bedrock_Model_ID> \
  --content-type application/json \
  --body '{"prompt": "Hello, how can I help you?"}'

Overview​

Prerequisites​

Step-by-Step Guide​

Step 1: Upload Model to S3​

Step 2: Register the Model in Bedrock​

Step 3: Deploy the Model​

RAG and LLM Server Configuration​

RAG Server (Bedrock)​

LLM Server (Bedrock)​

Testing​

Overview

Prerequisites

Step-by-Step Guide

Step 1: Upload Model to S3

Step 2: Register the Model in Bedrock

Step 3: Deploy the Model

RAG and LLM Server Configuration

RAG Server (Bedrock)

LLM Server (Bedrock)

Testing