Deploys an endpoint for a foundation model from SageMaker JumpStart Foundation Models.
The module uses AWS Generative AI CDK Constructs.
jump-start-model-name
- model name from SageMaker JumpStart Foundation Modelsinstance-type
- inference container instance type
vpc-id
- VPC idsubnet-ids
- VPC subnet ids
EndpointArn
- endpoint ARN.RoleArn
- IAM role ARN.
Example manifest:
name: hf-mistral-endpoint
path: modules/fmops/sagemaker-jumpstart-fm-endpoint
targetAccount: primary
parameters:
- name: jump-start-model-name
value: HUGGINGFACE_LLM_MISTRAL_7B_2_1_0
- name: instance-type
value: inf1.xlarge
- name: vpc_id
valueFrom:
moduleMetadata:
group: networking
name: networking
key: VpcId
- name: subnet_ids
valueFrom:
moduleMetadata:
group: networking
name: networking
key: PrivateSubnetIds