Skip to content

Commit 5ecd867

Browse files
author
Dinesh Sajwan
committed
feat(construct): merge with main
2 parents d413f11 + 92ebd97 commit 5ecd867

File tree

122 files changed

+29515
-176
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

122 files changed

+29515
-176
lines changed

.github/workflows/code-generation.yml

+4-4
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

.projen/tasks.json

+12
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

.projenrc.ts

+12
Original file line numberDiff line numberDiff line change
@@ -189,11 +189,23 @@ packageJson?.patch(JsonPatch.add('/scripts/postpack', 'pinst --enable'));
189189
project.addTask('generate-models-containers', {
190190
description: 'Generate new list of models available from Jumpstart and DLC containers',
191191
steps: [
192+
{
193+
say: 'Synthesize project files',
194+
spawn: 'default',
195+
},
196+
{
197+
say: 'Compile',
198+
spawn: 'compile',
199+
},
192200
{
193201
say: 'Generate new list of models available from Jumpstart and DLC containers',
194202
cwd: project.srcdir+'/patterns/gen-ai/aws-model-deployment-sagemaker/code-generation',
195203
exec: 'npm run generate',
196204
},
205+
{
206+
say: 'Generate the new apidocs',
207+
spawn: 'post-compile',
208+
},
197209
],
198210
});
199211

README.md

+6-6
Original file line numberDiff line numberDiff line change
@@ -73,14 +73,14 @@ The following constructs are available in the library:
7373
|:-------------|:-------------|:-------------|
7474
| [Data ingestion pipeline](./src/patterns/gen-ai/aws-rag-appsync-stepfn-opensearch/) | Ingestion pipeline providing a RAG (retrieval augmented generation) source for storing documents in a knowledge base. | Amazon OpenSearch, AWS Step Functions, Amazon Bedrock, AWS AppSync, AWS Lambda |
7575
| [Question answering](./src/patterns/gen-ai/aws-qa-appsync-opensearch/) | Question answering with a large language model (Anthropic Claude V2.1) using a RAG (retrieval augmented generation) source and/or long context. | Amazon OpenSearch, AWS Lambda, Amazon Bedrock, AWS AppSync |
76-
| [Summarization](./src/patterns/gen-ai/aws-summarization-appsync-stepfn/) | Document summarization with a large language model (Anthropic Claude V2.1). | AWS Lambda, Amazon Bedrock, AWS AppSync and Amazon Elasticache for Redis. |
76+
| [Summarization](./src/patterns/gen-ai/aws-summarization-appsync-stepfn/) | Document summarization with a large language model (Anthropic Claude V2.1). | AWS Lambda, Amazon Bedrock, AWS AppSync and Amazon ElastiCache for Redis. |
7777
| [Lambda layer](./src/patterns/gen-ai/aws-langchain-common-layer/) | Python Lambda layer providing dependencies and utilities to develop generative AI applications on AWS. | AWS Lambda, Amazon Bedrock, Amazon SageMaker |
78-
| [SageMaker model deployment (Jumpstart)](./src/patterns/gen-ai/aws-model-deployment-sagemaker/README_jumpstart.md) | Deploy a foundation model from Amazon SageMaker JumpStart to an Amazon SageMaker endpoint. | Amazon SageMaker |
78+
| [SageMaker model deployment (JumpStart)](./src/patterns/gen-ai/aws-model-deployment-sagemaker/README_jumpstart.md) | Deploy a foundation model from Amazon SageMaker JumpStart to an Amazon SageMaker endpoint. | Amazon SageMaker |
7979
| [SageMaker model deployment (Hugging Face)](./src/patterns/gen-ai/aws-model-deployment-sagemaker/README_hugging_face.md) | Deploy a foundation model from Hugging Face to an Amazon SageMaker endpoint. | Amazon SageMaker |
8080
| [SageMaker model deployment (Custom)](./src/patterns/gen-ai/aws-model-deployment-sagemaker/README_custom_sagemaker_endpoint.md) | Deploy a foundation model from an S3 location to an Amazon SageMaker endpoint. | Amazon SageMaker |
81-
| [Amazon Bedrock](./src/cdk-lib/bedrock/README.md) | CDK L2 Constructs for Amazon Bedrock | Amazon Bedrock, Amazon OpenSearch Serverless, AWS Lambda |
82-
| [Amazon OpenSearch Serverless Vector Collection](./src/cdk-lib/opensearchserverless/README.md) | CDK L2 Constructs to create a vector collection | Amazon OpenSearch Vector Index |
83-
| [Amazon OpenSearch Vector Index](./src/cdk-lib/opensearch-vectorindex/README.md) | CDK L1 Custom Resource to create a vector index | Amazon OpenSearch Serverless, AWS Lambda |
81+
| [Amazon Bedrock](./src/cdk-lib/bedrock/README.md) | CDK L2 Constructs for Amazon Bedrock. | Amazon Bedrock, Amazon OpenSearch Serverless, AWS Lambda |
82+
| [Amazon OpenSearch Serverless Vector Collection](./src/cdk-lib/opensearchserverless/README.md) | CDK L2 Constructs to create a vector collection. | Amazon OpenSearch Vector Index |
83+
| [Amazon OpenSearch Vector Index](./src/cdk-lib/opensearch-vectorindex/README.md) | CDK L1 Custom Resource to create a vector index. | Amazon OpenSearch Serverless, AWS Lambda |
8484

8585
# Sample Use Cases
8686

@@ -94,7 +94,7 @@ The official samples repository https://github.com/aws-samples/generative-ai-cdk
9494
| [aws-cdk-stack-builder-tool](https://github.com/aws-samples/aws-cdk-stack-builder-tool) | AWS CDK Builder is a browser-based tool designed to streamline bootstrapping of Infrastructure as Code (IaC) projects using the AWS Cloud Development Kit (CDK). |
9595
| [CDK Live! Building generative AI applications and architectures leveraging AWS CDK Constructs!](https://www.youtube.com/watch?v=NI1F4Xxqyr8) | CDK Live! episode focused on building and deploying generative AI applications and architectures on AWS using the AWS Cloud Development Kit (CDK) and the AWS Generative AI CDK Constructs. |
9696
| [Announcing AWS Generative AI CDK Constructs!](https://aws.amazon.com/blogs/devops/announcing-generative-ai-cdk-constructs/) | Blog post announcing the release of the AWS Generative AI CDK Constructs. |
97-
| [aws-genai-llm-chatbot](https://github.com/aws-samples/aws-genai-llm-chatbot/tree/main) | Multi-Model and Multi-RAG Powered Chatbot Using AWS CDK on AWS allowing you to experiment with a variety of Large Language Models and Multimodal Language Models, settings and prompts in your own AWS account |
97+
| [aws-genai-llm-chatbot](https://github.com/aws-samples/aws-genai-llm-chatbot/tree/main) | Multi-Model and Multi-RAG Powered Chatbot Using AWS CDK on AWS allowing you to experiment with a variety of Large Language Models and Multimodal Language Models, settings and prompts in your own AWS account. |
9898

9999
# Operational Metrics Collection
100100

apidocs/classes/CustomSageMakerEndpoint.md

-7
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,6 @@
3434
- [node](CustomSageMakerEndpoint.md#node)
3535
- [role](CustomSageMakerEndpoint.md#role)
3636
- [startupHealthCheckTimeoutInSeconds](CustomSageMakerEndpoint.md#startuphealthchecktimeoutinseconds)
37-
- [volumeSizeInGb](CustomSageMakerEndpoint.md#volumesizeingb)
3837

3938
### Methods
4039

@@ -164,12 +163,6 @@ ___
164163

165164
`Private` `Readonly` **startupHealthCheckTimeoutInSeconds**: `number`
166165

167-
___
168-
169-
### volumeSizeInGb
170-
171-
`Optional` `Readonly` **volumeSizeInGb**: `number`
172-
173166
## Methods
174167

175168
### addToRolePolicy

apidocs/classes/DeepLearningContainerImage.md

+63
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,8 @@ https://github.com/aws/deep-learning-containers/blob/master/available_images.md
3939
- [DJL\_INFERENCE\_0\_24\_0\_NEURONX\_SDK2\_14\_1](DeepLearningContainerImage.md#djl_inference_0_24_0_neuronx_sdk2_14_1)
4040
- [DJL\_INFERENCE\_0\_25\_0\_DEEPSPEED0\_11\_0\_CU118](DeepLearningContainerImage.md#djl_inference_0_25_0_deepspeed0_11_0_cu118)
4141
- [DJL\_INFERENCE\_0\_25\_0\_NEURONX\_SDK2\_15\_0](DeepLearningContainerImage.md#djl_inference_0_25_0_neuronx_sdk2_15_0)
42+
- [DJL\_INFERENCE\_0\_26\_0\_DEEPSPEED0\_12\_6\_CU121](DeepLearningContainerImage.md#djl_inference_0_26_0_deepspeed0_12_6_cu121)
43+
- [DJL\_INFERENCE\_0\_26\_0\_NEURONX\_SDK2\_16\_0](DeepLearningContainerImage.md#djl_inference_0_26_0_neuronx_sdk2_16_0)
4244
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_10\_2\_TRANSFORMERS4\_17\_0\_CPU\_PY38\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_1_10_2_transformers4_17_0_cpu_py38_ubuntu20_04)
4345
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_10\_2\_TRANSFORMERS4\_17\_0\_GPU\_PY38\_CU113\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_1_10_2_transformers4_17_0_gpu_py38_cu113_ubuntu20_04)
4446
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_13\_1\_TRANSFORMERS4\_26\_0\_CPU\_PY39\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_1_13_1_transformers4_26_0_cpu_py39_ubuntu20_04)
@@ -57,13 +59,20 @@ https://github.com/aws/deep-learning-containers/blob/master/available_images.md
5759
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_9\_1\_TRANSFORMERS4\_12\_3\_GPU\_PY38\_CU111\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_1_9_1_transformers4_12_3_gpu_py38_cu111_ubuntu20_04)
5860
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_0\_0\_TRANSFORMERS4\_28\_1\_CPU\_PY310\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_2_0_0_transformers4_28_1_cpu_py310_ubuntu20_04)
5961
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_0\_0\_TRANSFORMERS4\_28\_1\_GPU\_PY310\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_2_0_0_transformers4_28_1_gpu_py310_cu118_ubuntu20_04)
62+
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_CPU\_PY310\_UBUNTU22\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_2_1_0_transformers4_37_0_cpu_py310_ubuntu22_04)
63+
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_GPU\_PY310\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_2_1_0_transformers4_37_0_gpu_py310_cu118_ubuntu20_04)
6064
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_0\_TRANSFORMERS4\_28\_1\_NEURONX\_PY38\_SDK2\_9\_1\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_neuronx_1_13_0_transformers4_28_1_neuronx_py38_sdk2_9_1_ubuntu20_04)
6165
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_1\_TRANSFORMERS4\_34\_1\_NEURONX\_PY310\_SDK2\_15\_0\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_neuronx_1_13_1_transformers4_34_1_neuronx_py310_sdk2_15_0_ubuntu20_04)
66+
- [HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_1\_TRANSFORMERS4\_36\_2\_NEURONX\_PY310\_SDK2\_16\_1\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_inference_neuronx_1_13_1_transformers4_36_2_neuronx_py310_sdk2_16_1_ubuntu20_04)
6267
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_0\_TGI0\_6\_0\_GPU\_PY39\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_0_0_tgi0_6_0_gpu_py39_cu118_ubuntu20_04)
6368
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_0\_TGI0\_8\_2\_GPU\_PY39\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_0_0_tgi0_8_2_gpu_py39_cu118_ubuntu20_04)
6469
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_1\_TGI0\_9\_3\_GPU\_PY39\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_0_1_tgi0_9_3_gpu_py39_cu118_ubuntu20_04)
6570
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_1\_TGI1\_0\_3\_GPU\_PY39\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_0_1_tgi1_0_3_gpu_py39_cu118_ubuntu20_04)
6671
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_1\_TGI1\_1\_0\_GPU\_PY39\_CU118\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_0_1_tgi1_1_0_gpu_py39_cu118_ubuntu20_04)
72+
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_2\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_1_1_tgi1_2_0_gpu_py310_cu121_ubuntu20_04)
73+
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_1\_GPU\_PY310\_CU121\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_1_1_tgi1_3_1_gpu_py310_cu121_ubuntu20_04)
74+
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_3\_GPU\_PY310\_CU121\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_1_1_tgi1_3_3_gpu_py310_cu121_ubuntu20_04)
75+
- [HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_4\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_pytorch_tgi_inference_2_1_1_tgi1_4_0_gpu_py310_cu121_ubuntu20_04)
6776
- [HUGGINGFACE\_TENSORFLOW\_INFERENCE\_2\_11\_0\_TRANSFORMERS4\_26\_0\_CPU\_PY39\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_tensorflow_inference_2_11_0_transformers4_26_0_cpu_py39_ubuntu20_04)
6877
- [HUGGINGFACE\_TENSORFLOW\_INFERENCE\_2\_11\_0\_TRANSFORMERS4\_26\_0\_GPU\_PY39\_CU112\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_tensorflow_inference_2_11_0_transformers4_26_0_gpu_py39_cu112_ubuntu20_04)
6978
- [HUGGINGFACE\_TENSORFLOW\_INFERENCE\_2\_11\_1\_TRANSFORMERS4\_26\_0\_CPU\_PY39\_UBUNTU20\_04](DeepLearningContainerImage.md#huggingface_tensorflow_inference_2_11_1_transformers4_26_0_cpu_py39_ubuntu20_04)
@@ -240,6 +249,18 @@ ___
240249

241250
___
242251

252+
### DJL\_INFERENCE\_0\_26\_0\_DEEPSPEED0\_12\_6\_CU121
253+
254+
`Static` `Readonly` **DJL\_INFERENCE\_0\_26\_0\_DEEPSPEED0\_12\_6\_CU121**: [`ContainerImage`](ContainerImage.md)
255+
256+
___
257+
258+
### DJL\_INFERENCE\_0\_26\_0\_NEURONX\_SDK2\_16\_0
259+
260+
`Static` `Readonly` **DJL\_INFERENCE\_0\_26\_0\_NEURONX\_SDK2\_16\_0**: [`ContainerImage`](ContainerImage.md)
261+
262+
___
263+
243264
### HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_10\_2\_TRANSFORMERS4\_17\_0\_CPU\_PY38\_UBUNTU20\_04
244265

245266
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_INFERENCE\_1\_10\_2\_TRANSFORMERS4\_17\_0\_CPU\_PY38\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
@@ -348,6 +369,18 @@ ___
348369

349370
___
350371

372+
### HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_CPU\_PY310\_UBUNTU22\_04
373+
374+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_CPU\_PY310\_UBUNTU22\_04**: [`ContainerImage`](ContainerImage.md)
375+
376+
___
377+
378+
### HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_GPU\_PY310\_CU118\_UBUNTU20\_04
379+
380+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_INFERENCE\_2\_1\_0\_TRANSFORMERS4\_37\_0\_GPU\_PY310\_CU118\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
381+
382+
___
383+
351384
### HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_0\_TRANSFORMERS4\_28\_1\_NEURONX\_PY38\_SDK2\_9\_1\_UBUNTU20\_04
352385

353386
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_0\_TRANSFORMERS4\_28\_1\_NEURONX\_PY38\_SDK2\_9\_1\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
@@ -360,6 +393,12 @@ ___
360393

361394
___
362395

396+
### HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_1\_TRANSFORMERS4\_36\_2\_NEURONX\_PY310\_SDK2\_16\_1\_UBUNTU20\_04
397+
398+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_INFERENCE\_NEURONX\_1\_13\_1\_TRANSFORMERS4\_36\_2\_NEURONX\_PY310\_SDK2\_16\_1\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
399+
400+
___
401+
363402
### HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_0\_TGI0\_6\_0\_GPU\_PY39\_CU118\_UBUNTU20\_04
364403

365404
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_0\_0\_TGI0\_6\_0\_GPU\_PY39\_CU118\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
@@ -390,6 +429,30 @@ ___
390429

391430
___
392431

432+
### HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_2\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04
433+
434+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_2\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
435+
436+
___
437+
438+
### HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_1\_GPU\_PY310\_CU121\_UBUNTU20\_04
439+
440+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_1\_GPU\_PY310\_CU121\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
441+
442+
___
443+
444+
### HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_3\_GPU\_PY310\_CU121\_UBUNTU20\_04
445+
446+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_3\_3\_GPU\_PY310\_CU121\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
447+
448+
___
449+
450+
### HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_4\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04
451+
452+
`Static` `Readonly` **HUGGINGFACE\_PYTORCH\_TGI\_INFERENCE\_2\_1\_1\_TGI1\_4\_0\_GPU\_PY310\_CU121\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)
453+
454+
___
455+
393456
### HUGGINGFACE\_TENSORFLOW\_INFERENCE\_2\_11\_0\_TRANSFORMERS4\_26\_0\_CPU\_PY39\_UBUNTU20\_04
394457

395458
`Static` `Readonly` **HUGGINGFACE\_TENSORFLOW\_INFERENCE\_2\_11\_0\_TRANSFORMERS4\_26\_0\_CPU\_PY39\_UBUNTU20\_04**: [`ContainerImage`](ContainerImage.md)

0 commit comments

Comments
 (0)