Skip to content

processing job infra spin up takes 16x longer than the job itself #4358

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
jlonge4 opened this issue Jan 5, 2024 · 1 comment
Open

processing job infra spin up takes 16x longer than the job itself #4358

jlonge4 opened this issue Jan 5, 2024 · 1 comment
Assignees
Labels
component: processing Relates to the SageMaker Processing Platform type: feature request

Comments

@jlonge4
Copy link

jlonge4 commented Jan 5, 2024

Describe the feature you'd like
The feature I am proposing would be to implement some sort of warm pool (similar to estimator) or scheduled run to keep infra up to allow the processing job to take only as long as the script runtime. Requesting this due to my 30s script runtime taking 8min 17s total bc of infra allocation.

How would this feature be used? Please describe.
This feature would be used to cut down on processing time and reduce latency

Describe alternatives you've considered
Alternatives I have considered are sagemaker notebook jobs, and lambda container.

Additional context
The goal of my processing job would read in a file, process and index it (create vector embeddings and add to docstore) using the library of my choice (langchain, haystack, etc)

@mohanasudhan mohanasudhan added the component: processing Relates to the SageMaker Processing Platform label Jan 9, 2024
@lorenzwalthert
Copy link

Did you consider Local Mode? We use this feature for local prototyping and it works well except some limitations in the context of Pipeline and Experiment integration, e.g. #4114.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component: processing Relates to the SageMaker Processing Platform type: feature request
Projects
None yet
Development

No branches or pull requests

4 participants