Feature request: Calculate Log Sampling Per Request #6141

leandrodamascena · 2025-02-21T14:22:12Z

Use case

Original discussion: aws-powertools/powertools-lambda-typescript#3278

Copying @dreamorosi's comment to add more context to this feature request:

Had a discussion about this with @am29d yesterday and I thought it'd be useful to bring here the outcomes of the discussion, primarily for future implementation & consideration.

The log sampling feature changes the log level of a Logger to debug for a percentage of requests. Customers can set a rate in the Logger constructor and this rate is used to determine whether or not to change the log level to debug. This is useful for customers who want to keep a less verbose log level most of the time, but have more logs emitted for a percentage of their requests.

As it stands, the feature doesn't exactly behaves as described above. This is because the percentage or ratio is not calculated at the request level, but rather when the Logger class is instantiated, which is usually during the INIT phase of the execution environment, i.e.

from aws_lambda_powertools import Logger

logger = Logger(sampling_rate=0.5)   # whether or not the log level is switched to `debug` is decided here only

def handler(event, context):
    # ... your logic here
    pass

This means that all the requests served by the same environment will inherit the sampling decision that was made when the environment was initialized, which in turn results in a sampling rate different than the desired one. The degree of this difference will depend on how many environments are spun up and the distribution of requests among them.

To explain what I mean by that, let's consider this example that has 3 environments/sandboxes and a number of requests distributed across them, and - for the sake of simplicity - a log sampling of 0.5 (aka 50%):

Assuming a truly random chance of 50%, one could end up in a situation like the above, which would result in a sample rate of ~85% rather than the expected 50%.

To get around this, and instead get a more closer rate to the desired 50%, customers can use the logger.refresh_sample_rate_calculation() method mentioned above at the start/end of each request.

from aws_lambda_powertools import Logger

logger = Logger(sampling_rate=0.5)   # whether or not the log level is switched to `debug` is decided here only

def handler(event, context):
    # ... your logic here
    logger.refresh_sample_rate_calculation()
    pass

When called, this method essentially flips the coin again and decides whether the log level should be switched to debug or not. Because this is done at the request level, statistically speaking, the ratio of sampled requests should be much closer to the desired one:

With this in mind, we should consider easing this situation for customers by adding an optional flag to our class method decorator and Middy.js middleware so that when this flag is enabled, we'll call the logger.refresh_sample_rate_calculation() method for them at the end of each request, as proposed here.

The flag would be false by default to maintain backward compatibility, although in a future major version we could consider making it enabled by default since this would be a much accurate behavior than the current one.

Obviously, as mentioned above, this would work only if we're able to wrap the handler, so customers who are not using either of the two mechanisms just mentioned would have to continue calling the logger.refresh_sample_rate_calculation() manually.

Solution/User Experience

We want to have this experience:

1 - Customers will continue to set sampling_rate at the constructor level.

2 - Customers using the @logger.inject_lambda_context decorator will observe the sampling rate being recalculated on every request and having the expected result.

3 - Customers not using the decorator must call refresh_sample_rate_calculation manually.

4 - Customers not using the decorator or the refresh_sample_rate_calculation method will end up with unexpected sampling rates/logs.

5 - We need to change our documentation to make it more clear.

Alternative solutions

Acknowledgment

This feature request meets Powertools for AWS Lambda (Python) Tenets
Should this be considered in other Powertools for AWS Lambda languages? i.e. Java, TypeScript, and .NET

The text was updated successfully, but these errors were encountered:

github-actions · 2025-02-27T08:41:03Z

⚠️COMMENT VISIBILITY WARNING⚠️

This issue is now closed. Please be mindful that future comments are hard for our team to see.

If you need more assistance, please either tag a team member or open a new issue that references this one.

If you wish to keep having a conversation with other community members under this issue feel free to do so.

github-actions · 2025-03-25T11:42:56Z

This is now released under 3.9.0 version!

leandrodamascena added feature-request feature request triage Pending triage from maintainers labels Feb 21, 2025

github-project-automation bot added this to Powertools for AWS Lambda (Python) Feb 21, 2025

github-project-automation bot moved this to Triage in Powertools for AWS Lambda (Python) Feb 21, 2025

leandrodamascena moved this from Triage to Backlog in Powertools for AWS Lambda (Python) Feb 21, 2025

leandrodamascena added logger and removed triage Pending triage from maintainers labels Feb 21, 2025

leandrodamascena mentioned this issue Feb 21, 2025

feat(logger): add new logic to sample debug logs #6142

Merged

8 tasks

leandrodamascena linked a pull request Feb 21, 2025 that will close this issue

feat(logger): add new logic to sample debug logs #6142

Merged

8 tasks

leandrodamascena moved this from Backlog to Working on it in Powertools for AWS Lambda (Python) Feb 21, 2025

leandrodamascena self-assigned this Feb 21, 2025

leandrodamascena closed this as completed in #6142 Feb 27, 2025

github-project-automation bot moved this from Working on it to Coming soon in Powertools for AWS Lambda (Python) Feb 27, 2025

github-actions bot added the pending-release Fix or implementation already in dev waiting to be released label Feb 27, 2025

leandrodamascena moved this from Coming soon to Shipped in Powertools for AWS Lambda (Python) Mar 7, 2025

github-actions bot removed the pending-release Fix or implementation already in dev waiting to be released label Mar 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Calculate Log Sampling Per Request #6141

Feature request: Calculate Log Sampling Per Request #6141

leandrodamascena commented Feb 21, 2025

github-actions bot commented Feb 27, 2025

github-actions bot commented Mar 25, 2025

Feature request: Calculate Log Sampling Per Request #6141

Feature request: Calculate Log Sampling Per Request #6141

Comments

leandrodamascena commented Feb 21, 2025

Use case

Solution/User Experience

Alternative solutions

Acknowledgment

github-actions bot commented Feb 27, 2025

⚠️COMMENT VISIBILITY WARNING⚠️

github-actions bot commented Mar 25, 2025