Flush lower level logs to console #3178

zirkelc · 2024-10-09T12:56:34Z

boring-cyborg · 2024-10-09T12:56:38Z

Thanks a lot for your first contribution! Please check out our contributing guidelines and don't hesitate to ask whatever you need.
In the meantime, check out the #typescript channel on our Powertools for AWS Lambda Discord: Invite link

sonarqubecloud · 2024-10-09T12:57:13Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
17.6% Duplication on New Code

See analysis details on SonarCloud

github-actions · 2024-10-09T13:06:28Z

No acknowledgement section found. Please make sure you used the template to open a PR and didn't remove the acknowledgment section. Check the template here: https://github.com/aws-powertools/powertools-lambda-typescript/blob/develop/.github/PULL_REQUEST_TEMPLATE.md#acknowledgment

github-actions · 2024-10-09T13:06:29Z

No related issues found. Please ensure there is an open issue related to this change to avoid significant delays or closure.

leandrodamascena · 2024-10-09T13:12:51Z

Just adding a comment here to get notifications about this PR! Whatever is decided here might make sense, we implemented it in Python and dotnet too.

btw, thanks for submitting the PR @zirkelc! Let's wait to hear from @dreamorosi.

zirkelc · 2024-10-09T13:31:04Z

Sorry, I actually wanted to open the PR in my forked repo to get a diff to share with @dreamorosi in #3142 (comment)

Since it's already open, I will leave it open so we can add comments in code.

dreamorosi · 2024-10-14T21:33:47Z

Hi @zirkelc, thanks you for the PR.

I have a deadline for this year's re:Invent set for Wednesday afternoon. I'm planning on reviewing your PR by Thursday morning at the latest.

zirkelc · 2024-10-15T11:38:48Z

Hey @dreamorosi take your time, I just wanted to share it with you to get your opinion

dreamorosi · 2024-11-21T18:48:27Z

packages/logger/src/Logger.ts

@@ -868,6 +876,22 @@ class Logger extends Utility implements LoggerInterface {
    extraInput: LogItemExtraInput
  ): void {
    if (logLevel >= this.logLevel) {
+      // Only flush buffer when log level is higher than the configured log level
+      if (logLevel > this.logLevel) {
+        const xRayTraceId = this.envVarsService.getXrayTraceId() as string;


This can work in theory but in practice I don't see Powertools relying on this value to uniquely identify requests.

This value could be undefined in Lambda in rare occasions and would not exist at all outside of it, i.e. in testing environments or other compute platforms. It's true that we target primarily Lambda so this is not an immediate concern when it comes to other platforms, but introducing a requirement on this environment variable means now customers have to set it in their tests or CI, which is why for example the Tracer utility disables itself when running outside of Lambda.

Overall, for this type of thing I would instead relying on something that follows a similar principle but more robust, like AsyncLocalStorage. With this being a language feature, we wouldn't have to worry about where Logger is running and context would be guaranteed at the process level.

dreamorosi · 2024-11-21T18:59:42Z

packages/logger/src/Logger.ts

+
+      // Add the log item to the context
+      const context = this.#context[xRayTraceId] ?? [];
+      context.push([


This is something I also touched upon in the original discussion, but for the sake of the review I'll include it here as well.

Having an unbounded buffer is a risk as you could run out of memory depending on how much and what you're logging.

For your specific use case this might not be a problem since you might know with relative certainty what you're logging and how much resources you're assigning to your Lambda functions, however from a library perspective this is a dangerous assumption to make.

If we were to implement this feature, we'd most likely want to set a max size for the buffer with a sensible default and an option to configure it. Depending on the performance impact of doing this, we might also want to consider calculating the size of each log and allow customers to set a max size in kb/mb for the buffer rather than a number of buffered logs, since the latter might have less meaning. Alternatively, or additionally, we might also want to consider implementing a sliding window mechanism that deletes older logs as newer are added and the buffer reaches the limit.

This above is the main reason why we haven't implemented the feature, as it requires some design and discussion.

Yes, I agree! I think byte size and/or count limit should be implemented.

Since we store the stringified JSON (see comment on printLog/formatLog), we could easily calculate the UTF-8 byte size of each item and maintain a total byte size of the buffer. When a new log is added to the buffer and the total size exceeds the max allowed size, the buffer should drop the first N items until the total size is again below threshold.

dreamorosi · 2024-11-21T19:14:27Z

packages/logger/src/Logger.ts

-        this.getJsonReplacer(),
-        this.logIndentation
-      )
+      typeof log === 'string' ? log : this.formatLog(log)


If I understand this change correctly, we're bypassing the creation of an object, including all the attributes when passing a string message only.

If this is the case I don't think we would ever make this change, since the main value proposition of this logger is to emit JSON structured logs with keys, context, and all the other features.

I probably should have explained this change more thoroughly.
When saving logs into the buffer instead of emiting them to stdout, we cannot save the original object in the buffer because this object might be mutated later on in the Lambda function:

const user = { name: 'John' }; logger.debug('User', { user } ); user.name = 'Jane';

So I added a new function formatLog which is identical to printLog but doesn't emit to stdout:

powertools-lambda-typescript/packages/logger/src/Logger.ts

Lines 856 to 864 in e6d0db9

private formatLog(log: LogItem): string {

log.prepareForPrint();

return JSON.stringify(

log.getAttributes(),

this.getJsonReplacer(),

this.logIndentation

);

}

I use this function to serialise the log object into JSON and save it to the buffer:

powertools-lambda-typescript/packages/logger/src/Logger.ts

Lines 907 to 913 in e6d0db9

const context = this.#context[xRayTraceId] ?? [];

context.push([

logLevel,

this.formatLog(

this.createAndPopulateLogItem(logLevel, input, extraInput)

),

]);

Then when we flush the log, since we already have a JSON string we only need to print it to stdout:

powertools-lambda-typescript/packages/logger/src/Logger.ts

Lines 884 to 887 in e6d0db9

if (this.#context[xRayTraceId]) {

for (const contextItem of this.#context[xRayTraceId]) {

this.printLog(...contextItem);

}

I see - makes sense, I missed this.

Thank you for explaining it.

zirkelc · 2024-11-25T17:48:45Z

@dreamorosi do you want me to continue working on this or would like to start a RFC first? I think you also mentioned you wait for feedback from the Lambda for something related to this.

As a sidenote: in one of my projects, I implemented this feature manually by creating a “CustomLogger” class that extends the Logger and overrides the logging functions. Since processLogItem and printLog are private, I had to work around this and override each debug/info/warn/error method instead. Would you consider making these protected so we can go the way of subclassing Logger to add custom features like this?

dreamorosi · 2024-11-26T17:51:20Z

@dreamorosi do you want me to continue working on this or would like to start a RFC first? I think you also mentioned you wait for feedback from the Lambda for something related to this.

Thanks for bringing this up.

Given that there are still some fundamental questions about the implementation I think a small RFC would be better. This is also because we'll want to design the feature in a way that can be adopted by all versions of Powertools for AWS, and not just this one.

Since I expect there will be some moderate discussion and we'll have to make some tests that don't necessarily require an implementation, I think having this discussion in an RFC that we, and other versions, can then use as implementation guide would be best.

If possible, and you're interested in championing this effort, I'd suggest to open a new discussion under RFCs and follow the template¹, which will inform the subsequent discussion. After that, based on our process, once we reach an agreement we can start the implementation. Just to manage expectations, almost the entire team will be traveling in the US for re:Invent next week, and many (including me) are taking PTO the week afterwards. We'll do our best to engage with the RFC in a timely manner but there will be some delays.

Regarding the discussion with the AWS Lambda team, that is still ongoing but I don't expect to see any meaningful development soon. I think the most pragmatic way forward would be to implement this feature in a way that allows customers to buffer logs and then flush the buffer manually. Then later on, if Lambda decides to make some hook available to improve on this by allowing us to flush when certain conditions happen, we'll add that.

If anything, having a design document / RFC and a clearly defined use case will help the discussion with that team, so I'd say let's move forward without waiting on this.

As a sidenote: in one of my projects, I implemented this feature manually by creating a “CustomLogger” class that extends the Logger and overrides the logging functions. Since processLogItem and printLog are private, I had to work around this and override each debug/info/warn/error method instead. Would you consider making these protected so we can go the way of subclassing Logger to add custom features like this?

Yes, we can do this right away.

If you could open an issue, we can try to land this change in the next release.

I'm in the process of moving RFCs to Discussions as they were previously in Issues & I expect the PR to get merged tomorrow. You can however already see the template with the different sections in the PR or by trying to open a new issue using that category, if you want to get a sense of what should be in the RFC. ↩

zirkelc · 2024-11-27T08:06:36Z

@dreamorosi sounds like a good plan forward! I'd be happy to start the RFC!

Thank you for your support so far and have a good time at re:Invent!

dreamorosi · 2024-11-27T14:53:48Z

@dreamorosi sounds like a good plan forward! I'd be happy to start the RFC!

Thank you for your support so far and have a good time at re:Invent!

Thank you!

FYI the RFC template in discussions is now live, you should see it if you try to open a new discussion using the RFC category.

For the time being, would it be ok to close this PR? We can always reopen it, in case we want to reprise working on it instead of starting a new branch.

zirkelc · 2024-11-27T15:43:47Z

Yes, of course, I wanted to close this PR anyway. I'll start the RFC in the next few days/weeks.

zirkelc added 4 commits October 3, 2024 17:28

feat(logger): add context for low prio logs

45a67a6

feat(logger): save formatted log to context

7e99f1f

test: add test for flushing buffers

b434aa5

Merge branch 'aws-powertools:main' into low-prio-logs

e6d0db9

zirkelc requested a review from a team October 9, 2024 12:56

zirkelc requested a review from a team as a code owner October 9, 2024 12:56

boring-cyborg bot added logger This item relates to the Logger Utility tests PRs that add or change tests labels Oct 9, 2024

pull-request-size bot added the size/L PRs between 100-499 LOC label Oct 9, 2024

github-actions bot added do-not-merge This item should not be merged need-issue This PR needs an issue before it can be reviewed/worked on further labels Oct 9, 2024

dreamorosi reviewed Nov 21, 2024

View reviewed changes

zirkelc mentioned this pull request Nov 27, 2024

Feature request: change visibility from private to protected to allow overrides #3376

Closed

2 tasks

zirkelc closed this Nov 27, 2024

	private formatLog(log: LogItem): string {
	log.prepareForPrint();

	return JSON.stringify(
	log.getAttributes(),
	this.getJsonReplacer(),
	this.logIndentation
	);
	}

	const context = this.#context[xRayTraceId] ?? [];
	context.push([
	logLevel,
	this.formatLog(
	this.createAndPopulateLogItem(logLevel, input, extraInput)
	),
	]);

	if (this.#context[xRayTraceId]) {
	for (const contextItem of this.#context[xRayTraceId]) {
	this.printLog(...contextItem);
	}

Flush lower level logs to console #3178

Flush lower level logs to console #3178

Uh oh!

Conversation

zirkelc commented Oct 9, 2024

Uh oh!

boring-cyborg bot commented Oct 9, 2024

Uh oh!

sonarqubecloud bot commented Oct 9, 2024

Quality Gate passed

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

leandrodamascena commented Oct 9, 2024

Uh oh!

zirkelc commented Oct 9, 2024

Uh oh!

dreamorosi commented Oct 14, 2024

Uh oh!

zirkelc commented Oct 15, 2024

Uh oh!

dreamorosi Nov 21, 2024

Choose a reason for hiding this comment

Uh oh!

dreamorosi Nov 21, 2024

Choose a reason for hiding this comment

Uh oh!

zirkelc Nov 22, 2024

Choose a reason for hiding this comment

Uh oh!

dreamorosi Nov 21, 2024

Choose a reason for hiding this comment

Uh oh!

zirkelc Nov 22, 2024

Choose a reason for hiding this comment

Uh oh!

dreamorosi Nov 22, 2024

Choose a reason for hiding this comment

Uh oh!

zirkelc commented Nov 25, 2024

Uh oh!

dreamorosi commented Nov 26, 2024

Footnotes

Uh oh!

zirkelc commented Nov 27, 2024

Uh oh!

dreamorosi commented Nov 27, 2024

Uh oh!

zirkelc commented Nov 27, 2024

Uh oh!

Uh oh!