Maintenance: avoid fixed-time waiting in our E2E testings #644

ijemmy · 2022-03-08T16:13:20Z

Bug description

Currently, our E2E tests are very flaky (fail about 40-50% of the time) due to fixed waiting time. This happens in both metrics and tracers E2E tests.

For example, tracer.test.ts waits for 2 minutes before finishing the beforeAll() function.

Sometimes, the trace come back later than two minutes. The test() blocks will fail.

Expected Behavior

The tests should not be flaky.

Current Behavior

The tests are flaky. If the traces/metrics appear after the fixed wait time.

Possible Solution

Implement retry The tests should never wait for fixed amount of time. Instead, it should poll for trace or metrics and retry until it gets expected number of traces. This could be implemented with promise-retry library. See example code in "Reference" section below
Introduce uuid However, we cannot simply count the number of traces or metrics because there might be another test running concurrently (e.g. one for Node14, another for Node12) . The traces/metrics found may be from another test running. Thus, we need to introduce a uuid in metadata and filter base on that. You can pass UUID via environment variable like in this e2e test in Logger

If you need refactoring, take note of Logger's e2e tests and try to structure it similar to that.

Steps to Reproduce

Run E2E tests a few times. You can use this script:

repeat 20 {jest --group=e2e/PACKAGE; sleep 0.5}

Environment

Powertools version used:0.7.0
Packaging format (Layers, npm): N/A
AWS Lambda function runtime: 12 and 14
Debugging logs: N/A

Related issues, RFCs

N/A

Reference

Here's an example implementation on fetching log stream with promise-retry library. It keep fetching until the log group has the atLeast number of log stream. You can adapt this with traces/metrics. (but need to check uuid)

export const fetchStreamsUntil = async (logGroupName: string, atLeast:number): Promise<string[]> => {
  const retryOption = {
    retries: 6,
    factor: 1.5,
    minTimeout: 1000, 
  }
  return promiseRetry(async (retry: any, attemptNumber: number) => {
    const streams = await fetchStreams(logGroupName)
    if(streams.length < atLeast){
      console.debug(`Found ${streams.length} log streams, retry until having at least ${atLeast} stream`);
      retry();
    }
    return streams;
  }, retryOption)
};

The text was updated successfully, but these errors were encountered:

ijemmy · 2022-03-08T16:16:19Z

Discussed with @AWSDB . He'll take this issue.

dreamorosi · 2022-03-08T16:50:33Z

@AWSDB could you leave a comment so I can assign the issue to you? (apologies for bothering you with this but GitHub doesn't let me add you if you haven't interacted with the issue first).

ghost · 2022-03-09T08:20:59Z

@dreamorosi That's strange. Anyway, here's my comment - happy to work on this issue!

github-actions · 2022-03-14T18:28:43Z

⚠️ COMMENT VISIBILITY WARNING ⚠️

Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.

ijemmy added bug Something isn't working triage This item has not been triaged by a maintainer, please wait labels Mar 8, 2022

ijemmy assigned ijemmy and unassigned ijemmy Mar 8, 2022

saragerion added this to the production-ready-release milestone Mar 8, 2022

saragerion added the priority:high label Mar 8, 2022

dreamorosi assigned ghost Mar 9, 2022

ghost mentioned this issue Mar 11, 2022

fix(tracer, metrics): use polling instead of fixed wait in e2e tests #654

Merged

13 tasks

dreamorosi closed this as completed Mar 14, 2022

dreamorosi removed the triage This item has not been triaged by a maintainer, please wait label Oct 19, 2022

dreamorosi changed the title ~~Bug (metrics, tracer): avoid fixed-time waiting in our E2E testings~~ Maintenance: avoid fixed-time waiting in our E2E testings Nov 14, 2022

dreamorosi added automation This item relates to automation tests PRs that add or change tests completed This item is complete and has been merged/shipped and removed bug Something isn't working labels Nov 14, 2022

dreamorosi added this to AWS Lambda Powertools for TypeScript Nov 14, 2022

dreamorosi moved this to Shipped in AWS Lambda Powertools for TypeScript Nov 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintenance: avoid fixed-time waiting in our E2E testings #644

Maintenance: avoid fixed-time waiting in our E2E testings #644

ijemmy commented Mar 8, 2022 •

edited

Loading

ijemmy commented Mar 8, 2022

dreamorosi commented Mar 8, 2022

ghost commented Mar 9, 2022

github-actions bot commented Mar 14, 2022

Maintenance: avoid fixed-time waiting in our E2E testings #644

Maintenance: avoid fixed-time waiting in our E2E testings #644

Comments

ijemmy commented Mar 8, 2022 • edited Loading

Bug description

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce

Environment

Related issues, RFCs

Reference

ijemmy commented Mar 8, 2022

dreamorosi commented Mar 8, 2022

ghost commented Mar 9, 2022

github-actions bot commented Mar 14, 2022

⚠️ COMMENT VISIBILITY WARNING ⚠️

ijemmy commented Mar 8, 2022 •

edited

Loading