Replace TimeoutCancellationException with TimeoutException #1374

qwwdfsad · 2019-07-24T12:00:52Z

In the current version of the library, withTimeout throws TimeoutCancellationException when a timeout was exceeded.

It can lead to very subtle errors, for example:

launch {
    val result = withTimeout(...) {
        // ... some computation ...
    }
    // process result
}

TimeoutCancellationException is CancellationException, thus is never reported.
But in the snippet above, it's likely to be a programmatic error. If it is expected to miss the deadline, then withTimeoutOrNull should be used explicitly.

My proposal is deprecation of TimeoutCancellationException and replacement with TimeoutException that is not CancellationException

The text was updated successfully, but these errors were encountered:

artbez · 2020-10-15T15:13:15Z

Hi, @qwwdfsad !
My team and I also faced this problem. Do you have any expectations about the time this task will be solved?

qwwdfsad · 2020-10-16T08:41:39Z

Could you please elaborate on the problem you've faced?

artbez · 2020-10-16T14:27:50Z

Sure, we use withTimeout for restricting our inner functions with deadlines. So, any exceptions except CancellationException will be reported from our business logic code, but TimeoutCancellationException won’t. We want to parse such exceptions as others. And, as you said before, it would be possible if withTimeout threw TimeoutException instead of TimeoutCancellationException. Another case is when we use launch within coroutineScope. Exceded deadline in one job doesn't cancel other jobs. Of course, we can use our own custom launch which catches TimeoutCancellationException inside and throws TimeoutException or we can use withTimeoutOrNull. But, we'd like to know if you plan to change the behavior of withTimeout function.

Thank you!

qwwdfsad · 2021-11-11T16:40:56Z

The preliminary design decision is to deprecate withTimeout function with @LowPriorityInOverloadResolution and re-introduce it in the brand new package kotlinx.coroutines.time with the same name.

The change is now blocked by the IDE bug that doesn't replace a signature to the same name in a different package

dkhalanskyjb · 2022-02-09T10:29:31Z

This could be a good time to also avoid this behavior #1914 by checking at the end of the block if the coroutine was canceled and throwing in that case.

qwwdfsad · 2023-01-03T15:07:10Z

The same applies to the cancellation before withTimeout even start (that is also observed in #1914).

Historically, it was on par with withContext behaviour. Though the latter actively evolved (#962, #785, then #791), while withTimeout has been left as is. Its behaviour is clearly lagging behind what is expected from it by default and is worth changing.

It seems like we already have three potential changes in the function semantics:

Exception type (more important, it's CE -> non-CE)
Non-atomic cancellation (it also will help to get rid of all the hacks in our undispatchedResult and startUndispatchedOrReturnIgnoreTimeout) on completion
Avoid entrance into the body of withTimeout if timeout already kicked in before the coroutine starts

qwwdfsad · 2023-01-03T15:11:26Z

It can be the case that the behaviour is too breaking and thus we should come up with a better name instead of silently shadowing the original signature.
What concerns me is the fact that timeouts are non-deterministic and are likely to happen under the load or if things go south, meaning that users are likely to learn about this change from production crashes rather than regular testing

jQrgen · 2023-03-01T12:21:27Z

Relevant: https://betterprogramming.pub/the-silent-killer-thats-crashing-your-coroutines-9171d1e8f79b

wkornewald · 2023-03-04T03:39:53Z

Relevant: https://betterprogramming.pub/the-silent-killer-thats-crashing-your-coroutines-9171d1e8f79b

The article correctly describes some root causes of the problem, i.e. CancellationException being used in places where an error-indicating exception would be needed instead.

However the article's proposed solution is to treat all abortable cancellations as errors (i.e. where the job wasn't canceled). This can't be correct. Just because >90% of CancellationExceptions use Job cancellation doesn't mean you can turn all other cancellations into errors. Ignoring those abortable cases with "but you shouldn't be catching errors there anyway" or "So far, I haven’t found any major issues" is a shaky argument (and in our codebase it would quickly hit issues). The article's solution is a workaround that breaks fundamental "safe bubbling" behavior of cancellations where you don't want to trigger error handling (like AbortFlowException).

The real solution is to fix the incorrect usages of CancellationException or at least provide alternative APIs where breaking changes wouldn't be acceptable. The bubbling behavior should not be broken.

jQrgen · 2023-03-06T09:30:38Z

@wkornewald, thanks for the clarification.

joffrey-bion · 2023-03-09T06:18:02Z

Is there a similar plan to "fix" Deferred.await() / SendChannel.send() / ReceiveChannel.receive() in the same way? i.e. only throw CancellationException when the calling coroutine is cancelled, but NOT in situations where the producer/consumer on the other end is cancelled (in which case some other exception that's not a subtype of CancellationException would be more appropriate, in order to avoid cancelling the calling coroutine silently)

dkhalanskyjb · 2023-03-09T07:42:47Z

We are yet to decide what to do about this. See #3658

Legion2 · 2023-04-27T21:06:43Z

Also facing this inconsistency, as a workaround I now use withTimeoutOrNull and throw an exception on null value. Took my a long time to figure out why my code got silently canceled instead of getting a real exception which explains the timeout.

masc3d · 2023-07-05T21:08:37Z

this will also happen when combining flow operators timeout and merge. very confusing and difficult to track down in complex flows.

See Kotlin/kotlinx.coroutines#1374

…tionException is thrown out of the catch block or the ViewModel is told to cancel its scope as soon as it is detected/thrown, code won't be able to return any kind of error to upper layers and that will lead to gray area where the app goes to a zombie state and UI will "forever" wait for a result that never comes The information that lead to this second change is contained in this article: https://betterprogramming.pub/the-silent-killer-thats-crashing-your-coroutines-9171d1e8f79b The point is that the examples shown in the article go for a more generic approach and do not reflect how Coroutine scope generation works on Android where the Scope owner is not in most cases in the same file where a CancellationException will likely be thrown so devs have to find a way around in order to tell the Scope owner which usually is ViewModel to check if its scope is still active or not The way I found to be the better one in my specific case is to instead of letting CancellationExceptions out of catch blocks or warn ViewModels to check on their Coroutine scopes and possibly cancel them if it is found they are not active anymore leading the app to a "Zombie" state, is to check the scope status at the end of every operation. This way, the UI is warned beforehand and if indeed the scope in question is not active anymore, it is cancelled before another operation is performed under it. According to Roman Elizarov's comments in the open issue for this matter, it will probably never be fixed due to its high complexity where a new implementation of Coroutines from scratch would almost certainly be required to fix it, which will most likey never happen. So, there's that. Let's enjoy Coroutines as it is :) More info on it can be found here and I highly recommend you take a look at it: Kotlin/kotlinx.coroutines#1374 Move away from the "By feature + By layer" modularization approach in favor of the "By feature" only approach Create domain module Create di module Create resources module Create utils module Delete unused XML layouts Delete unused ViewHolders Create specific modules for JVM and Instrumented testing so one cannot access the other's dependencies Rename midfield module to domain Convert all module-level build.gradle files to KTS Remove targetSdk field from build.gradle files as it is now deprecated Add Koin lazy modules feature Add Koin startup feature Migrate to Kotlin 2.1.0 Migrate from packagingOption to packaging as it is now deprecated Migrate default build.gradle JVM Versioning approach to jvmToolchain Upgrade to Coil3 Upgrade to Ktor 3.0 Upgrade Gradle to 8.12 Refactor layout ids Disable "allowBackup" manifest option as database changes will always require migrations even if all app data is removed through the app details screen Remove empty/unused manifest files Split ProfileAggregator into individual UseCases Refactor gradlew script Add "orderingId" field to Profile table so now when data is retrieved from Database it accurately reflects how data is ordered in Github API since it does not return individual profile scores anymore Refactor libs.versions.toml Re-add INSERT to available SQL commands Refactor the way remote repositories propagate their results so now there is a callback for each type of result and there is no need to check for success or failure more than once Move ResultHandler to Data layer Change HTTP codes inside RemoteFetcher as search quota reached error code is not 451 anymore but 403 (Forbidden) Add checks to UIState so state checking in the UI is more concise Update dependency versions

scherrsasrf · 2025-01-02T14:34:34Z

Just wasted 4 hours tracking down jobs that were cancelled on timeout but didn't throw an exception that could be logged. We are now using something like this:

class TimeoutException(cause: Exception) : Exception(cause)

suspend fun <T> withDeadline(timeout: Duration, block: suspend () -> T): T = try {
    withTimeout(timeout) { block() }
} catch (e: TimeoutCancellationException) {
    throw TimeoutException(e)
}

kevincianfarini · 2025-01-23T15:43:38Z

I understand that the maintainers are working on how to properly fix this design flaw, but in the meantime, can we mark the regular withTimeout function as requiring opt in? Acknowledging the design flaw in a way that's highly visible will help prevent future uses of withTimeout when the recommended solution for now is to use withTimeoutOrNull and throw a custom exception.

eygraber · 2025-01-28T05:24:14Z

I knew about this issue and still got bit by it.

qwwdfsad · 2025-02-18T17:28:50Z

Here goes the discussion for the potential solution: #4356

qwwdfsad added enhancement design breaking change labels Jul 24, 2019

qwwdfsad self-assigned this Jul 24, 2019

qwwdfsad mentioned this issue Nov 17, 2020

withTimeout never thrown TimeoutCancellationException #2394

Closed

dkhalanskyjb mentioned this issue Oct 1, 2021

Provided example test for withTimeout fails #1390

Closed

qwwdfsad mentioned this issue Aug 22, 2022

TimeoutCancellationException is thrown inconsistently between flatMapConcat and flatMapLatest #3392

Open

dkhalanskyjb mentioned this issue Oct 31, 2022

Make withTimeout throw not a CancellationException #3515

Closed

qwwdfsad mentioned this issue Jan 3, 2023

Coroutines, exception handling and withTimeout. Can't wrap my head around this combination #1914

Closed

amal mentioned this issue Jan 18, 2023

Code that uses withTimeout often fails and cannot be safely test covered using the kotlinx.coroutines.test #3588

Closed

dkhalanskyjb mentioned this issue Apr 14, 2023

TimeoutCancellationException inconsistently bubbled up #3716

Open

dkhalanskyjb mentioned this issue Jul 24, 2023

CancellationException can be thrown not only to indicate cancellation #3658

Open

kostya05983 pushed a commit to kinfra/kinfra-commons that referenced this issue Feb 17, 2024

make withDeadline throw TimeoutException

ffb2d4a

See Kotlin/kotlinx.coroutines#1374

dkhalanskyjb mentioned this issue Jan 23, 2025

No Issue #4338

Closed

kevincianfarini mentioned this issue Feb 18, 2025

Introduce migration path for long-standing issue of withTimeout #4356

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace TimeoutCancellationException with TimeoutException #1374

Replace TimeoutCancellationException with TimeoutException #1374

qwwdfsad commented Jul 24, 2019 •

edited

Loading

artbez commented Oct 15, 2020

qwwdfsad commented Oct 16, 2020

artbez commented Oct 16, 2020

qwwdfsad commented Nov 11, 2021 •

edited

Loading

dkhalanskyjb commented Feb 9, 2022

qwwdfsad commented Jan 3, 2023 •

edited

Loading

qwwdfsad commented Jan 3, 2023

jQrgen commented Mar 1, 2023

wkornewald commented Mar 4, 2023

jQrgen commented Mar 6, 2023

joffrey-bion commented Mar 9, 2023

dkhalanskyjb commented Mar 9, 2023

Legion2 commented Apr 27, 2023

masc3d commented Jul 5, 2023

scherrsasrf commented Jan 2, 2025

kevincianfarini commented Jan 23, 2025

eygraber commented Jan 28, 2025

qwwdfsad commented Feb 18, 2025

Replace TimeoutCancellationException with TimeoutException #1374

Replace TimeoutCancellationException with TimeoutException #1374

Comments

qwwdfsad commented Jul 24, 2019 • edited Loading

artbez commented Oct 15, 2020

qwwdfsad commented Oct 16, 2020

artbez commented Oct 16, 2020

qwwdfsad commented Nov 11, 2021 • edited Loading

dkhalanskyjb commented Feb 9, 2022

qwwdfsad commented Jan 3, 2023 • edited Loading

qwwdfsad commented Jan 3, 2023

jQrgen commented Mar 1, 2023

wkornewald commented Mar 4, 2023

jQrgen commented Mar 6, 2023

joffrey-bion commented Mar 9, 2023

dkhalanskyjb commented Mar 9, 2023

Legion2 commented Apr 27, 2023

masc3d commented Jul 5, 2023

scherrsasrf commented Jan 2, 2025

kevincianfarini commented Jan 23, 2025

eygraber commented Jan 28, 2025

qwwdfsad commented Feb 18, 2025

qwwdfsad commented Jul 24, 2019 •

edited

Loading

qwwdfsad commented Nov 11, 2021 •

edited

Loading

qwwdfsad commented Jan 3, 2023 •

edited

Loading