Add `failfast` option #133

nickrobinson251 · 2024-01-06T19:15:54Z

Close Feature request: Fail-fast option #132
Other test frameworks have a similar option
- Go go test has -failfast
- Rust cargo test has -no-fail-fast
- Python pytest has --exitfirst
- Ruby rspec has --fail-fast
- Javascript jest has bail (https://jestjs.io/docs/configuration#bail-number--boolean)
Also (TIL) the Test.jl stdlib has failfast since Julia v1.9 (JuliaLang/julia@88def1a)

Since for us "running tests" involves running test-items which themselves can run multiple tests, we can "fail fast" at two levels:

Stop runtests from running new testitems as soon as one returns as a failure/error (mark that whole run as a failure)
Stop @testitem from running the tests inside as soon as there is a failure/error (mark that test-item as a failure)

Originally this PR added just a failfast keyword to runtests: runtests(..., failfast=true) stops as soon as any testitem fails.

But i've extended it to add the ability for an @testitem to set failfast=true: @testitem "foo" failfast=true stops on the first test error/failure. This can be set for all test-items by a second new runtests keyword named testitem_failfast, (i.e. this can be used to set the default for all testitems).

This matches how @testitem "foo" timeout=60 corresponds to runtests(...; testitem_timeout=60)

failfast defaults to false. I've set testitem_failfast to default to the same value as given to failfast (if not set explicitly on a @testitem).

So the proposed behaviour is:

runtests(...) => neither runtests nor individual testitems stop early
runtests(...; failfast=true) => both stop early
runtests(...; testitem_failfast=true) => only testitems stop early
runtests(...; failfast=true, testitem_failfast=false) => only runtests stops early

API question

Would it be simpler to have the separate keywords operate independently?

The downside would be you have to set both to true to get "fail fastest", i.e. to both have individual testitems stop when they hit an error/failure, and to have no new testitems run once one has hit an error/failure you have to run with runtests(...; failfast=true, testitem_failfast=true)

Alternative (keywords operate independently):

runtests(...;) => neither runtests nor individual testitems stop early
runtests(...; failfast=true) => only runtests stops early
runtests(...; testitem_failfast=true) => only testitems stop early
runtests(...; failfast=true, testitem_failfast=true) => both stop early

Really the question is whether failfast=true should default to turning on both forms of early stopping (as currently proposed by this PR), or if it should mean only runtests fails fast?

I would quite like the seperation of the two... one keyword controls one, the other controls the other... BUT I think in practice it is more ergonomic for users to just pass failfast=true to get the "fastest" failures (hence proposing that) ...i'd love some feedback on this decision!

Drvi

I like the feature and I agree with the current API. 👍

I didn't manage to get through all of this today, will continue tomorrow.

test/integrationtests.jl

src/ReTestItems.jl

Drvi

This looks good to me!

It's a bit sad that we can't guarantee actual fast failing when nworkers > 1 since then the current test items scheduled on other test workers might take up to timeout seconds to finish. What if we pretend that the timeout is effectively zero and just kill any other workers to guarantee truly fast failure? The user likely doesn't care about all the test results when they run runtests with failfast... especially if we'd call the kwarg failfastandfurious:)

Drvi · 2024-09-17T11:59:02Z

src/ReTestItems.jl

@@ -278,7 +301,7 @@ end
 # By tracking and reusing test environments, we can avoid this issue.
 const TEST_ENVS = Dict{String, String}()

-function _runtests(ti_filter, paths, nworkers::Int, nworker_threads::String, worker_init_expr::Expr, test_end_expr::Expr, testitem_timeout::Int, retries::Int, memory_threshold::Real, verbose_results::Bool, debug::Int, report::Bool, logs::Symbol, timeout_profile_wait::Int, gc_between_testitems::Bool)
+function _runtests(ti_filter, paths, nworkers::Int, nworker_threads::String, worker_init_expr::Expr, test_end_expr::Expr, testitem_timeout::Int, retries::Int, memory_threshold::Real, verbose_results::Bool, debug::Int, report::Bool, logs::Symbol, timeout_profile_wait::Int, gc_between_testitems::Bool, failfast::Bool, testitem_failfast::Bool)


Not necessarily for this PR, but I think we should just bundle all these args into a Context struct

for sure! #186

nickrobinson251 · 2024-09-18T10:12:40Z

It's a bit sad that we can't guarantee actual fast failing when nworkers > 1 since then the current test items scheduled on other test workers might take up to timeout seconds to finish. What if we pretend that the timeout is effectively zero and just kill any other workers to guarantee truly fast failure?

Yeah, i wonder about that too 🤔 It was so long ago that i did this i can't remember why i didn't go kill the other workers... I'll need to look into it. I suspect/hope it was laziness/simplicity (i.e. this implementation is so simple, because we just have next_testitem check for a Bool, so there's no need to coordinate workers)

In fact, i wonder if it's even worse than "might take up to timeout seconds to finish", because of retries? in which case that could be pretty bad, and we'd have to just name this failfastish or failabitfaster

nickrobinson251 · 2024-09-18T10:17:37Z

Buuut, also i've no time to work on this at the minute, and it's a pain to keep rebasing and a bit of a shame not to have it at least in it's current form... so i think i might just update the documentation to call-out that we wait for testitems on others workers to finish in the current implementation, and say this may change in future releases to proactively cancel other running testitems to enable even faster failures, and merge what's here (i.e. let us land that improvement in a follow-up, non-breaking release) -- what do you think?

To make clear this may change in a non-breaking release

Drvi · 2024-09-18T12:17:48Z

Good point about the retries! I wonder if retries could be skipped relatively easily by checking first whether the run has been canceled. But I think it's fine to refine the behavior of this feature in the future 👍

nickrobinson251 force-pushed the npr-fail-fast branch 4 times, most recently from 8bd277b to 9051fd7 Compare January 12, 2024 00:55

nickrobinson251 requested review from Drvi and NHDaly January 12, 2024 00:56

nickrobinson251 changed the title ~~[WIP] Add failfast option~~ Add failfast option Jan 12, 2024

nickrobinson251 marked this pull request as ready for review January 12, 2024 01:18

Drvi reviewed Jan 23, 2024

View reviewed changes

test/integrationtests.jl Show resolved Hide resolved

src/ReTestItems.jl Outdated Show resolved Hide resolved

nickrobinson251 force-pushed the npr-fail-fast branch from fc2bae2 to 1adf0d0 Compare May 3, 2024 22:18

nickrobinson251 force-pushed the npr-fail-fast branch from 1adf0d0 to a5d08a4 Compare August 5, 2024 10:53

nickrobinson251 force-pushed the npr-fail-fast branch from 0d174a0 to 4c87b2f Compare August 15, 2024 10:42

nickrobinson251 force-pushed the npr-fail-fast branch from 4c87b2f to bcb747e Compare September 12, 2024 22:01

nickrobinson251 requested review from Drvi and NHDaly and removed request for NHDaly September 12, 2024 22:04

Drvi approved these changes Sep 17, 2024

View reviewed changes

nickrobinson251 added 11 commits September 18, 2024 12:54

Add failfast

5378395

support ENV var

57ed657

Improve logging for failfast mode

9f78d86

Add tests

4ee0922

Handle @testset failfast=true

439d3cb

Bump test timeout to avoid unexpected failures

685f805

Add testitem failfast

248051b

Switch to separate testitem_failfast keyword

9670e39

testitem failfast printing

2f08d6c

Fallback to testset timing if tests errored

a624770

Add testitem failfast tests

148b53e

nickrobinson251 added 4 commits September 18, 2024 12:55

Make testset name more distinctive

884661e

More docs

70900b5

Setting failfast sets testitem_failfast by default

aff563a

Remove racy cancellation check and switch to an atomic

Loading
Loading status checks…

962b128

nickrobinson251 force-pushed the npr-fail-fast branch from bcb747e to 962b128 Compare September 18, 2024 11:56

nickrobinson251 added 3 commits September 18, 2024 13:02

Document current limitation of failfast with multiple workers

b52d725

To make clear this may change in a non-breaking release

Bump version

Loading
Loading status checks…

475fd5d

fixup! Document current limitation of failfast with multiple workers

Loading
Loading status checks…

64441fb

nickrobinson251 mentioned this pull request Sep 18, 2024

faster failfast by cancelling all test-items running in parallel when one fails #189

Open

nickrobinson251 enabled auto-merge (squash) September 18, 2024 12:38

nickrobinson251 merged commit fc7845b into main Sep 18, 2024
7 checks passed

nickrobinson251 deleted the npr-fail-fast branch September 18, 2024 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `failfast` option #133

Add `failfast` option #133

nickrobinson251 commented Jan 6, 2024 •

edited

Loading

Uh oh!

Drvi left a comment

Uh oh!

Uh oh!

Uh oh!

Drvi left a comment

Uh oh!

Drvi Sep 17, 2024

Uh oh!

nickrobinson251 Sep 18, 2024

Uh oh!

nickrobinson251 commented Sep 18, 2024

Uh oh!

nickrobinson251 commented Sep 18, 2024 •

edited

Loading

Uh oh!

Drvi commented Sep 18, 2024

Uh oh!

Uh oh!

Add failfast option #133

Add failfast option #133

Conversation

nickrobinson251 commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API question

Uh oh!

Drvi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Drvi left a comment

Choose a reason for hiding this comment

Uh oh!

Drvi Sep 17, 2024

Choose a reason for hiding this comment

Uh oh!

nickrobinson251 Sep 18, 2024

Choose a reason for hiding this comment

Uh oh!

nickrobinson251 commented Sep 18, 2024

Uh oh!

nickrobinson251 commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Drvi commented Sep 18, 2024

Uh oh!

Uh oh!

Add `failfast` option #133

Add `failfast` option #133

nickrobinson251 commented Jan 6, 2024 •

edited

Loading

nickrobinson251 commented Sep 18, 2024 •

edited

Loading