Intermittent E2E test failure: ERROR: 1 setup jobs failed. Check logs above for details #3796

wallrj · 2021-03-24T11:43:31Z

The E2E tests sometimes fail during setup with the message:

 ERROR: 1 setup jobs failed. Check logs above for details.

-- https://prow.build-infra.jetstack.net/view/gcs/jetstack-logs/pr-logs/pull/jetstack_cert-manager/3647/pull-cert-manager-e2e-v1-20/1359494849920765955#1:build-log.txt%3A668 (thanks @irbekrm )

The setup jobs are launched as background shell jobs:

https://github.com/jetstack/cert-manager/blob/7204284063702358164713379b07b73642ea9bec/devel/setup-e2e-deps.sh

But the logs of all the setup jobs are mixed up, so it's difficult to know which job failed and what went wrong.

Simplest solution would be to disable the parallelism and run the setup jobs in series.

Alternatively we might try and keep track of exactly which jobs failed and show the logs of those jobs on failure.

And that should allow us to find out which of the jobs keeps failing.

/kind flake
/cc @RinkiyaKeDad

The text was updated successfully, but these errors were encountered:

RinkiyaKeDad · 2021-03-24T14:08:10Z

/assign

RinkiyaKeDad · 2021-03-26T08:38:29Z

Hi @wallrj! I am relatively new to writing bash scripts so just wanted to confirm a few things:

Here wait $job returns a boolean value right? And if that is false then EXIT is incremented. So that would mean the simplest solution you were referring to would have this pseudo code:

for job in $(jobs -p); do
    if !(wait $job)
        echo "{logs}"
        break 
done

Like you mentioned,

Alternatively we might try and keep track of exactly which jobs failed and show the logs of those jobs on failure.

Would this be recommended over the first approach?

I can see that the benefit of this would be that we'll be able to see all the jobs that fail in a single test run.

But the downside would be that if we know that one job has failed, we know the entire test will fail so we would be using unnecessary resources running all other jobs.

The current message just says to check logs for details. Can you please point me to how I would get access to these logs in the shell script?

Thanks!

irbekrm · 2021-04-15T12:16:06Z

/priority important-soon

wallrj added good first issue area/testing kind/flake labels Mar 24, 2021

jetstack-bot assigned RinkiyaKeDad Mar 24, 2021

RinkiyaKeDad mentioned this issue Mar 28, 2021

feat: fix logs display for e2e test failures #3815

Merged

RinkiyaKeDad mentioned this issue Mar 31, 2021

restoring parallel setup behaviour for e2e tests #3825

Open

jetstack-bot added the priority/important-soon label Apr 15, 2021

jetstack / cert-manager

Intermittent E2E test failure: ERROR: 1 setup jobs failed. Check logs above for details #3796

Intermittent E2E test failure: ERROR: 1 setup jobs failed. Check logs above for details #3796

wallrj commented Mar 24, 2021

RinkiyaKeDad commented Mar 24, 2021

RinkiyaKeDad commented Mar 26, 2021

irbekrm commented Apr 15, 2021

jetstack / cert-manager

Intermittent E2E test failure: ERROR: 1 setup jobs failed. Check logs above for details #3796

Intermittent E2E test failure: ERROR: 1 setup jobs failed. Check logs above for details #3796

Comments

wallrj commented Mar 24, 2021

RinkiyaKeDad commented Mar 24, 2021

RinkiyaKeDad commented Mar 26, 2021

irbekrm commented Apr 15, 2021