Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replicate webhook pods for resiliency #3386

Closed
ImJasonH opened this issue Oct 14, 2020 · 2 comments
Closed

Replicate webhook pods for resiliency #3386

ImJasonH opened this issue Oct 14, 2020 · 2 comments

Comments

@ImJasonH
Copy link
Collaborator

@ImJasonH ImJasonH commented Oct 14, 2020

Currently we run one replica of the webhook deployment, which makes it a single point of failure in the event of a node outage. The job doesn't do anything that requires running as a single replica, so we should investigate running multiple replicas.

We can crib from Knative Serving, which configures:

/kind feature

@afrittoli
Copy link
Member

@afrittoli afrittoli commented Oct 14, 2020

Thanks for this.
I think the pod anti-affinity will also mean that we will have one pod in case of a single-node cluster, which would be perfect as I'd like to preserve the smallest possible footprint for dev / test scenarios.

@raballew
Copy link
Contributor

@raballew raballew commented Oct 14, 2020

Awesome idea! This is not only true for a node outage but also heavy load scenarios for the event listeners. In one of our production environments we faced situations, where the event listener got unresponsive because no auto scaling was in place and too many requests were send to it.

@raballew raballew mentioned this issue Oct 15, 2020
4 of 4 tasks complete
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

4 participants
You can’t perform that action at this time.