kv,server: do more verbose logging of graceful drain range lease transfer, once the majority of ranges leases are transferred over #65659
Comments
|
NB: the logging you're requesting is already available via the vmodule setting. |
|
That is my suggestion but ONLY once we get down to N=10 (not sure what N should be) or so range leases still held on the node. Do you think that make sense? I think there is no reason to not be very verbose once you are down to a small number of ranges. What do you think about that? |
|
I'm not opposed to it. |
|
Ack. I think how useful it would be depends on the scope of the issue we see on CC. I can get more info about scope without this change. I can post here when I have a sense of scope. |
|
@tbg has suggested that we can use a verbose condition, since draining could stall/encounter errors at a point where the number of remaining leases is greater than N. We can set verbose to true when the number of remaining leases does not decrease (or worse increases) with graceful drain calls. |
Is your feature request related to a problem? Please describe.
Here are some example logs:
Above logs show 1m passing without graceful drain finishing (without all range lease transfers moving over), leading to some impact during update.
More detailed logs might help with debugging. It's not clear what is going wrong from the above logs.
Describe the solution you'd like
I expect it would help debugging if we logged more verbose info once we got down to N=10 or so range leases still held on the node. We could log:
Describe alternatives you've considered
We could do nothing, relying on operators using vmodule to get the additional verbosity when needed.
Additional context
On CC, graceful drain doesn't always finish before SIGKILL time. We want to fix this.
Epic: CRDB-8501
The text was updated successfully, but these errors were encountered: