Create a plugin to integrate with systemd watchdog for daemon health#13188
Open
chilkaditya wants to merge 2 commits intocontainerd:mainfrom
Open
Create a plugin to integrate with systemd watchdog for daemon health#13188chilkaditya wants to merge 2 commits intocontainerd:mainfrom
chilkaditya wants to merge 2 commits intocontainerd:mainfrom
Conversation
Member
CI is failing because you did not sign-off your commit. Please add the appropriate Signed-off-by line to indicate your acceptance of the Developer Certificate of Origin. |
Member
48b2843 to
9b03453
Compare
Signed-off-by: chilkaditya <apurkahini@gmail.com>
9b03453 to
67b6f61
Compare
Signed-off-by: chilkaditya <apurkahini@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds a watchdog plugin to containerd that integrates with the systemd watchdog.
The plugin performs lightweight internal health checks and sends WATCHDOG=1 notifications only when the daemon is healthy. If containerd becomes unresponsive, notifications stop and systemd restarts the service.
Now here we are healthchecking for metadata store and content store because MetadataPlugin stores all persistent state: container records, image manifests, snapshot references, lease data everything. ContentPlugin stores the actual image layer blobs.
How this health check is done?
I have tested this in my setup by setting up WatchdogSec = 60s in container.service.d.
Fix - #10329