(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

dtardon · 2022-06-22T16:52:57Z

Previously we tried to be smart: when a new unit appeared and it only
added controllers to the cgroup mask we'd update the cached members mask
in all parents by ORing in the controller flags in their cached values.
Unfortunately this was quite broken, as we missed some conditions when
this cache had to be reset (for example, when a unit got unloaded),
moreover the optimization doesn't work when a controller is removed
anyway (as in that case there's no other way for the parent to iterate
though all children if any other, remaining child unit still needs it).
Hence, let's simplify the logic substantially: instead of updating the
cache on the right events (which we didn't get right), let's simply
invalidate the cache, and generate it lazily when we encounter it later.
This should actually result in better behaviour as we don't have to
calculate the new members mask for a whole subtree whever we have the
suspicion something changed, but can delay it to the point where we
actually need the members mask.

This allows us to simplify things quite a bit, which is good, since
validating this cache for correctness is hard enough.

Fixes: #9512
(cherry picked from commit 5af8805)

Resolves: #2096371

msekletar

I think we shouldn't just cherry pick the single commit from the lengthy series in systemd/systemd#10894 and hope for the best. I have a gut feeling this will introduce regressions in our handling of cgroups. At very least we shouldn't be introducing dead code.

msekletar · 2022-07-12T12:23:12Z

src/core/unit.h

-        CGroupMask cgroup_members_mask;
+        CGroupMask cgroup_realized_mask;           /* In which hierarchies does this unit's cgroup exist? (only relevant on cgroupsv1) */
+        CGroupMask cgroup_enabled_mask;            /* Which controllers are enabled (or more correctly: enabled for the children) for this unit's cgroup? (only relevant on cgroupsv2) */
+        CGroupMask cgroup_invalidated_mask;        /* A mask specifiying controllers which shall be considered invalidated, and require re-realization */


cgroup_invalidated_mask isn't used anywhere. Also a comment contains typo in "specifiying".

I attempted to backport the whole series at first, but abandoned that approach in the end. The PR depends on previous PR(s). It would likely take several days to sort it out and backport everything needed.

This way we can corectly ensure that when a unit that requires some controller goes away, we propagate the removal of it all the way up, so that the controller is turned off in all the parents too. (cherry picked from commit b8b6f32) Related: RHEL-9322

(cherry picked from commit 43738e0) Related: RHEL-9322

msekletar · 2024-05-21T15:22:11Z

CI failures looked related but to be sure they are not just flukes @lnykryn force-pushed the feature branch to rerun the checks.

jamacku · 2024-05-22T08:23:36Z

Unit tests seem to be failing because of #434, But I'm not sure about CentOS CI.

msekletar · 2024-05-22T08:47:31Z

CentOS CI failure is concerning. @dtardon can you please have a look?

mrc0mmand · 2024-06-25T09:50:25Z

Oh well, the CentOS CI for this repo is currently FUBAR, since CentOS Stream 8 is no more. I'll need to figure something out to make it working again...

msekletar · 2024-06-25T10:12:54Z

@mrc0mmand Maybe we could arrange "Redhat developer subscription for teams", i.e. self-service subscription and then we could generate activation key that we would store as GH secret and then use that to enable all needed repos in UBI RHEL container image (e.g. CRB).

mrc0mmand · 2024-06-25T10:16:47Z

@mrc0mmand Maybe we could arrange "Redhat developer subscription for teams", i.e. self-service subscription and then we could generate activation key that we would store as GH secret and then use that to enable all needed repos in UBI RHEL container image (e.g. CRB).

I mean, yeah, that would work for the container stuff (and I plan to actually do that), but this wouldn't work for CentOS CI, since, as the name suggests, it only supports CentOS. So I'll move the C8S job to C9S, which should work just fine. The environment will be different, but it should be enough for making sure we haven't royally screwed something up (we run the same tests internally on actual RHEL 8 anyway).

mrc0mmand · 2024-06-26T07:50:32Z

CentOS CI should be back (albeit with some compromises, see systemd/systemd-centos-ci@f415a75). I'll look into the container stuff next.

mrc0mmand · 2024-06-26T11:27:48Z

A quick workaround for the current C8S containers is in #440.

mergify bot added the pr/needs-ci Formerly needs-ci label Jun 22, 2022

systemd-rhel-bot added the pr/needs-review Formerly needs-review label Jun 22, 2022

systemd-rhel-bot changed the title ~~cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Jun 22, 2022

systemd-rhel-bot added the tracker/unapproved Formerly needs-acks label Jun 22, 2022

mergify bot removed the pr/needs-ci Formerly needs-ci label Jun 22, 2022

systemd-rhel-bot removed the tracker/unapproved Formerly needs-acks label Jul 11, 2022

msekletar requested changes Jul 12, 2022

View reviewed changes

systemd-rhel-bot removed the pr/needs-review Formerly needs-review label Jul 12, 2022

dtardon force-pushed the master branch from 7a5edd5 to a729ea1 Compare July 18, 2022 12:09

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Aug 11, 2022

systemd-rhel-bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Aug 21, 2022

mergify bot added pr/needs-ci Formerly needs-ci and removed pr/needs-ci Formerly needs-ci labels Aug 21, 2022

jamacku added this to the RHEL-8.8 milestone Aug 24, 2022

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Aug 31, 2022

mergify bot added the pr/needs-ci Formerly needs-ci label Sep 21, 2022

jamacku force-pushed the master branch from 3d7bf8e to 4c241b8 Compare October 3, 2022 07:54

systemd-rhel-bot changed the title ~~(#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Oct 22, 2022

systemd-rhel-bot added pr/needs-review Formerly needs-review and removed pr/needs-review Formerly needs-review labels Nov 19, 2022

jamacku changed the title ~~(#2096371) (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Dec 7, 2022

systemd-rhel-bot removed the tracker/unapproved Formerly needs-acks label Dec 8, 2022

systemd-rhel-bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Dec 16, 2022

jamacku changed the title ~~(#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Jan 6, 2023

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Jan 13, 2023

dtardon force-pushed the bz2096371-slice-cgroup branch from 78e2717 to 83ada01 Compare April 9, 2024 13:59

jamacku removed the needs-rebase label Apr 9, 2024

github-actions bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (RHEL-9322) cgroup: drastically simplify caching of cgroups members mask Apr 9, 2024

jamacku force-pushed the rhel-8.10.0 branch from e5dfa54 to 045ba12 Compare April 11, 2024 08:52

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Apr 12, 2024

test: extend testcase to ensure controller membership doesn't regress

b719f06

(cherry picked from commit 43738e0) Related: RHEL-9322

lnykryn force-pushed the bz2096371-slice-cgroup branch from 83ada01 to b719f06 Compare May 21, 2024 15:12

github-actions bot removed the tracker/unapproved Formerly needs-acks label May 22, 2024

msekletar closed this Jun 11, 2024

msekletar reopened this Jun 11, 2024

msekletar closed this Jun 25, 2024

msekletar reopened this Jun 25, 2024

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Jun 26, 2024

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Nov 9, 2024

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Feb 1, 2025

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

Uh oh!

dtardon commented Jun 22, 2022 •

edited by jamacku

Loading

Uh oh!

msekletar left a comment

Uh oh!

msekletar Jul 12, 2022 •

edited

Loading

Uh oh!

dtardon Jul 12, 2022

Uh oh!

msekletar commented May 21, 2024

Uh oh!

jamacku commented May 22, 2024

Uh oh!

msekletar commented May 22, 2024

Uh oh!

mrc0mmand commented Jun 25, 2024

Uh oh!

msekletar commented Jun 25, 2024

Uh oh!

mrc0mmand commented Jun 25, 2024

Uh oh!

mrc0mmand commented Jun 26, 2024

Uh oh!

mrc0mmand commented Jun 26, 2024

Uh oh!

Uh oh!

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

Are you sure you want to change the base?

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

Uh oh!

Conversation

dtardon commented Jun 22, 2022 • edited by jamacku Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msekletar left a comment

Choose a reason for hiding this comment

Uh oh!

msekletar Jul 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dtardon Jul 12, 2022

Choose a reason for hiding this comment

Uh oh!

msekletar commented May 21, 2024

Uh oh!

jamacku commented May 22, 2024

Uh oh!

msekletar commented May 22, 2024

Uh oh!

mrc0mmand commented Jun 25, 2024

Uh oh!

msekletar commented Jun 25, 2024

Uh oh!

mrc0mmand commented Jun 25, 2024

Uh oh!

mrc0mmand commented Jun 26, 2024

Uh oh!

mrc0mmand commented Jun 26, 2024

Uh oh!

Uh oh!

dtardon commented Jun 22, 2022 •

edited by jamacku

Loading

msekletar Jul 12, 2022 •

edited

Loading