capacity: fix duplicate topology #1435

huww98 · 2025-10-30T14:33:16Z

What type of PR is this?

/kind bug

What this PR does / why we need it:

When the controller starts, 2 sync() call will run simultaneously, one from HasSynced(), another from processNextWorkItem(). Each will produce an instance for the same topology segment, and pass it to callbacks.

This will result in duplicated entries in capacities map, resulting in: either

Two CSIStorageCapacity object get created for the same topology, or
The same CSIStorageCapacity object get assigned to two keys in capacities map. When one of them is updated, the other one will hold an outdated object and all subsequent update will fail with conflict.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

Fixed possible duplicated CSIStorageCapacity and constantly failing update request.

k8s-ci-robot · 2025-10-30T14:33:29Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: huww98
Once this PR has been reviewed and has the lgtm label, please assign pohly for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

huww98 · 2025-10-30T17:28:47Z

/cc @pohly

pohly · 2025-11-24T16:19:56Z

When the controller starts, 2 sync() call will run simultaneously, one from HasSynced(), another from processNextWorkItem(). Each will produce an instance for the same topology segment, and pass it to callbacks.

New segments get produced in sync, right? So whenever two sync calls are executed in parallel, we have this problem. I agree that this is faulty. What isn't clear to me is the proposed solution.

Suppose there are two work queue items in the queue at a time when nt.hasSynced is still false. Both get processed in parallel. Don't we still have the problem?

Two solutions:

only run one worker
serialize the code which generates new segment pointers (not exactly sure though how long the mutex must be held for that)

huww98 · 2025-11-24T17:14:53Z

only run one worker

Yes, I think this controller is designed to only run one worker

external-provisioner/pkg/capacity/topology/nodes.go

Lines 204 to 210 in b84b08f

    
           func (nt *nodeTopology) RunWorker(ctx context.Context) { 
        
           	klog.Info("Started node topology worker") 
        
           	defer klog.Info("Shutting node topology worker") 
        
           	for nt.processNextWorkItem(ctx) { 
        
           	} 
        
           }

It is not configuable, and we will only start one worker goroutine now.

pkg/capacity/topology/nodes.go

sunnylovestiramisu · 2025-11-24T20:54:42Z

pkg/capacity/topology/nodes.go

+	if nt.upstreamSynced() {
+		// Now that both informers are up-to-date,
+		// trigger a sync to update the list of topology segments.
+		nt.queue.Add("")


Nit: I think this is a common way to trigger a sync, but it is kinda difficult to debug with logging? Can the key be something more explict like "full-sync" or "reconcile-all"?

But if it is a common pattern among our sidecars, ignore this comment.

When the controller starts, 2 sync() call will run simultaneously, one from HasSynced(), another from processNextWorkItem(). Each will produce an instance for the same topology segment, and pass it to callbacks. This will result in duplicated entries in capacities map, resulting in: either - Two CSIStorageCapacity object get created for the same topology, or - The same CSIStorageCapacity object get assigned to two keys in capacities map. When one of them is updated, the other one will hold an outdated object and all subsequent update will fail with conflict.

pohly · 2025-11-25T07:08:15Z

Yes, I think this controller is designed to only run one worker

RunWorker executes one worker, but could be invoked more than once. Where is it called?

huww98 · 2025-11-25T07:38:15Z

Yes, I think this controller is designed to only run one worker

RunWorker executes one worker, but could be invoked more than once. Where is it called?

external-provisioner/cmd/csi-provisioner/csi-provisioner.go

Line 524 in b84b08f

go topologyInformer.RunWorker(ctx)

Here, only once.

pohly · 2025-11-25T10:24:50Z

It wasn't designed to be run only once, that's just only how it's currently being done. But as that apparently is sufficient, the fix can be pretty simple:

document that RunWorker must only be called once
in RunWorker, block waiting for informer sync
populate one work queue item
run the for loop

Wouldn't that solve the problem without all of the complicated proposed back-and-forth between event handlers and sync loop?

huww98 · 2025-11-25T13:32:33Z

in RunWorker, block waiting for informer sync

Then we will lost the ability of syncing partial data from upstream controllers. Not sure if this is good. Given the current implementation, I think incremental sync is not faster than a full sync. But this may delay the first topology being passed to callbacks.

And together with this, I think we need to move go topologyInformer.RunWorker(ctx) after we start other informers, or we will poll not-started informers infinitely if we are not the leader.

Wouldn't that solve the problem without all of the complicated proposed back-and-forth between event handlers and sync loop?

We still need a hasSynced atomic.Bool to tell outside that we have finished at least one loop.
IMO, your proposal can be simpler, but not very much. I can give it a try.

pohly · 2025-11-25T14:36:50Z

Then we will lost the ability of syncing partial data from upstream controllers. Not sure if this is good.

It's normal that controllers wait for a full cache sync before starting their work. It depends a bit on the controller whether it makes sense to start earlier.

We still need a hasSynced atomic.Bool to tell outside that we have finished at least one loop.

Is someone checking that? I don't remember.

huww98 · 2025-11-26T06:17:07Z

external-provisioner/pkg/capacity/capacity.go

Line 276 in b84b08f

    
           if !cache.WaitForCacheSync(ctx.Done(), c.topologyInformer.HasSynced, c.scInformer.Informer().HasSynced, c.cInformer.Informer().HasSynced) {

checked here, via c.topologyInformer.HasSynced.

So It seems fine to only sync after upstream synced, because the controller is still waiting for sync.

huww98 · 2025-11-26T15:15:39Z

@pohly Please take a look at #1450. Implemented your proposed fix.

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. kind/bug Categorizes issue or PR as related to a bug. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Oct 30, 2025

k8s-ci-robot requested review from RaunakShah and andyzhangx October 30, 2025 14:33

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Oct 30, 2025

huww98 force-pushed the fix-duplicate-capacity branch from de81be0 to b80ae72 Compare October 30, 2025 14:34

k8s-ci-robot requested a review from pohly October 30, 2025 17:28

sunnylovestiramisu reviewed Nov 24, 2025

View reviewed changes

pkg/capacity/topology/nodes.go Outdated Show resolved Hide resolved

sunnylovestiramisu reviewed Nov 24, 2025

View reviewed changes

huww98 force-pushed the fix-duplicate-capacity branch from b80ae72 to 118064d Compare November 25, 2025 06:18

huww98 mentioned this pull request Nov 26, 2025

capacity: fix duplicate topology (attempt #2) #1450

Open

capacity: fix duplicate topology #1435

Are you sure you want to change the base?

capacity: fix duplicate topology #1435

Conversation

huww98 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-ci-robot commented Oct 30, 2025

Uh oh!

huww98 commented Oct 30, 2025

Uh oh!

pohly commented Nov 24, 2025

Uh oh!

huww98 commented Nov 24, 2025

Uh oh!

Uh oh!

sunnylovestiramisu Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

pohly commented Nov 25, 2025

Uh oh!

huww98 commented Nov 25, 2025

Uh oh!

pohly commented Nov 25, 2025

Uh oh!

huww98 commented Nov 25, 2025

Uh oh!

pohly commented Nov 25, 2025

Uh oh!

huww98 commented Nov 26, 2025

Uh oh!

huww98 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

huww98 commented Oct 30, 2025 •

edited

Loading