fetch SAs from apiserver #242

modulitos · 2024-10-21T02:57:58Z

Issue #, if available:
#174

Description of changes:
Enhances the implementation introduced in #236 so that we can fetch missing service accounts from the apiserver. Retains the existing guarantees that we won't fetch multiple service accounts concurrently, minimizing load on the apiserver.

This feature is still in shadow mode (off by default).

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

kmala · 2024-11-18T05:01:43Z

pkg/cache/cache.go

 	}

+	go func() {
+		for req := range saFetchRequests {


This would mean we are making only one request at a time to apiserver right?

Wrapped this in a goroutine - thanks for the catch 💯

kmala · 2024-11-18T05:02:36Z

pkg/cache/cache.go

+	defer cancel()
+
+	klog.V(5).Infof("fetching SA: %s", req.CacheKey())
+	saList, err := getter.ServiceAccounts(req.Namespace).List(


why List? why can't we use Get?

I moved this to Get

modulitos · 2024-12-02T19:50:16Z

pkg/handler/handler.go

 	// Use the STS WebIdentity method if set
-	request := cache.Request{Namespace: pod.Namespace, Name: pod.Spec.ServiceAccountName, RequestNotification: true}
+	gracePeriodEnabled := m.saLookupGraceTime > 0
+	request := cache.Request{Namespace: pod.Namespace, Name: pod.Spec.ServiceAccountName, RequestNotification: gracePeriodEnabled}


I added this change to toggle RequestNotification only when the grace period is enabled.

Previously it was basically a no-op when the feature is disabled and RequestNotification is true, but now we're fetching from the API server when it's true. So we only want it set when the feature is enabled.

haoranleo · 2025-01-09T22:49:38Z

pkg/cache/notifications.go

+	if !found {
+		notifier = make(chan struct{})
+		n.handlers[req.CacheKey()] = notifier
+		n.fetchRequests <- &req


We control the APIServer request rate through the size of the channel but it has two downsides:

The APIServer request is actually not rate limited. There is no request handling rate limiter in the channel consumption and it is possible that there could be > 100 requests (for different namespace/name) sent in the same second given the channel consumer initiates a new go routine to submit a request to APIServer

If due to some reasons the channel consumer dies or the queue is full, the create function will hang until the channel has some capacity. It could unexpectedly delay pod creation for arbitrary time.

A better choice could be use a larger channel size to minimize the chance of channel write blocking, and implement a more robust channel consumer which limit the consumption rate. In case of extremely high volumes of requests queued in the channel and the API requests could not be sent in time, the result would be either be the cache is synced before grace period and pod is mutated, or cache is not synced and the pod is not mutated. But no prolonged delay to pod creation or excessive requests to the APIServer

haoranleo · 2025-01-10T18:51:24Z

Moved the changes to #252

modulitos requested a review from a team as a code owner October 21, 2024 02:57

modulitos force-pushed the notifications-enhancement branch 2 times, most recently from 5af4ac8 to 5020bec Compare October 21, 2024 04:00

modulitos force-pushed the notifications-enhancement branch 2 times, most recently from ed8a585 to 0178602 Compare November 14, 2024 22:26

kmala reviewed Nov 18, 2024

View reviewed changes

modulitos force-pushed the notifications-enhancement branch from a37069c to ccbb720 Compare November 18, 2024 19:45

modulitos added 3 commits November 20, 2024 15:29

fetch SAs from apiserver

450f651

cleanup metrics

c0b2e89

pr feedback updates

dba2b0d

modulitos force-pushed the notifications-enhancement branch from ccbb720 to 1b19f6f Compare November 26, 2024 01:31

modulitos added 2 commits November 26, 2024 14:12

add retries

ad9a20a

fix gracePeriodEnabled

9a8d1ad

modulitos force-pushed the notifications-enhancement branch from 2c19173 to 9a8d1ad Compare November 26, 2024 22:12

modulitos commented Dec 2, 2024

View reviewed changes

haoranleo reviewed Jan 9, 2025

View reviewed changes

haoranleo mentioned this pull request Jan 10, 2025

Fetch SA from apiserver #252

Merged

haoranleo closed this Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fetch SAs from apiserver #242

fetch SAs from apiserver #242

Uh oh!

modulitos commented Oct 21, 2024 •

edited

Loading

Uh oh!

kmala Nov 18, 2024

Uh oh!

modulitos Nov 18, 2024

Uh oh!

kmala Nov 18, 2024

Uh oh!

modulitos Nov 18, 2024

Uh oh!

modulitos Dec 2, 2024

Uh oh!

haoranleo Jan 9, 2025

Uh oh!

haoranleo commented Jan 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fetch SAs from apiserver #242

fetch SAs from apiserver #242

Uh oh!

Conversation

modulitos commented Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmala Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

modulitos Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

kmala Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

modulitos Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

modulitos Dec 2, 2024

Choose a reason for hiding this comment

Uh oh!

haoranleo Jan 9, 2025

Choose a reason for hiding this comment

Uh oh!

haoranleo commented Jan 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

modulitos commented Oct 21, 2024 •

edited

Loading