Documenting APIPriorityAndFairness beta criteria #1632

yue9944882 · 2020-03-25T07:19:32Z

/sig api-machinery
/kind feature

this pull is for discussing necessary criterias of moving toward beta for APIPriorityAndFairness

deads2k · 2020-03-25T12:44:26Z

keps/sig-api-machinery/20190228-priority-and-fairness.md


+Beta:
+
+- Supports concurrency limiting upon long-running requests


I like the goal, but I also see a lot of value in having this on by default even without long running request handling.

@lavalamp would you hold beta on this?

I agree with David - it would be nice to have, but I don't think it should block beta. It would already be huge win without us.

I won't block beta on this but I am looking for someone to add this. If possible I'd like it.

watch requests (iff served by the cache) is relatively "cheap" in terms of apiserver's performance. but there're certain cases where we need to police the long-runnings in protection of apiserver. added this to the non-blocking list.

deads2k · 2020-03-25T12:48:18Z

keps/sig-api-machinery/20190228-priority-and-fairness.md

 - Adequate documentation for the changes
 - Minimum viable test cases mentioned in Test Plan section

+Beta:


I'd like to see a discussion of

documenting metrics for inspection (on their way to a v1 metrics API, but they don't have to be there).

recommended alerts for the prometheus, somewhere like here https://github.com/coreos/kube-prometheus/blob/master/manifests/prometheus-rules.yaml

eliminating rate limiting by some means. Something like Standard API qps and burst in kubeconfig file. #1629 could do. (It's hard to prove otherwise).

documenting metrics for inspection (on their way to a v1 metrics API, but they don't have to be there).
recommended alerts for the prometheus, somewhere like here https://github.com/coreos/kube-prometheus/blob/master/manifests/prometheus-rules.yaml

is there a common repo to place these prometheus integration and related docs? cluster admins will definitely find these docs helpful if they feel like understand the system better..

eliminating rate limiting by some means. Something like #1629 could do. (It's hard to prove otherwise).

added this to blocking list. i need an approach to opt-out client-side rate-limitting

Regarding item 1, there is already some documentation at https://kubernetes.io/docs/concepts/cluster-administration/flow-control/#observability . Is something more being requested here?

wojtek-t · 2020-03-25T12:53:09Z

keps/sig-api-machinery/20190228-priority-and-fairness.md


+Beta:
+
+- Supports concurrency limiting upon long-running requests


I agree with David - it would be nice to have, but I don't think it should block beta. It would already be huge win without us.

wojtek-t · 2020-03-25T12:54:14Z

keps/sig-api-machinery/20190228-priority-and-fairness.md

+- Allow constant concurrency shares in the priority-level API model
+- Automatically manages versions of mandatory/suggested configuration
+- Necessary e2e test
+


I would really like to have the LIST calls addressed though. Currently we treat "list all pods" as the same cost as "get single pod". I think this is much more important than addressing long-running requests.

+1, and I think that's relatively easy to do.

Treat a LIST as weight = # of records requested. (unpaginated list, treat as a request for 10k items or something sufficiently punitive.)

I'm not sure 10k is a good number, maybe we can introduce something a bit more dynamic here (we already expose #objects per type metric (it's exposed by periodically sending Count() request to etcd - IIRC once per minute). We can use those numbers for it and make it a bit more reflecting reality - the reason I don't fully like 10k is that it may be too small (by order of magnitude) in large clusters). But it's implementation details and we should discuss it outside of this doc :)

I think the current plan is to focus on visibility & robustness for beta, and add features afterwards.

We can add features sooner--if we can find people to do the work.

Currently we treat "list all pods" as the same cost as "get single pod".

we've been discussing on discriminating unpaginated LIST calls from the very start --- we agreed it will be valuable for managing large clusters but didn't make it in alpha for simplicity. given the fact that unpaginated requests will eat-up/occupy more service-time in the apiserver, the current/alpha version is already applying some sort of discrimination upon unpaginated LIST calls linearly.. i think what @wojtek-t proposing is a significant/non-linear approach to punish those unproper LISTs in a way, (we can also punish those un-cached LISTs similarly).

(i just added this to non-blocking list)

I think there's two pieces of work here:

The mechanism for acting on the estimate of an API call's cost

The rules that produce the cost estimation

We should make the code separate enough that it's easy for different people to work on these different tasks. It's easy to have someone go off and spend a month optimizing 2 once 1 is in place.

Sure - (1) is much more important - we can work on tweaking (2) further on.

keps/sig-api-machinery/20190228-priority-and-fairness.md

lavalamp · 2020-04-07T16:08:48Z

keps/sig-api-machinery/20190228-priority-and-fairness.md

+
+- Blocking Items:
+ - Improving observability and robustness: finer metrics and adding debug endpoint dumping fine-grained states of the queues for priority-levels
+ - Opt-out client-side rate-limitting to prove the feature


Can you expand this line to match the verboseness of the previous line? People who didn't see the conversation here will likely not understand what this means.

expanded the line, PTAL

lavalamp · 2020-04-07T16:09:01Z

keps/sig-api-machinery/20190228-priority-and-fairness.md

+- Blocking Items:
+ - Improving observability and robustness: finer metrics and adding debug endpoint dumping fine-grained states of the queues for priority-levels
+ - Opt-out client-side rate-limitting to prove the feature
+ - Necessary e2e test


What specifically should the e2e test cover?

added two requirements for e2e tests

lavalamp · 2020-05-21T17:17:06Z

/lgtm
/approve

k8s-ci-robot · 2020-05-21T17:17:26Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lavalamp, yue9944882

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-api-machinery/OWNERS~~ [lavalamp]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

…ovider authentication: direct external OIDC provider

k8s-ci-robot requested review from MikeSpreitzer, deads2k and lavalamp March 25, 2020 07:19

yue9944882 changed the title ~~Documenting APIPriorityAndFairness beta critiria~~ Documenting APIPriorityAndFairness beta criteria Mar 25, 2020

yue9944882 force-pushed the apf-beta branch from 1d4254a to c07e669 Compare March 25, 2020 07:35

deads2k reviewed Mar 25, 2020

View reviewed changes

wojtek-t reviewed Mar 25, 2020

View reviewed changes

lavalamp reviewed Mar 25, 2020

View reviewed changes

keps/sig-api-machinery/20190228-priority-and-fairness.md Outdated Show resolved Hide resolved

yue9944882 force-pushed the apf-beta branch from c07e669 to 8e7d345 Compare April 7, 2020 06:35

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Apr 7, 2020

yue9944882 force-pushed the apf-beta branch from 8e7d345 to 9c6ee99 Compare April 7, 2020 07:00

lavalamp reviewed Apr 7, 2020

View reviewed changes

yue9944882 mentioned this pull request May 18, 2020

Priority and Fairness for API Server Requests #1040

Closed

4 tasks

yue9944882 force-pushed the apf-beta branch from 9c6ee99 to c503b4c Compare May 21, 2020 14:22

k8s-ci-robot added the sig/architecture Categorizes an issue or PR as relevant to SIG Architecture. label May 21, 2020

APIPriorityAndFairness beta critiria

feb0133

yue9944882 force-pushed the apf-beta branch from c503b4c to feb0133 Compare May 21, 2020 14:28

k8s-ci-robot assigned lavalamp May 21, 2020

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 21, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 21, 2020

k8s-ci-robot merged commit c2aaf7a into kubernetes:master May 21, 2020

k8s-ci-robot added this to the v1.19 milestone May 21, 2020

RomanBednar pushed a commit to RomanBednar/enhancements that referenced this pull request Jan 31, 2025

Merge pull request kubernetes#1632 from liouk/direct-external-oidc-pr…

060ebf1

…ovider authentication: direct external OIDC provider

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Documenting APIPriorityAndFairness beta criteria #1632

Documenting APIPriorityAndFairness beta criteria #1632

Uh oh!

yue9944882 commented Mar 25, 2020

deads2k Mar 25, 2020

wojtek-t Mar 25, 2020

lavalamp Mar 25, 2020

yue9944882 Apr 7, 2020

deads2k Mar 25, 2020

yue9944882 Apr 7, 2020

MikeSpreitzer May 20, 2020

wojtek-t Mar 25, 2020

wojtek-t Mar 25, 2020

lavalamp Mar 25, 2020

wojtek-t Mar 25, 2020

lavalamp Apr 6, 2020

yue9944882 Apr 7, 2020 •

edited

Loading

lavalamp Apr 7, 2020

wojtek-t Apr 7, 2020

Uh oh!

lavalamp Apr 7, 2020

yue9944882 May 21, 2020

lavalamp Apr 7, 2020

yue9944882 May 21, 2020

lavalamp commented May 21, 2020

k8s-ci-robot commented May 21, 2020

Labels

6 participants


		Beta:

		- Supports concurrency limiting upon long-running requests

Documenting APIPriorityAndFairness beta criteria #1632

Documenting APIPriorityAndFairness beta criteria #1632

Uh oh!

Conversation

yue9944882 commented Mar 25, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yue9944882 Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lavalamp commented May 21, 2020

k8s-ci-robot commented May 21, 2020

Labels

6 participants

yue9944882 Apr 7, 2020 •

edited

Loading