feat(translator): implement ratelimit costs #5035

mathetake · 2025-01-11T02:01:19Z

What type of PR is this?

The API implementation

What this PR does / why we need it:

This is the follow up on #4957 and implement the API.

Which issue(s) this PR fixes:
Fixes #4756

Release Notes: Yes

internal/gatewayapi/backendtrafficpolicy_test.go

codecov · 2025-01-11T02:11:30Z

Codecov Report

Attention: Patch coverage is 87.80488% with 10 lines in your changes missing coverage. Please review.

Project coverage is 66.85%. Comparing base (da987da) to head (758f5f5).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
internal/xds/translator/ratelimit.go	85.93%	7 Missing and 2 partials ⚠️
internal/xds/translator/httpfilters.go	0.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5035      +/-   ##
==========================================
+ Coverage   66.81%   66.85%   +0.03%     
==========================================
  Files         211      211              
  Lines       32854    32928      +74     
==========================================
+ Hits        21952    22014      +62     
- Misses       9579     9587       +8     
- Partials     1323     1327       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

zirain · 2025-01-11T02:12:17Z

I think we have e2e for costPerReq, able to add one for costPreResponse?

mathetake · 2025-01-11T02:14:45Z

can the e2e use the latest envoyproxy/ratelimit?

zirain · 2025-01-11T02:17:52Z

can the e2e use the latest envoyproxy/ratelimit?

IIRC, master branch always use the latest one.

mathetake · 2025-01-11T02:18:25Z

ok cool

arkodg · 2025-01-11T02:41:49Z

that was fast 😅

now that EG supports writing metadata via ext proc #5023 would be great if we can have an e2e where the ext proc inserts a fixed metadata that is used by the ratelimit API

mathetake · 2025-01-11T18:34:43Z

2025-01-11T18:29:53.4823172Z         < X-Ratelimit-Remaining: 20

2025-01-11T18:29:53.4856372Z         < X-Ratelimit-Remaining: 18

2025-01-11T18:29:53.4884601Z         < X-Ratelimit-Remaining: 16

2025-01-11T18:29:53.4919980Z         < X-Ratelimit-Remaining: 14

looks like applyOnStreamDone is working fine but the per-descriptor is not working (both request and response costing 1 regardless of config)

mathetake · 2025-01-11T18:36:54Z

from envoyproxy/envoy#37684:

added the per descriptor custom hits addend support (only for rate_limits in the RateLimitPerRoute for now).

maybe this is something to do with it

mathetake · 2025-01-11T18:55:16Z

ok @arkodg looks like @wbpcode introduced the newRateLimitPerRoute used in type_per_filter_config vs the legacy existing rate limit config on route virtual_host.

and only the new one currently supports per-descriptor hits_addend

I think we have to either add support per-descriptor hits_addend to the legacy route/virtual_host config or migrate EG to use the new RateLimitPerRoute, which i think is not suitable as it will break the existing EG cluster (requiring the newest Envoy version)

mathetake · 2025-01-11T18:56:04Z

i will take a look at the envoy code monday to see how hard to support them in the legacy config

mathetake · 2025-01-11T21:51:00Z

I think it only requires to do some small refactoring...

mathetake · 2025-01-12T14:51:07Z

envoyproxy/envoy#37972 this should fix the test

mathetake · 2025-01-12T18:10:08Z

the required change in envoy seems a bit trickier than i thought... (the change itself is small but the build constraints by Envoy mobile makes it impossible as-is which in turn requires quite a refactoring around formatter...)

arkodg · 2025-01-13T00:10:17Z

@mathetake can we copy the user defined metadata into the ratelimit hits addend metadata field using a filter instead, in a per route filter way ?

mathetake · 2025-01-13T00:14:13Z

not sure what you meant but one workaround here is to use typed_per_filter_config RateLimitConfig instead of route-embedded-legacy rate limit only when costs are configured (which allows us to assume its latest Envoy) and i think that should work. Having said that though, I found the workaround in the envoyside and waiting for @wbpcode to review

mathetake · 2025-01-13T02:56:44Z

ok tomorrow i will work on the workaround mentioned ^^

mathetake · 2025-01-13T18:37:26Z

ok passing now!!!!!

mathetake · 2025-01-13T18:41:11Z

test/e2e/tests/ratelimit.go

@@ -662,6 +669,68 @@ var RateLimitHeadersAndCIDRMatchTest = suite.ConformanceTest{
 	},
 }

+var UsageRateLimitTest = suite.ConformanceTest{


this is almost identical with the usecase with ai-gateway - except that the extproc sets the dynamic metadata on responseHeaders hook vs ai-gateway in responseBody hook. Either should work from Envoy pov as the response cost is applied when stream is closing

mathetake · 2025-01-13T19:18:53Z

come on CI queue

mathetake · 2025-01-13T20:46:52Z

ping @arkodg @zirain - passing now

internal/xds/translator/ratelimit.go

arkodg · 2025-01-13T23:14:40Z

internal/xds/translator/ratelimit.go

+// patchRouteWithRateLimitOnTypedFilterConfig builds rate limit actions and appends to the route via
+// the TypedPerFilterConfig field. This only happens when the response cost is specified which allows us to assume that
+// users are using Envoy >= v1.33.0.
+func patchRouteWithRateLimitOnTypedFilterConfig(route *routev3.Route, rateLimits []*routev3.RateLimit) error { //nolint:unparam


cc @zhaohuabing this introduces another design pattern to do per route filter config

yeah i was too lazy to do the right abstraction 😉

so anyways when we can set the floor Envoy version to v1.33, then we should be able to migrate to this typed_per_filter_config global rate limit unconditionally since that's the latest way (having support for per-descriptor-hits-addend) vs the current route-embedded config is legacy one

Yeah, typed_per_filter_config is the way to go - it aligns with the approach used by all other filters for per-route configurations.

We can address htis in a seperate PR later.

internal/xds/translator/ratelimit.go

arkodg · 2025-01-14T01:43:03Z

@arkodg like this ? 8a016ba

hey @mathetake you'll also need to run make testdata to generate the IR

mathetake · 2025-01-14T01:52:34Z

hmmm doesn't make any diff

mathetake · 2025-01-14T03:31:11Z

@arkodg passing all tests modulo unrelated flake so can we merge?

zhaohuabing · 2025-01-14T08:50:10Z

Look good. Fixed the conflicts then we can merge it.

mathetake · 2025-01-14T17:05:46Z

/retest

...rnal/gatewayapi/testdata/backendtrafficpolicy-with-ratelimit-invalid-distinct-invert.in.yaml

arkodg

LGTM thanks !

mathetake · 2025-01-14T18:15:35Z

come on

mathetake · 2025-01-14T18:15:38Z

/retest

mathetake · 2025-01-14T19:02:30Z

man the tests are too flaky 🤷 nothing to do with this PR

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake requested a review from a team as a code owner January 11, 2025 02:01

zirain reviewed Jan 11, 2025

View reviewed changes

internal/gatewayapi/backendtrafficpolicy_test.go Outdated Show resolved Hide resolved

mathetake requested a review from zirain January 11, 2025 02:08

mathetake force-pushed the translation branch 2 times, most recently from 543c44e to ad5bf27 Compare January 11, 2025 17:58

mathetake commented Jan 13, 2025

View reviewed changes

arkodg requested review from zhaohuabing and shawnh2 January 13, 2025 23:10

arkodg reviewed Jan 13, 2025

View reviewed changes

internal/xds/translator/ratelimit.go Outdated Show resolved Hide resolved

arkodg reviewed Jan 13, 2025

View reviewed changes

internal/xds/translator/ratelimit.go Show resolved Hide resolved

mathetake force-pushed the translation branch from 991feed to e1bf3da Compare January 14, 2025 16:06

arkodg reviewed Jan 14, 2025

View reviewed changes

...rnal/gatewayapi/testdata/backendtrafficpolicy-with-ratelimit-invalid-distinct-invert.in.yaml Outdated Show resolved Hide resolved

arkodg approved these changes Jan 14, 2025

View reviewed changes

zhaohuabing approved these changes Jan 15, 2025

View reviewed changes

mathetake added 12 commits January 15, 2025 08:53

feat(translator): implement ratelimit costs

d0735a2

Signed-off-by: Takeshi Yoneda <[email protected]>

fix

c601450

Signed-off-by: Takeshi Yoneda <[email protected]>

fix

32ec24e

Signed-off-by: Takeshi Yoneda <[email protected]>

fix

9d726f1

Signed-off-by: Takeshi Yoneda <[email protected]>

more

a978302

Signed-off-by: Takeshi Yoneda <[email protected]>

works now

8f7a56e

Signed-off-by: Takeshi Yoneda <[email protected]>

lint

59a4f4c

Signed-off-by: Takeshi Yoneda <[email protected]>

fixes comments

a0bb337

Signed-off-by: Takeshi Yoneda <[email protected]>

fixes comments

aba707f

Signed-off-by: Takeshi Yoneda <[email protected]>

adds the requested test

372cb12

Signed-off-by: Takeshi Yoneda <[email protected]>

more

eb7a8d2

Signed-off-by: Takeshi Yoneda <[email protected]>

gen

758f5f5

Signed-off-by: Takeshi Yoneda <[email protected]>

zirain force-pushed the translation branch from e4e2ff9 to 758f5f5 Compare January 15, 2025 00:53

arkodg merged commit 3e35b12 into envoyproxy:main Jan 15, 2025
17 checks passed

zhaohuabing mentioned this pull request Jan 15, 2025

chore: move ratelimit per-route config to typedPerFilterConfig #5072

Closed

mathetake deleted the translation branch January 15, 2025 21:19

zhaohuabing mentioned this pull request Jan 16, 2025

Refactor: move ratelimit per-route config to typedPerFilterConfig #5078

Open

mathetake mentioned this pull request Jan 16, 2025

api: RequestCost configurations envoyproxy/ai-gateway#103

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(translator): implement ratelimit costs #5035

feat(translator): implement ratelimit costs #5035

mathetake commented Jan 11, 2025

codecov bot commented Jan 11, 2025 •

edited

Loading

zirain commented Jan 11, 2025

mathetake commented Jan 11, 2025

zirain commented Jan 11, 2025

mathetake commented Jan 11, 2025

arkodg commented Jan 11, 2025

mathetake commented Jan 11, 2025 •

edited

Loading

mathetake commented Jan 11, 2025

mathetake commented Jan 11, 2025 •

edited

Loading

mathetake commented Jan 11, 2025

mathetake commented Jan 11, 2025

mathetake commented Jan 12, 2025

mathetake commented Jan 12, 2025

arkodg commented Jan 13, 2025 •

edited

Loading

mathetake commented Jan 13, 2025 •

edited

Loading

mathetake commented Jan 13, 2025

mathetake commented Jan 13, 2025

mathetake Jan 13, 2025

mathetake commented Jan 13, 2025 •

edited

Loading

mathetake commented Jan 13, 2025

arkodg Jan 13, 2025

mathetake Jan 14, 2025

mathetake Jan 14, 2025

zhaohuabing Jan 14, 2025 •

edited

Loading

arkodg commented Jan 14, 2025

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025 •

edited

Loading

zhaohuabing commented Jan 14, 2025

mathetake commented Jan 14, 2025

arkodg left a comment

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025

feat(translator): implement ratelimit costs #5035

feat(translator): implement ratelimit costs #5035

Conversation

mathetake commented Jan 11, 2025

codecov bot commented Jan 11, 2025 • edited Loading

Codecov Report

zirain commented Jan 11, 2025

mathetake commented Jan 11, 2025

zirain commented Jan 11, 2025

mathetake commented Jan 11, 2025

arkodg commented Jan 11, 2025

mathetake commented Jan 11, 2025 • edited Loading

mathetake commented Jan 11, 2025

mathetake commented Jan 11, 2025 • edited Loading

mathetake commented Jan 11, 2025

mathetake commented Jan 11, 2025

mathetake commented Jan 12, 2025

mathetake commented Jan 12, 2025

arkodg commented Jan 13, 2025 • edited Loading

mathetake commented Jan 13, 2025 • edited Loading

mathetake commented Jan 13, 2025

mathetake commented Jan 13, 2025

mathetake Jan 13, 2025

Choose a reason for hiding this comment

mathetake commented Jan 13, 2025 • edited Loading

mathetake commented Jan 13, 2025

arkodg Jan 13, 2025

Choose a reason for hiding this comment

mathetake Jan 14, 2025

Choose a reason for hiding this comment

mathetake Jan 14, 2025

Choose a reason for hiding this comment

zhaohuabing Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

arkodg commented Jan 14, 2025

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025 • edited Loading

zhaohuabing commented Jan 14, 2025

mathetake commented Jan 14, 2025

arkodg left a comment

Choose a reason for hiding this comment

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025

mathetake commented Jan 14, 2025

codecov bot commented Jan 11, 2025 •

edited

Loading

mathetake commented Jan 11, 2025 •

edited

Loading

mathetake commented Jan 11, 2025 •

edited

Loading

arkodg commented Jan 13, 2025 •

edited

Loading

mathetake commented Jan 13, 2025 •

edited

Loading

mathetake commented Jan 13, 2025 •

edited

Loading

zhaohuabing Jan 14, 2025 •

edited

Loading

mathetake commented Jan 14, 2025 •

edited

Loading