GitHub API rate limit #127

achimnol · 2025-01-20T14:52:16Z

After #116, I see now it performs retries while downloading PBS and other stuffs.
Unfortunately, there is another problem: we now hit the GitHub's API rate limit. 🫠

Would there be any better way to avoid it?

For instance,

Could I set GITHUB_TOKEN to authenticate the fetch requests?
How could I configure GitHub Actions cache to preserve the already fetched artifacts in previous workflow runs?

Example workflow run: https://github.com/lablup/backend.ai/actions/runs/12868611532

The text was updated successfully, but these errors were encountered:

jsirois · 2025-01-20T16:06:14Z

For auth, you can do like so:

lift/.github/workflows/ci.yml

Line 62 in 2bd9199

SCIENCE_AUTH_API_GITHUB_COM_BEARER: ${{ secrets.GITHUB_TOKEN }}

Also see: https://github.com/pex-tool/pex/blob/027781802e73d68fb26a75f6e2b7d8b65e3b160b/.github/workflows/ci.yml#L10-L12

The whole auth scheme is defined here:

lift/science/fetcher.py

Lines 78 to 127 in 2bd9199

    
           def _configure_auth(url: Url) -> httpx.Auth | tuple[str, str] | None: 
        
               if not url.info.hostname: 
        
                   return None 
        
               normalized_hostname = url.info.hostname.upper().replace(".", "_").replace("-", "_") 
        
               env_auth_prefix = f"SCIENCE_AUTH_{normalized_hostname}" 
        
               env_auth = {key: value for key, value in os.environ.items() if key.startswith(env_auth_prefix)} 
        
               def check_ambiguous_auth(auth_type: str) -> None: 
        
                   if env_auth: 
        
                       raise AmbiguousAuthError( 
        
                           f"{auth_type.capitalize()} auth was configured for {url} via env var but so was: " 
        
                           f"{", ".join(env_auth)}" 
        
                       ) 
        
               def get_username(auth_type: str) -> str | None: 
        
                   return env_auth.pop(f"{env_auth_prefix}_{auth_type.upper()}_USER", None) 
        
               def require_password(auth_type: str) -> str: 
        
                   env_var = f"{env_auth_prefix}_{auth_type.upper()}_PASS" 
        
                   passwd = env_auth.pop(env_var, None) 
        
                   if not passwd: 
        
                       raise InvalidAuthError( 
        
                           f"{auth_type.capitalize()} auth requires a password be configured via the " 
        
                           f"{env_var} env var." 
        
                       ) 
        
                   return passwd 
        
               if bearer := env_auth.pop(f"{env_auth_prefix}_BEARER", None): 
        
                   check_ambiguous_auth("bearer") 
        
                   return "Authorization", f"Bearer {bearer}" 
        
               if username := get_username("basic"): 
        
                   password = require_password("basic") 
        
                   check_ambiguous_auth("basic") 
        
                   return httpx.BasicAuth(username=username, password=password) 
        
               if username := get_username("digest"): 
        
                   password = require_password("digest") 
        
                   check_ambiguous_auth("digest") 
        
                   return httpx.DigestAuth(username=username, password=password) 
        
               try: 
        
                   return httpx.NetRCAuth(None) 
        
               except (FileNotFoundError, IsADirectoryError): 
        
                   pass 
        
               except NetrcParseError as e: 
        
                   logger.warning(f"Not using netrc for auth, netrc file is invalid: {e}") 
        
               return None

So you could similarly define basic auth or use ~/.netrc.

For caching, I won't teach GitHub actions, but science allows you to control the cache with --cache-dir: https://science.scie.app/cli.html#science

So you can use SCIENCE_CACHE_DIR.

jsirois · 2025-01-20T16:14:06Z

@achimnol I've marked this as an answered question, but please confirm this answer works for you.

achimnol · 2025-01-21T01:43:34Z

Thanks for the detailed answer.
I'll check out with my team and leave the result.

jsirois · 2025-01-30T14:31:27Z

@achimnol do you have any results to report?

achimnol · 2025-02-01T07:25:15Z

@achimnol do you have any results to report?

We made a temporary CI workflow to reproduce the issue before applying the GitHub API token, but we failed to reproduce it when we triggered the workflow manually several times afterwards. 😞

I'll update you once it happens again.

Yaminyam · 2025-02-19T08:00:14Z

@jsirois https://github.com/lablup/backend.ai/actions/runs/13407796118/job/37451008145
https://github.com/lablup/backend.ai/blob/3c920779cc6b5cdc008731d56da7f64e133f6935/.github/workflows/ci.yml#L342
I added the token to the scies build task as you instructed, but I'm having the same issue.

jsirois · 2025-02-19T16:31:51Z

@Yaminyam this is a problem - there should be at least 2 hits and there is just 1:

:; git grep SCIENCE_AUTH_API_GITHUB_COM_BEARER
.github/workflows/ci.yml:      SCIENCE_AUTH_API_GITHUB_COM_BEARER: ${{ secrets.GITHUB_TOKEN }}

1st rule of Pants club: Pants blocks env vars by default
2nd rule of Pants club: Pants is generally sneaky and does things behind your back. All is usually not as it seems.

So you need to leak the SCIENCE_AUTH_API_GITHUB_COM_BEARER env var through Pants to underlying processes somewhere in your config. I think you still use https://www.pantsbuild.org/stable/reference/subsystems/subprocess-environment for Python / Pex subprocesses to do this, but I have not hacked on or used Pants for several years now; so you'll want to make sure.

jsirois · 2025-03-06T16:40:12Z

@Yaminyam have you been able to configure Pants to let the SCIENCE_AUTH_API_GITHUB_COM_BEARER env var leak through to Pex processes?

cairijun · 2025-03-18T11:29:29Z

lift/science/fetcher.py

Line 108 in e4bf708

return "Authorization", f"Bearer {bearer}"

https://github.com/encode/httpx/blob/9e8ab40369bd3ec2cc8bff37ab79bf5769c8b00f/httpx/_client.py#L448

I think passing a tuple as auth means Basic Authentication in httpx, not Bearer.

jsirois · 2025-03-18T16:13:27Z

@cairijun indeed. That tuple should be routed to the request headers but it is not. Thanks for spotting this bug! I'll get out a release today with a fix.

Keep in mind, my observation above is still pertinent. Even with the fix, things should still fail due to rate limits unless the env var is punched through past Pants hermetic sandbox walls.

Fixes a-scie#127

jsirois · 2025-03-18T19:05:39Z

@cairijun thank you very much for looking at the code. That was an embarrasing bug. #148 is out for review and I should have a release in the next several hours. I'll in turn produce a Pex release that bumps its minimum science requirement to that release. I'll not when both releases are complete here and then close the issue.

Fixes #127

jsirois · 2025-03-18T20:13:46Z

OK, science 0.12.2 is released with the fix: https://github.com/a-scie/lift/releases/tag/v0.12.2

C.F.: a-scie/lift#127

jsirois · 2025-03-18T22:29:50Z

And now Pex 2.33.5 is released with a bump to science>=0.12.2: https://github.com/pex-tool/pex/releases/tag/v2.33.5

@achimnol, @Yaminyam and @cairijun please speak up if you are not able to upgrade to Pex 2.33.5 or newer and fix your rate limit issue after ensuring Pants is not blocking the SCIENCE_AUTH_API_GITHUB_COM_BEARER env var (it might be easiest to inspect a Pex scie build sandbox __run.sh script to confirm locally).

jsirois self-assigned this Jan 20, 2025

jsirois added question Further information is requested answered Indicates an answered question. labels Jan 20, 2025

jsirois added bug Something isn't working in progress Indicates the assignee is actively working on the item. labels Mar 18, 2025

jsirois added a commit to jsirois/lift that referenced this issue Mar 18, 2025

Fix plumbing of env var based Bearer authentication.

2b9cce6

Fixes a-scie#127

jsirois mentioned this issue Mar 18, 2025

Fix plumbing of env var based Bearer authentication. #148

Merged

jsirois closed this as completed in #148 Mar 18, 2025

jsirois added a commit that referenced this issue Mar 18, 2025

Fix plumbing of env var based Bearer authentication. (#148)

2c5d97e

Fixes #127

jsirois added a commit to jsirois/pex that referenced this issue Mar 18, 2025

Upgrade to science 0.12.2 to fix PBS rate limits.

b49c6fe

C.F.: a-scie/lift#127

jsirois mentioned this issue Mar 18, 2025

Upgrade to science 0.12.2 to fix PBS rate limits. pex-tool/pex#2720

Merged

jsirois added a commit to pex-tool/pex that referenced this issue Mar 18, 2025

Upgrade to science 0.12.2 to fix PBS rate limits. (#2720)

53b8827

C.F.: a-scie/lift#127

jsirois removed the in progress Indicates the assignee is actively working on the item. label Mar 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub API rate limit #127

GitHub API rate limit #127

achimnol commented Jan 20, 2025 •

edited

Loading

jsirois commented Jan 20, 2025 •

edited

Loading

jsirois commented Jan 20, 2025

achimnol commented Jan 21, 2025

jsirois commented Jan 30, 2025

achimnol commented Feb 1, 2025

Yaminyam commented Feb 19, 2025

jsirois commented Feb 19, 2025 •

edited

Loading

jsirois commented Mar 6, 2025

cairijun commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

GitHub API rate limit #127

GitHub API rate limit #127

Comments

achimnol commented Jan 20, 2025 • edited Loading

jsirois commented Jan 20, 2025 • edited Loading

jsirois commented Jan 20, 2025

achimnol commented Jan 21, 2025

jsirois commented Jan 30, 2025

achimnol commented Feb 1, 2025

Yaminyam commented Feb 19, 2025

jsirois commented Feb 19, 2025 • edited Loading

jsirois commented Mar 6, 2025

cairijun commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

jsirois commented Mar 18, 2025

achimnol commented Jan 20, 2025 •

edited

Loading

jsirois commented Jan 20, 2025 •

edited

Loading

jsirois commented Feb 19, 2025 •

edited

Loading