Skip to content

fix(bloomberg): read Businessweek from section page; add green/crypto/pursuits feeds#1892

Open
gucasbrg wants to merge 1 commit into
jackwener:mainfrom
gucasbrg:fix/bloomberg-businessweek-feeds
Open

fix(bloomberg): read Businessweek from section page; add green/crypto/pursuits feeds#1892
gucasbrg wants to merge 1 commit into
jackwener:mainfrom
gucasbrg:fix/bloomberg-businessweek-feeds

Conversation

@gucasbrg

@gucasbrg gucasbrg commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

Bloomberg has emptied the Businessweek RSS feed — feeds.bloomberg.com/businessweek/news.rss
now returns a maintained but item-less channel (HTTP 200, today's lastBuildDate, zero ),
so bloomberg businessweek always fails with NOT_FOUND. The Businessweek section page keeps
publishing, and like bloomberg news it ships its data as Next.js NEXT_DATA, so this reads
the section page in the browser and pulls stories from
props.pageProps.initialState.modulesById[*].items[] (same title/summary/link/mediaLinks columns).

Also in this change:

  • Add green / crypto / pursuits RSS feeds — Bloomberg publishes these and they return items
    (the existing markets/economics/tech/etc. feeds are unchanged).
  • fetchBloombergFeed: retry a transient empty / non-OK response a couple of times before
    surfacing NOT_FOUND. Some feeds (e.g. industries) intermittently return an empty body under
    load; a genuinely empty feed still ends in NOT_FOUND after the retries.

Verified: bloomberg businessweek extracts 55 stories with clean title/summary/link/media;
green/crypto/pursuits return live items; tsc --noEmit and the bloomberg unit tests pass.

…/pursuits feeds

Bloomberg has emptied the Businessweek RSS feed — feeds.bloomberg.com/businessweek/news.rss
now returns a maintained but item-less channel (HTTP 200, today's lastBuildDate, zero <item>),
so `bloomberg businessweek` always fails with NOT_FOUND. The Businessweek section page keeps
publishing, and like `bloomberg news` it ships its data as Next.js __NEXT_DATA__, so this reads
the section page in the browser and pulls stories from
props.pageProps.initialState.modulesById[*].items[] (same title/summary/link/mediaLinks columns).

Also in this change:
- Add green / crypto / pursuits RSS feeds — Bloomberg publishes these and they return items
  (the existing markets/economics/tech/etc. feeds are unchanged).
- fetchBloombergFeed: retry a transient empty / non-OK response a couple of times before
  surfacing NOT_FOUND. Some feeds (e.g. industries) intermittently return an empty body under
  load; a genuinely empty feed still ends in NOT_FOUND after the retries.

Verified: `bloomberg businessweek` extracts 55 stories with clean title/summary/link/media;
green/crypto/pursuits return live items; `tsc --noEmit` and the bloomberg unit tests pass.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant