Scrape any URL with rotating proxies, headless Chrome, and AI extraction. Returns HTML, Markdown, or structured AI-extracted data.
The first web scraping API available as a GitHub Marketplace action.
- name: Scrape a webpage
uses: scrapingant/scrape-action@v1
with:
api-key: ${{ secrets.SCRAPINGANT_API_KEY }}
url: 'https://example.com'Get the full rendered HTML of any page.
- name: Scrape HTML
id: scrape
uses: scrapingant/scrape-action@v1
with:
api-key: ${{ secrets.SCRAPINGANT_API_KEY }}
url: 'https://example.com'
output-type: 'html'
browser: 'false'
- name: Use result
run: echo "${{ steps.scrape.outputs.content }}" | head -20Convert any page to clean Markdown, ready to feed into an LLM.
- name: Get page as Markdown
id: markdown
uses: scrapingant/scrape-action@v1
with:
api-key: ${{ secrets.SCRAPINGANT_API_KEY }}
url: 'https://example.com'
output-type: 'markdown'
- name: Feed to AI
run: echo "${{ steps.markdown.outputs.content }}"Extract structured JSON data from any page using AI.
- name: Extract product data
id: extract
uses: scrapingant/scrape-action@v1
with:
api-key: ${{ secrets.SCRAPINGANT_API_KEY }}
url: 'https://www.amazon.com/dp/B0EXAMPLE'
output-type: 'extract'
extract-properties: 'product title, price, rating, availability'
- name: Use extracted JSON
run: echo "${{ steps.extract.outputs.content }}"- name: Scrape and save
uses: scrapingant/scrape-action@v1
with:
api-key: ${{ secrets.SCRAPINGANT_API_KEY }}
url: 'https://example.com'
output-type: 'markdown'
output-file: 'scraped-content.md'
- name: Commit scraped data
run: |
git add scraped-content.md
git commit -m "Update scraped data"
git push- Competitor price monitoring -- cron schedule scrapes product pages daily, saves to repo
- Content change detection -- scrape page, compare with previous version, alert on diff
- AI-powered data pipeline -- scrape, extract structured data, feed to LLM in next step
- SEO monitoring -- scrape your pages as Markdown, check for content issues
- Uptime/render check -- verify your deployed site renders correctly after deploy
| Input | Description | Required | Default |
|---|---|---|---|
api-key |
ScrapingAnt API key | Yes | |
url |
URL to scrape | Yes | |
output-type |
Response type: html, markdown, or extract |
No | html |
extract-properties |
Comma-separated fields to extract (for extract mode) |
No | |
browser |
Enable headless Chrome JS rendering | No | true |
proxy-type |
Proxy type: datacenter or residential |
No | datacenter |
proxy-country |
Two-letter country code for geo-targeting (e.g. us, uk, de) |
No | |
timeout |
Max request time in seconds (5-60) | No | 60 |
output-file |
File path to save output (e.g. output.html, data.json) |
No |
| Output | Description |
|---|---|
content |
Scraped content (HTML, Markdown, or JSON depending on output-type) |
status-code |
HTTP status code from ScrapingAnt API |
url |
Final URL after redirects (for markdown output-type) |
browser: true(default) costs 10 credits per requestbrowser: falsecosts 1 credit per request- Only successful responses are charged
Get a free API key with 10,000 credits/month at app.scrapingant.com/signup.
MIT