Releases: q-m/scrapy-webarchive
Releases · q-m/scrapy-webarchive
0.5.2
0.5.1
- Fix mismatch between crawling from local storage vs. S3
Full Changelog: 0.5.0...0.5.1
0.5.0
0.4.1
What's Changed
- Fix for getting spider name in different scrapy versions
Full Changelog: 0.4.0...0.4.1
0.4.0
What's Changed
- Update WARC version by @leewesleyv in #28
- Crawl source information per item/page by @leewesleyv in #31
- Add settings to run a spider against previously generated archives by @leewesleyv in #32
- Seperate WARC and CDXJ module by @leewesleyv in #33
Full Changelog: 0.3.0...0.4.0
0.3.0
- Change
_check_configuration_prerequisiteslogic inWaczExporter
Full Changelog: 0.2.0...0.3.0
v0.2.0 - Hotfixes
Initial Release
- Save web crawls in WACZ format (multiple storages supported; local and cloud).
- Crawl against WACZ format archives.