Releases: apify/crawlee-python
Releases · apify/crawlee-python
0.0.6
Adds
- BREAKING:
BasicCrawler.export_data
helper method which replacesBasicCrawler.export_to
Configuration.get_global_configuration
method- Automatic logging setup
- Context helper for logging (
context.log
)
Fixes
- Handling of relative URLs in
add_requests
- Graceful exit in
BasicCrawler.run
0.0.5
Adds
- Add explicit error messages for missing package extras during import
- Better browser abstraction:
BrowserController
- Wraps a single browser instance and maintains its state.BrowserPlugin
- Manages the browser automation framework, and basically acts as a factory for controllers.
- Browser rotation with a maximum number of pages opened per browser.
- Add emit persist state event to event manager
- Add batched request addition in
RequestQueue
- Add start requests option to
BasicCrawler
- Add storage-related helpers
get_data
,push_data
andexport_to
toBasicCrawler
andBasicContext
- Add
PlaywrightCrawler
's enqueue links helper
Fixes
- Fix type error in persist state of statistics
0.0.4
0.0.3
Another internal release, adding mainly session management and BeautifulSoupCrawler
.
Adds
HttpxClient
SessionPool
BeautifulSoupCrawler
BaseStorageClient
Storages
andMemoryStorageClient
were refactored
Was added in 0.0.2
EventManager
&LocalEventManager
Snapshotter
AutoscaledPool
MemoryStorageClient
Storages
BasicCrawler
&HttpCrawler