You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Every web harvester container must have a heritrix container. This is currently done by simple linking. However, this probably won't work well with docker-compose scale, as the 1:1 pairing won't occur.
Possible approaches to fixing:
Move Heritrix and web container into same container.
- big heritrix crawls -- and, perhaps, not deduping, or both -- are a bottleneck. there are surely tweets with attached media and linked content in my timeline collections that will be gone by the time heritrix catches up. adding a big list of accounts up front always takes a long while, so later incremental follow-ups take a while to process. this might be a result of running user timelines every hour.. will have to play with that. but a second heritrix process for recent/small batches might help.
Every web harvester container must have a heritrix container. This is currently done by simple linking. However, this probably won't work well with
docker-compose scale
, as the 1:1 pairing won't occur.Possible approaches to fixing:
The text was updated successfully, but these errors were encountered: