A tool to crawl a site and log any resources that return a 404. Results are presented with a searchable todo-style checklist.
- Install Node
- Clone repo
git clone [email protected]:hudakdidit/site_crawler.git - Install dependencies
npm install - Setup config file: run
mv config-example.json config.json. Update thesiteandportproperties as necessary.
Start webpack and the express web server
npm startStart webpack the express web server, and the web crawler
npm run dev-crawlStart the express web server
npm run serverStart the crawler script.
npm run crawl