WebScraper uses the Integrity v8 engine to quickly scan a website, and can output extracted data (currently) as CSV or JSON. Plus download images to a folder.
Easy to scan a site – just enter the starting URL and press “Go”
Easy to export – choose the columns you want
Plenty of extraction options, including HTML elements with certain classes or IDs, regular expressions, or entire content in a number of
formats (html, plain text, markdown)
Since v4.1 can download to a folder all images discovered
Configuration of various limits on the crawl and the output file size
Adds option in simple setup and complex setup for scraping email addresses.
Adds field in Preferences for editing the regular expression that is used when scraping email addresses.
Note that web pages may obfuscate email addresses to prevent scraping. Even if the email address appears normally on the page, it may not
appear in the page’s source.
Compatibility: OS X 10.8 or later, 64-bit processor
DMG open password: minorpatch.com