Need one of those.
I know about Reffy, any others?
Preferably one in Perl so I can can add a few lines to it to make it download html source of page 1, follow all HREF links on page 1, and all links after that all with the same HTTP referer. Leave it to crawl millions of domains for a few weeks on Uni servers
