View Single Post
Old 04-26-2010, 05:47 PM  
d-null
. . .
 
d-null's Avatar
 
Industry Role:
Join Date: Apr 2007
Location: NY
Posts: 13,724
this doesn't answer your question, but I thought it would be a good tip to post for people that don't want any of their sites on archive.org, it's an easy fix to just throw in all of your robots.txt files and they will remove all of your sites off of there

Quote:
To remove your site from the Wayback Machine, place a robots.txt file at the top level of your site (e.g. www. yourdomain.com/robots.txt) and then submit your site below.

The robots.txt file will do two things:

It will remove all documents from your domain from the Wayback Machine.
It will tell us not to crawl your site in the future.

To exclude the Internet Archive?s crawler (and remove documents from the Wayback Machine) while allowing all other robots to crawl your site, your robots.txt file should say:

User-agent: ia_archiver
Disallow: /

Last edited by d-null; 04-26-2010 at 05:49 PM..
d-null is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote