Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Post New Thread Reply

Register GFY Rules Calendar
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed.

 
Thread Tools
Old 08-23-2004, 08:40 AM   #1
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
Huh? Google is indexing GFY?

# robots.txt file for http://www.gfyboard.com/

User-agent: *
Disallow: /




This says nothing is allowed to crawl this site, but there are plenty of pages indexed in Google...

Here's an example

http://www.google.com/search?hl=en&l...&btnG=Sear ch
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 08:41 AM   #2
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
http://www.google.com/search?q=%22Ne...e=off&filter=0
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 08:44 AM   #3
chupachups
Confirmed User
 
chupachups's Avatar
 
Join Date: Dec 2002
Location: Sweden/Spain you sum bitch!
Posts: 6,576
Google doesnt give a shit about robot rules...
chupachups is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:03 AM   #4
julioxp
Confirmed User
 
Join Date: Aug 2004
Location: wake me up, before you go go..
Posts: 160
Yeap, I have found some pages of known posts at GFY when do a search at google..

julio
__________________
SIG TOO BIG! Maximum 120x60 button and no more than 3 text lines of DEFAULT SIZE and COLOR. Unless your sig is for a GFY top banner sponsor, then you may use a 624x80 instead of a 120x60.
julioxp is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:40 AM   #5
beergood
Confirmed User
 
Join Date: Jun 2003
Location: United States
Posts: 2,918
I think that the robots.txt file for gfy only causes the robot not to index the index page.

http://www.google.com/search?hl=en&l...ckyourself.com


User-agent: *
Disallow: *

for the whole site
beergood is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:42 AM   #6
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
Quote:
Originally posted by beergood
I think that the robots.txt file for gfy only causes the robot not to index the index page.

http://www.google.com/search?hl=en&l...ckyourself.com


User-agent: *
Disallow: *

for the whole site
No, the robots.txt entry for GFY is correct. "Disallow: /" will exclude the entire site from crawling.
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:44 AM   #7
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
BTW Google has indexed the root, just not "gofuckyourself.com"

http://www.google.com/search?hl=en&l...&btnG= Search
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:45 AM   #8
beergood
Confirmed User
 
Join Date: Jun 2003
Location: United States
Posts: 2,918
Quote:
Originally posted by rowan
No, the robots.txt entry for GFY is correct. "Disallow: /" will exclude the entire site from crawling.

See I wasn't entirely sure because I never blocked their bot. You are absolutely right. I looked it up.
beergood is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:47 AM   #9
modF
Confirmed User
 
Join Date: Aug 2002
Posts: 1,888
I don't think it matters now that you need an account to read the board, those threads should start dropping off soon.
__________________

I do things
skype:themodF
modF is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:48 AM   #10
beergood
Confirmed User
 
Join Date: Jun 2003
Location: United States
Posts: 2,918
Its fairly neat that they still don't have a bot that can obey even the most simple robots.txt file.
beergood is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:51 AM   #11
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
The date on the GFY robots.txt is July 8th 2004, so it's possible that it didn't exist (or didn't have the exclude) the last time that Googlebot passed over it.
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:55 AM   #12
beergood
Confirmed User
 
Join Date: Jun 2003
Location: United States
Posts: 2,918
Quote:
Originally posted by rowan
The date on the GFY robots.txt is July 8th 2004, so it's possible that it didn't exist (or didn't have the exclude) the last time that Googlebot passed over it.
Yup. could be. Wonder what it was before.
__________________
icq: 320340263
beergood is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 09:59 AM   #13
chemicaleyes
UNSTOPPABLE
 
chemicaleyes's Avatar
 
Join Date: Aug 2003
Location: UK :: ICQ# 156068
Posts: 11,569
Quote:
Originally posted by rowan
The date on the GFY robots.txt is July 8th 2004, so it's possible that it didn't exist (or didn't have the exclude) the last time that Googlebot passed over it.
The robots file used to be up and google didn't index GFY, then the file disappeared months ago, google started indexing and now the file is back up. Google will stop indexing.. why is this a big deal for you anyway?
__________________
No way as way, No limitation as limitation. AmeriNOC formally PhatServers
chemicaleyes is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 10:03 AM   #14
beergood
Confirmed User
 
Join Date: Jun 2003
Location: United States
Posts: 2,918
Quote:
Originally posted by chemicaleyes
The robots file used to be up and google didn't index GFY, then the file disappeared months ago, google started indexing and now the file is back up. Google will stop indexing.. why is this a big deal for you anyway?

Just something to talk about.
__________________
icq: 320340263
beergood is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 08-23-2004, 10:14 AM   #15
Basic_man
Programming King Pin
 
Basic_man's Avatar
 
Industry Role:
Join Date: Oct 2003
Location: Montreal
Posts: 27,360
Quote:
Originally posted by Chupachups
Google doesnt give a shit about robot rules...
Looks like that..
__________________
UUGallery Builder - automated photo/video gallery plugin for Wordpress!
Stop looking! Checkout Naked Hosting, online since 1999 !
Basic_man is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Post New Thread Reply
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >

Bookmarks



Advertising inquiries - marketing at gfy dot com

Contact Admin - Advertise - GFY Rules - Top

©2000-, AI Media Network Inc



Powered by vBulletin
Copyright © 2000- Jelsoft Enterprises Limited.