GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   removing google indexed wp login page? (https://gfy.com/showthread.php?t=776571)

d-null 10-14-2007 11:59 PM

removing google indexed wp login page?
 
I guess a proactive approach is best, but I noticed that a couple of my blogs have had the wp-admin login page indexed, as well as the lost password etc from that page


what is the best way to unindex those pages? I am worried that if I do a robots exclusion on those pages that my site will somehow be penalized by google for the broken link

doing a 301 is also a problem because I still want to use the wp-admin links myself

what is the best way to handle this problem, as the wp-admin login page is ranking higher than some of the other pages in my wp

Gothbox 10-15-2007 12:02 AM

If you have a webmaster account with Google there is an option inside your account area to have certain url's taken out of the index.

minusonebit 10-15-2007 12:02 AM

Am I the only one who has ever heard of Robots.txt? Google does follow it...

http://www.icwt.us/robots.txt

# Stop indexing forbidden directories User-agent: *

Disallow: /wp-content/
Disallow: /wp-admin/
Disallow: /wp-includes/

d-null 10-15-2007 12:04 AM

Quote:

Originally Posted by Gothbox (Post 13236353)
If you have a webmaster account with Google there is an option inside your account area to have certain url's taken out of the index.

what's the link for that? I checked webmaster tools and couldn't find anything there for it

d-null 10-15-2007 12:05 AM

Quote:

Originally Posted by minusonebit (Post 13236354)
Am I the only one who has ever heard of Robots.txt? Google does follow it...

I mention in my post the reason why robots.txt might not be the way to go in this situation... re-read the original post

minusonebit 10-15-2007 12:14 AM

Quote:

Originally Posted by jetjet (Post 13236360)
I mention in my post the reason why robots.txt might not be the way to go in this situation... re-read the original post

Google is not gonna penalize you for that. Can always put a meta tag into the login page itself if you are worried... no cache might not be a bad idea as well:

<META HTTP-EQUIV="CACHE-CONTROL" CONTENT="NO-CACHE">
<META HTTP-EQUIV="PRAGMA" CONTENT="NO-CACHE">

<META NAME="ROBOTS" CONTENT="NONE">

The following seems to be a bastardized hack:

<META NAME="ROBOTS" CONTENT="NOINDEX,NOFOLLOW">

And there is also:

<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">

d-null 10-15-2007 12:20 AM

thanks

reading your file, what is your reasoning for excluding rinkworks and whois.sc?

also archive.org?

V_RocKs 10-15-2007 12:26 AM

Penalized for having broken links... haha...

d-null 10-15-2007 12:27 AM

I guess I thought wrongly that google only would spider files that are linked from your index.html or index.php .....

so is it safe to say then that google may all of the sudden decide to spider and index any file or folder that is found in your root directory?

minusonebit 10-15-2007 12:28 AM

For all three, I dislike sites that take my content and use it in any way without asking first. Whois.sc has some digging features I don't want people using. Archive.org, I never gave them permission to rip off copies of my sites including graphics and all.

Diligent 10-15-2007 07:38 AM

jetjet:

No, in that regard you're not thinking very wrong...
All files in your directories that are NOT *linked to*, from *anywhere*, can *never* get indexed - those are invisible to spiders.
And BTW, it doesn't matter *where* from a file is linked to - if there's a link on a forum to some textfile on your server... that'll get it indexed too.
(As long as the origin is/gets indexed in the search engine.)

CyberHustler 10-15-2007 07:42 AM

remove the link to your login page.

TeenCat 10-15-2007 07:46 AM

just rename on your pages "/wp-admin/" to something like "/my_adm1n-5634Z8/" or anything what is not so normal and you have no more problems with indexed wp-admin link ;)


All times are GMT -7. The time now is 06:54 PM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123