View Single Post
Old 01-06-2019, 09:50 AM  
babeterminal
Confirmed User
 
Industry Role:
Join Date: Jul 2010
Location: tits
Posts: 2,751
Tips For Google: part 3

The text that you have entered is too long (12356 characters). Please shorten it to 10000 characters long.

Instead of using the META Description Tag, Google uses a "snippet" to describe what the page is about. Snippets often don't tell you enough about the page. Ignoring the Description META Tag for determining the relevancy of a page is fine, but utilizing the META Description Tag may provide more meaningful information than their little snippet does. Most Web authors use the META Description Tag for what it was designed for and if all search engines did not include the META Description for ranking purposes, we would then have a more meaningful description. We have to admit that some authors use the META Keywords Tag inappropriately.

Google has no case sensitive searching. Using either lower or upper case returns the same results. From a Web developer's point of view this is great, but from a searcher's point of view this can be extremely frustrating.

Google ignores common words and characters such as "where" and "how", as well as certain single digits and single letters, because they tend to slow down your search without improving the results.

The concept of using stop words is great because if you searched on the term "apple" and a hundred billion URLs in Google's index contained that word, it would in my opinion be a worthless search. But Google's stop words are hard coded words and these stop words are included with every single search you perform including phrase searching. So if you go to Google and search on the phrase:

"this and that"

You would not be presented with any results. Any other full text search engine will perform this search because the phrase "this and that" is NOT the same as if you were to enter each keyword separately. This is now a "phrase" and should not be considered separate keywords. It is true that Google allows you to search this type of phrase. To do this a searcher would have to enter the following query instead:

"+this +and +that"

Google treats each word in your phrase as a separate word rather than as a group (phrase) of words like other major full text search engines do. And if you don't know Google's stop words list (nobody does), then it may take several searches before you've actually entered a search phrase that Google will accept and actually search every word in the phrase (even though it may not return a page with that phrase).

The search phrase "+this +and +that" returned about 183,000 URLs in Google which is NOT very much considering Google's index is larger than 1.6 billion URLs! And if you look closely at the results you will find quite a few of the returned results do NOT contain the phrase "this and that". That's because Google uses "proximity" calculations instead of returning the results the searcher asked for. Proximity in this case means that if Google can find all the words contained in this phrase somewhat close to each other in the document, then Google feels that it is a valid hit. Again, this is fine for the Web page author, but not usually good for the searcher and certainly not for the power user.

Stop words on other major search engines like Alta Vista are generated dynamically. Here's an example of how Alta Vista dynamically generates it's stop words:

Let's say it's day one for Alta Vista. They just launched their search engine with NO entries in the index. They receive a URL addition and that entire document contained the following:

Apple pie is good eating

Now you go to Alta Vista and search on the keyword "apple". Alta Vista will return NO results because the word "apple" is contained in 100% of all URLs indexed (only one at this point). Alta Vista considered "apple" to be a stop word if 60% or greater of the documents indexed contained that word (60% is a hypothetical number used for this example). But later another page was added to their index with the following text:

Cherry pie is good tasting

Now when a searcher searches on "apple" he will be presented with results. But if that searcher searched on the word "pie", Alta Vista would return NO results because "pie" was found in 100% of all URLs indexed. Now one more URL was added to their index and it contained the following:

Blueberry pie is an excellent desert

The words "Apple", "Cherry", "Blueberry", "an", "eating", "tasting" and "desert" would all be relevant search terms because they appear in less than 60% of all documents indexed. The words "pie", "is" and "good" would be considered stop words in this example because they are found in 60% or more of all indexed documents.

With Google being one of the largest full text search engine on the Web, it makes little sense that Google would hard code stop words. Dynamically generated stop words would be perfect for Google.

Of course Google as well as other major full text search engines will allow you to search using stop words simply by placing a "+" in front of that word.

Google's advanced link search is broken....

If you want to search out pages that link to a specific Web site (very important for Web site optimization and submission) you would enter the following search term:

link:Scrub The Web Search Engine - Submit URL - Seo Directory

That will return all pages that link to this domain. At the bottom of the search results Google gives you the opportunity to search for a specific term found on any of these pages that link to this domain. Click on the link (next to the search box at the bottom of the page), that reads "Search within results". Now type in your search terms. In this example we will use the search term "meta tag" and hit the "Search within results" button. This will then return zero results, but it should return hundreds or even thousands.

Google may or may NOT follow the Standard Robot Exclusion (ie, robots.txt). We have found that Google can visit a Web page on a site and follow that link to your site and index that page even if on your site you include that page in your robots.txt (or in the Robots META Tag).

We know of this problem because we have repeatedly requested Google remove pages from their index that we have requested robots NOT index in our robots.txt file. In this file (created way back in 1996) we explicitly tell ALL robots to NOT index anything found in our /cgi-bin/ directory:

# for use by all robots
User-agent: *
Disallow: /cgi-bin/

We do this because it makes no sense for a robot to visit our dynamically generated Web pages. We also do this so that a robot doesn't get "caught up in an infinite space" just as Google themselves realize can happen. A robot can run wild in circumstances like this and can actually bring down a server in no time. To show you what Google has indexed that we have repeatedly requested they remove (for several months now)

Pretty stupid isn't it.

Google's believes their PageRank rating system makes Google virtually immune to spam because even if a Web page makes it into Google's database by using unethical methods, there is very little chance it will receive a high PageRank rating. However we have shown you ways to manipulate Google's PageRank system by generating as many links to your site as possible. Links alone does not make a page become relevant to a search query, but it can boost your PageRank.
__________________
*SIG SPOT SEND MESSAGE IF INTERESTED*
babeterminal is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote