Quote:
Originally Posted by FlexxAeon
I understand that FULLY but as i said above, that is still INFORMATION that would need to be processed. are we saying now that a SE has eyeballs and is reading the page? or that all SE indexing is done by hand? 1's and 0's are still 1's and 0's.... am i wrong?
|
Ok, I wrote this just now for you:
See the highlighted html; that's a webpage:
This script is simple so don't include any javascript or comments and only use
css in a tag like this <div style="border:solid black 1px">.
This is to show you how a search engine sees your site.
This is the part of a search engine that will find the text on your site.
Quote:
#!/usr/bin/perl
print "content-type: text/html\n\n";
$webpage = <<EOF;
<html>
<head>
<title>don't you get it yet?</title>
</head>
<center>
<table width=100>
<tr>
<td align=center>
yeah!!!
</td>
</tr>
<table>
</center>
</body>
</html>
EOF
$safe = 0;
$x = 0;
while ($x >= 0) {
$loc = index($webpage, "\<");
if ($loc != -1) {
$loc2 = index($webpage, "\>", $loc);
if ($loc2 != -1) {
$mess = substr($webpage, $loc, $loc2-$loc+1);
$webpage =~ s/$mess/ /ig;
}
else {
$x=-1;
}
}
else {
$x=-1;
}
$safe++;
if ($safe > 200) {exit;}
if (-e "kill.txt") {exit;}
}
print "Here is the text for that page : <br><br> $webpage";
exit;
|
Now run that script in your cgi-bin and watch it return the text.
Note:
If you include <script>blah</script> or <style>blah</style> it will return "blah" as text since this
is just a simple example for you.