GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   Motherfucking stinking bitch of a URL - programmers check this link out. (https://gfy.com/showthread.php?t=223766)

AssFairy 01-21-2004 03:14 AM

Motherfucking stinking bitch of a URL - programmers check this link out.
 
I'm running my spider and indexing all the url's when suddenly the whole system freezes and eventually crashes, I managed to find the problem url and spent hours debugging my software looking for the fault.

Can you see what the problem was :Graucho

http://x-star.com1.ru/xstar.htm

AssFairy 01-21-2004 03:20 AM

Quote:

Originally posted by Mystery Man
it has .com in it
Nope my url parser eats it for breakfast

results

TLD = .ru
Domain Name = com1
Subdomains= x-star
Path= "/"
FileName= xstar
File Extention = .html

blackmonsters 01-21-2004 03:39 AM

Quote:

Originally posted by AssFairy


Nope my url parser eats it for breakfast

results

TLD = .ru
Domain Name = com1
Subdomains= x-star
Path= "/"
FileName= xstar
File Extention = .html

It ate the wrong file extension.

AssFairy 01-21-2004 03:44 AM

Quote:

Originally posted by blackmonsters


It ate the wrong file extension.

No that's just my bad typing, the url parser breaks down any url into all its individual component parts including port numbers user data bookmark tags cgi strings blah blah blah...

you'll never spot what is wrong with the url (as far as programming is concerned) :glugglug

rogerbo 01-21-2004 05:16 AM

I realy don't see the problem
com1.ru is a Domain and x-star the subdomain.
you can also call x-star.ru also a domain
all on the same host agava.ru
so where should be the problem or secret ?

AssFairy 01-21-2004 06:17 AM

Quote:

Originally posted by rogerbo
I realy don't see the problem
com1.ru is a Domain and x-star the subdomain.
you can also call x-star.ru also a domain
all on the same host agava.ru
so where should be the problem or secret ?

Well this had me stumped for a long time.

The domain name is "com1" and that just so happens to be used by computers as a communications port, when I was trying to read or write to a file called "com1" the data stream was being redirected to the bloody :BangBang: printer

(not exactly what happened but its close enough and a lot shorter to explain)

Just wondered if anybody would spot that :thumbsup

rogerbo 01-21-2004 07:23 AM

Hehe what kind of printer are you using ??
the com ports are Serial ports used for Modems, Mouses, Cammeras and so on on the older PCs.
Printer normaly do run Paralell over lpt ports and as today most is run through UBS ports there is not that much use for the com ports anyway LOL.

AssFairy 01-21-2004 08:01 AM

Quote:

Originally posted by rogerbo
Hehe what kind of printer are you using ??
the com ports are Serial ports used for Modems, Mouses, Cammeras and so on on the older PCs.
Printer normaly do run Paralell over lpt ports and as today most is run through UBS ports there is not that much use for the com ports anyway LOL.

Well I did say it wasn't exactly how it happened

Try renaming a text file on your pc to "com1.txt" and it won't let you because its an illegal file name.

In VB you can look for the existence of a file like this

LookForFile$ = DIR("somefile.txt")

if "somefile.txt" exists then LookForFile$ would be set to "somefile.txt" otherwise is would be set to an empty string .. LookForFile$ = ""

Now if you try the following

LookForFile$ = DIR("com1.txt")

LookForFile$ = "com1" even though there is no such file as "com1.txt"

My program would then start to open "com1" for read write access because it thought the file existed and as far as I can tell started streaming data to the port "com1" (I thought that was the printer port but its not its "lpt1")

Its just a stupid one off domain name, its never happened before and I've spiderd hundreds of thousands of pages.

nuclei 01-21-2004 09:50 AM

There is nothing wrong with the url at all, which you suggested there was.

There is only something wrong with your spider and the way it names files.


All times are GMT -7. The time now is 12:40 AM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123