![]() |
Motherfucking stinking bitch of a URL - programmers check this link out.
I'm running my spider and indexing all the url's when suddenly the whole system freezes and eventually crashes, I managed to find the problem url and spent hours debugging my software looking for the fault.
Can you see what the problem was :Graucho http://x-star.com1.ru/xstar.htm |
Quote:
results TLD = .ru Domain Name = com1 Subdomains= x-star Path= "/" FileName= xstar File Extention = .html |
Quote:
|
Quote:
you'll never spot what is wrong with the url (as far as programming is concerned) :glugglug |
I realy don't see the problem
com1.ru is a Domain and x-star the subdomain. you can also call x-star.ru also a domain all on the same host agava.ru so where should be the problem or secret ? |
Quote:
The domain name is "com1" and that just so happens to be used by computers as a communications port, when I was trying to read or write to a file called "com1" the data stream was being redirected to the bloody :BangBang: printer (not exactly what happened but its close enough and a lot shorter to explain) Just wondered if anybody would spot that :thumbsup |
Hehe what kind of printer are you using ??
the com ports are Serial ports used for Modems, Mouses, Cammeras and so on on the older PCs. Printer normaly do run Paralell over lpt ports and as today most is run through UBS ports there is not that much use for the com ports anyway LOL. |
Quote:
Try renaming a text file on your pc to "com1.txt" and it won't let you because its an illegal file name. In VB you can look for the existence of a file like this LookForFile$ = DIR("somefile.txt") if "somefile.txt" exists then LookForFile$ would be set to "somefile.txt" otherwise is would be set to an empty string .. LookForFile$ = "" Now if you try the following LookForFile$ = DIR("com1.txt") LookForFile$ = "com1" even though there is no such file as "com1.txt" My program would then start to open "com1" for read write access because it thought the file existed and as far as I can tell started streaming data to the port "com1" (I thought that was the printer port but its not its "lpt1") Its just a stupid one off domain name, its never happened before and I've spiderd hundreds of thousands of pages. |
There is nothing wrong with the url at all, which you suggested there was.
There is only something wrong with your spider and the way it names files. |
| All times are GMT -7. The time now is 12:40 AM. |
Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123