![]() |
how2 extract only domain names from a list of URLs automatically???
I need a code or just a method to that.
I.E. URLs List: http://www.fdh.com/fsdas.htm http://cxzb.net/bgy5.htm http://www.cvxzn.info/467.htm extracted domain names: fdh.com cxzb.net cvxzn.info thanx very much for advise me! |
replace all in notepad.
|
You need a domain scrapper. When I have bulk lists of domains, with bullshit registration dates/etc interfering, I just toss it in a domain scrapper and shit spits back the domains minus the junk. Mine is custom programmed, on my comp. But they've some server side scripts. Just search around.
|
Doesnt Brujah have a site that has a tool that sifts lists? Im pretty sure he used to if not.
Sorry but I cant remember the name of it right now but if you look him up on here you may find some info. |
|
Quote:
|
Forgive me because I'm lazy and it's pretty early in the morning but just get a nice HTML / text editor and do find and replace with a regular expression turned on (even Dreamweaver has the regex feature).
Find: .*\.(([a-zA-Z0-9]|_|-)+\.(com|net|org|etc)).* Replace: $1 And i'm sure someone can beautify my ugly ass regex above. Cuz it's pretty ugly and hasn't been tested. |
try this code : 5%.yrt=43.9/todo(ou812) :thumbsup should work
|
Put your entire list in notepad - put a tab at the bottom and copy just the tab.
Do a replace- http:// with http://TAB then do a replace- .com/ with .com/TAB then do a replace- .net/ with .net/TAB then do a replace- .info/ with .info/TAB then copy your page and paste in Excel and your domains will in a seperate column than the rest of the url. |
| All times are GMT -7. The time now is 10:52 AM. |
Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2026, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123