![]() |
![]() |
![]() |
||||
Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact us. |
![]() ![]() |
|
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed. |
|
Thread Tools |
![]() |
#1 |
rockin tha trailerpark
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
|
Need a link stripper
Any programs out there that can hit a url, & export a text list of every link on the page
in this fashion http://url1.com http://url2.com http://url3.com
__________________
__________ Loadedca$h - get sum! - Revengebucks - mmm rebills! - webair (gotz sErVrz) ![]() |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#2 |
rockin tha trailerpark
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
|
bump bitch!
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#3 |
Confirmed User
Join Date: Nov 2004
Posts: 1,905
|
yes anyone know, that could be really useful
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#4 | |
rockin tha trailerpark
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#5 |
Registered User
Join Date: Feb 2005
Location: Minnesota
Posts: 19
|
I use Velocityscape's webscraper. It is rather spendy, but has a lot of bells and whistles to it. It's automation features are rather nice.
If you want free, and don't mind processing offline, then you can use this from the perl handbook: Code:
#!/usr/bin/perl use HTML::LinkExtor; my $FILENAME = 'file.html'; $parser = HTML::LinkExtor->new(undef, $base_url); $parser->parse_file($FILENAME); @links = $parser->links; foreach $linkarray (@links) { my @element = @$linkarray; my $elt_type = shift @element; # element type # possibly test whether this is an element we're interested in while (@element) { # extract the next attribute and its value my ($attr_name, $attr_value) = splice(@element, 0, 2); if ($elt_type eq 'a' && $attr_name eq 'href') { print "ANCHOR: $attr_value\n" } } } |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#6 |
Confirmed User
Join Date: Nov 2004
Posts: 1,905
|
http://www.velocityscape.com/ nice software
Wouldnt a link stripper and a site scrapper be two diffent items in a sense? |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#7 | |
Registered User
Join Date: Feb 2005
Location: Minnesota
Posts: 19
|
Quote:
It's probably overkill for what the original poster wants to accomplish. But, if you need to do this sort of thing a lot, and want to automate the process as much as possible, then it's an excellent tool. |
|
![]() |
![]() ![]() ![]() ![]() ![]() |