Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Post New Thread Reply

Register GFY Rules Calendar
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed.

 
Thread Tools
Old 03-10-2006, 01:19 PM   #1
pr0
rockin tha trailerpark
 
pr0's Avatar
 
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
Need a link stripper

Any programs out there that can hit a url, & export a text list of every link on the page

in this fashion


http://url1.com
http://url2.com
http://url3.com
pr0 is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 01:33 PM   #2
pr0
rockin tha trailerpark
 
pr0's Avatar
 
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
bump bitch!
pr0 is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 01:43 PM   #3
Love Sex
Confirmed User
 
Join Date: Nov 2004
Posts: 1,905
yes anyone know, that could be really useful
Love Sex is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 01:49 PM   #4
pr0
rockin tha trailerpark
 
pr0's Avatar
 
Industry Role:
Join Date: May 2001
Location: ~Coastal~
Posts: 23,088
Quote:
Originally Posted by Love Sex
yes anyone know, that could be really useful
well help me keep this bumped...someones got one
pr0 is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 02:50 PM   #5
PolySix
Registered User
 
Join Date: Feb 2005
Location: Minnesota
Posts: 19
I use Velocityscape's webscraper. It is rather spendy, but has a lot of bells and whistles to it. It's automation features are rather nice.

If you want free, and don't mind processing offline, then you can use this from the perl handbook:


Code:
#!/usr/bin/perl
  
use HTML::LinkExtor;
  
my $FILENAME = 'file.html';
  
$parser = HTML::LinkExtor->new(undef, $base_url);
$parser->parse_file($FILENAME);
@links = $parser->links;
foreach $linkarray (@links) {
    my @element = @$linkarray;
    my $elt_type = shift @element; # element type
  
    # possibly test whether this is an element we're interested in
    while (@element) {
        # extract the next attribute and its value
        my ($attr_name, $attr_value) = splice(@element, 0, 2);
        if ($elt_type eq 'a' && $attr_name eq 'href') {
            print "ANCHOR: $attr_value\n" 
        }
    }
}
PolySix is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 04:14 PM   #6
Love Sex
Confirmed User
 
Join Date: Nov 2004
Posts: 1,905
http://www.velocityscape.com/ nice software
Wouldnt a link stripper and a site scrapper be two diffent items in a sense?
Love Sex is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 03-10-2006, 10:50 PM   #7
PolySix
Registered User
 
Join Date: Feb 2005
Location: Minnesota
Posts: 19
Quote:
Originally Posted by Love Sex
www velocityscape com nice software
Wouldnt a link stripper and a site scrapper be two diffent items in a sense?
Yes, they would be considered two different items. I guess a more accurate way to say it would be that, link stripping (link extracting), is one basic example of the many things that can be done with a product like webscraper.

It's probably overkill for what the original poster wants to accomplish. But, if you need to do this sort of thing a lot, and want to automate the process as much as possible, then it's an excellent tool.
PolySix is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Post New Thread Reply
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >

Bookmarks



Advertising inquiries - marketing at gfy dot com

Contact Admin - Advertise - GFY Rules - Top

©2000-, AI Media Network Inc



Powered by vBulletin
Copyright © 2000- Jelsoft Enterprises Limited.