Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Post New Thread Reply

Register GFY Rules Calendar
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed.

 
Thread Tools
Old 06-19-2005, 01:41 PM   #1
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
I need an extraction tool...

I need a tool that extracts search strings (in this case domains >> searching for .com/.net/.org and the letters that precede it) from a text file. Does anyone have any suggestions? Thanks :-)
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 01:52 PM   #2
xl11
Confirmed User
 
Industry Role:
Join Date: Jul 2003
Posts: 712
PHP + regular expressions
xl11 is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 01:56 PM   #3
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
Quote:
Originally Posted by xl11
PHP + regular expressions
If only I knew php ;-)
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 01:59 PM   #4
woj
<&(©¿©)&>
 
woj's Avatar
 
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
if you are looking to invest $$ for this tool, hit me up on icq: 33375924
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000
Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager
Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager
woj is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 02:00 PM   #5
chadglni
Confirmed User
 
Join Date: Dec 2002
Location: PEI, Canada
Posts: 6,924
Quote:
Originally Posted by woj
if you are looking to invest $$ for this tool, hit me up on icq: 33375924
lol
chadglni is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 02:02 PM   #6
fusionx
Confirmed User
 
Industry Role:
Join Date: Nov 2003
Location: Olongapo City, Philippines
Posts: 4,618
http://www.google.com/search?sourcei...&q=url+extract
fusionx is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 02:17 PM   #7
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
Quote:
Originally Posted by woj
if you are looking to invest $$ for this tool, hit me up on icq: 33375924
$5 max lmk
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 02:20 PM   #8
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
Quote:
Originally Posted by fusionx
Most of these extractors search for http, I only want domains, not URLs
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 02:25 PM   #9
woj
<&(©¿©)&>
 
woj's Avatar
 
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
Quote:
Originally Posted by Rhesus
$5 max lmk
hardly worth it, if you can do more, hit me up
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000
Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager
Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager
woj is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 03:25 PM   #10
Antonio
Too lazy to set a custom title
 
Antonio's Avatar
 
Join Date: Oct 2001
Location: Spartaaaaaaaaa
Posts: 14,136
Code:
<HTML>
<HEAD>
<SCRIPT LANGUAGE="JAVASCRIPT" TYPE="text/javascript">
<!-- Begin

// Created and Copyrighted by Benjamin Leow
// Please do not remove the copyright notice and the advertisement. Thank you.
// Please go to http://www.surf7.net for latest version and more freeware

function copy() {
textRange = document.extractor.output.createTextRange();
textRange.execCommand("RemoveFormat");
textRange.execCommand("Copy");
}

function paste() {
textRange = document.extractor.input.createTextRange();
textRange.execCommand("RemoveFormat");
textRange.execCommand("Paste");
}

function help(){

var imgwid = 450;
var imghgt = 360;

content = ('<html><head><title>Email Extractor Lite : Help</title>');
content += ('<STYLE TYPE="text/css">');
content += ('BODY,td,th,ul,p       { font: normal normal normal 8pt/1em Verdana; color: #000; 

}');
content += ('</STYLE>');
content += ('</head><body onload="window.focus();">');
content += ('<B>Quick and dirty</B>');
content += ('<OL>');
content += ('<LI>Copy all text from any webpages, documents, files, etc...');
content += ('<LI>Paste it into <B>Input Window</B>.');
content += ('<LI>Click "<I>Extract</I>" button.');
content += ('<LI>Copy the result from <B>Output Window</B> to somewhere and save it.');
content += ('<LI>Click "<I>Reset</I>" button to start all over again.');
content += ('</OL>');
content += ('<P><B>More Controls</B>');
content += ('<OL>');
content += ('<LI>Click "<I>Paste Input</I>" link to paste any text you copied elsewhere into 

<B>Input Window</B>.');
content += ('<LI>Click "<I>Copy Output</I>" link to copy whatever text inside <B>Output 

Window</B>.');
content += ('<LI>Choose different separator from the dropdown menu or specify your own. Default 

is comma.');
content += ('<LI>You can group a number of emails together. Each group is separated by a new 

line. Please enter number only.');
content += ('<LI>Check "<I>Sort Alphabetically</I>" checkbox to arrange extracted emails well... 

alphabetically.');
content += ('<LI>You can extract or exclude emails containing certain string (text). Useful if 

you only want to get email from a particular domain.');
content += ('<LI>You can choose to extract web addresses instead of email addresses.');
content += ('</OL>');
content += ('<DIV ALIGN="CENTER"><INPUT TYPE="button" VALUE="Close" 

onClick="javascript:window.close();"></DIV>');
content += ('</body></html>');

var winl = (screen.width - imgwid) / 2;
var wint = (screen.height - imghgt) / 2;
helpwindow = window.open('','help','width=' + imgwid + ',height=' + imghgt + 

',resizable=0,scrollbars=0,top=' + wint + ',left=' + winl + 

',toolbar=0,location=0,directories=0,status=0,menubar=0,copyhistory=0');
helpwindow.document.write(content);
helpwindow.document.close();
}

function checksep(value){
if (value) document.extractor.sep.value = "other";
}

function numonly(value){
if (isNaN(value)) {
	window.alert("Please enter a number or else \nleave blank for no grouping.");
	document.extractor.groupby.focus();
}
}

function findEmail() {
var email = "none";
var a = 0;
var ingroup = 0;
var separator = document.extractor.sep.value;
var string = document.extractor.string.value;
var groupby = Math.round(document.extractor.groupby.value);
var address_type = document.extractor.address_type.value;
var input = document.extractor.input.value;

if (document.extractor.lowcase.checked) {
	var input = input.toLowerCase();
}

if (separator hahahaha "new") separator = "\n";
if (separator hahahaha "other") separator = document.extractor.othersep.value;

if (address_type hahahaha "web") {
	rawemail = input.match(/([a-zA-Z0-9._-]+\.com+)/gi);
} else {
	rawemail = input.match(/([a-zA-Z0-9._-]+@[a-zA-Z0-9._-]+\.[a-zA-Z0-9._-]+)/gi);
}

var norepeat = new Array();
var filtermail = new Array();
if (rawemail) {
	if (string){
		x = 0;
		for (var y=0; y<rawemail.length; y++) {
			if (document.extractor.filter_type.value hahahaha 1) {
				if (rawemail[y].search(string) >= 0) {
					filtermail[x] = rawemail[y];
					x++;
				}
			} else {
				if (rawemail[y].search(string) < 0) {
					filtermail[x] = rawemail[y];
					x++;
				}
			}
		}
		rawemail = filtermail;
	}

	for (var i=0; i<rawemail.length; i++) {
		var repeat = 0;
		
		// Check for repeated emails routine
		for (var j=i+1; j<rawemail.length; j++) {
			if (rawemail[i] hahahaha rawemail[j]) {
				repeat++;
			}
		}
		
		// Create new array for non-repeated emails
		if (repeat hahahaha 0) {
			norepeat[a] = rawemail[i];
			a++;
		}
	}
	if (document.extractor.sort.checked) norepeat = norepeat.sort(); // Sort the array
	email = "";
	// Join emails together with separator
	for (var k = 0; k < norepeat.length; k++) {
		if (ingroup != 0) email += separator;
		email += norepeat[k];
		ingroup++;
		
		// Group emails if a number is specified in form. Each group will be separate by 

new line.
		if (groupby) {
			if (ingroup hahahaha groupby) {
				email += '\n\n';
				ingroup = 0;
			}
		}
	}
}

// Return array length
var count = norepeat.length;

// Print results
document.extractor.count.value = count;
document.extractor.output.value = email;
}
//  End -->
</SCRIPT>

<STYLE TYPE="text/css">
BODY                  { background:#FFF }
BODY,td,th,ul,p       { font: normal normal normal 8pt/1em Verdana; color: #000; }
textarea,input,select { font: normal normal normal 8pt/1em Verdana; color: #000; background:#FFF}
A:link, A:visited     { text-decoration: none; color: #059; }
A:active, A:hover     { text-decoration: underline; color: #D14; }
fieldset              { padding-left: 10px; padding-bottom: 10px; }
.bordercolor          { background:#666 }
.maincolor            { background:#CCC }
.button               { background:#CCC }
.titlebarcolor        { background:#007 }
.titlefont            { font: normal normal bold 9pt/1em Arial; color: #FFF; }
.copyrightfont        { font: normal normal normal 7.5pt/1.5em Verdana; color: #666; }
</STYLE>

<TITLE>Email Extractor Lite 1.7</TITLE>
</HEAD>
<BODY>

<DIV ALIGN="CENTER">
<FORM NAME="extractor">
<TABLE CLASS="bordercolor" CELLPADDING=1 CELLSPACING=0 BORDER=0><TR><TD>
<TABLE CLASS="maincolor" CELLPADDING=4 CELLSPACING=0 BORDER=0>
<TR CLASS="titlebarcolor" VALIGN="MIDDLE"> 
<TD><FONT CLASS="titlefont">Email Extractor Lite 1.7</FONT></TD>
<TD ALIGN="RIGHT" NOWRAP></TD>
</TR>
<TR>
<TD VALIGN="TOP" ALIGN="CENTER" WIDTH="50%">
<B>Input Window</B><BR>
<TEXTAREA NAME="input" rows=8 cols=50></TEXTAREA>
</TD>
<TD VALIGN="TOP" ALIGN="CENTER" WIDTH="50%">
<B>Output Window</B><BR>
<TEXTAREA NAME="output" rows=8 cols=50 readonly></TEXTAREA>
</TD></TR>
<TR>
<TD VALIGN="TOP" ALIGN="CENTER">

<SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript">
<!--
if ((navigator.appNamehahahaha"Microsoft Internet Explorer")&&(parseInt(navigator.appVersion)>=4)) 

document.write('<A HREF="#" onClick="paste();">Paste Input</A>');
else document.write('Paste Input');
// -->
</SCRIPT>

</TD>
<TD VALIGN="TOP" ALIGN="CENTER">

<SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript">
<!--
if ((navigator.appNamehahahaha"Microsoft Internet Explorer")&&(parseInt(navigator.appVersion)>=4)) 

document.write('<A HREF="#" onClick="copy();">Copy Output</A>');
else document.write('Copy Output');
// -->
</SCRIPT>

</TD></TR>
<TR>
<TD VALIGN="TOP" ALIGN="LEFT" COLSPAN=2>
<fieldset title="Output Option">
<legend align="left"><B>Output Option</B></legend>
<BR>
Separator:
<SELECT NAME="sep">
<OPTION VALUE=", " SELECTED>Comma</OPTION>
<OPTION VALUE="|">Pipe</OPTION>
<OPTION VALUE=" : ">Colon</OPTION>
<OPTION VALUE="new">New Line</OPTION>
<OPTION VALUE="other">Other</OPTION>
</SELECT>
<INPUT TYPE="TEXT" NAME="othersep" SIZE=3 onBlur="checksep(this.value);">
&nbsp;&nbsp;
Group: <INPUT TYPE="TEXT" SIZE=3 NAME="groupby" onBlur="numonly(this.value);"> Addresses
&nbsp;&nbsp;
<LABEL FOR="sortbox"><INPUT TYPE="CHECKBOX" NAME="sort" id="sortbox">Sort Alphabetically</LABEL>
&nbsp;&nbsp;
<LABEL FOR="casebox"><INPUT TYPE="CHECKBOX" NAME="lowcase" id="casebox" CHECKED>To 

Lowercase?</LABEL>
</fieldset>
<BR>
<fieldset title="Filter Option">
<legend align="left"><B>Filter Option</B></legend>
<BR>
<SELECT NAME="filter_type">
<OPTION VALUE=1 SELECTED>Only</OPTION>
<OPTION VALUE=0>Do not</OPTION>
</SELECT>
extract address containing this string: <INPUT TYPE="TEXT" SIZE=20 NAME="string">
<BR>
<BR>
Type of address to extract: 
<SELECT NAME="address_type">
<OPTION VALUE="email" SELECTED>Email</OPTION>
<OPTION VALUE="web">Web</OPTION>
</SELECT>
</fieldset>
</TD></TR>
<TR>
<TD VALIGN="TOP" ALIGN="LEFT">
<INPUT TYPE="BUTTON" CLASS="button" VALUE="Extract" onClick="findEmail();"> 
<INPUT TYPE="RESET" CLASS="button" VALUE="Reset">&nbsp;&nbsp;&nbsp;
<A HREF="#" onClick="help();"><I>Need help?</I></A>
</TD>
<TD VALIGN="TOP" ALIGN="RIGHT" NOWRAP>
Counter: <INPUT NAME="count" SIZE=5 READONLY>
</TD></TR>
</TABLE>
</TD></TR></TABLE>
</FORM>
<A HREF="http://www.surf7.net/" TARGET="_blank"><IMG SRC="http://www.surf7.net/images/banner.gif" 

WIDTH=468 HEIGHT=60 ALT="Surf7.net: Web Hosting, Domain Name, Web Design, Freewares, Reseller" 

BORDER=0></A>
<BR><BR>
<FONT CLASS="copyrightfont">&copy; 2002 - 2005 <A HREF="http://www.surf7.net/">Surf7.net</A> - 

All Right Reserved.</FONT>
</DIV>

</BODY>
</HTML>
save as html file, choose ->Type of address to extract -> web (not email) and it'll extract com domains from a txt files
you'll have to modify it a bit if uou want to extract .net, .biz etc
Antonio is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 03:27 PM   #11
$5 submissions
I help you SUCCEED
 
$5 submissions's Avatar
 
Industry Role:
Join Date: Nov 2003
Location: The Pearl of the Orient Seas
Posts: 32,195
A php script should be enough. HOWEVER, if you need some ANALYSIS or judgment calls made with your extract process, give http://www.webmasterlabor.com we specialize in CHEAP manual data labor.
$5 submissions is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 03:39 PM   #12
a1ka1ine
Confirmed User
 
Industry Role:
Join Date: Apr 2002
Posts: 3,387
i was gonna recommend this from just reading your subject title:

a1ka1ine is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 03:45 PM   #13
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
Quote:
Originally Posted by Antonio
save as html file, choose ->Type of address to extract -> web (not email) and it'll extract com domains from a txt files
you'll have to modify it a bit if uou want to extract .net, .biz etc
Thanks, but for some reason this doesn't work...
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 03:50 PM   #14
Rhesus
Confirmed User
 
Join Date: Aug 2004
Posts: 2,009
Quote:
Originally Posted by Rhesus
Thanks, but for some reason this doesn't work...
Again, this tool only searches for http blabla, not domains
Rhesus is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 04:05 PM   #15
Tat2Jr
Confirmed User
 
Tat2Jr's Avatar
 
Join Date: Feb 2001
Location: Sunny California
Posts: 4,882
You ^%$*%% damn programmers!

I was hoping this was a photoshop thread and I was going to learn about a new extraction tool to cut out backgrounds.
__________________
NICHE MONEY >> Ass WorshipPantiesSolo TeenPantyhose
Serving up exclusive fetish sites since 1997!
Tat2Jr is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 06-19-2005, 04:11 PM   #16
woj
<&(©¿©)&>
 
woj's Avatar
 
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
Quote:
Originally Posted by Tat2Jr
You ^%$*%% damn programmers!

I was hoping this was a photoshop thread and I was going to learn about a new extraction tool to cut out backgrounds.
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000
Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager
Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager
woj is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Post New Thread Reply
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >

Bookmarks



Advertising inquiries - marketing at gfy dot com

Contact Admin - Advertise - GFY Rules - Top

©2000-, AI Media Network Inc



Powered by vBulletin
Copyright © 2000- Jelsoft Enterprises Limited.