![]() |
![]() |
![]() |
||||
Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact us. |
![]() ![]() |
|
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed. |
|
Thread Tools |
![]() |
#1 |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
I need an extraction tool...
I need a tool that extracts search strings (in this case domains >> searching for .com/.net/.org and the letters that precede it) from a text file. Does anyone have any suggestions? Thanks :-)
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#2 |
Confirmed User
Industry Role:
Join Date: Jul 2003
Posts: 712
|
PHP + regular expressions
![]() |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#3 | |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#4 |
<&(©¿©)&>
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
|
if you are looking to invest $$ for this tool, hit me up on icq: 33375924
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000 Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager ![]() Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#5 | |
Confirmed User
Join Date: Dec 2002
Location: PEI, Canada
Posts: 6,924
|
Quote:
![]() |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#6 |
Confirmed User
Industry Role:
Join Date: Nov 2003
Location: Olongapo City, Philippines
Posts: 4,618
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#7 | |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#8 | |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#9 | |
<&(©¿©)&>
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
|
Quote:
![]()
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000 Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager ![]() Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#10 |
Too lazy to set a custom title
Join Date: Oct 2001
Location: Spartaaaaaaaaa
Posts: 14,136
|
Code:
<HTML> <HEAD> <SCRIPT LANGUAGE="JAVASCRIPT" TYPE="text/javascript"> <!-- Begin // Created and Copyrighted by Benjamin Leow // Please do not remove the copyright notice and the advertisement. Thank you. // Please go to http://www.surf7.net for latest version and more freeware function copy() { textRange = document.extractor.output.createTextRange(); textRange.execCommand("RemoveFormat"); textRange.execCommand("Copy"); } function paste() { textRange = document.extractor.input.createTextRange(); textRange.execCommand("RemoveFormat"); textRange.execCommand("Paste"); } function help(){ var imgwid = 450; var imghgt = 360; content = ('<html><head><title>Email Extractor Lite : Help</title>'); content += ('<STYLE TYPE="text/css">'); content += ('BODY,td,th,ul,p { font: normal normal normal 8pt/1em Verdana; color: #000; }'); content += ('</STYLE>'); content += ('</head><body onload="window.focus();">'); content += ('<B>Quick and dirty</B>'); content += ('<OL>'); content += ('<LI>Copy all text from any webpages, documents, files, etc...'); content += ('<LI>Paste it into <B>Input Window</B>.'); content += ('<LI>Click "<I>Extract</I>" button.'); content += ('<LI>Copy the result from <B>Output Window</B> to somewhere and save it.'); content += ('<LI>Click "<I>Reset</I>" button to start all over again.'); content += ('</OL>'); content += ('<P><B>More Controls</B>'); content += ('<OL>'); content += ('<LI>Click "<I>Paste Input</I>" link to paste any text you copied elsewhere into <B>Input Window</B>.'); content += ('<LI>Click "<I>Copy Output</I>" link to copy whatever text inside <B>Output Window</B>.'); content += ('<LI>Choose different separator from the dropdown menu or specify your own. Default is comma.'); content += ('<LI>You can group a number of emails together. Each group is separated by a new line. Please enter number only.'); content += ('<LI>Check "<I>Sort Alphabetically</I>" checkbox to arrange extracted emails well... alphabetically.'); content += ('<LI>You can extract or exclude emails containing certain string (text). Useful if you only want to get email from a particular domain.'); content += ('<LI>You can choose to extract web addresses instead of email addresses.'); content += ('</OL>'); content += ('<DIV ALIGN="CENTER"><INPUT TYPE="button" VALUE="Close" onClick="javascript:window.close();"></DIV>'); content += ('</body></html>'); var winl = (screen.width - imgwid) / 2; var wint = (screen.height - imghgt) / 2; helpwindow = window.open('','help','width=' + imgwid + ',height=' + imghgt + ',resizable=0,scrollbars=0,top=' + wint + ',left=' + winl + ',toolbar=0,location=0,directories=0,status=0,menubar=0,copyhistory=0'); helpwindow.document.write(content); helpwindow.document.close(); } function checksep(value){ if (value) document.extractor.sep.value = "other"; } function numonly(value){ if (isNaN(value)) { window.alert("Please enter a number or else \nleave blank for no grouping."); document.extractor.groupby.focus(); } } function findEmail() { var email = "none"; var a = 0; var ingroup = 0; var separator = document.extractor.sep.value; var string = document.extractor.string.value; var groupby = Math.round(document.extractor.groupby.value); var address_type = document.extractor.address_type.value; var input = document.extractor.input.value; if (document.extractor.lowcase.checked) { var input = input.toLowerCase(); } if (separator hahahaha "new") separator = "\n"; if (separator hahahaha "other") separator = document.extractor.othersep.value; if (address_type hahahaha "web") { rawemail = input.match(/([a-zA-Z0-9._-]+\.com+)/gi); } else { rawemail = input.match(/([a-zA-Z0-9._-]+@[a-zA-Z0-9._-]+\.[a-zA-Z0-9._-]+)/gi); } var norepeat = new Array(); var filtermail = new Array(); if (rawemail) { if (string){ x = 0; for (var y=0; y<rawemail.length; y++) { if (document.extractor.filter_type.value hahahaha 1) { if (rawemail[y].search(string) >= 0) { filtermail[x] = rawemail[y]; x++; } } else { if (rawemail[y].search(string) < 0) { filtermail[x] = rawemail[y]; x++; } } } rawemail = filtermail; } for (var i=0; i<rawemail.length; i++) { var repeat = 0; // Check for repeated emails routine for (var j=i+1; j<rawemail.length; j++) { if (rawemail[i] hahahaha rawemail[j]) { repeat++; } } // Create new array for non-repeated emails if (repeat hahahaha 0) { norepeat[a] = rawemail[i]; a++; } } if (document.extractor.sort.checked) norepeat = norepeat.sort(); // Sort the array email = ""; // Join emails together with separator for (var k = 0; k < norepeat.length; k++) { if (ingroup != 0) email += separator; email += norepeat[k]; ingroup++; // Group emails if a number is specified in form. Each group will be separate by new line. if (groupby) { if (ingroup hahahaha groupby) { email += '\n\n'; ingroup = 0; } } } } // Return array length var count = norepeat.length; // Print results document.extractor.count.value = count; document.extractor.output.value = email; } // End --> </SCRIPT> <STYLE TYPE="text/css"> BODY { background:#FFF } BODY,td,th,ul,p { font: normal normal normal 8pt/1em Verdana; color: #000; } textarea,input,select { font: normal normal normal 8pt/1em Verdana; color: #000; background:#FFF} A:link, A:visited { text-decoration: none; color: #059; } A:active, A:hover { text-decoration: underline; color: #D14; } fieldset { padding-left: 10px; padding-bottom: 10px; } .bordercolor { background:#666 } .maincolor { background:#CCC } .button { background:#CCC } .titlebarcolor { background:#007 } .titlefont { font: normal normal bold 9pt/1em Arial; color: #FFF; } .copyrightfont { font: normal normal normal 7.5pt/1.5em Verdana; color: #666; } </STYLE> <TITLE>Email Extractor Lite 1.7</TITLE> </HEAD> <BODY> <DIV ALIGN="CENTER"> <FORM NAME="extractor"> <TABLE CLASS="bordercolor" CELLPADDING=1 CELLSPACING=0 BORDER=0><TR><TD> <TABLE CLASS="maincolor" CELLPADDING=4 CELLSPACING=0 BORDER=0> <TR CLASS="titlebarcolor" VALIGN="MIDDLE"> <TD><FONT CLASS="titlefont">Email Extractor Lite 1.7</FONT></TD> <TD ALIGN="RIGHT" NOWRAP></TD> </TR> <TR> <TD VALIGN="TOP" ALIGN="CENTER" WIDTH="50%"> <B>Input Window</B><BR> <TEXTAREA NAME="input" rows=8 cols=50></TEXTAREA> </TD> <TD VALIGN="TOP" ALIGN="CENTER" WIDTH="50%"> <B>Output Window</B><BR> <TEXTAREA NAME="output" rows=8 cols=50 readonly></TEXTAREA> </TD></TR> <TR> <TD VALIGN="TOP" ALIGN="CENTER"> <SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript"> <!-- if ((navigator.appNamehahahaha"Microsoft Internet Explorer")&&(parseInt(navigator.appVersion)>=4)) document.write('<A HREF="#" onClick="paste();">Paste Input</A>'); else document.write('Paste Input'); // --> </SCRIPT> </TD> <TD VALIGN="TOP" ALIGN="CENTER"> <SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript"> <!-- if ((navigator.appNamehahahaha"Microsoft Internet Explorer")&&(parseInt(navigator.appVersion)>=4)) document.write('<A HREF="#" onClick="copy();">Copy Output</A>'); else document.write('Copy Output'); // --> </SCRIPT> </TD></TR> <TR> <TD VALIGN="TOP" ALIGN="LEFT" COLSPAN=2> <fieldset title="Output Option"> <legend align="left"><B>Output Option</B></legend> <BR> Separator: <SELECT NAME="sep"> <OPTION VALUE=", " SELECTED>Comma</OPTION> <OPTION VALUE="|">Pipe</OPTION> <OPTION VALUE=" : ">Colon</OPTION> <OPTION VALUE="new">New Line</OPTION> <OPTION VALUE="other">Other</OPTION> </SELECT> <INPUT TYPE="TEXT" NAME="othersep" SIZE=3 onBlur="checksep(this.value);"> Group: <INPUT TYPE="TEXT" SIZE=3 NAME="groupby" onBlur="numonly(this.value);"> Addresses <LABEL FOR="sortbox"><INPUT TYPE="CHECKBOX" NAME="sort" id="sortbox">Sort Alphabetically</LABEL> <LABEL FOR="casebox"><INPUT TYPE="CHECKBOX" NAME="lowcase" id="casebox" CHECKED>To Lowercase?</LABEL> </fieldset> <BR> <fieldset title="Filter Option"> <legend align="left"><B>Filter Option</B></legend> <BR> <SELECT NAME="filter_type"> <OPTION VALUE=1 SELECTED>Only</OPTION> <OPTION VALUE=0>Do not</OPTION> </SELECT> extract address containing this string: <INPUT TYPE="TEXT" SIZE=20 NAME="string"> <BR> <BR> Type of address to extract: <SELECT NAME="address_type"> <OPTION VALUE="email" SELECTED>Email</OPTION> <OPTION VALUE="web">Web</OPTION> </SELECT> </fieldset> </TD></TR> <TR> <TD VALIGN="TOP" ALIGN="LEFT"> <INPUT TYPE="BUTTON" CLASS="button" VALUE="Extract" onClick="findEmail();"> <INPUT TYPE="RESET" CLASS="button" VALUE="Reset"> <A HREF="#" onClick="help();"><I>Need help?</I></A> </TD> <TD VALIGN="TOP" ALIGN="RIGHT" NOWRAP> Counter: <INPUT NAME="count" SIZE=5 READONLY> </TD></TR> </TABLE> </TD></TR></TABLE> </FORM> <A HREF="http://www.surf7.net/" TARGET="_blank"><IMG SRC="http://www.surf7.net/images/banner.gif" WIDTH=468 HEIGHT=60 ALT="Surf7.net: Web Hosting, Domain Name, Web Design, Freewares, Reseller" BORDER=0></A> <BR><BR> <FONT CLASS="copyrightfont">© 2002 - 2005 <A HREF="http://www.surf7.net/">Surf7.net</A> - All Right Reserved.</FONT> </DIV> </BODY> </HTML> you'll have to modify it a bit if uou want to extract .net, .biz etc |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#11 |
I help you SUCCEED
Industry Role:
Join Date: Nov 2003
Location: The Pearl of the Orient Seas
Posts: 32,195
|
A php script should be enough. HOWEVER, if you need some ANALYSIS or judgment calls made with your extract process, give http://www.webmasterlabor.com we specialize in CHEAP manual data labor.
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#12 |
Confirmed User
Industry Role:
Join Date: Apr 2002
Posts: 3,387
|
i was gonna recommend this from just reading your subject title:
![]() |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#13 | |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#14 | |
Confirmed User
Join Date: Aug 2004
Posts: 2,009
|
Quote:
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#15 |
Confirmed User
Join Date: Feb 2001
Location: Sunny California
Posts: 4,882
|
You ^%$*%% damn programmers!
I was hoping this was a photoshop thread and I was going to learn about a new extraction tool to cut out backgrounds. ![]()
__________________
NICHE MONEY >> Ass Worship • Panties • Solo Teen • Pantyhose Serving up exclusive fetish sites since 1997! |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#16 | |
<&(©¿©)&>
Industry Role:
Join Date: Jul 2002
Location: Chicago
Posts: 47,882
|
Quote:
![]()
__________________
Custom Software Development, email: woj#at#wojfun#.#com to discuss details or skype: wojl2000 or gchat: wojfun or telegram: wojl2000 Affiliate program tools: Hosted Galleries Manager Banner Manager Video Manager ![]() Wordpress Affiliate Plugin Pic/Movie of the Day Fansign Generator Zip Manager |
|
![]() |
![]() ![]() ![]() ![]() ![]() |