![]() |
![]() |
![]() |
||||
Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact us. |
![]() ![]() |
|
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed. |
|
Thread Tools |
![]() |
#1 |
Confirmed User
Join Date: Aug 2002
Location: on the internet
Posts: 3,783
|
Combining 100's of HTML pages into 1. Ideas?
Just looking to see if anyone has any creative ideas....
I have a directory with about 200 standard HTML pages that I am trying to append into one BIG page. Any thoughts on how I can do this quickly and easily?
__________________
<table cellspacing="0" cellpadding="3" border="1" bgcolor="#008000"><tr><td><font size=3>Gone</font></td></tr></table> |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#2 |
Confirmed User
Join Date: Jun 2002
Location: Lightspeed Sorority
Posts: 103
|
The quick and dirty method is to use the unix "cat" command...
ex/ cat *.htm >output.htm This will take all contents from all .htm files and put them in output.htm A simple perl script could do something similar and you could format the output so you don't have a billion extra HTML tags.
__________________
<a href="http://www.lightspeedcash.com"> Make money at the speed of light!</a><br> - Wouldn't it be cool to own a retarded monkey? |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#3 |
Confirmed User
Join Date: Jul 2002
Location: Los Angeles, CA
Posts: 446
|
Print all the pages out and tape them together from top to bottom, forming one long page. Then buy text recognition software and a scanner that supports it. Buy a feeder that will slide the now 100-page long paper through your scanner, slow enough to recognize the text.
Then take the text that has been scanned and convert it into html (you'll need to remember what was bold and everything, and use those tags). And as an added bonus, you'll have a 100 foot long "rope" if you will, of taped paper. Fold it up carefully and keep it on your nightstand. In case of a fire, drop it out your window and climb to safety.
__________________
Make money with your exit/404 traffic - hit up ICQ 26910698 <br> "My eyes! The goggles do nothing!" |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#4 | |
Confirmed User
Join Date: Aug 2002
Location: on the internet
Posts: 3,783
|
Quote:
I'll just jump into your mama's gash. Im sure that'll be big enough to catch me. Booo.... Mama Jokes.
__________________
<table cellspacing="0" cellpadding="3" border="1" bgcolor="#008000"><tr><td><font size=3>Gone</font></td></tr></table> |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#5 | |
Confirmed User
Industry Role:
Join Date: Jun 2005
Location: Hell
Posts: 1,626
|
Quote:
![]()
__________________
WHO THE FUCK ARE YOU? |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#6 |
Too lazy to set a custom title
Join Date: Jan 2002
Location: Holland
Posts: 9,870
|
under windows you can use copy *.html+*.html output.html
( command mode ) Then with a text editor remove the html / html tags
__________________
Don't let greediness blur your vision | You gotta let some shit slide icq - 441-456-888 |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#7 | |
Confirmed User
Join Date: Jul 2002
Location: Los Angeles, CA
Posts: 446
|
Quote:
![]()
__________________
Make money with your exit/404 traffic - hit up ICQ 26910698 <br> "My eyes! The goggles do nothing!" |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#8 | |
Confirmed User
Join Date: Aug 2002
Location: on the internet
Posts: 3,783
|
Quote:
__________________
<table cellspacing="0" cellpadding="3" border="1" bgcolor="#008000"><tr><td><font size=3>Gone</font></td></tr></table> |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#9 |
Confirmed User
Join Date: Mar 2002
Posts: 323
|
The "rope" could also be used to hang yourself.
__________________
Pffffffftttttttth. I'm done. |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#10 | |
Confirmed User
Join Date: Jul 2002
Location: Los Angeles, CA
Posts: 446
|
Quote:
![]()
__________________
Make money with your exit/404 traffic - hit up ICQ 26910698 <br> "My eyes! The goggles do nothing!" |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#11 | |
Confirmed User
Join Date: Jul 2002
Posts: 1,721
|
Quote:
if you have cygwin installed, you could use the cygwin version of sed to pull the body text out of each file pipe, and map over each directory list element. or you could use sed to grab the body elements of "cat *.html" as suggested and pipe the result to out.html.
__________________
the sound of one hand googlewhacking |
|
![]() |
![]() ![]() ![]() ![]() ![]() |