![]() |
Combining 100's of HTML pages into 1. Ideas?
Just looking to see if anyone has any creative ideas....
I have a directory with about 200 standard HTML pages that I am trying to append into one BIG page. Any thoughts on how I can do this quickly and easily? |
The quick and dirty method is to use the unix "cat" command...
ex/ cat *.htm >output.htm This will take all contents from all .htm files and put them in output.htm A simple perl script could do something similar and you could format the output so you don't have a billion extra HTML tags. |
Print all the pages out and tape them together from top to bottom, forming one long page. Then buy text recognition software and a scanner that supports it. Buy a feeder that will slide the now 100-page long paper through your scanner, slow enough to recognize the text.
Then take the text that has been scanned and convert it into html (you'll need to remember what was bold and everything, and use those tags). And as an added bonus, you'll have a 100 foot long "rope" if you will, of taped paper. Fold it up carefully and keep it on your nightstand. In case of a fire, drop it out your window and climb to safety. |
Quote:
I'll just jump into your mama's gash. Im sure that'll be big enough to catch me. Booo.... Mama Jokes. |
Quote:
|
under windows you can use copy *.html+*.html output.html
( command mode ) Then with a text editor remove the html / html tags |
Quote:
|
Quote:
|
The "rope" could also be used to hang yourself.
|
Quote:
|
Quote:
if you have cygwin installed, you could use the cygwin version of sed to pull the body text out of each file pipe, and map over each directory list element. or you could use sed to grab the body elements of "cat *.html" as suggested and pipe the result to out.html. |
All times are GMT -7. The time now is 08:46 AM. |
Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123