Although I don't post on ATS as frequently as I did a few years ago, I'm still continuing to archive and share UFO material. I've now helped make
over 400 sets of different UFO magazines/newsletters freely available online as searchable PDFs.
Anyway, I'm currently working on preserving and sharing a searchable archive of posts to the "Reality Uncovered" forum (with the blessing of one of
the former owners of that forum, Ryan Dube).
Many pages of that forum (which used to be at realityuncovered.net) are preserved in the Wayback Machine archive, but that archive is not easily
shared. I want to make pages of posts to that UFO forum available in one or two easily searched formats (including as PDF files, as I did with a
Rendlesham UFO forum a few years ago).
After installing Ruby (a first for me...), I've used the free Wayback Machine Downloader available on Github at the link below to download over 7000
archived files from the realityuncovered.net website (most of those files being html files for individual pages of posts to that UFO forum):
github.com...
The Wayback Machine Downloader downloaded those files with filenames such as "viewtopic.php%3ff%3d19%26t%3d2165%26start%3d30".
Windows identified the period after "viewtopic" as indicating the rest of the file name is the file extension, but I have used the free Bulk Rename
Tool at the link below to:
(1) rename all the files that didn't have a proper file extension so that the file extension is appended to the file name (by replacing .php with
%2Ephp) and
(2) then added a new file extension of "html" for those files.
www.bulkrenameutility.co.uk...
Most of the files now open as html files and the text can be read. I could convert these to PDFs now and I'd get a considerable portion of what I
wanted.
But I'd prefer to make some of the internal links work, e.g. when reading on page 1 of a thread, I'd like the link to page 2 to work.
The internal links in the html files downloaded by the Wayback Machine Downloader generally don't work - I think partly because some specify webpages
in absolute terms (rather than relative ones) and also partly - I think - because they include characters such as ? and =, whereas the downloaded
files include the escaped codes for those characters.
I've tried using Notepad++ "Find in Files" function to find and replace some of the parts of the links in all the html files with alternatives and
managed to get a few of the links working.
Trial and error is - however - a bit time consuming and frustrating (not least because I'm sure there's a more elegant/simpler way to get this to
work)
So, I know (or at least think...) that getting the internal links to work could eventually be done by using that find and replace method.
But I'm going to go nuts doing this on my own when I'm now really familiar with any of this technical stuff.
I'd welcome some input from those with a bit more technical knowledge than me.
Any thoughts?