If it's a private server, and not as huge as my imagination tells me, I would make a script which goes file by file, and open then, recognize if there's an <html>, if so, fetch the title, if not, just the file name.
And get a portion of tha content anywhere randomly.
|