On Wed, Jun 23, 2010 at 9:22 AM, Easwar Hariharan meindian523@gmail.comwrote:
On Wed, Jun 23, 2010 at 7:45 AM, Kussh Singh kussh.singh@gmail.comwrote:
Besides htmldoc is there any other way to convert all html documents in a folder recursively to pdf files?
I need to convert html to pdf in some automatic way (script based, gui etc) --typically websites downloaded for offline reading have html documents. These documents are sometimes many pages long. Since there is no way to remember how much of the html file was read before stopping reading for the day, i need to convert the html etc files to pdf--so that the reading done upto the particular day for the particular pdf file can be noted.
I have used html2text (the package is available in the Ubuntu repos, at least), but can't find a text to pdf converter. May be you could mark the extent of the file you have read by making a line with whatever character you choose (@, _, -, etc) in a text file?
Forgot to mention that html2text is CLI, man page is good, and I've done batch processing with it (just wrapped it in a bash loop)