We’ve gone from visions of the paperless office to printing like never before to the current situation: a decline in printing. At least that’s my guess. And presumably it’s thanks to email, PDAs, SMS messaging, bigger screens, and better collaborative tools such as document annotation and wikis. Yet, the decline in printing will be gradual and long. Paper still has many benefits: superior reading experience for many, legal relevance, safeguard for electronic storage.
So as long as we expect printing to be around for a while, let’s make printing the web a worthwhile experience. At present, there’s HTML, which is difficult to print. And there’s PDFs, which are designed for printing, but are difficult to read. I’m not going to be Mr. SaveTheWorld and propose an uber-format … I’m just going to suggest a few incremental improvements that would make life easier…
Browsers should render PDF more gracefully. Since PDF actually achieves – or bypasses the above, it could be a useful format for distribution. However, browsers just don’t handle it very well, even when Acrobat is embedded into the browser.
I have just the one big suggestion here: stop dealing with PDFs using plugins, and instead render them as HTML. Google has been converting PDFs quite effectively for about five years now, and many tools do too. I’m sure I’m not alone when I click on the Google HTML version rather than the PDF version after performing a search. If Google can convert every PDF in the universe, the browser should be able to do it for a single document.
PDFs, with their discrete pages are very difficult to browse up and down. The font size is rarely anything to do with the browser’s normal HTML size. All the browser tools you’ve come to know and love are either gone or mutated. Want to find some text in a PDF document? You’ll have to do it the Acrobat way, not the browser way. And you can forget about all your browser-specific plugins, like language translation and bookmarklets. They’d be just as useful on PDF content, but it’s not happening.
So the solution is simple: browsers should be able to treat PDFs as HTML. The Acrobat plugin can still be used for printing, so the PDF document could actually provide the best of both worlds. But for reading in a browser, HTML wins every time. And as the IText (Java PDF framework) FAQ notes, its perfectly within Adobe’s conditions to create PDF tools. In any event, if Google can put converted PDFs on the web, what’s to stop a browser from doing likewise?