Friday, May 07, 2010

Transforming old interbase 6.0 docs from pdf to html using htmltopdf from ubuntu

wget http://www.ibphoenix.com/downloads/60LangRef.zip
unzip 60LangRef.zip
Archive:  60LangRef.zip
  inflating: LANGREF.PDF  

simple transformation with weird chars for quotes is done this way

pdftohtml LANGREF.PDF
then there is a LANGREF.html file

So i had to add the encoding type to be UTF-8
pdftohtml -noframes -enc UTF-8 LANGREF.PDF

you can see the results on this page
http://firebirdsql.ro/images/LANGREF.html


the interesting thing is -xml output
pdftohtml -noframes -xml LANGREF.PDF

with LANGREF.xml file as output

So in theory we can have simple interbase 6.0/ pre firebird 1.0 docbook document with language reference

No comments: