I’m still mad there’s no straightforward way to convert a PDF into semantic HTML. There’s plenty of tools to convert it into HTML that looks the same with pages and such, but I just want the content.
Would it work to convert it to a simpler intermediate format like rtf or txt, and then convert into html? Why html anyway, Isn’t epub more appropriate?
I’m still mad there’s no straightforward way to convert a PDF into semantic HTML. There’s plenty of tools to convert it into HTML that looks the same with pages and such, but I just want the content.
Sounds like a good opportunity for a crowdfunded start up.
Would it work to convert it to a simpler intermediate format like rtf or txt, and then convert into html? Why html anyway, Isn’t epub more appropriate?
I just hate two column paginated lay outs. Give me pageless single column text.