Is there a field in which PDF files specify their encoding?

A quick look at the PDF specification seems to suggest that you can have different encoding inside a PDF-file. Have a look at page 86. So a PDF library with some kind of low level access should be able to provide you with encoding used for a string. But if you just want the text and don’t care about the internal encodings used I would suggest to let the library take care of conversions for you.

Leave a Comment