Reply to post

Imported pdf is not readable

Author
Bernard
User
  • Total Posts : 0
  • Reward points: 0
  • Joined: 2020/11/04 04:24:26
  • Status: offline
2020/11/04 05:27:17 (permalink)

Imported pdf is not readable

Hello
An imported pdf in French is unreadable because special characters are inserted between the words.
Here is an example :

Bien{#1e1} donc cela explique une partie de
l{#1ef} histoire{#1e4} J{#1ef} ai expliqué un peu comment les
distributions de probabilités continues devraient être interprétées
{#20b} c{#1ef} est-à-dire que l{#1ef} aire sous la courbe est
l{#1ef} élément clé{#20c}

This happened only for one specific pdf file.
The others can be read normally

Would you please have a suggestion?
Is there something I have to change in the settings of VoiceAloud? Or should I convert this pdf?

Thank you

1 Reply Related Threads

    Admin
    Administrator
    • Total Posts : 275
    • Reward points: 0
    • Joined: 2010/11/22 00:00:00
    • Location: USA
    • Status: offline
    Re: Imported pdf is not readable 2020/11/07 05:15:01 (permalink)
    This happens when the font to Unicode table in a PDF file is incorrect or has missing entries. You can verify this by opening the app in Adobe Acrobat Reader (and Adobe is the company who invented and defines PDF file format), copying some text, then pasting it in any other app. You will see similar problems in pasted text.
    Please open this file using the "Open" button on top of @Voice screen (folder icon), then when the "PDF Text Import Settings" screen appears, enable OCR option in it. This way the incorrect font to Unicode table won't be used at all, instead the letters will be recognized from page images. It will take much longer, but should give you the desired result.
     
    Greg
    Jump to:
    © 2024 APG vNext Commercial Version 5.1