Text extraction from Arabic PDF General Discussions Forum


Please consider registering

sp_LogInOut Log In

Lost password?
Advanced Search

— Forum Scope —

— Match —

— Forum Options —

Minimum search word length is 4 characters - maximum search word length is 84 characters

The forums are currently locked and only available for read only access
sp_Feed Topic RSS sp_TopicIcon
Text extraction from Arabic PDF
6:38 am
New Member
Forum Posts: 0
Member Since:
sp_UserOfflineSmall Offline

Hi, I am a student of middle eastern languages. My task is like this: I have a few PDF files with Arabic text and want to rearrange the layout in InDesign (I only have the PDF, no other format). Copying and pasting text from the PDF results in unfortunate switched order of certain character combinations and displacement of diacritics, and text export from Acrobat to Word, RTF etc. does not seem to yield any usable results.


Would it help me if I had CS 5.5 ME? I have seen on other forums that others trying to do the same (pasting Arabic text from PDF to InDesign) have similar problems, but I haven't found a solution so far.


Or is there maybe a plugin or tool that could help me with this? I already have the IndicPlus plugin from Word-Tools, which works fine to arrange the text flow of Arabic text in InDesign, but I still have this encoding problem.


Fixing those errors manually seems like a nightmare! Confused




3:37 pm
Forum Posts: 201
Member Since:
sp_UserOfflineSmall Offline

How copying Arabic text works depends on the source of the pdf among other things.

Very often the encoding is not correct.

Forum Timezone: America/New_York

Most Users Ever Online: 197

Currently Online:
6 Guest(s)

Currently Browsing this Page:
1 Guest(s)

Top Posters:

jacksonlame: 18

niftyix: 10

Ulrich Bogun: 9

jackferthomas: 8

Adrian: 8

Soyablack123: 7

Member Stats:

Guest Posters: 1

Members: 6013

Moderators: 0

Admins: 2

Forum Stats:

Groups: 3

Forums: 23

Topics: 279

Posts: 705

Newest Members: WiBjlnqboALz, NqmAUlOzLTYdRE, pyseed, QhuwMYagqvi, MWillustration, Our Daily Bread Ministries

Administrators: In-Tools: 11, Harbs: 201