How to segment 15th to 17th century Arabic manuscripts for CAT use
Thread poster: Haytham Abulela

Haytham Abulela  Identity Verified
Canada
Local time: 14:04
Member (2008)
Arabic to English
+ ...
Mar 11

I have been translating Arabic alchemical manuscripts into English for few years, and wanted to use CAT tools to compile a translation memory and glossary, to use my previous translations in case similar text occurs. Various manuscripts contain quotations or relatively long allegories, which might be repeated in other manuscripts, though not verbatim. The challenge I have is that such manuscripts have little punctuation which makes segmentation hard. If I try to force my own interpretation of se... See more
I have been translating Arabic alchemical manuscripts into English for few years, and wanted to use CAT tools to compile a translation memory and glossary, to use my previous translations in case similar text occurs. Various manuscripts contain quotations or relatively long allegories, which might be repeated in other manuscripts, though not verbatim. The challenge I have is that such manuscripts have little punctuation which makes segmentation hard. If I try to force my own interpretation of segment length, I will have either big chunks of text, or small segments that risk nullifying the benefits of translation memory if segmentation was done differently for similar texts. In addition to this, subject specific glossaries are rare and I have to consult several sources and thesauri to find equivalents, and as such I wish to compile my own glossary to ensure quality and consistency.

You may check this printed book from the Asiatic Society of Bengal which compiles three of Ibn Umail's treatises, which demonstrates how an Arabic manuscript of the same period looks like: pahar.in/mountains/Journals/Asiatic%20Society%20of%20Bengal%201788-1921/Memoirs%20of%20Asiatic%20Society%20of%20Bengal/1933%20Memoirs%20of%20Asiatic%20Society%20of%20Bengal%20Vol%2012%20s.pdf It is a 228 page PDF file, but a quick look at pages 7-17 should be enough.
Collapse


 

Suzanne Chabot
Local time: 16:04
Arabic to French
Slip segments Sep 16

Hello

I am translating from Arabic into French and I have the same problems.

The solution I found is to make the i'rab of the sentence in my brain, and when I understand that the 'wa' is only a way to connect two sentences, I split the paragraph at this point, using the 'Split segment' functionality in SDL Trados Studio. I know many CAT tools enable you to do so.

This cannot be done automatically, since the 'wa' in Arabic has many meanings, but also, it is
... See more
Hello

I am translating from Arabic into French and I have the same problems.

The solution I found is to make the i'rab of the sentence in my brain, and when I understand that the 'wa' is only a way to connect two sentences, I split the paragraph at this point, using the 'Split segment' functionality in SDL Trados Studio. I know many CAT tools enable you to do so.

This cannot be done automatically, since the 'wa' in Arabic has many meanings, but also, it is often attached to the first word of the sentence.

Concerning the term recognition, I did not find any other solution than to add each term with all the different 'damîr' attached to it and the broken plural forms and so on... These CAT tools are so bad to recognize the Arabic terms. Hope they will fix this problem one day. The problem become worse when there are 'harakat' attached to a word, which is very frequent when you translate classical Arabic documents.

Wish you all the best. Baraka Allahu fik.
Collapse


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


How to segment 15th to 17th century Arabic manuscripts for CAT use

Advanced search






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
SDL Trados Studio 2019 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2019 has evolved to bring translators a brand new experience. Designed with user experience at its core, Studio 2019 transforms how new users get up and running and helps experienced users make the most of the powerful features.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search