Translating with too many tags
Thread poster: Srini Venkataraman

Srini Venkataraman
United States
Local time: 08:15
Member (2012)
Tamil to English
+ ...
SITE LOCALIZER
Jan 24, 2019

Got a word doc for translation. 890 words.
When loaded in WF pro 5, surprised to see around 240 tags.each of the 65 segments were so “full” of tags.
For starters have removed the hyperlinks, before loading.
Do I translate without bothering about tags and then final doc to be compared with original, and modify the bold,italics,underlines,color font, bullets and paragraph numbers and so on. Is there a different way to handle? Thanks
P.s. this is a device manual, hence
... See more
Got a word doc for translation. 890 words.
When loaded in WF pro 5, surprised to see around 240 tags.each of the 65 segments were so “full” of tags.
For starters have removed the hyperlinks, before loading.
Do I translate without bothering about tags and then final doc to be compared with original, and modify the bold,italics,underlines,color font, bullets and paragraph numbers and so on. Is there a different way to handle? Thanks
P.s. this is a device manual, hence the too many tags.

[Edited at 2019-01-24 22:41 GMT]
Collapse


 

Vadim Kadyrov  Identity Verified
Ukraine
Local time: 16:15
Member (2011)
English to Russian
+ ...
Usually you have too many tags when you have an OCRed pdf Jan 25, 2019

I assume that your source document was once a .pdf, and then it was OCRed by someone who sent you the .docx for translation. Sometimes, when you OCR something, you end up having too many tags (sometimes a single word would be split into chunks of 1 or 2 or 3 letters by tags). It is very annoying, yes.

And it is very dangerous to stop paying attention to/remove tags arbitrarily in your document. CAT tools just can`t distinguish between types/'importance' of tags. For example, TRADOS
... See more
I assume that your source document was once a .pdf, and then it was OCRed by someone who sent you the .docx for translation. Sometimes, when you OCR something, you end up having too many tags (sometimes a single word would be split into chunks of 1 or 2 or 3 letters by tags). It is very annoying, yes.

And it is very dangerous to stop paying attention to/remove tags arbitrarily in your document. CAT tools just can`t distinguish between types/'importance' of tags. For example, TRADOS will always yield an error when any tag is removed (the app assumes that tags there are not accidental, which is not always the case).

In other words, you can ask your client to send you a pdf, and the OCR it manually using, say, ABBYY FineReader. Or, you can copy and paste the text from the .docx you now have into a new .docx, replicating the formatting of the source .docx.
Collapse


B D Finch
 

B D Finch  Identity Verified
France
Local time: 15:15
Member (2006)
French to English
+ ...
Sort it out in the source document Jan 25, 2019

I agree with Vadim's comment. The other way to deal with this would be to save your source file under another name (e.g. [source file-name_WKG.docx]. Then look at this and the original one side by side. If you see that the format changes are unjustified and just a result of either careless formatting or OCR, then change the whole of your WKG doc to a single font and pick a header font that you use throughout. Also, make sure that double carriage returns are swapped for paragraph signs etc. Scrol... See more
I agree with Vadim's comment. The other way to deal with this would be to save your source file under another name (e.g. [source file-name_WKG.docx]. Then look at this and the original one side by side. If you see that the format changes are unjustified and just a result of either careless formatting or OCR, then change the whole of your WKG doc to a single font and pick a header font that you use throughout. Also, make sure that double carriage returns are swapped for paragraph signs etc. Scroll through to compare the two files and check that your new formatting hasn't missed anything or corrupted anything and make any necessary adjustments.

Whichever method you use, you need to make the client aware of the problem and ensure you are paid for the extra time.
Collapse


 

Srini Venkataraman
United States
Local time: 08:15
Member (2012)
Tamil to English
+ ...
TOPIC STARTER
SITE LOCALIZER
Renaming the doc Jan 25, 2019

For loading in WF I always make a copy of the source doc .so when I remove hyperlinks etc I will still have untampered source doc for verification at any time.

 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Translating with too many tags

Advanced search


Translation news related to Wordfast





WordFinder Unlimited
For clarity and excellence

WordFinder is the leading dictionary service that gives you the words you want anywhere, anytime. Access 260+ dictionaries from the world's leading dictionary publishers in virtually any device. Find the right word anywhere, anytime - online or offline.

More info »
SDL Trados Business Manager Lite
Create customer quotes and invoices from within SDL Trados Studio

SDL Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search