Merge segment should always be possible
Thread poster: Heinrich Pesch

Heinrich Pesch  Identity Verified
Finland
Local time: 01:01
Member (2003)
Finnish to German
+ ...
Nov 6, 2013

I wonder if you agree with me. When using any CAT-tool, and you see a segment that has been chopped into senseless pieces, you need to merge the pieces into a mindful sentence. But most of the time the merge-function is grayed out or the software says: "segments belong to different paragraphs" or something the like.

I recognize that merging will cause the software to lose matches next time when the document is prepared for translation. But not merging and translating those senseless pieces one for one will induce false matches in the TM. For instance German verbs come often at the end of the sentence, but always at the beginning of the sentence in English and most other languages.

So my wish to Santa this year goes to all designers of translation software: Let us merge, let us merge, please!


 

Gyula Erdész
Hungary
Local time: 00:01
Member (2005)
English to Hungarian
+ ...
Do not blame the CAT, blame the lazy PMs/DTP specialists Nov 6, 2013

Dear Heinrich,

I believe this is not the flaw of the CAT tools. You usually do not want to merge segments across paragraphs, as they belong to different logical units of the text. The reason of the problem is almost always the poor preparation of the source text.


Regards,

Gyula


 

Erik Freitag  Identity Verified
Germany
Local time: 00:01
Member (2006)
Dutch to German
+ ...
@Gyula: strong disagree! Nov 6, 2013

Gyula Erdész wrote:

Dear Heinrich,

I believe this is not the flaw of the CAT tools. You usually do not want to merge segments across paragraphs, as they belong to different logical units of the text. The reason of the problem is almost always the poor preparation of the source text.


Regards,

Gyula


I strongly disagree with you, Gyula: While of course the reason of the problem indeed is a flawed source text, this is no excuse for CAT tools not offering this kind of basic functionality. I know what I'm doing when I want to merge segments, and I don't need the software developers to tell me that these paragraph marks shouldn't be there in the first place. In real life, they ARE there, and it costs me a lot of time to prepare the source documents before translating. And most of the time, I'll miss a couple of instances, which means starting all over again (open and edit source file, pretranslate in CAT tool again (splitting or merging any segments in the way I did during the earlier rounds of translation)....

In other words: Heinrich is spot on!


 

Gyula Erdész
Hungary
Local time: 00:01
Member (2005)
English to Hungarian
+ ...
Different point of view Nov 6, 2013

Thank you for your feedback. Interesting point of view, indeed.

If I receive a source file with tag soup and/or bad segmentation, I immediately warn the PM. It is not my job to split/merge the segments in the source text. If it is needed, I can do it, by I usually hate to spend extra time in front of the computer because of other people's negligence.

On the other hand, it is so much easier to eliminate the unnecessary hard returns / paragraph breaks etc. during the preparation of the source text than in any CAT tools. And if you merge the segments, and next time your colleague does not do the same, it causes mess in the TM.

Silly example, but if you buy a loaf ob bread and find a handful of raisins in it, do you start to sort them out or do you complain at the baker?


Regards,

Gyula


 

Philippe Etienne  Identity Verified
Spain
Local time: 00:01
Member
English to French
Many people don't care one way or the other Nov 6, 2013

I would think that everybody has already seen a customer TM riddled with bits of meaningless sentences because translators either don't know, don't care, cannot because of the CAT tool, or are explicitly not allowed to rework segmenting.
Heinrich Pesch wrote:
I recognize that merging will cause the software to lose matches next time when the document is prepared for translation.

This happens only if merging is not applied systematically. A comma added or margins moved in a brochure update may also lead to many missed 100% matches, because the hard breaks at the end of each line in the layout break sentences in other places during the DTP>CAT process.

So yes, CAT tools should all have a joining/merging feature whatever the tags/characters between both segments. AFAIC, I always try to create meaningful segments in TMs, including through resegmenting the source text after reformatting it properly. Why? More often than not, this extra work turns to my advantage, revealing repetitions and matches where there weren't, besides building a "good" TM.

But Gyula certainly has a point: it shouldn't be the translator's job to prepare a file suitable for CAT use. If I do it on large files, it's only because I am usually financially rewarded with commensurate TM results. The agency gains a cleaner TM, and I save nett time while translating.

By the way, I seem to have noticed in MemoQ 6.2 that you cannot join/split segments in a file when a view including that file exists, even if it is not open. But overall, I find MemoQ fairly flexible in terms of splitting/joining segments.

Heinrich Pesch wrote:
But not merging and translating those senseless pieces one for one will induce false matches in the TM.

Yes, especially on multiword table headers using hard breaks instead of proper formatting. And then some people think that not paying 100% matches is smart.


 

Erik Freitag  Identity Verified
Germany
Local time: 00:01
Member (2006)
Dutch to German
+ ...
Another silly example Nov 6, 2013

Gyula Erdész wrote:

Silly example, but if you buy a loaf ob bread and find a handful of raisins in it, do you start to sort them out or do you complain at the baker?


Let me answer with an equally silly example: If you want a set of different screwdrivers (flat, Phillips, Pozidriv etc.) because you're working on an old piece of furniture where all of these are used - how would you react if the manufacturer only offered one of them saying that the type he offers is the best, and other screws shouldn't be used anyway?icon_wink.gif

[Bearbeitet am 2013-11-06 13:57 GMT]


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Merge segment should always be possible

Advanced search







SDL Trados Studio 2017 only €435 / $519
Get the cheapest prices for SDL Trados Studio 2017 on ProZ.com

Join this translator’s group buy brought to you by ProZ.com and buy SDL Trados Studio 2017 Freelance for only €435 / $519 / £345 / ¥63000 You will also receive FREE access to Studio 2019 when released.

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search