Segmentation of initials
Thread poster: Tom Fennell

Tom Fennell
United States
Local time: 16:53
Member (2010)
Russian to English
+ ...
Oct 17, 2010

Hi folks,

Russian names are often given in this form:

Ivanov I.I.

(Last name first initial.[no space] patronymic initial.).

Trados Studio (and earlier versions) very often (although not always!) see a segment boundary and end the segment after the second initial.

It can take much to to properly merge the segments manually.

Any suggestions on how to suppress this?

Thanks,

Tom


Direct link Reply with quote
 

Tom Fennell
United States
Local time: 16:53
Member (2010)
Russian to English
+ ...
TOPIC STARTER
This is that much of a stickler? Oct 17, 2010

I understand that "advanced segmentation" is a mysterious art indeed,

Such as the "full stop rule" which reads in full like this:

\.+[\p{Pe}\p{Pf}\p{Po}"-[\u002C\u003A\u003B\u055D\u060C\u061B\u0703\u0704\u0705\u0706\u0707\u0708\u0709\u07F8\u1363\u1364\u1365\u1366\u1802\u1804\u1808\u204F\u205D\u3001\uA60D\uFE10\uFE11\uFE13\uFE14\uFE50\uFE51\uFE54\uFE55\uFF0C\uFF1A\uFF1B\uFF64]]*

but no comments....come on folks!


Direct link Reply with quote
 

Tom Fennell
United States
Local time: 16:53
Member (2010)
Russian to English
+ ...
TOPIC STARTER
Help sez Oct 17, 2010

" Add a regular expression that identifies the pattern of text that occurs immediately before a segment break.

Rules created in the Basic view are converted to regular expressions in the Advanced view."

Looks like I need to really study the old regular expressions......


Direct link Reply with quote
 

SDL Community  Identity Verified
United Kingdom
Local time: 23:53
English
Can you give an example of where Studio does break this? Oct 17, 2010

Hi Thomas,

I thought I'd play a little with this, but all my examples refuse to break. For example, if I do this;

My name is Ivanov I.I. and I don’t want a break here.

The segment doesn't break. What examples do you have to work with?

Regards

Paul


Direct link Reply with quote
 

ahmadwadan.com  Identity Verified
Kuwait
Local time: 00:53
English to Arabic
+ ...
It is ok with me Oct 18, 2010

I tried "My name is Ivanov I.I. and I don’t want a break here." and Trados recognized it as a full translation unit without breaking it at your initials.

Direct link Reply with quote
 

Tom Fennell
United States
Local time: 16:53
Member (2010)
Russian to English
+ ...
TOPIC STARTER
I am of course breaking with with Russian Oct 19, 2010

And the damning this is, sometimes it breaks and sometimes not not always.

Seems to depend on the word form which follows.

It will take me 30 minutes to give you some good examples - best of all I can send an sdlxliff file and a TM so you can see the problem "live."

But I have such a ridiculously large number of words to translate today that I'm embarrassed to tell you (until its done)....I'll be back a bit later in the week.

This is an important problem, but not an urgent on.

It slows down projects considerably (and hinders pre-translation), but does not "stop the show"

More later...Tom


Direct link Reply with quote
 

Tom Fennell
United States
Local time: 16:53
Member (2010)
Russian to English
+ ...
TOPIC STARTER
Need to send file... Oct 22, 2010

SDL Support wrote:

Hi Thomas,

I thought I'd play a little with this, but all my examples refuse to break. For example, if I do this;

My name is Ivanov I.I. and I don’t want a break here.

The segment doesn't break. What examples do you have to work with?

Regards

Paul


Whew - 57000 words in 1 week!!!

In the last job I did (25000 words), I was heavily dependent on my TM, and the segmentation problems caused many hours, perhaps days of extra work.

I'll forward this to the SDL affiliate in Russia.....they should have a direct interest in it, and/or may have encountered it before...

[Edited at 2010-10-22 21:23 GMT]


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Segmentation of initials

Advanced search







WordFinder Unlimited
For clarity and excellence

WordFinder is the leading dictionary service that gives you the words you want anywhere, anytime. Access 260+ dictionaries from the world's leading dictionary publishers in virtually any device. Find the right word anywhere, anytime - online or offline.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search