ProZ.com global directory of translation services
 The translation workplace
Ideas

 
User
Thread poster: Jessicaliu
Help: Setup segmentation rule for Chinese
Jessicaliu
China
Local time: 03:43
Jan 18, 2011

Hi! I find there is no segmentation rule for Chinese in OmegaT. Therefore, my source text is segmented in paragraph.

I tried to setup my own segmentation rule as follows.

Intention: set a segment after "。" (Chinese period)
Before: 。
After:

But, it's not working. All the sentence are still stick together. Could anyone tell me what to do? Many thanks.


Direct link Reply with quote
 

Didier Briel  Identity Verified
France
Local time: 21:43
Member (2007)
English to French
+ ...
Did you check Break/Exception? Jan 18, 2011


Jessicaliu wrote:
Hi! I find there is no segmentation rule for Chinese in OmegaT. Therefore, my source text is segmented in paragraph.

I tried to setup my own segmentation rule as follows.

Intention: set a segment after "。" (Chinese period)
Before: 。
After:

But, it's not working. All the sentence are still stick together. Could anyone tell me what to do? Many thanks.

Is the box Break/Exception in your rule checked?

Is your project set to sentence segmentation (check in Project Properties)?

Didier


Direct link Reply with quote
 
Jessicaliu
China
Local time: 03:43
TOPIC STARTER
box checked (break) Jan 19, 2011

Thank you for reply. I check the box.

The version I use is 2.2.0_2.

I also tried several source texts and move up the rule to the first one on the list. But, it seems that OmegaT does not recoginze my segmentation rule.


Direct link Reply with quote
 

Didier Briel  Identity Verified
France
Local time: 21:43
Member (2007)
English to French
+ ...
A simple test: set your source language to Japanese Jan 19, 2011


Jessicaliu wrote:

Thank you for reply. I check the box.

The version I use is 2.2.0_2.

I also tried several source texts and move up the rule to the first one on the list. But, it seems that OmegaT does not recoginze my segmentation rule.

You can do a simple test: set your source language to Japanese (just temporarily for the test), as the end of sentence character is the same.

If it works, then there's something wrong in your rule (for instance, the end of sentence character is not the right one).

If it doesn't work, then there's another issue.

For instance, you did not answer my other question:
Is your project set to sentence segmentation (check in Project Properties)?

Didier


Direct link Reply with quote
 
Jessicaliu
China
Local time: 03:43
TOPIC STARTER
It works. Thank you a lot. Jan 19, 2011

Thank you Didier.

I checked the sentence-level segmenting.

But, your reply remind me that I forget to change "language pattern" from default to a specific Chinese language code that OmegaT is able to recoginize. I changed it to ZH-HK. And, it works!


Direct link Reply with quote
 

Didier Briel  Identity Verified
France
Local time: 21:43
Member (2007)
English to French
+ ...
Glad it works Jan 19, 2011


Jessicaliu wrote:
I checked the sentence-level segmenting.

But, your reply remind me that I forget to change "language pattern" from default to a specific Chinese language code that OmegaT is able to recoginize. I changed it to ZH-HK. And, it works!

Thank you for the feedback.
It might help finding the issue for another user in the future.

Didier


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Emanuela Galdelli[Call to this topic]

You can also contact site staff by submitting a support request »

Help: Setup segmentation rule for Chinese






Fluency Translation Suite 2011
Translate Up To 50% Faster with Fluency

Start and finish your translations faster than ever with Fluency Translation Suite 2011. TMs, Terminology, and Online Resources are all fully integrated and only a click away. Download a free trial today!

More info »
SDL MultiTerm Extract 2011
Save time by automatically extracting terms. 15% off!

SDL MultiTerm Extract 2011 allows you to automatically create candidate term lists from your existing documentation. This removes the manual effort involved with traditional terminology creation, allowing you to rapidly add terms to SDL MultiTerm.

More info »