Help with filter RegEx needed
Thread poster: Iris Kleinophorst

Iris Kleinophorst  Identity Verified
Germany
Local time: 09:36
Chinese to German
+ ...
Mar 8

Hi,
I am translating an English-Chinese mixed text and would like to filter all segments that include Chinese characters as well as English words (= letters of the Roman alphabet).

I have the RegEx for displaying Chinese characters: [\u4e00-\u9fa5]
How would I have to set up a filter RegEx that combines Chinese characters with Roman alphabet letters, regardless of their order or amount? Can anybody help me or give me a hint on how to do it? I have checked the reference handbook, but could not find a viable solution.

TIA
Iris


 

Jason_P
South Korea
Local time: 16:06
English to Korean
post examples Mar 8

then you will have better chance to get proper answer.
the more examples, the nicer RegExp could be made (by some other smart people not me).

[Edited at 2018-03-08 15:15 GMT]


 

Hans Lenting  Identity Verified
Netherlands
Member (2006)
German to Dutch
Double filter action? Mar 9

How about filtering on Chinese characters first and then—while keeping the result of this filter action—filtering on either letters or whole words with Roman letters?

 

Karen_E
United Kingdom
Local time: 08:36
Thumbs up to double filter suggestion Mar 9

I agree with Hans - I would apply one filter first, then go back to the "Create filter" option, make your adjustments, then click on "More" and tick the box to apply the new filter "To the result of the current filter". That way, it will create a double layer of filtering and show you the segments that meet both criteria.

 

Iris Kleinophorst  Identity Verified
Germany
Local time: 09:36
Chinese to German
+ ...
TOPIC STARTER
DUH-Why didn't I think of the double filter? Mar 10

Hi Hans, hi Karen,

thanks, the double filter option works perfectly!

Funny enough, I've used double filters in this project before, like filtering all segments NOT containing Chinese characters and then filtering all untranslated segments within these. However, I did not think of just entering different segment content for a double filter.

Solved my problem, thanks!
Iris


 
Or operator Jun 1

You can also create an expression with an OR operator. Sequence of english letters [a-z]|[\u4e00-\u9fa5]

 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Maya Gorgoshidze[Call to this topic]

You can also contact site staff by submitting a support request »

Help with filter RegEx needed

Advanced search






SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »
memoQ translator pro
Kilgray's memoQ is the world's fastest developing integrated localization & translation environment rendering you more productive and efficient.

With our advanced file filters, unlimited language and advanced file support, memoQ translator pro has been designed for translators and reviewers who work on their own, with other translators or in team-based translation projects.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search