Scanning source documents for possible non-translatables
Thread poster: Hans Lenting

Hans Lenting  Identity Verified
Netherlands
Member (2006)
German to Dutch
Apr 13

In this thread: https://www.proz.com/forum/cat_tools_technical_help/324470-tm_anonymization_gdpr.html

I wrote:

Create a set of regular expressions to filter the source segments directly after import of the source document. Scan the filtered segments for non-translatables.


With the option Extract reg. exp. results you can collect candidates for non-translatables. Once you have collected them, you can tag them as regular expressions and save them in a glossary for non-translatables. These won't be send to the MT system (when Edit > Preferences > Mask non-translatable fragments option is selected).

lvxuymxdhd1gnkmaxtke.gif

[Edited at 2018-04-14 09:56 GMT]


 

Hans Lenting  Identity Verified
Netherlands
Member (2006)
German to Dutch
TOPIC STARTER
Creating a glossary from the found candidates Apr 15

Here's a script to convert the results of the Reg. exp. results pane to regular expressions that can be used in a glossary for regular expressions. You'll need the (free) version of BBEdit (https://www.barebones.com/products/bbedit/ ).

uaob6xqzjkh6zhbvluhs.png

The tagged items (real regular expressions now):

sdrdpyjcgxwvsf9hecd0.png

Once all entries in the glossary have been tagged, you create a glossary definition:

lajvi3ikkg2ucrisrucf.png

The non-translatables from the glossaries will be recognised:

fmi9vzlw9avyostudpze.png

Due to a glitch, not all non-translatables will be displayed in the glossary pane (they will be recognised, though):

eeey0hzfybwc6ei9d75n.png

Masked non-translatables won't be send to the MT systems:

fbcrhezkhhi5f1bluhim.png

[Edited at 2018-04-15 08:43 GMT]


 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Natalie[Call to this topic]

You can also contact site staff by submitting a support request »

Scanning source documents for possible non-translatables

Advanced search






PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »
WordFinder Unlimited
For clarity and excellence

WordFinder is the leading dictionary service that gives you the words you want anywhere, anytime. Access 260+ dictionaries from the world's leading dictionary publishers in virtually any device. Find the right word anywhere, anytime - online or offline.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search