Passolo SRX segmenter bugs
Thread poster: Evzen
Evzen
English to Czech
Oct 19, 2014

Posted this to Passolo User Forum, but it seems rather quiet, so I'm reposting here in a hope that it gets noticed by relevant people...

I tried to set up a simple Passolo project with HTML and Java properties files (with sentence level segmentation) and German as source language.
And Passolo created a zillion of totally wrong entries segmented after German abbreviations like z.B. or d.h. and others .

After some research I found the following bugs in SRX segmenter add-in:
  1. New/edited segmentation rules are not stored. This is because Passolo.srx file is stored in Passolo System folder, which is not writable for non-admin users...
    As far as I can see, this serious bug is still present even in Passolo 2015
    It was reported back in 2010 already: http://www.passolo.com/forum/viewtopic.php?t=817

  2. SRX language-specific rules don't work out-of-the-box. With the default languagepattern="[Dd][Ee].*" definition Passolo does NOT recognize German language.
    Only after adding languagepattern="[Gg][Ee][Rr].*" mapping Passolo finally started using the German rules. It looks like a bug - Passolo should use the ISO language abbreviation for matching the language pattern, not the language name (like German (Germany)).
    The help file states that Passolo uses only the "Default" language rules... hmmm, so why would the SRX file contain the other languages definition then?!
    To me it looks like no one in Passolo/Trados/SDL really understood how the languagepattern should be actually configured, so they added this weird sentence to the help file to cover their backs... :-\

  3. The set of segmentation rules provided in default SRX file is totally insufficient and results in a lot of wrongly segmented text.
    Can't you guys simply take the ruiles, abbreviations list, etc. from Studio?! One would expect that products of the same company work the same way...

  4. The default segmentation rules exceptions definitions are rather incorrect - they all start with "\s", so abbreviations inside parentheses (like "blabla (z.B. bleble)") are not recognized and result in segment break.

  5. It would make sense that the Passolo.srx file written by Passolo keeps the XML structured/indented formatting like the original default file, so that it's still comfortably editable in e.g. Notepad.
    Currently Passolo writes it is a single huuuuge line which makes it pretty unusable in plaintext editors

In the end I had to drop Passolo and move the project to Studio...
I found pretty alarming that such expensive tool like Passolo has such essential functional bugs... which apparently exist for a very long time


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Myron Netchypor[Call to this topic]

You can also contact site staff by submitting a support request »

Passolo SRX segmenter bugs

Advanced search






Across v6.3
Translation Toolkit and Sales Potential under One Roof

Apart from features that enable you to translate more efficiently, the new Across Translator Edition v6.3 comprises your crossMarket membership. The new online network for Across users assists you in exploring new sales potential and generating revenue.

More info »
SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search