Skip words starting with ' when extracing terminology (Multiterm Extract)
Thread poster: Mathieu Jacquet

Mathieu Jacquet  Identity Verified
France
Local time: 16:42
English to French
Mar 24, 2009

Hi all,

is there any way to skip terms starting with a single quote when extracting terminology? We do have a lot in French (virtually any word starting with a vowel and following a definite article) and the result is a significant number of "doublons" in the result pane of Multiterm Extract.

For instance:

1. 'usine
2. usine

I was thinking of some setting in an exclusion file, but am stuck on how to achieve that.

Any help appreciated!

Mathieu.


 

ViktoriaG  Identity Verified
Canada
Local time: 10:42
English to French
+ ...
Manual method Mar 24, 2009

Off the top of my head, I can think of one thing you could do.

If you are extracting bilingual terminology (you are not yet at a stage where you fill in fields other than source term and target term), you can take the initial term extraction output and export it to Excel. Then, you can sort the source terms (also select target terms, naturally) alphabetically. Once you are done with that, all of the words that start with an apostrophe will appear next to each other - they will be easy to delete with a few clicks of the mouse. Then, you just need to import them back into MultiTerm and process whatever you still need to process.

In time, some of our colleagues may post easier methods, but if you're in a hurry, I am almost convinced that this will fix it.

All the best!


 

Carmen Moreton
Spain
Local time: 16:42
French to Spanish
+ ...
Create a filter Mar 25, 2009

I don't use MT Extract to much time ago, but I remember that you can create a filter and maybe there you can skip the terms with a " ' ".
Hope this helps...


 

Mathieu Jacquet  Identity Verified
France
Local time: 16:42
English to French
TOPIC STARTER
Sorting out available in Multiter Extract Mar 25, 2009

@Viktoria

Thank you for your help Viktoria. There is actually a way of sorting out terms alphabetically in the result pane of Multiterm Extract (by clicking the language flag). It is then easy to select and delete them all. The problem is that some words appear only preceded by " ' " (no occurence of the term without " ' " in the TM). I thnk i'll have to review the all to be sure i do not forget one.

Yours,
Mathieu.


 

Attila Piróth  Identity Verified
France
Local time: 16:42
Member
English to Hungarian
+ ...
Excel solution revisited Mar 25, 2009

Mathieu Jacquet wrote:

@Viktoria

Thank you for your help Viktoria. There is actually a way of sorting out terms alphabetically in the result pane of Multiterm Extract (by clicking the language flag). It is then easy to select and delete them all. The problem is that some words appear only preceded by " ' " (no occurence of the term without " ' " in the TM). I thnk i'll have to review the all to be sure i do not forget one.

Yours,
Mathieu.


Hi Matthieu,

It can be done quite easily in Excel.

You perform the steps suggested by Victoria. You sort the entries in alphabetical order, but you do not delete anything yet. You select that part of the list which contains the words starting with an apostrophe, and you replace the apostrophe by nothing. (Note: if there are other apostrophes, it may be a problem. But I expect this to be a minor problem, which can be certainly solved by counting the number of apostrophes, for example).

Then you re-order the entries, so if there are any duplicates, they will show up in adjacent rows.

Assuming that the ordered list is in column d, you add an If function to the cells in column e: if d_i = d_(i-1) then the cell e_i is left empty, otherwise the value of e_i is copied from d_i. Check Excel's help for the If function; it is quite straightforward.

In column e, you will have some empty cells. By re-ordering alphabetically column e, you will get the final list.

Kind regards,
Attila


 

Mathieu Jacquet  Identity Verified
France
Local time: 16:42
English to French
TOPIC STARTER
Clever Mar 25, 2009

Thank you Attila, that sounds like a clever solution.

I hope to thank you soon in live (next powwow?) in Toulouse!

Mathieu.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Skip words starting with ' when extracing terminology (Multiterm Extract)

Advanced search







SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search