Skip words starting with ' when extracing terminology (Multiterm Extract)
Thread poster: Mathieu Jacquet

Mathieu Jacquet  Identity Verified
France
Local time: 12:43
English to French
Mar 24, 2009

Hi all,

is there any way to skip terms starting with a single quote when extracting terminology? We do have a lot in French (virtually any word starting with a vowel and following a definite article) and the result is a significant number of "doublons" in the result pane of Multiterm Extract.

For instance:

1. 'usine
2. usine

I was thinking of some setting in an exclusion file, but am stuck on how to achieve that.

Any help appreciated!

Mathieu.


Direct link Reply with quote
 

ViktoriaG  Identity Verified
Canada
Local time: 06:43
English to French
+ ...
Manual method Mar 24, 2009

Off the top of my head, I can think of one thing you could do.

If you are extracting bilingual terminology (you are not yet at a stage where you fill in fields other than source term and target term), you can take the initial term extraction output and export it to Excel. Then, you can sort the source terms (also select target terms, naturally) alphabetically. Once you are done with that, all of the words that start with an apostrophe will appear next to each other - they will be easy to delete with a few clicks of the mouse. Then, you just need to import them back into MultiTerm and process whatever you still need to process.

In time, some of our colleagues may post easier methods, but if you're in a hurry, I am almost convinced that this will fix it.

All the best!


Direct link Reply with quote
 

Carmen Moreton
Spain
Local time: 12:43
French to Spanish
+ ...
Create a filter Mar 25, 2009

I don't use MT Extract to much time ago, but I remember that you can create a filter and maybe there you can skip the terms with a " ' ".
Hope this helps...


Direct link Reply with quote
 

Mathieu Jacquet  Identity Verified
France
Local time: 12:43
English to French
TOPIC STARTER
Sorting out available in Multiter Extract Mar 25, 2009

@Viktoria

Thank you for your help Viktoria. There is actually a way of sorting out terms alphabetically in the result pane of Multiterm Extract (by clicking the language flag). It is then easy to select and delete them all. The problem is that some words appear only preceded by " ' " (no occurence of the term without " ' " in the TM). I thnk i'll have to review the all to be sure i do not forget one.

Yours,
Mathieu.


Direct link Reply with quote
 

Attila Piróth  Identity Verified
France
Local time: 12:43
Member
English to Hungarian
+ ...
Excel solution revisited Mar 25, 2009

Mathieu Jacquet wrote:

@Viktoria

Thank you for your help Viktoria. There is actually a way of sorting out terms alphabetically in the result pane of Multiterm Extract (by clicking the language flag). It is then easy to select and delete them all. The problem is that some words appear only preceded by " ' " (no occurence of the term without " ' " in the TM). I thnk i'll have to review the all to be sure i do not forget one.

Yours,
Mathieu.


Hi Matthieu,

It can be done quite easily in Excel.

You perform the steps suggested by Victoria. You sort the entries in alphabetical order, but you do not delete anything yet. You select that part of the list which contains the words starting with an apostrophe, and you replace the apostrophe by nothing. (Note: if there are other apostrophes, it may be a problem. But I expect this to be a minor problem, which can be certainly solved by counting the number of apostrophes, for example).

Then you re-order the entries, so if there are any duplicates, they will show up in adjacent rows.

Assuming that the ordered list is in column d, you add an If function to the cells in column e: if d_i = d_(i-1) then the cell e_i is left empty, otherwise the value of e_i is copied from d_i. Check Excel's help for the If function; it is quite straightforward.

In column e, you will have some empty cells. By re-ordering alphabetically column e, you will get the final list.

Kind regards,
Attila


Direct link Reply with quote
 

Mathieu Jacquet  Identity Verified
France
Local time: 12:43
English to French
TOPIC STARTER
Clever Mar 25, 2009

Thank you Attila, that sounds like a clever solution.

I hope to thank you soon in live (next powwow?) in Toulouse!

Mathieu.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Skip words starting with ' when extracing terminology (Multiterm Extract)

Advanced search







CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums