Samuel Murray Netherlands Local time: 08:50 Member (2006) English to Afrikaans + ...
Jun 17, 2010
G'day everyone
I have 25 TBX files with a lot of useful stuff in them. I'd like to convert it to CSV so that I can merge it all, sort it, and remove duplicate entries, to come up with a final, useful glossary. Does anyone know of a tool that can convert TBX to CSV (preferably in bulk)? Or... a tool that can merge TBX files?
Thanks
Samuel
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Spiros Doikas Greece Local time: 09:50 Member (2002) English to Greek + ...
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Pablo Bouvier Spain Local time: 08:50 Member German to Spanish + ...
Converting TBX 2 CSV
Jun 18, 2010
Samuel Murray wrote:
G'day everyone
I have 25 TBX files with a lot of useful stuff in them. I'd like to convert it to CSV so that I can merge it all, sort it, and remove duplicate entries, to come up with a final, useful glossary. Does anyone know of a tool that can convert TBX to CSV (preferably in bulk)? Or... a tool that can merge TBX files?
Thanks
Samuel
Hi, Samuel:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file. But, I have not checked...
Good luck!
Pablo B.
[Editado a las 2010-06-18 04:50 GMT]
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Piotr Bienkowski Poland Local time: 08:50 Member (2005) English to Polish + ...
This project only offers a tbxCHECK tool, which does not imply any ability to import or export.
Samuel, I can't make any promises, but if you're interested in extracting just the source and translation from these files, I might be able to write an XSLT transformation for you, if you send me just one TBX file. E-mail me at syntax[{@}]neo.pl
Regards,
Piotr
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
FarkasAndras Hungary Local time: 08:50 English to Hungarian + ...
Xbench
Jun 18, 2010
Xbench should also be able to do this, provided that you only need the source and target terms from the glossaries, no other fields. If you already have it installed, give it a shot. (Xbench can import pretty much anything, including TBX, and can and export in tab delimited format which is trivial to convert to csv or whatever you want to).
I think could also throw together a converter script pretty easily if it came to it, I'd just need to modify the script I already have for MultiTerm exports, which are also XML.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 08:50 Member (2006) English to Afrikaans + ...
TOPIC STARTER
On APSIC Xbench
Jun 18, 2010
Pablo Bouvier wrote:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file.
I've just tried it, and it works, if you hack the TBX file a bit.
XBench does not accept TBX files with single quotes. You have to change all single quotes to double quotes first. In other words, you have to change <martif type='TBX' xml:lang='en'> into >martif type="TBX" xml:lang="en"> etc.
To import a TBX file, simply create a few project, and you'll be prompted to add glossary files. The file extension that XBench expects TBX to have is "xml", but it does allow you to select "All files". To export the tab-delimited file, go Tools > Export Items.
Unfortunately you can export only one TBX file at a time. And *unfortunately* Xbench does not retain the term definitions in the exported file.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 08:50 Member (2006) English to Afrikaans + ...
TOPIC STARTER
On Across
Jun 18, 2010
Michael J.W. Beijer wrote:
Have you tried playing around with Across (Personal Edition)? Imports: CSV, Langenscheidt, MultiTerm, Star Transit, TBX
Exports: CSV, TBX
Luckily I do have Across installed on my computer. But... it tells me my TBX files are an invalid format, with no clue as to what may be so invalid about it. At least Xbench gave me a traceable error message that enabled me to find out what the program wanted. Across just says "invalid format".
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Pablo Bouvier Spain Local time: 08:50 Member German to Spanish + ...
Converting TBX 2 CSV
Jun 18, 2010
Samuel Murray wrote:
Pablo Bouvier wrote:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file.
I've just tried it, and it works, if you hack the TBX file a bit.
XBench does not accept TBX files with single quotes. You have to change all single quotes to double quotes first. In other words, you have to change into >martif type="TBX" xml:lang="en"> etc.
To import a TBX file, simply create a few project, and you'll be prompted to add glossary files. The file extension that XBench expects TBX to have is "xml", but it does allow you to select "All files". To export the tab-delimited file, go Tools > Export Items.
Unfortunately you can export only one TBX file at a time. And *unfortunately* Xbench does not retain the term definitions in the exported file.
May be it is worth to write a line to Apsic on this behalf. I did it at the very beginning for an issue related to the Apsic Comparator utility and in the next version it had been fixed.
In my experience, they listen to you even if you are not related in any manner with them, and are kind enough to send a short answer. If I had to assign them a rating, this would be: effective...
Many thanks for the feedback about Xbench & Across.
[Editado a las 2010-06-18 17:38 GMT]
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
I haven't had to do it but if you have MultiTerm you might try importing into MultiTerm and then exporting from it using a tool called "Excelling MultiTerm" (http://www.experttools.at/index_new.php?mmid=6)
It's a third-party tool which claims to be able to make good CSV exports from MultiTerm keeping the columns in the right place.
Daniel
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 08:50 Member (2006) English to Afrikaans + ...
TOPIC STARTER
On APSIC Xbench II
Jun 21, 2010
Samuel Murray wrote:
Pablo Bouvier wrote:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file.
I've just tried it, and it works, if you hack the TBX file a bit.
Sergei of Xbench contacted me privately about the problem, and fixed it in the latest development version. Also, the fixed version of Xbench contains "tbx" in the file-open list, and (I'm not sure if this was so in the previous version) one can import many TBX files and then export the whole lot in one go into a single file, if you want.
1. In Xbench, go Project > Properties.
2. Click Add and select TBX.
3. Click Next, select your TBX files, and OK.
4. Click Next twice, then select your languages, and click OK.
5. Click OK to close the Project Properties dialog.
Then:
1. In Xbench, go Tools > Export Items.
2. Select "All items", choose "Tabbed Text file", and browse for a target file name.
3. Click OK.
You can export to tabbed text or to TMX. You can import (or "add", rather) a whole lot of file types, including glossary and TM formats.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
FarkasAndras Hungary Local time: 08:50 English to Hungarian + ...
Thanks
Jun 21, 2010
Samuel Murray wrote:
Samuel Murray wrote:
Pablo Bouvier wrote:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file.
I've just tried it, and it works, if you hack the TBX file a bit.
Sergei of Xbench contacted me privately about the problem, and fixed it in the latest development version. Also, the fixed version of Xbench contains "tbx" in the file-open list, and (I'm not sure if this was so in the previous version) one can import many TBX files and then export the whole lot in one go into a single file, if you want.
1. In Xbench, go Project > Properties.
2. Click Add and select TBX.
3. Click Next, select your TBX files, and OK.
4. Click Next twice, then select your languages, and click OK.
5. Click OK to close the Project Properties dialog.
Then:
1. In Xbench, go Tools > Export Items.
2. Select "All items", choose "Tabbed Text file", and browse for a target file name.
3. Click OK.
You can export to tabbed text or to TMX. You can import (or "add", rather) a whole lot of file types, including glossary and TM formats.
... for reporting back. Much appreciated. Too many people around here ask for help and then disappear without doing the decent thing and letting everyone (esp. people with the same problem who find the thread later) know which suggested solution worked for them - and exactly how.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Pablo Bouvier Spain Local time: 08:50 Member German to Spanish + ...
Betterr late than never...
Jul 19, 2010
Samuel Murray wrote:
Samuel Murray wrote:
Pablo Bouvier wrote:
You did not though to use APSIC Xbench? So far I know, in the latest editions you may import a glossary in TBX format and then save it as tabbed text file.
I've just tried it, and it works, if you hack the TBX file a bit.
Sergei of Xbench contacted me privately about the problem, and fixed it in the latest development version. Also, the fixed version of Xbench contains "tbx" in the file-open list, and (I'm not sure if this was so in the previous version) one can import many TBX files and then export the whole lot in one go into a single file, if you want.
1. In Xbench, go Project > Properties.
2. Click Add and select TBX.
3. Click Next, select your TBX files, and OK.
4. Click Next twice, then select your languages, and click OK.
5. Click OK to close the Project Properties dialog.
Then:
1. In Xbench, go Tools > Export Items.
2. Select "All items", choose "Tabbed Text file", and browse for a target file name.
3. Click OK.
You can export to tabbed text or to TMX. You can import (or "add", rather) a whole lot of file types, including glossary and TM formats.
Great!! Samuel, thanks a lot for the feedback, even late.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
I just downloaded the Microsoft Glossary in tbx and was going crazy over how to open it. So I gave it a shot and opened it in Excel. It was awkward the first time, but the important thing is it worked, and after saving as Excel file it's converted well. I don't know if it's supposed to look like that -a bit odd- but it's functional.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
elishevah Israel Local time: 09:50 English to Hebrew + ...
Opened in Excel
Oct 6, 2011
Hi. I just opened a MS glossary with TBX extension in MS-Excel 2003. No problems.
Elisheva Hershler
Technical Translator
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
A fully featured online CAT tool and TMS, with no installation required, and a simple, intuitive interface. Maximize linguistic assets by sharing in real time as you collaborate with colleagues. Make use of next generation, cloud-based translation technol
Start and finish your translations faster than ever with Fluency Translation Suite 2011. TMs, Terminology, and Online Resources are all fully integrated and only a click away. Download a free trial today!