text editor with Unicode support : looking for recommendations
Thread poster: Angel Llacuna

Angel Llacuna  Identity Verified
Spain
Local time: 15:37
English to Spanish
May 23

I am looking for recommendations for a free text editor with full support of the Unicode character encoding and running in Windows 7

 

Darius Sciuka  Identity Verified
Lithuania
Local time: 16:37
Member (2016)
English to Lithuanian
+ ...
Notepad++ May 23

https://notepad-plus-plus.org/

 

Angel Llacuna  Identity Verified
Spain
Local time: 15:37
English to Spanish
TOPIC STARTER
thank you Darius May 24

I think Notepad ++ will be appropriate for my purposes

 

Samuel Murray  Identity Verified
Netherlands
Local time: 15:37
Member (2006)
English to Afrikaans
+ ...
Babelpad May 24

Angel Llacuna wrote:
I am looking for recommendations for a free text editor with full support of the Unicode character encoding and running in Windows 7.


Babelpad excels at Unicode, but it isn't a very powerful text editor otherwise.

Compare Babelpad to Notepad++ by pasting [👓] into it. The character [👓] (codepoint U+1F453) has been part of Unicode since 2010 already, and Notepad++ still can't display it, but Babelpad can.

[Edited at 2018-05-24 06:25 GMT]


 

esperantisto  Identity Verified
Local time: 16:37
Member (2006)
English to Russian
+ ...
Some more suggestions May 24

AkelPad
UniRed
jEdit


 

Rolf Keller
Germany
Local time: 15:37
English to German
A modern alternative May 24

Visual Studio Code by Microsoft

 

Angel Llacuna  Identity Verified
Spain
Local time: 15:37
English to Spanish
TOPIC STARTER
I intend to use the text editor ... May 24

to import Terminology into a new TermBase for memoQ. I need that memoQ treats the file as Unicode so the accented vowels of the Spanish language can be rendered properly in memoQ

 

esperantisto  Identity Verified
Local time: 16:37
Member (2006)
English to Russian
+ ...
The question is… May 24

What is the source of your terminology? MemoQ does accept comma- or tab-delimited text files in virtually any encoding, the default being UTF-8. Thus, any of the suggested text editors would be fine, as well as a bunch of other text editors. However, you may find it more convenient to use a spreadsheet editor such as OpenOffice Calc or Microsoft Excel.

Here is my StarBasic macro that exports a Calc spreadsheet to a glossary (I created it for OmegaT, but memoQ should be happy with the resulting file too):

Code:
Sub ExportToGlossary

Dim oDoc as Object
Dim glossaryFileProperties(3) as new com.sun.star.beans.PropertyValue
Dim sURL as String
Dim iLen as Integer
Dim isSpreadSheet as Boolean

oDoc = ThisComponent
If oDoc.getLocation() = "" Then Exit Sub

isSpreadSheet = oDoc.supportsService("com.sun.star.sheet.SpreadsheetDocument")

If isSpreadSheet Then
sURL = oDoc.getLocation()
iLen = len(sURL)

If lcase(right(sURL,4))=".ods" Then
sURL = left(sURL, iLen - 4)
Else
If lcase(right(sURL,4))=".xls" Then
sURL = left(sURL, iLen - 4)
Else
If lcase(right(sURL,5))=".xlsx" Then
sURL = left(sURL, iLen - 5)
End If
End If
End If

sURL = sURL + ".txt"

glossaryFileProperties(0).Name = "FilterName" ' setting properties of exported file such as tab as field delimiter, nothing as text delimiter, UTF-8 as encoding
glossaryFileProperties(0).Value = "Text - txt - csv (StarCalc)"
glossaryFileProperties(1).Name = "FilterOptions"
glossaryFileProperties(1).Value = "9,0,76,1,,0,false,true,false"
glossaryFileProperties(2).Name = "Overwrite"
glossaryFileProperties(2).Value = True

oDoc.storeToURL(sURL, glossaryFileProperties())
End If
End Sub



 

Angel Llacuna  Identity Verified
Spain
Local time: 15:37
English to Spanish
TOPIC STARTER
the source for the terminology is a ... May 24

plain text file where the source and target are separated with a tab character,
one row for each entry. Finally, I was able to do it creating a new empty file in Texpad using UTF-8 as format. Then I pasted the terminology into the file and created the TermBase. Now all the characters appear well rendered.

I have yet to create a regular expression to find and replace quickly the redundant tabs contained on the terminology, that is, all the tabs on each row other than the first tab

[Edited at 2018-05-24 12:25 GMT]


 

Angel Llacuna  Identity Verified
Spain
Local time: 15:37
English to Spanish
TOPIC STARTER
error May 24

remove this

[Edited at 2018-05-24 12:27 GMT]


 

Samuel Murray  Identity Verified
Netherlands
Local time: 15:37
Member (2006)
English to Afrikaans
+ ...
@Angel May 24

Angel Llacuna wrote:
I need to import terminology into a new termbase for memoQ. I need that memoQ treat the file as Unicode so the accented vowels of the Spanish language can be rendered properly in memoQ.


Oh, okay. Well, Akelpad would work (Notepad++ probably as well). It's important that you save the text file as either "UTF-16 LE" (not BE) or "UTF-8 with BOM" (byte order mark), otherwise MemoQ might think your file is ANSI or similar.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

text editor with Unicode support : looking for recommendations

Advanced search






CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search