Export Chinese Text

BaptisteC · August 2, 2018, 12:04pm

Hi,

I’ve to translate chinese texts from a dwg file.

I was thinking of a PythonScript in Rhino to export each text of my model in a CSV file in order to open it in MS Excel, then translate every text content, and finally import back the translated text in the original model by replacing text objects’texts.

I can build the main workflow but I am facing difficulties with the encoding for chinese characters.

Message: 'ascii' codec can't encode character '\u5BA4' in position 0: ordinal not in range(128)

Is there someone who could help with this?

Thanks

jesterking · August 2, 2018, 12:10pm

What is the code that gives this error? Preferrably both the string construction and the string usage (probably the writing to the text file?)

BaptisteC · August 2, 2018, 12:59pm

Here is my code :

# -*- coding: utf-8 -*-
import rhinoscriptsyntax as rs

# extract texts in the model
textList = []
for object in rs.AllObjects():
    if rs.IsText(object):
        textList.append(rs.TextObjectText(object))

# create a filename variable
filename = "C:\\Users\\XXXX\\Desktop\\Exportation.csv"

file = open(filename, 'w')

#create and write a header for the CSV file
headerList=["Index","中国","English","Français"]
header = ";".join(headerList)+"\n"
file.write(header)

#create and write a line in CSV file for every text in the model
lineList=[]
i=0
for text in textList:
    line = [i,text]
    i+=1

for line in lineList:
    fileLine = ";".join(line)+"\n"
    file.write(fileLine)

file.close()

And a Rhino 6 file for testing :

Chinese text exportation.3dm (39.6 KB)

Thank you for watching.

jesterking · August 2, 2018, 2:20pm

Python is extremely finnicky with non-ascii strings. You’ll have to explicitly encode all strings to utf-8 to have it work properly. Including your header:

# -*- coding: utf-8 -*-
import rhinoscriptsyntax as rs

# extract texts in the model
textList = []
for o in rs.AllObjects():
    if rs.IsText(o):
    	# explicitly encode to utf-8
        s = rs.TextObjectText(o).encode('utf-8')
        textList.append(s)

# create a filename variable
filename = "C:\\Users\\XXX\\Desktop\\Exportation.csv"

file = open(filename, 'w')

#create and write a header for the CSV file
headerList=[u'Index',u'中国',u'English',u'Français']
# explicitly encode to utf-8
headerList = [i.encode('utf-8') for i in headerList]
header = u"{}\n".format(u';'.join(headerList))
file.write(header)

#create and write a line in CSV file for every text in the model
lineList=[]
i=0
for text in textList:
    line = [str(i),text]
    i+=1
    lineList.append(line)

for line in lineList:
    fileLine = u';'.join(line)+u'\n'
    file.write(fileLine)

file.close()

utf8_export.py (917 Bytes)

This creates from your 3dm

Exportation.csv.txt (158 Bytes) (Don’t forget to remove the .txt bit)

BaptisteC · August 2, 2018, 4:16pm

Thank you very much, it does the job perfectly !

jesterking · August 2, 2018, 5:34pm

Enjoy

Topic		Replies	Views
Layers List in Excel Format Scripting	16	1678	June 8, 2020
Writing CSV files with cyrillic characters using Python Scripting windows	4	2314	September 20, 2021
Export Layer names to ASCII list Rhino for Windows script , rhinoscriptsyntax	8	2146	April 6, 2024
Add Chinese text solid objects Rhino for Windows	4	2739	February 27, 2014
Cyrillic characters export problem Serengeti (Rhino WIP) 2ddrawing , rhino , textedit	3	1080	February 16, 2018

Export Chinese Text

Related topics