既然还没有人跳进来,这就是我的镜头。在写入控制台时,Python会设置stdout的编码,但在写入文件时,Python不会设置。该脚本显示了问题
import sys
msg = {'text':u'\2026'}
sys.stderr.write('default encoding: %s\n' % sys.stdout.encoding)
print msg['text']
$ python bad.py>/tmp/xxx
default encoding: None
Traceback (most recent call last):
File "fix.py", line 5, in <module>
print msg['text']
UnicodeEncodeError: 'ascii' codec can't encode character u'\x82' in position 0: ordinal not in range(128)
添加编码
import sys
msg = {'text':u'\2026'}
sys.stderr.write('default encoding: %s\n' % sys.stdout.encoding)
encoding = sys.stdout.encoding or 'utf-8'
print msg['text'].encode(encoding)
问题解决了
$ python good.py >/tmp/xxx
default encoding: None
$ cat /tmp/xxx
6