nmbug-status: Hardcode UTF-8 instead of using the user's locale

David [1] and Tomi [2] both feel that the user's choice of LANG is not
explicit enough to have such a strong effect on nmbug-status.  For
example, cron jobs usually default to LANG=C, and that is going to
give you ASCII output:

  $ LANG=C python -c 'import locale; print(locale.getpreferredencoding())'
  ANSI_X3.4-1968

Trying to print Unicode author names (and other strings) in that
encoding would crash nmbug-status with a UnicodeEncodeError.  To avoid
that, this patch hardcodes UTF-8, which can handle generic Unicode,
and is the preferred encoding (regardless of LANG settings) for
everyone who has chimed in on the list so far.  I'd prefer trusting
LANG, but in the absence of any users that prefer non-UTF-8 encodings
I'm fine with this approach.

While we could achieve the same effect on the output content by
dropping the previous patch (nmbug-status: Encode output using the
user's locale), Tomi also wanted UTF-8 hardcoded as the config-file
encoding [2].  Keeping the output encoding patch and then adding this
to hardcode both the config-file and output encodings at once seems
the easiest route, now that fd29d3f (nmbug-status: Decode Popen output
using the user's locale, 2014-02-10) has landed in master.

[1]: id="877g8z4v4x.fsf@zancas.localnet"
     http://article.gmane.org/gmane.mail.notmuch.general/17202
[2]: id="m2vbwj79lu.fsf@guru.guru-group.fi"
     http://article.gmane.org/gmane.mail.notmuch.general/17209
This commit is contained in:
W. Trevor King 2014-02-14 08:48:55 -08:00 committed by David Bremner
parent ffed8f2866
commit 320d4a856e

View file

@ -13,7 +13,6 @@ import codecs
import collections import collections
import datetime import datetime
import email.utils import email.utils
import locale
try: # Python 3 try: # Python 3
from urllib.parse import quote from urllib.parse import quote
except ImportError: # Python 2 except ImportError: # Python 2
@ -27,7 +26,7 @@ import subprocess
import xml.sax.saxutils import xml.sax.saxutils
_ENCODING = locale.getpreferredencoding() or sys.getdefaultencoding() _ENCODING = 'UTF-8'
_PAGES = {} _PAGES = {}