<<< Date Index >>>     <<< Thread Index >>>

Re: charset fixing at display time



On Wed, Dec 31, 2003 at 05:27:56PM -0200, Carlos Laviola wrote:
>   [23 lines, 170 words, 954 characters]  Top characters: etoarnis
> 
> On Tue, Dec 30, 2003 at 05:55:09PM -0500, Jeff Abrahamson wrote:
> > Some email arrive with eight bit characters that are not interpreted
> > correctly.  Mostly, these are punctuation: octal 222 for single quote,
> > octal 223 and 224 for double quotes, octal 226 for dash.
> > 
> > One way to handle this is to have my text/plain viewer make the
> > modifications.  This is what I am currently doing.
> > 
> > (Another way is to get the senders to reform their ways and do proper
> > charset tagging instead of none or us-ascii.)
> > 
> > Is there a better way?
> 
> You should be using en_US as your locale.  If by running 'locale' (the
> actual command) either LANG and/or LC_ALL are set to 'C' or 'POSIX',
> you're not going to see any accents on mutt.
> 
> On Debian, you should dpkg-reconfigure locales, tell it to generate
> en_US (or en_GB, or whatever) and use it.  Or wrestle with en_US.UTF-8
> like me, but it's hell on earth these days...

I see accents ok.  I get a lot of mail en français, and that cedilla
shows up just fine, along with all the other accents I expect.

It's when "dog" shows up as \223dog\224 and dog's shows up as dog\222s
that I am annoyed and driven to say

    | perl -pwe "s/\222/'/g;" | perl -pwe 's/[\223\224]/"/g; s/\226/--/g;'

BTW,

    jeff@asterix:jeff $ locale
    LANG=en_US
    LC_CTYPE="en_US"
    LC_NUMERIC="en_US"
    LC_TIME="en_US"
    LC_COLLATE="en_US"
    LC_MONETARY="en_US"
    LC_MESSAGES="en_US"
    LC_PAPER="en_US"
    LC_NAME="en_US"
    LC_ADDRESS="en_US"
    LC_TELEPHONE="en_US"
    LC_MEASUREMENT="en_US"
    LC_IDENTIFICATION="en_US"
    LC_ALL=
    jeff@asterix:jeff $

-- 
 Jeff

 Jeff Abrahamson  <http://www.purple.com/jeff/>
 GPG fingerprint: 1A1A BA95 D082 A558 A276  63C6 16BF 8C4C 0D1D AE4B

Attachment: pgpchSK75KXIt.pgp
Description: PGP signature