<<< Date Index >>>     <<< Thread Index >>>

Re: freebsd, mutt, unicode



On 1/29/06, Alain Bench <veronatif@xxxxxxx> wrote:
>  On Sunday, January 29, 2006 at 0:56:42 +0000, mal content wrote:
>
> > $ export LANG=en_US.UTF-8 ...both the Latin1 e acute and the attached
> > character are shown as '?' and '??' respectively. On a Latin-1
> > terminal (xterm, in this case) the same thing occurs.
>
>     Mysterious: Russian chars are octalized as unprintables, while latin
> ones are ?-masked as unconvertables...
>
>
> > the command :set charset in mutt always gives 'us-ascii'
>
>     This might explain the ?-masking. But wait: You said previously that
> "locale charmap" was correctly shouting UTF-8, on a system which
> +HAVE_LANGINFO_CODESET, and still $charset doesn't automagically get
> UTF-8 by default? Hum... I assume ":set &charset ?charset" also gives
> "us-ascii", right? So for now, put that in muttrc:
>
> | set charset="`locale charmap`"      # workaround for CODESET problem

(set)

>
>     But I guess we're back to octalization? Let's check better one of
> those 8 bits locale: What gives the little tester in sig, with xterm and
> LANG=en_GB.ISO8859-1

The little tester gives this on an xterm (I'm using koi fonts):

---
[Latin1/9] If there's no real copyrightsymbol at the end of this sentence,
then your terminal/terminalemulator/font is not ISO8859-1/15 ready: (c)

- Current environment settings:
  LANG        = "en_GB.ISO8859-1"
  LC_CTYPE    = "en_GB.UTF-8"
  LC_NUMERIC  = "en_GB.UTF-8"
  LC_TIME     = "en_GB.UTF-8"
  LC_COLLATE  = "en_GB.UTF-8"
  LC_MONETARY = "en_GB.UTF-8"
  LC_MESSAGES = "en_GB.UTF-8"

- Implicitly setting all locale categories with LANG succeeded.
  Note: setlocale() returned "en_GB.UTF-8"
  This might be a (system dependent) composite value.

  Testing LC_CTYPE with isprint():
  # # # # # # # # # # # # # # # #
  # # # # # # # # # # # # # # # #
    ! " # $ % & ' ( ) * + , - . /
  0 1 2 3 4 5 6 7 8 9 : ; < = > ?
  @ A B C D E F G H I J K L M N O
  P Q R S T U V W X Y Z [ \ ] ^ _
  ` a b c d e f g h i j k l m n o
  p q r s t u v w x y z { | } ~ #
  # # # # # # # # # # # # # # # #
  # # # # # # # # # # # # # # # #
    ¡ ¢ £ ¤ ¥ ¦ § ¨ (c) ª « ¬ ­ (r) ¯
  ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
  À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
  Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
  à á â ã ä å æ ç è é ê ë ì í î ï
  ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

- Testing LC_MESSAGES with perror(), but it's a libc message.
  Error message for "EAGAIN": Resource temporarily unavailable

- Setting LC_CTYPE to "en_GB.UTF-8" succeeded.
- Setting LC_NUMERIC to "en_GB.UTF-8" succeeded.
- Setting LC_TIME to "en_GB.UTF-8" succeeded.
- Setting LC_COLLATE to "en_GB.UTF-8" succeeded.
- Setting LC_MONETARY to "en_GB.UTF-8" succeeded.
- Setting LC_MESSAGES to "en_GB.UTF-8" succeeded.
---

On a urxvt with the standard UTF-8 :

---
[Latin1/9] If there's no real copyrightsymbol at the end of this sentence,
then your terminal/terminalemulator/font is not ISO8859-1/15 ready: (c)

- Current environment settings:
  LANG        = "en_GB.UTF-8"
  LC_CTYPE    = "en_GB.UTF-8"
  LC_NUMERIC  = "en_GB.UTF-8"
  LC_TIME     = "en_GB.UTF-8"
  LC_COLLATE  = "en_GB.UTF-8"
  LC_MONETARY = "en_GB.UTF-8"
  LC_MESSAGES = "en_GB.UTF-8"

- Implicitly setting all locale categories with LANG succeeded.

  Testing LC_CTYPE with isprint():
  # # # # # # # # # # # # # # # #
  # # # # # # # # # # # # # # # #
    ! " # $ % & ' ( ) * + , - . /
  0 1 2 3 4 5 6 7 8 9 : ; < = > ?
  @ A B C D E F G H I J K L M N O
  P Q R S T U V W X Y Z [ \ ] ^ _
  ` a b c d e f g h i j k l m n o
  p q r s t u v w x y z { | } ~ #
  # # # # # # # # # # # # # # # #
  # # # # # # # # # # # # # # # #
    ¡ ¢ £ ¤ ¥ ¦ § ¨ (c) ª « ¬ ­ (r) ¯
  ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
  À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
  Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
  à á â ã ä å æ ç è é ê ë ì í î ï
  ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

- Testing LC_MESSAGES with perror(), but it's a libc message.

- Setting LC_CTYPE to "en_GB.UTF-8" succeeded.
- Setting LC_NUMERIC to "en_GB.UTF-8" succeeded.
- Setting LC_TIME to "en_GB.UTF-8" succeeded.
- Setting LC_COLLATE to "en_GB.UTF-8" succeeded.
- Setting LC_MONETARY to "en_GB.UTF-8" succeeded.
- Setting LC_MESSAGES to "en_GB.UTF-8" succeeded.
---

> > System: FreeBSD 6.0-RELEASE (i386) [using ncurses 5.2] [using libiconv
> 1.9]
>
>     Anyone else on FreeBSD 6.0 has success or failure?

a1