<<< Date Index >>>     <<< Thread Index >>>

[Mutt] #3472: Lines over 1024 in message body are broken, possibly also broking multi-byte charsets



#3472: Lines over 1024 in message body are broken, possibly also broking multi-
byte charsets
---------------------------------------------------+------------------------
 Reporter:  herbelin                               |       Owner:  mutt-dev
     Type:  defect                                 |      Status:  new     
 Priority:  major                                  |   Milestone:          
Component:  mutt                                   |     Version:  1.5.20  
 Keywords:  multibyte characters, 1024-long lines  |  
---------------------------------------------------+------------------------
 When viewing a mail body, lines containing more than 1024 bytes are broken
 (a return is inserted). In general, it is innocuous but if ever the 1024th
 byte is a multi-byte character (e.g. a non-ascii utf-8 character), then
 the multi-byte character is split into two parts and the meaning of the
 character is lost (possibly also fooling the encoding recognition at the
 time of replying and messing the whole quoted message to which one
 replies).

 An artificial though typical example of line fooling mutt if appearing in
 the body of a mail is the following 1025-bytes line:
 
"ÃaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaÃ"

 If encoded in utf-8 in the body of the mail, one will see "\303
 \251" instead of the last "Ã" and if one replies, the first à will become
 "ÃÂ" as if mutt had decided (because of the \303 and
 \251) that the mail was not in utf-8 but in iso-latin.

 I quickly looked at the code of mutt but could not figure out which of the
 numerous 1024-sized buffers mutt uses was the faulty one.

-- 
Ticket URL: <http://dev.mutt.org/trac/ticket/3472>
Mutt <http://www.mutt.org/>
The Mutt mail user agent