[Mutt] #3472: Lines over 1024 in message body are broken, possibly also broking multi-byte charsets
#3472: Lines over 1024 in message body are broken, possibly also broking multi-
byte charsets
---------------------------------------------------+------------------------
Reporter: herbelin | Owner: mutt-dev
Type: defect | Status: new
Priority: major | Milestone:
Component: mutt | Version: 1.5.20
Keywords: multibyte characters, 1024-long lines |
---------------------------------------------------+------------------------
When viewing a mail body, lines containing more than 1024 bytes are broken
(a return is inserted). In general, it is innocuous but if ever the 1024th
byte is a multi-byte character (e.g. a non-ascii utf-8 character), then
the multi-byte character is split into two parts and the meaning of the
character is lost (possibly also fooling the encoding recognition at the
time of replying and messing the whole quoted message to which one
replies).
An artificial though typical example of line fooling mutt if appearing in
the body of a mail is the following 1025-bytes line:
"ÃaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaÃ"
If encoded in utf-8 in the body of the mail, one will see "\303
\251" instead of the last "Ã" and if one replies, the first à will become
"ÃÂ" as if mutt had decided (because of the \303 and
\251) that the mail was not in utf-8 but in iso-latin.
I quickly looked at the code of mutt but could not figure out which of the
numerous 1024-sized buffers mutt uses was the faulty one.
--
Ticket URL: <http://dev.mutt.org/trac/ticket/3472>
Mutt <http://www.mutt.org/>
The Mutt mail user agent