<<< Date Index >>>     <<< Thread Index >>>

Re: [Mutt] #3472: Lines over 1024 in message body are broken, possibly also broking multi-byte charsets



#3472: Lines over 1024 in message body are broken, possibly also broking multi-
byte charsets
---------------------------------------------------+------------------------
 Reporter:  herbelin                               |       Owner:  mutt-dev
     Type:  defect                                 |      Status:  new     
 Priority:  major                                  |   Milestone:          
Component:  mutt                                   |     Version:  1.5.20  
 Keywords:  multibyte characters, 1024-long lines  |  
---------------------------------------------------+------------------------

Comment(by herbelin):

 {{{
 On Tue, Nov 23, 2010 at 04:48:08PM +0100, Mutt wrote:

 It seems that the bug tracker indents lines by 2 spaces. So if you're
 talking about the mail received from fleas and whose comment is
 enclosed between "{{{" and "}}}" then the "Ã" is no longer broken (but
 still appearing on a different line than the "aaaa..." while it should
 not).

 Let me try again with another line that takes into account the
 automatic 2-spaces indentation of fleas@xxxxxxxx:

 Â This is single line of 1025 characters ending with a closing chevron
 that are preceded by two 2-bytes utf-8 characters... So, the sequence of
 dots in this line should end into "......ÃÃ" with two e with acute accent
 and the closing chevron on the same line. With my version of mutt
 (1.5.20), the "ÃÃ" are damaged and the closing chevron comes on the next
 line. With an extra indentation of 2 spaces as done in messages received
 from fleat@xxxxxxxx then it is the first "Ã" which is shown as "\303"
 followed by line break, followed by "\251", the second à and the closing
 chevron. Without the extra indentation of 2 spaces, it is the second "Ã"
 which is damaged. Then, how does the end of this line? Does it correctly
 show a sequence of dots ended by "ÃÃ" and a closing chevron?
 
......................................................................................................................................................................................................................................ÃÃ
 Â

 Actually, there is even a line that freezes my instance of mutt! It is
 the long line below (assuming an automatic extra indentation of
 two-characters).

 Â This is single line of 1025 characters ending with a closing chevron
 that are preceded by two 2-bytes utf-8 characters... So, the sequence of
 dots in this line should end into "......ÃÃ" with two e with acute accent
 and the closing chevron on the same line. With my version of mutt
 (1.5.20), the "ÃÃ" are damaged and the closing chevron comes on the next
 line. With an extra indentation of 2 spaces as done in messages received
 from fleat@xxxxxxxx then it is the first "Ã" which is shown as "\303"
 followed by line break, followed by "\251", the second à and the closing
 chevron. Without the extra indentation of 2 spaces, it is the second "Ã"
 which is damaged. Then, how does the end of this line? Does it correctly
 show a sequence of dots ended by "ÃÃ" and a closing chevron?
 
....................................................................................................................................................................................................................................ÃÃ
 Â


 I removed the "set charset", indeed it is not necessary, but I still
 have lines longer than 1024 bytes cut into pieces of 1024 byte blocks
 and multi-bytes characters still damaged if overlapping a 1024-byte
 block and the next one.

 Thanks in advance for your answer.

 Hugo
 }}}

-- 
Ticket URL: <http://dev.mutt.org/trac/ticket/3472#comment:>
Mutt <http://www.mutt.org/>
The Mutt mail user agent