Re: html2text + microsoft schemas
- To: mutt-users@xxxxxxxx
- Subject: Re: html2text + microsoft schemas
- From: Kyle Wheeler <kyle-mutt@xxxxxxxxxxxxxx>
- Date: Mon, 8 Jun 2009 14:40:08 -0500
- Comment: DomainKeys? See http://domainkeys.sourceforge.net/
- Dkim-signature: v=1; a=rsa-sha1; c=relaxed; d=memoryhole.net; h=date	:from:to:subject:message-id:references:mime-version:content-type	:in-reply-to; s=default; bh=EqodYm4fxLOZPVSBQqboQeh3Cfg=; b=PCKH	jzyR1r5b5Ts3ri3YoHBsuet8QcUk0B+uUVXuknoWG7mp6GdBpXdWm3Q+LST8glNI	wnk8TxVTO7thWYTFSmTKkIg4xHv4SMR0500sxaXgvjoSuGue4nWu0dSugYSqn78U	I3Ydug7YvOe5/pPGevgjYrYxRryOSn7JxqP/AiI=
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws;  s=default; d=memoryhole.net;  b=QvpxM2P5ZfI0wL0mjh5b9hQw8o6i5r4dL59oSbenzs7fHVei/90t4sO3VWGT0k6qwRxzA7ZEeCDiCMYg/6aV4fAV3KftEgcYvNosZoDVAxmO671zz4NtTuGUn0AbqD+ezl4NkPQ8CT2txFvqqdLgD6YAFxsUzvI3wW85QiH6Amg=;  h=Received:Received:Date:From:To:Subject:Message-ID:Mail-Followup-To:References:MIME-Version:Content-Type:Content-Disposition:In-Reply-To:OpenPGP:User-Agent;
- In-reply-to: <e107b4ff0906081150j22cf02dfla579a4f0b31d177@xxxxxxxxxxxxxx>
- List-post: <mailto:mutt-users@mutt.org>
- List-unsubscribe: send mail to majordomo@mutt.org, body only "unsubscribe mutt-users"
- Mail-followup-to: mutt-users@xxxxxxxx
- Openpgp: id=CA8E235E; url=http://www.memoryhole.net/~kyle/kyle-pgp.asc;	preference=signencrypt
- References: <e107b4ff0906081150j22cf02dfla579a4f0b31d177@xxxxxxxxxxxxxx>
- Sender: owner-mutt-users@xxxxxxxx
- User-agent: Mutt/1.5.19 (2009-05-29)
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256
On Monday, June  8 at 02:50 PM, quoth James:
>I'm using html2text to view 90% of the messages that hit my Inbox.
>Unfortunately html2text shows a massive blurb of information at the
>top of every message sent from Microsoft Outlook, as seen below:
>
>-----
>
>="http://schemas.microsoft.com/office/excel/2003/xml"
>xmlns:ppda="http://www.passport.com/NameSpace.xsd"
>xmlns:ois="http://schemas.microsoft.com/sharepoint/soap/ois/"
>xmlns:dir="http://schemas.microsoft.com/sharepoint/soap/directory/"
>xmlns:ds="http://www.w3.org/2000/09/xmldsig#" xmlns:dsp="http://
>*snip*
>
>-----
>
>I understand that html2text is simply unable to read this junk since
>it's not a standard, but is there some way to repress the output?
Two suggestions. The first, to do literally what you want, would be to 
create a small wrapper script (e.g. ~/.html2textwrapper.sh) that runs 
html2text and runs sed (or something similar, like perl) on the output 
to remove those xmlns lines. Second... I've never seen lines like 
that, and I use w3m to render most of my html-based email into text. 
You may find it simpler to simply use something other than html2text.
I suppose it's also possible to get html2text to deal with them by 
asking the html2text developers... I have no experience with the 
program so I don't know if there's a built-in way of dealing with that 
stuff.
~Kyle
- -- 
The longer I live the more I see that I am never wrong about anything, 
and that all the pains that I have so humbly taken to verify my 
notions have only wasted my time.
                                                 -- George Bernard Shaw
-----BEGIN PGP SIGNATURE-----
Comment: Thank you for using encryption!
iQIcBAEBCAAGBQJKLWkXAAoJECuveozR/AWeJ48P/0voTm4lDAL5uytKGWMtwgxP
YyHYhE+r41tzqUuzaX2ywzzwZDpSAqY+gL/G6ApOuYcLqLM7mtGtiIqz007Yig4+
v5QspgyZ9NM5gTBb6Haa66+6xzbLaXnSEn5u2eh49jq/8d8rtza/AOscoFhBdFZR
/J396pA1NZ7PQpiLiv3xtjX/cj9DwyMO+lvuLEuZO6PNctX6IM88v9hqMINclu+Y
iu3ikrQaz3HT678Zv3Vc0sC6ccbSou6lo3ppQtdfF6DF6CVgHM7Tz5n3GxOvBOu8
m192wIZ5qec8gFp6xWOurxwwCfwD/8/6nXkFzZuf3nNflpi13V/3qILt4bcSgLAZ
JW/xuX1uECQdC/BSQKEPIpEW7pZUjAsCmK0FhbYKk7GQ29II6nk1l5TL68E/Cm1B
rz7vAxRsf6izJDc2iGyxAprzFVPaJhtequuJaTrHO9mts0nUluWxg4/AGqDW/cTd
9Ya3PZBhGoq+2ivp8YN1p+bkWaUViRkmejwCmFqob4BiPkwzDyGocogXCS/++o6t
WO6NGIUZUVlKyw4TWAuMkyzEReNUsnGUOtUbxGWA9+g1TM6dhflSFzObu5s2+DIN
+9cq0Dju/6CZaHW+1o+Pl/liwneIHhC/lgXGsHb3lVvkO+LqgzFuWF0F678kupMR
GiLKZcGIS8GL22N5P76s
=BRDT
-----END PGP SIGNATURE-----