<<< Date Index >>>     <<< Thread Index >>>

Re: html2text + microsoft schemas



-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On Monday, June  8 at 02:50 PM, quoth James:
>I'm using html2text to view 90% of the messages that hit my Inbox.
>Unfortunately html2text shows a massive blurb of information at the
>top of every message sent from Microsoft Outlook, as seen below:
>
>-----
>
>="http://schemas.microsoft.com/office/excel/2003/xml";
>xmlns:ppda="http://www.passport.com/NameSpace.xsd";
>xmlns:ois="http://schemas.microsoft.com/sharepoint/soap/ois/";
>xmlns:dir="http://schemas.microsoft.com/sharepoint/soap/directory/";
>xmlns:ds="http://www.w3.org/2000/09/xmldsig#"; xmlns:dsp="http://
>*snip*
>
>-----
>
>I understand that html2text is simply unable to read this junk since
>it's not a standard, but is there some way to repress the output?

Two suggestions. The first, to do literally what you want, would be to 
create a small wrapper script (e.g. ~/.html2textwrapper.sh) that runs 
html2text and runs sed (or something similar, like perl) on the output 
to remove those xmlns lines. Second... I've never seen lines like 
that, and I use w3m to render most of my html-based email into text. 
You may find it simpler to simply use something other than html2text.

I suppose it's also possible to get html2text to deal with them by 
asking the html2text developers... I have no experience with the 
program so I don't know if there's a built-in way of dealing with that 
stuff.

~Kyle
- -- 
The longer I live the more I see that I am never wrong about anything, 
and that all the pains that I have so humbly taken to verify my 
notions have only wasted my time.
                                                 -- George Bernard Shaw
-----BEGIN PGP SIGNATURE-----
Comment: Thank you for using encryption!

iQIcBAEBCAAGBQJKLWkXAAoJECuveozR/AWeJ48P/0voTm4lDAL5uytKGWMtwgxP
YyHYhE+r41tzqUuzaX2ywzzwZDpSAqY+gL/G6ApOuYcLqLM7mtGtiIqz007Yig4+
v5QspgyZ9NM5gTBb6Haa66+6xzbLaXnSEn5u2eh49jq/8d8rtza/AOscoFhBdFZR
/J396pA1NZ7PQpiLiv3xtjX/cj9DwyMO+lvuLEuZO6PNctX6IM88v9hqMINclu+Y
iu3ikrQaz3HT678Zv3Vc0sC6ccbSou6lo3ppQtdfF6DF6CVgHM7Tz5n3GxOvBOu8
m192wIZ5qec8gFp6xWOurxwwCfwD/8/6nXkFzZuf3nNflpi13V/3qILt4bcSgLAZ
JW/xuX1uECQdC/BSQKEPIpEW7pZUjAsCmK0FhbYKk7GQ29II6nk1l5TL68E/Cm1B
rz7vAxRsf6izJDc2iGyxAprzFVPaJhtequuJaTrHO9mts0nUluWxg4/AGqDW/cTd
9Ya3PZBhGoq+2ivp8YN1p+bkWaUViRkmejwCmFqob4BiPkwzDyGocogXCS/++o6t
WO6NGIUZUVlKyw4TWAuMkyzEReNUsnGUOtUbxGWA9+g1TM6dhflSFzObu5s2+DIN
+9cq0Dju/6CZaHW+1o+Pl/liwneIHhC/lgXGsHb3lVvkO+LqgzFuWF0F678kupMR
GiLKZcGIS8GL22N5P76s
=BRDT
-----END PGP SIGNATURE-----