<<< Date Index >>>     <<< Thread Index >>>

Re: extract_url.pl - version 1.3



    0n Tue, May 20, 2008 at 07:14:21PM -0500, Kyle Wheeler wrote: 

    >The original reason for this script was because urlview doesn't 
    >correctly handle format=flowed email or any other email encodings, so 
    >URLs are often mishandled or simply broken. This script handles all 
    >known encodings *correctly* (when fed the raw email). It can be used 
    >either as a standalone script (which requires the Curses::UI perl 
    >module) or as a pre-filter for urlview.

Ahh, now this is what i like to hear.

I have a few questions:

  1. What is meant by "format=flowed email" ?
  2. What are the "known encodings" ?

I often have broken links in the body of my emails and I don't know why e.g.

The link is meant to look like:

   
http://odinr.dcb.defence.gov.au/uhtbin/cgisirsi/MhOktEUDHs/DSTOE/242330010/60/54/X

But I will always see it like this in mutt:

   http://odinr.dcb.defence.gov.au/uhtbin/cgisirsi/MhOktEUDHs/DSTOE/2423300
   10/60/54/X

When I look at the raw spool file (independent of mutt) I see:

   <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
   <HTML>
   <HEAD>
   <META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
   charset=3Dus-ascii">
   <META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
   6.5.7652.24">
   <TITLE>Link to catalogue</TITLE>
   </HEAD>
   <BODY>
   <!-- Converted from text/rtf format -->
   <BR>

   <P><A =
   HREF=3D"http://odinr.dcb.defence.gov.au/uhtbin/cgisirsi/MhOktEUDHs/DSTOE/=
   242330010/60/54/X"><U><FONT COLOR=3D"#0000FF" SIZE=3D2 =
   FACE=3D"Arial">http://odinr.dcb.defence.gov.au/uhtbin/cgisirsi/MhOktEUDHs=
   /DSTOE/242330010/60/54/X</FONT></U></A>
   </P>

Would your script deal with this annoying problem (which I still don't
understand). If it would ... I am going to use it permanently :)

 -aW

IMPORTANT: This email remains the property of the Australian Defence 
Organisation and is subject to the jurisdiction of section 70 of the CRIMES ACT 
1914.  If you have received this email in error, you are requested to contact 
the sender and delete the email.