<<< Date Index >>>     <<< Thread Index >>>

Re: WebVulnCrawl searching excluded directories for hackable web servers



Hi Michael, thanks for noticing my project.  You've pretty much made
my point about why I'm doing this, as robots.txt shouldn't be used as
an ACL.  That's exactly the reason why I'm doing this crawling.  I'm
trying to find out how widespread of an issue this is, and to see how
this is being misused, as I've stated in my blog.

In both your posts, you've claimed you have evidence of port scanning
and, in this recent post, hitting .org and .gov sites, both of which
are false.  As you've made no attempt to contact me directly about
what I'm doing or to verify that I am indeed doing either of these
things, I'd like to ask you to share this information so we can
determine what is going on.  The data you've linked to shows mostly
port 80 access, as well as some port 9999 access which I suspect is
from a redirect, probably the main reason you may believe I'm doing
things like intentionally hitting .org sites, but there's nothing
there to verify your claims.  If you have other data to back up these
allegations, please share it.

Dennis Brown
http://webvulncrawl.blogspot.com

On 3/29/06, Michael Scheidell <scheidell@xxxxxxxxxx> wrote:
> Just a quick followup and clarification:
>
> > -----Original Message-----
> > From: Michael Scheidell
> > Sent: Wednesday, March 15, 2006 8:38 AM
> > To: bugtraq@xxxxxxxxxxxxxxxxx
> > Subject: WebVulnCrawl searching excluded directories for
> > hackable web servers
> >
> >
> > What he is doing is a violation of the RFC's (governing
> > robots.txt.. Yes, hackers do that also)
>
> There was an RFC proposed and looked at in 1996, but never adopted.
>
> >
> > The robots.txt file is NOT AN ACCESS CONTROL LIST, and SHOULD
> > NOT BE USED TO 'HIDE' DIRECTORIES. ALL DIRECTORIES SHOULD BE
> > PROTECTED AGAINST Directory listing.
>
> Someone mentioned that sometimes you want directory listings.
> That should have suggested turning off directory listing for any
> directories you don't want listed.
> (I don't know why you would put them in robots.txt)
>
> WebVuln Blog stated he was only hitting .com sites.
> I have evidence he has moved to .org sites, and in fact, has hit a US
> government site as well.
> I would hope this US government IT security folks would know not to use
> robots.txt as an ACL, the web folks aren't always security folks (web
> aplications themselves are sometimes prone to SQL injextion, XSS
> attacks, PHP coding errors) and since there is a large gap between
> applications and web development, the chances of accidentially gathering
> information that should not be gathered is huge.
>
> Every security person should review the robots.txt file on their web
> site for implications.
>
> >
> > Further, dshield shows them portscanning the net also,
> > looking for unpublished information on unpublished servers.
> http://www.dshield.org/ipinfo.php?ip=216.179.125.69&Submit=Submit
>
> So does mynetwatchman:
>
> http://www.mynetwatchman.com/LID.asp?IID=178401366
>
> --
> Michael Scheidell, CTO
> 561-999-5000, ext 1131
> SECNAP Network Security Corporation
>