Skip to content

Biting the hand that feeds IT

The Register ®

Software:


Related Whitepapers

Comments on ‘Tackling Apache zombies’

Myths and legends laid to rest

Published Tuesday 1st August 2006 09:54 GMT

« Back to article page

Not very accurate 

By Anonymous Coward
Posted Wednesday 2nd August 2006 20:22 GMT

Perhaps the author should read and understand RFC2396 and then rewrite the last section of his article. In that reading and understanding he might want to notice that the RFC is about URIs not about URLs, that section 6 says nothing at all about case sensitivity being required in any component of a URI (it does point out that it can't be required in certain components, but that doesn't mean it is required in the components not mentioned), and that section 2.4.2 leads straight to URIs in the http scheme that differ in case in teh file component of the URL and are required (by the escape rules) to identify the same resource (for example http://abd.com/a%2bc locates the same resource as http://abd.com/a%2Bc despite the case difference between b and B). His claim that the RFC unambiguously defines URLs to be case sensitive in the file part is pure balderdash. Maybe he could read appendix G, which says quite clearly that this RFC doesn't define any URL schems?

Sophistry? 

By David Norfolk
Posted Friday 4th August 2006 09:51 GMT

Interesting comment, but (I think) at a different level to this article. Not everything of value appears in a textbook and sometimes the point of a piece is to encourage people to delve more deeply.

Anyway, I asked Nick about your point and here is what he said:

This comment is sophistry.

A URL is defined in the RFC as a string of octets. Representing them as ASCII letters is merely a convenience for humans. But 'A' and 'a' are decimal 65 and 97 respectively. In the absence of an equivalence relation, they are unambiguously different.

Of course %2B == %2b, or decimal 43 / ASCII '+'. That's URLescaping, and is a separate issue to the one I was talking about (strictly speaking, unescaped character representations of URLs). I was talking about the representation that may relate to the filesystem, or be affected by mod_speling, mod_rewrite, etc.

I was also trying (evidently failing) to simplify the issue by not talking about components that are explicitly case insensitive (hostnames), or some intermediate situation (protocols - defined to be lowercase, but implementations should error-correct for uppercase).

--

(from Nick Kew)

whitepaper title

Gartner Paper: US Data Centers

U.S. enterprise data centers face considerable space and energy constraints over the next few years. Download this free independent report to read more..
whitepaper title

The Perfect (Virtual) Marriage

Get consistent virtual machine storage savings of 50% (often as high as 90%) with virtually no performance impact with NetApp deduplication..

The MSDN Developer Zone

Top 20 storiesAll The Week’s HeadlinesArchiveSearch