Hello,
On 04/08/2007, Tim Seifert wrote in Digest Number 1935:
>> The head of the document includes:
>>
>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
>> "http://www.w3.org/TR/html4/loose.dtd">
>
> It doesn't look typographically incorrect, here's the one from the HTML
> specifications:
>
> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
> "http://www.w3.org/TR/html4/loose.dtd">
Which is what I used.
>
> But we'd need to see your actual page, to be sure. Is it just the
> articles one that you mention below, or other pages on the site. The
All of them, but I've changed them as of this morning.
> page checked out fine, when I tried it with the W3C validator, using my
> browser to send a copy of the file. Likewise, it checked out fine with
> the WDG one being given the address directly. But putting the articles
> address directly into the W3C got the tentative assessment. That
> suggests server or validator issues.
Which browser did you use to send the file? Ibrowse? And you sent it how? By
upload, or be referring it by URI?
I'm asking because with either method using IBrowse to either upload or
direct the validator to the page, the page does NOT pass validation.
Which then causes me to speculate if IBrowse does contain a glitch after
all, given that you do not find a problem with the validation? However, the
same problem occurs with AWeb, so I'm inclined to think it's not an IBrowse
problem.
>
> I seem to recall reading problems with validation when the DOCTYPE
> wasn't split in two lines, as above. But that was a bug in the
> validator, it's not an important matter. Likewise, I can remember
> hearding grumblings about its handling of CHARSET information in the
> META statements. It was years ago that I heard those comments.
>
>> <HTML>
>> <HEAD>
>> <META HTTP-EQUIV="Content-Type" CONTENT="text/html;CHARSET=iso-8859-1">
>
> Try it with a space after text/html; before the CHARSET. Though I
> noticed that your file is like that when I tested it. But it really
> should be done with HTTP headers, not as part of the file.
>
>> <META NAME="AUTHOR" CONTENT="KRONSFELD CEMETERY">
>> <META NAME="revisit-after" CONTENT="15-days">
See other messages posted re. combinations of spacing and case. It's the
CHARSET word that causes it to choke if if it's uppercase. It must be
lowercase. And I think that is a bug.
You say it should be done with HTTP headers, but what should be the
construction of an http header? Are you saying and http header this can be
part of the html document? I don't find any information in the html
specification on how to construct and http header. Isn't sending an http
header the server's responsibility?
Well, I supposed this is actually some kind of http header,
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
[Note to users: please don't copy the line above, because I've deliberately
inserted non-breaking spaces for illustration sake]
If you would be so kind as to offer an example of a correct http header,
that would be appreciated 
>> Perhaps it's the server delivering the pages that is causing the problem.
>
> [tim
bigblack ~]$ lynx --head
> http://www.mts.net/~ejunrau/kronsfeld/articles.html
>
> HTTP/1.1 200 OK
> Server: Sun-ONE-Web-Server/6.1
> Date: Sat, 04 Aug 2007 06:57:11 GMT
> Content-length: 3591
> Content-type: text/html
> Last-modified: Fri, 03 Aug 2007 17:07:17 GMT
> Accept-ranges: bytes
> Connection: close
>
> Looks okay, albeit it doesn't have the charset information that you're
> providing with the META statement. This is where it should really be.
That suggests that your server is also case sensitive when it comes to that
line in question. I've uploaded new versions of the pages. Try it again and
see what happens.
> If you can't configure the server, you might want to see if it accepts
I don't see how I can configure the server, but I could bring this to the
attention of the ISP's tech support and see what they say. Note that the
ISP's server delivering the pages for personal web pages such as at
http://www.mts.net/~ejunrau/ is case sensitive. I belive it's some kind of
unix-based server. for example "InDeX.htMl" is not the same as
"index.html". Apparently their "business" server is or was something to do
with a Windows 2000 server when I last had occasion to ask about it.
> you putting .htaccess files on the server. That might allow you to do
> something like the following to set the HTTP headers correctly:
>
> AddDefaultCharset iso-8859-1
>
> That's actually for the Apache server, but the technique is sometimes
> copied by other servers. Of course, be certain that you are using that
> charset.
>
> If you serve plain text files anywhere on the server, you really NEED to
> be setting the charset in the HTTP headers. Only a HTML file can use a
To echo my question above, how is that done?
> META statement to do what should be done by the HTTP headers. Without
> information about the charset there's some common, and sometimes
> problematical, assumptions: Unidentified by MIME text is treated as
> ASCII. The old default for unidentified webserved text was iso-8859-1.
> A new assumed default is UTF-8 (which is compatible with ASCII, but not
> iso-8859-1).
That is indeed what the validation server is doing, falling back to UTF-8
since it can't find the charset without the "fixed" syntax.
>> So I changed the code to read,
>>
>> <META http-equiv="content-type" content="text/html;charset=iso-8859-1">
>>
>> instead of,
>>
>> <META HTTP-EQUIV="Content-Type" CONTENT="text/html;CHARSET=iso-8859-1">
>>
>> and it passes validation. I didn't take the time to test out if only the
>> CHARSET parameter needed to be lowercased -- perhaps Bonnie tested that
>> -- but I thought for good measure I'd do the rest of the line also.
>>
>> I don't think this is correct validator behavior. Since when is html case
>> sensitive? Granted that CHARSET in the way it is used there is a
>> parameter and not a tag, IMHO, but I still don't think that justifies
>> that behavior.
>
> It's not HTML that's case sensitive, so to speak, in this regard, but
> the parser. XHTML is case sensitive. I don't recall whether HTTP
> header names are, though do recall reading that the parameters aren't
> (e.g. ISO-8859-1).
>
> This is a HTTP "Content-Type" header as received from a server (below).
> Note the case of the entire thing (header name, and header contents).
>
> Content-Type: text/html; charset=iso-