SyntaxHighlighter

Thursday, February 28, 2008

What do URLs "mean": tei-c.org

Does anybody besides me find this bizarre:

http://www.tei-c.org/index.xml

gets you:


<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">


http://www.tei-c.org/index.xml?style=raw

gets you:


<?xml version="1.0" encoding="UTF-8"?>
<?oxygen RNGSchema="http://www.tei-c.org/cms/system/modules/org.tei.www/_common/schemas/teilite.rnc" type="compact"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0" rend="home">


http://www.tei-c.org/index.xml?style=printable

gets you:


<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">


Why not:

http://www.tei-c.org/index.html
http://www.tei-c.org/index.xml
http://www.tei-c.org/index-printable.html


or some such (in which filename extensions actually bear some relationship to what the user gets, rather than the hidden underlying format or application/file hierarchy on the server) -- and thus throughout the whole website.

3 comments:

Gabriel Bodard said...

I don't know, but I'll pass this on to the Council, someone among whom no doubt has an opinion. :)

Unknown said...

And here's a good answer: quoting Sebastian Rahtz and Syd Bauman:

By [this] argument, all the .php and .asp pages in the world are wrong too. It's actually quite common for the suffix to reflect the source, not the output. (SR)

And Syd:

On the web it has been commonplace for years now to have in the URL [an extension referring to the type of data in] the source file that you ask the server for, rather than the type of data that the server returns in the URL (some of that information is stored elsewhere in the HTTP header stuff).

Syd adds that the combining names and types of data together are bad practice anyway, but what can you do?

Unknown said...

I would not disagree that unhealthy urls like this are commonplace; the TEI website is not alone or worst.

Syd's last point is actually the point. Apropos which there's an established meme on sane/cool ur[i|l]s.

What can one do? Do something better!

Drop extraneous or ambiguous extensions entirely then:

http://www.tei-c.org/Guidelines
http://www.tei-c.org/Guidelines.tei
http://www.tei-c.org/Guidelines-printable

I should say, I applaud the tei website design for making the three avatars of each resource separately addressable (instead of switched by javascript madness or some such), and discoverable by clicking through links.