Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Netscape Restores RSS DTD, Until July

Posted by CmdrTaco on Wed Jan 17, 2007 10:25 AM
from the that's-kinda-lame dept.
Randall Bennett writes "RSS 0.91's DTD has been restored to it's rightful location on my.netscape.com, but it'll only stay there till July 1st, 2007. Then, Netscape will remove the DTD, which is loaded four million times each day. Devs, start your caching engines."

Related Stories

[+] Ask Slashdot: Is Dedicated Hosting for Critical DTDs Necessary? 140 comments
pcause asks: "Recently there was a glitch, when someone at Netscape took down a page that had an important DTD (for RSS), used by many applications and services. This got me thinking that many or all of the important DTDs that software and commerce depend on are hosted at various commercial entities. Is this a sane way to build an XML based Internet infrastructure? Companies come and go all of the time; this means that the storage and availability of those DTDs is in constant jeopardy. It strikes me that we need an infrastructure akin to the root server structure to hold the key DTDs that are used throughout the industry. What organization would be the likely custodian of such data, and what would be the best way to insure such an infrastructure stays funded?"
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • Redirect (Score:3, Insightful)

    And they can't set up a redirect to the new hosting location?
    • Re:Redirect by Aladrin (Score:1) Wednesday January 17 2007, @10:42AM
    • Re:Redirect by Otter (Score:3) Wednesday January 17 2007, @10:44AM
    • Re:Redirect (Score:4, Funny)

      by AndroidCat (229562) on Wednesday January 17 2007, @10:45AM (#17646984)
      (http://home.primus.ca/~ronsharp/tororg.html)

      HTTP/1.1 301 Moved Permanently
      Content-Type: text/html
      Location: http://127.0.0.1/
      [ Parent ]
    • Re:Redirect (Score:5, Insightful)

      by werewolf1031 (869837) on Wednesday January 17 2007, @10:45AM (#17646986)
      (http://werewolf.darkicon.com/)
      And they can't set up a redirect to the new hosting location?
      What in the world would be the point? That would merely duplicate the problem to a different location. As was clearly stated in the article by Mr. Finke, four-million hits every day is a crapload of bandwidth wasted re-downloading a file that will never change. The RSS 0.91 spec is finished, complete, and yes, for all intents and purposes, written in stone. Stop looking at it every damned day. It will not change. Ever. It's truly stupid for client-side software to be accessing it over the Internet to read its forever-static contents. That's like checking the writings of a dead poet every day to see if anything's changed.

      And any dev who codes his app to check a file like this every day instead of caching it client-side should be smacked oh-my-god-so-frickin-hard.
      [ Parent ]
      • Re:Redirect (Score:5, Funny)

        by AndroidCat (229562) on Wednesday January 17 2007, @10:59AM (#17647172)
        (http://home.primus.ca/~ronsharp/tororg.html)
        And any dev who codes his app to check a file like this...
        They might not even know that they're doing it if they're using Microsoft's Swiss Army Chainsaw XMLHTTP COM object and set the flags wrong.
        [ Parent ]
        • Re:Redirect by Frosty Piss (Score:1) Wednesday January 17 2007, @11:17AM
          • Re:Redirect by BobNET (Score:2) Wednesday January 17 2007, @11:24AM
            • 1 reply beneath your current threshold.
          • Re:Redirect by AndroidCat (Score:3) Wednesday January 17 2007, @11:29AM
        • 2 replies beneath your current threshold.
      • Re:Redirect by gbjbaanb (Score:2) Wednesday January 17 2007, @12:03PM
      • Re:Redirect by naChoZ (Score:3) Wednesday January 17 2007, @12:21PM
      • Re:Redirect by doom (Score:3) Wednesday January 17 2007, @02:07PM
    • URIs by Anonymous Coward (Score:1) Wednesday January 17 2007, @11:33AM
      • Re:URIs (Score:5, Informative)

        by Schraegstrichpunkt (931443) on Wednesday January 17 2007, @02:27PM (#17650624)
        (http://www.faqs.org/rfcs/rfc3675.html)
        No. This is the perfect example of why a URI is not necessarily supposed to be treated as a URL. http://my.netscape.com/publish/formats/rss-0.91.dt d is just a unique identifier for the RSS DTD. It used to also be hosted there as a convenience, but your software isn't supposed to rely on that.
        [ Parent ]
    • Re:Redirect by Albanach (Score:3) Wednesday January 17 2007, @12:41PM
    • Or mitigate with cache headers by Kelson (Score:3) Wednesday January 17 2007, @02:08PM
  • Not enough time!!!! (Score:5, Funny)

    by Anonymous Coward on Wednesday January 17 2007, @10:32AM (#17646770)
    Developers who made the mistake to use that external resource in their code most likely don't have the brain resources to adapt until July.

    (This is not a troll. Resignation and bitterness, maybe. But not a troll.)
    • Re:Not enough time!!!! (Score:5, Funny)

      by Anonymous Coward on Wednesday January 17 2007, @10:39AM (#17646900)
      That is kind of like declaring PI to be a volatile double variable, in case it changes in real time...
      [ Parent ]
      • pi meter by shani (Score:2) Wednesday January 17 2007, @11:58AM
        • Re:pi meter by nuzak (Score:2) Wednesday January 17 2007, @12:38PM
        • Re:pi meter by Anonymous Coward (Score:2) Wednesday January 17 2007, @01:05PM
          • Re:pi meter by alienmole (Score:3) Wednesday January 17 2007, @06:38PM
        • Re:pi meter by T-Ranger (Score:2) Wednesday January 17 2007, @05:19PM
      • 2 replies beneath your current threshold.
    • Re:Not enough time!!!! by Pollardito (Score:2) Wednesday January 17 2007, @12:02PM
  • Why can't we just move it? (Score:4, Interesting)

    by Anonymous Coward on Wednesday January 17 2007, @10:34AM (#17646806)
    Developers should take the opportunity to move to Atom. In the mean time we could use something as simple as round-robin DNS to share the load or have Mozilla, Google or the internet archive host it. It's a historical document and should reside at a permanent URI.
    • Re:Why can't we just move it? by Antique Geekmeister (Score:2) Wednesday January 17 2007, @03:57PM
    • Re:Why can't we just move it? by Anonymous Coward (Score:1) Wednesday January 17 2007, @10:49AM
    • Re:Why can't we just move it? by metamatic (Score:3) Wednesday January 17 2007, @12:08PM
      • URLs, URIs and URNs 101 (Score:5, Informative)

        by metamatic (202216) on Wednesday January 17 2007, @01:26PM (#17649716)
        (http://www.pobox.com/~meta/ | Last Journal: Sunday February 29 2004, @09:19AM)
        URLs are a subset of URIs. A URL defines a location where a resource can be accessed. A URI may merely be the name of a resource, i.e. a URN.

        For example, globally unique IDs in Atom feeds are often URNs, and hence URIs; but URNs aren't URLs, and you shouldn't need or want to try to connect to something just because it's used as a globally unique identifier in an Atom feed and looks a bit like a URL.

        This is relevant because many Internet specifications use URNs (or in the case of HTML, FPIs) as spec identifiers. For instance, XML namespace identifiers are URIs; and while some of them happen to be URLs too, the XML namespace recommendation [w3.org] says:

        The namespace name, to serve its intended purpose, should have the characteristics of uniqueness and persistence. It is not a goal that it be directly usable for retrieval of a schema (if any exists).

        In the case of RSS 0.91, Netscape wrote the spec, and they used a URL and told people to connect to it to fetch the necessary information to parse the file. They could have used a URN, but I'm guessing they wanted to keep their options open as far as changing the spec on the fly.

        (Of course, Dave Winer has a different approach to changing RSS specs on the fly...)
        [ Parent ]
      • 1 reply beneath your current threshold.
    • 1 reply beneath your current threshold.
  • CmdrTaco (Score:5, Funny)

    by MagicM (85041) on Wednesday January 17 2007, @10:40AM (#17646906)
    Netscape Restores RSS DTD, Until July - from the that's-kinda-lame dept.
    Two Stargate SG1 Films Announced - from the good-for-them dept.
    Linux: x86 Linux Flash Player 9 is Final - from the i-still-hate-flash dept.

    Looks like somebody is having a case of the mondays.

    (On Wednesday.)
  • I don't get it (Score:5, Interesting)

    by Thansal (999464) on Wednesday January 17 2007, @10:40AM (#17646910)
    I admit, I am not familiar enough with RSS. However this is a 2.3KB file that is not supposed to change. Why would developers NOT hardcode it into their RSS tools?
    • by 140Mandak262Jamuna (970587) on Wednesday January 17 2007, @10:51AM (#17647046)
      (Last Journal: Wednesday October 31, @08:33AM)
      No one ever writes a new XML (and most other Web2.0) application from the scratch. They all take an app they are familiar with and modify it to do new things. And some of the initial boot-strap processes are never looked into. If it works, dont mess with it attitude is pervasive. So someone long ago may be in a galaxy far away wrote an application that replicated and mutated by developers and others took it and did more mutations and it propagated. One side effect of this and similar cut&paste code development tactics is that bugs, security holes, inefficient algorithms, brain dead implementations also propagate.

      Richard Dawkins asks this very fundamental question, why reproduce (sexually or asexually) using seeds and embryos? Why not propagate by cuttings and cloning? It happens in nature. Many fern like plants do it. Bananas have been reproducing by new shoots. Then he discusses how harmful mutations too propagage and how going back to the basics and recreating the embryo selects the beneficial mutations and puts a check on deletrious mutations. Books The Selfish Gene, Climbing the Mount Improbable.

      [ Parent ]
    • Re:I don't get it (Score:5, Informative)

      by jrumney (197329) on Wednesday January 17 2007, @10:58AM (#17647138)
      (http://jasonrumney.net/)

      Developers use off the shelf XML parsers, which generally take care of validation for you. Netscape created this problem themselves when they stated in the spec for RSS 0.91 that well-formedness was not enough, RSS 0.91 feeds should be validated against the DTD. They then specified that document authors must use a PUBLIC doctype specifier, so the option of using a SYSTEM one (where the DTD is looked up in a local catalog) is not an option.

      [ Parent ]
    • Re:I don't get it by rholliday (Score:3) Wednesday January 17 2007, @12:46PM
  • Let's be Evil (Score:2, Interesting)

    by hackershandbook (963811) on Wednesday January 17 2007, @10:52AM (#17647056)
    .. and I thought it was only Microsoft and Google that tried to "break the web" on purpose ....
  • "Caching" not the answer (Score:5, Informative)

    by KrisWithAK (32865) on Wednesday January 17 2007, @11:01AM (#17647202)
    As I replied for the previous Netscape RSS DTD article http://slashdot.org/comments.pl?sid=216818&cid=176 03480 [slashdot.org], caching DTDs from the network is not the answer if there is the possibility they will not be there in the future:

    The proper thing to do is for your application to use an XML catalog for resolving entities/URIs and bundle the DTD files with the application. There is a good article at http://xml.apache.org/commons/components/resolver/ resolver-article.html [apache.org] that helped me out. In addition, if you are using Eclipse with the web tools platform, you can customize the catalog so it resolves DTDs and entities locally. See http://wiki.eclipse.org/index.php/Using_the_XML_Ca talog [eclipse.org].
  • Technical vs. Emotional (Score:5, Interesting)

    by mmurphy000 (556983) on Wednesday January 17 2007, @11:03AM (#17647226)

    (I tried posting this as a reply to the blog posting, but I'm not getting the confirmation email, so I'll post it here)

    From a purely technical standpoint, I agree with your assertion that, for well-baked files like RSS DTDs, clients should not be relying on a file hosted by an arbitrary service.

    That being said, please understand that the emotional message you're sending is: "Don't rely on Netscape".

    Why?

    Back when RSS was first starting out, Netscape's documentation said to use Netscape URLs for the RSS DTDs. Witness this page [archive.org], published by Netscape, from late 2000:

    Now, a shade over six years later, Netscape is saying "Oh, yeah, what we told you to do? Never mind. We're not supporting it any more."

    If Netscape/AOL was shutting its doors, that'd be one thing. If the service in question was obviously onerous, that too would be understandable. Or, if Netscape told people "For the love of all that is holy, don't use our URLs for your DTD needs!" from the get-go (based on that document, you didn't), any such reliance would be our own fault.

    But, because AOL does not want to serve up two static files, each of which is smaller than the "Netscape Reports" graphic on the netscape.com home page, Netscape is abandoning a service they told people to use.

    So what are we to think about Netscape's current services and their long-term usability?

  • dumb (Score:1, Funny)

    by Anonymous Coward on Wednesday January 17 2007, @11:05AM (#17647258)
    "should feed readers be relying on the availability of a static document on a third-party Web server (and thus a connection to the Internet)?"

    Yeah, feed readers don't need the internet at all! What WERE you guys thinking?
    • Re:dumb by AndroidCat (Score:1) Thursday January 18 2007, @09:19AM
  • by gr8_phk (621180) on Wednesday January 17 2007, @11:06AM (#17647272)
    I never understood why web pages need references to these external things (or do they?). Why embed into a page a pointer to a document that you don't have direct control over? My own dumb pages do this as well since I switched from plain HTML to using CSS and SVG, but I don't have the time to figure out why it's in there or if it's needed. I just pasted it in like the examples I found. Now if I thought my web page was really important, I'd look into this a bit more...
  • by MillionthMonkey (240664) on Wednesday January 17 2007, @11:25AM (#17647558)
    (Last Journal: Wednesday January 31 2007, @02:25AM)
    You have five months to update your apps to use RSS DTD version 0.92!
  • Better than alternative (Score:1, Insightful)

    by Anonymous Coward on Wednesday January 17 2007, @11:48AM (#17648024)
    It seems to me that having the ability to track the src and dest address of every website viewed (nearly) would be a huge financial gain to companies willing to sell that information. Netscape (read AOL) never really struck me as a "feel good, do good" company and I am surprised that they would not try to profit off of this. I distinctly remember thinking this as motive back when they declared everyone must use their DTD in the first place.
  • First woodpecker... (Score:3, Insightful)

    by kabdib (81955) on Wednesday January 17 2007, @12:09PM (#17648386)
    (http://www.dadhacker.com/)
    This is why whenever I hear the words "architecture" and "web" in the same sentence that I snicker. Unpolite, but OMFG who designed this junk?

    Oh, right. Nobody, really. It's amazing it works at all (... and sometimes it doesn't!)

    Djikstra's quip, "If programmers build houses they way they built programs, the first woodpecker to come along would topple civilization" was and remains insightful.
  • by Omnifarious (11933) * on Wednesday January 17 2007, @12:09PM (#17648392)
    (http://www.omnifarious.org/~hopper/ | Last Journal: Tuesday October 02, @12:21PM)

    The web needs some scheme for content based addressing. Like the urn:sha1 scheme used in gnutella. This (and some sort of reasonable caching scheme) would do a lot to alleviate problems like this. It could also help a lot with the Slashdot effect.

  • by untoldone (839361) on Wednesday January 17 2007, @12:15PM (#17648504)
    (http://blog.bluenodes.com/)
    With my very limited understanding of the W3C and such, the the netscape blog post seems very anti-W3C ... as Daniel Glazman commented on the blog post

    "cool URIs never change". This sentence has been on W3C's site for ages, constantly repeated by W3C staff in Web conferences.
    However, it's not like the author of the post didn't have valid reasoning. That IS a lot of bandwidth, why should netscape be made to foot the bill when they don't get anything in return? No one sees any of their pretty ads when the users' machines just fetch the DTD -- netscape gets nothing off of hosting it ... beyond the ability to feel good about themselves because they keep RSS 0.91 stable for everyone else. I didn't even know netscape hosted this until I read the blog post ... Not having much experience with it, this seems like the exact same issue I had with the entirety of how semantic web stuff works: Lots of machine readable documents must be fetched from many different sources in order for one service to work. W3C says that this should work cause documents don't go away, but generally this is not the case -- companies go bankrupt, services change ownership (and domain). I feel like someone needs to work out (and use in practice) a system of distributing static documents like this consistently over lots of places so when one source dies or changes location -- the services that rely on it don't just stop.
  • by gamer4Life (803857) on Wednesday January 17 2007, @01:08PM (#17649392)
    Ever since they took DevEdge offline without warning, I've been weary to rely on them as a resource. This is just another step toward insignificance.
  • by Animats (122034) on Wednesday January 17 2007, @01:16PM (#17649518)
    (http://www.animats.com)

    Don't get it from archive.org. The Internet Archive isn't really set up to have a huge number of quick retrievals of the same tiny item. There's no front-end cache farm, and response will be slow. (This was a problem after they started archiving Greatful Dead fan recordings. The Deadheads, many of whom did too many drugs in the 1960s, would stream the same audio, over and over and over. The music archive had to be moved to a completely different system.)

    Try to get this hosted by "w3c.org", which hosts other DTDs and seems to do a good job.

  • whoops (Score:1)

    by mastershake_phd (1050150) on Wednesday January 17 2007, @10:32AM (#17646760)
    (http://freedomsforums.com/)
    The unavailability of this file had the effect of causing certain feed readers - Microsoft's Live.com RSS gadget, for one - to refuse to display RSS 0.91 feeds

    Cant they sue Microsoft for stealing bandwidth, and bad design?
    [ Parent ]
  • Re:hmm (Score:1)

    by mastershake_phd (1050150) on Wednesday January 17 2007, @10:38AM (#17646884)
    (http://freedomsforums.com/)
    You can't sue Microsoft for bad design!

    They're still in business, aren't they?


    I know, i was trying to be funny.....hey so where you.
    [ Parent ]
  • Probably (Score:2, Funny)

    by Anonymous Coward on Wednesday January 17 2007, @10:43AM (#17646956)
    As that would give Google another way to track your every online move.
    [ Parent ]
  • by flyingfsck (986395) on Wednesday January 17 2007, @10:55AM (#17647104)
    You forgot 'there' ==> "their" and "forget" ==> "forgot"...
    [ Parent ]
  • Re:Spelling issue (Score:3, Informative)

    by JasonKChapman (842766) on Wednesday January 17 2007, @11:32AM (#17647680)
    (http://www.jasonkchapman.com/)
    What's so fucking hard about spelling "its" correctly?

    An old Jedi mind trick:

    Its apostrophe is missing, because it's been moved over here.

    [ Parent ]
  • Re:Its (Score:1, Informative)

    by Anonymous Coward on Wednesday January 17 2007, @11:49AM (#17648048)

    I think you should subscribe to this newsgroup [google.com].

    [ Parent ]
  • Re:OR (Score:1)

    by Simetrical (1047518) <Simetrical+sd@gmail.com> on Wednesday January 17 2007, @10:20PM (#17658262)

    If everyone overlooked petty grammar mistakes then grammar would cease to exist. Not overnight, but over time.

    What you said is wrong, and what you meant is trivial. Grammar is a product of the human brain, almost all learned unconsciously. I'm sure anyone here who's looked at AI language can attest that the important parts of grammar do not boil down to simple rules that are taught in school, or indeed could possibly be taught in school. Anyone who's used state-of-the-art translators like Google's should be able to infer that too. If everyone started talking like hicks (or some other low-class group) tomorrow, we'd all be able to understand each other perfectly. The only "grammar" that would cease to exist would be the frequently bizarre and definitely unnecessary rules by up by a handful of generations of prescriptivists starting in the 1800s, and frankly, I think we could do without that kind of grammar.

    None of that is to say that it doesn't annoy me when people don't put in the effort to use standard spellings, or even that I don't think that people should be corrected on points of grammar (because being able to speak and write in high-prestige standard dialects is a socially useful tool whether I like it or not). But to say that the sky will fall if people don't correct each other's grammar, or that speakers of nonstandard dialects are "wrong", is absurd.

    Incidentally, I would highly recommend The Language Instinct by Steven Pinker. Very compelling book.

    [ Parent ]
    • 1 reply beneath your current threshold.
  • 9 replies beneath your current threshold.