Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Dark Corners of the OpenXML Standard

Posted by CowboyNeal on Thu Jan 04, 2007 11:54 PM
from the dared-to-comply dept.
Standard Disclaimer writes "Most here on Slashdot know that Microsoft released its OpenXML specification to counter ODF and to help preserve its market position, but most people probably aren't aware of all the interesting legacy code the OpenXML specification has brought to light. This article by Rob Weir details many of the crazy legacy features in the dark corners of OpenXML. As it concludes after analyzing specification requirements like suppressTopSpacingWP, 'so not only must an interoperable OOXML implementation first acquire and reverse-engineer a 14-year old version of Microsoft Word, it must also do the same thing with a 16-year old version of WordPerfect.'"
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by creimer (824291) on Thursday January 04 2007, @11:58PM (#17469738) Homepage
    Until it supports WordStar [wikipedia.org] documents.
    • Bah! (Score:3, Funny)

      by mrchaotica (681592) *

      The strings command supports all legacy document formats! What more could you possibly need? Besides, formatting is overrated anyway...

  • Length (Score:4, Funny)

    by jcnnghm (538570) on Friday January 05 2007, @12:08AM (#17469792)
    I don't know why anyone would complain, the spec is only 6,000 pages long.
    • Size (Score:5, Funny)

      by Kadin2048 (468275) <slashdot@kadin.xoxy@net> on Friday January 05 2007, @12:30AM (#17469924) Homepage Journal
      I don't know why anyone would complain, the spec is only 6,000 pages long.

      And the best part is, these [umn.edu] are the pages it uses... (I mean, why else do those specs cost so much?)
    • MS areslow learners (Score:4, Interesting)

      by WebCowboy (196209) on Friday January 05 2007, @02:14AM (#17470484)
      ...but they do learn....slowly...eventually.

      Their "open" XML format for office docs is a prime example of this.

      I think Steve Jobs was the one who first said "Microsoft just doesn't get it". Microsoft was probably the very first third-party software developer for the Mac and this was Jobs' reaction to Microsoft's first Mac applications (I think a port of Multiplan--which was re-incarnated into Excel IIRC, and MSBasic). They really WERE "tasteless", ugly and took almost no advantage of the revolutionary GUI interface--their DOSness really showed through--I think in the case of Multiplan the mouse could be used only to jump the cursor to a certain cell and that was it--the rest was all like in DOS.

      MS Windows is another example--Microsoft didn't "get it" well enough until the third major release. Now MS is SLOWLY "getting it" with the beneficial characteristics of XML standards. Microsoft's early XML efforts are like Windows 1.0--there is some very rudmentary understanding of the mechanics but not the philosophy of XML, and I wonder if this is why SOAP ended up NOT so simple (given Microsofties were involved in its creation and seemed to be trying to make it a DCOM-in-XML-but-dumber thing). Microsoft's "Version1" XML might look like this:

      <Soap:Envelope>
      <Soap:Body>
      <wsWriteLegacyData>
        <encodedBinaryData>
      SDFgkdfkljSDFJLDFSJKLkjdfbks df jklsdfklj;hk/jkjnb.kndf
      jk.sdfjkldfsddfsdfkkjsdfh kvbkjnkjkjksdfkjsdfkeuieru903
      oijooeoefvkmefmklef lmkseflkvfeklmlmermklemleflmdvldflk
      </encodedBina ryData>
      </wsWriteLegacyData>
      </Soap:Body>
      </Soa p:Envelope>
      "See? We're using XML and SOAP! We're hip! We're cooool! You can't say we don't play by the rules now!"

      Of course, this is an obtuse, opaque and obsfucated way to use XML andtotally NOT in the spirit of interoperability and openness. I won't even go into the nifty XML tools MS has made...nifty to use but they've done a lot to obliterate the S out of SOAP in their crazy output.

      OOXML (Opaque and Obsfucated XML) standard is "version 2.0"--they're doing their best to eliminate ambiguity but now we've gone over to hyper-specificity, and the standard is being shared a bit better...problem is that they don't fully describe the interpretation of the standard elements so as to keep its advantage. All they've done is taken every formatting option and mapped it to an XML element--it is monolithic and completely non-extensible. But hey, at least its publicly available and doesn't involve weirdness like encoded-binary-blobs.

      In a few years MS will reach version 3.0 of "getting" XML...
        • Re: (Score:3, Funny)

          by chthon (580889)

          Yes, but modular programming is anti-thetical to Microsoft's way of doing things.

  • by JoshJ (1009085) on Friday January 05 2007, @12:09AM (#17469800) Journal
    This is why the Microsoft Office XML (let's not kid ourself, this is far from "open") format should not become an ISO standard.
    • Re: (Score:3, Interesting)

      Totally agree. I wonder how it managed to get approved by ECMA? IIRC only IBM didn't agree to its approval; all other parties (whoever they are) agreed. I don't understand what they felt was good about this "standard" especially given that ODF had already been approved.
      • Re: (Score:3, Informative)

        by mwvdlee (775178)
        Nobody takes ECMA seriously anyway.

        You probably know that JavaScript has been standardized as "EcmaScript" by ECMA; everybody just ignores that standard.
    • by _|()|\| (159991) on Friday January 05 2007, @01:51AM (#17470366)
      Prior to reading this article, I was ambivalent about Office XML. The push to standardize Office's "DNA sequence" seemed disingenuous, but at least the format was described in detail. Now I see that the table-sagging 6,000 pages is just the tip of the iceberg: this "standard" effectively includes, by reference, the source code for every prior version of Office, to which only Microsoft has access.
    • Disadvantages of ISO (Score:5, Interesting)

      by BillGatesLoveChild (1046184) on Friday January 05 2007, @02:08AM (#17470444) Journal

      Once it is ratified as an ISO Standard, the standard is locked up and anyone that does want to a copy has to buy it from ISO. These are copyrighted. They're not cheap; thousands of dollars. Out of the reach of the average hobbyist, and not listed anywhere on the Internet. That 6,000 page draft will vanish into the mists of time.

      Larger Companies can afford this, but garage companies and hobbyists definitely can't. So what's the chance of an open source or even small upstart challenging Microsoft's Documentonopoly? Zero.

      Want another example? ISO country codes. The country codes (e.g. .us, .jp) are actually ISO, and ISO ended up backing off on a demand for royalties for this(!) But if you want state codes (e.g. California, Kantou), well, forget it unless you want to buy them off ISO. http://www.alvestrand.no/pipermail/ietf-languages/ 2003-September/001472.html [alvestrand.no]

      ISO aren't the only ones guility of doing this. IEEE do it as well. Want the latest simulation standard? Then get out your checkbook: http://standards.ieee.org/catalog/olis/compsim.htm l [ieee.org]

      ISO and the IEEE are enemies of openness. Microsoft is taking a page out of their gamebook.

      ISO or IEEE certification is a *bad* thing.

  • by Anonymous Coward on Friday January 05 2007, @12:09AM (#17469804)
    The power of legacy systems is at once both Microsoft's greatest strength and greatest weakness. Nobody in OSS is going to have the patience to rebuild the same level of backwards compatibility needed to displace them but the code must be an absolute tarpit of accumulated cruft and security holes that's incredibly difficult for them to keep going.
      • by Erris (531066) on Friday January 05 2007, @04:31AM (#17471080) Homepage Journal

        The hitch here is that *not* having them means tons and tons of reverse engineering, and that's only after tracking down every release of every version of every MS Office ever.

        The real hitch, as the article hints, is that the releases are contradictory. For instance, the Mac version of small caps is different from others. This is part of the reason Word is so bloated and does not preserve printing type setting from one machine to the next.

        Ten years ago, a state agency I was working for was forced to move from Word Perfect to Word. Hundreds, if not thousands, of documents were painstakingly converted from one format to the other. The typesetting, which they had never had a problem with previously, was easily broken by moves from one machine to the other or by changing printers. That is the kind of thing that no program can account for - it was broken from then and can not be created correctly today. It's also probably the reason for all of the nebulous "guidance" sections that don't tell you anything other than to look at, and presumably measure, old printed examples. Not even M$ knows what it was really doing in the field. As I saw at the time, no two were alike.

        Of course, the time to get things right is not in your XML it's when you import the document. The author tells us this in so many words. The XML should be general enough to encompass any kind of typesetting. It is the importing program's task to figure out what the old format wanted things to look like. As the author points out, the spec does not do anything other create something impossible to follow. It's not going to magically make things look right no matter how hard they wish it would.

  • Basically (Score:5, Insightful)

    by DrYak (748999) on Friday January 05 2007, @12:11AM (#17469810) Homepage
    ODF is the former SXW format that was taken and transformed into a standard by a committee comprising several Office software makers. It's suppose to describe the normal features that anyone should expect from any Word processing application, be it OpenOffice.org, KWord, AbiWord, Corel Word Perfect, etc. all this in a perfectly neutral way. It was designed with a function in mind (storing word processing documents in an open and interoperable way). Its benefits are comparable to the standardisation of HTML.

    OpenXML is Microsoft trying to translate its proprietary DOC file inside a XML container (because it's a big buzzword) and propose it as a standart to ECMA (because everyone is speaking about ODF being an ISO standard). It describes not only what is to be expected from a word processor, but also all MS-Word specific microsoftism. It was designed with a specific software in mind (and partly derives from the internal functionning of MS-Word). It's only a small improvement over the previous MS XML format (which had a lot of informations hidden in a binary blob).

    The good thing for Microsoft, is that they can pretend this limitation is "Not-a-bug-but-a-feature", and brag around that there are a lot of stuffs that MS-Word couldn't store inside an ODF and only OpenXML can carry.

    Microsoft's plan :
    1. Embrace
    2. Extend <- They are here
    3. Extinguish
    • by Anonymous Coward on Friday January 05 2007, @12:34AM (#17469946)

      ODF spec page count: 722 [iso.org].

      OpenXML spec page count: 6000 [regdeveloper.co.uk]!!
    • Re: (Score:3, Insightful)

      ODF is a nice idea in theory, but really, it's a similar situation (OpenOffice.Org internal dataformat jammed into a standard, so designed with OO.o in mind by necessity) just with more OSS-positive karma associated. There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea). However, interoperability seems to work best when taken from the ground
      • Re:Basically (Score:4, Insightful)

        by blincoln (592401) on Friday January 05 2007, @01:45AM (#17470338) Homepage Journal
        There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea).

        I would argue that when it's taken to the extreme of Office prior to 2007, it *is* a bad thing. AFAIK, the old Word format is more or less a (very) partial RAM dump (which is why you can often find all sorts of interesting stuff in Word files that the authors think they've deleted). That makes for faster dev times, but because the load and save functions don't really "understand" the content of the file, IMO the developers made things a lot harder for themselves in the big picture. I imagine reproducing issues in testing is a particular nightmare.
      • Re:Basically (Score:5, Interesting)

        by Nicopa (87617) <nick@[ ]oco.com.ar ['rel' in gap]> on Friday January 05 2007, @01:46AM (#17470342)
        No. ODF has several real, factual, benefits. It might have been originated in a single product but... it reuses existing standard technologies (SVG, CSS...). It has properly designed XML tags that act as "markup", in OpenDocument xml tags act as container for chunks of data. ODF tries to separate content from style.

        And about your RTF suggestion... can I draw diagrams with RTF? Can I have a ToC? Can I do complex styling? Can I have a "galery" of styles? Can I include images? No. RTF is not a solution.
        • Re: (Score:3, Interesting)

          Actually, I think for most of the things you suggest, you can do them - I know AbiWord supports them at least. (images, complex styles, TOC) RTF's really not the old dog it seems to be - keep in mind that for copy/paste of any sort of rich text to work in any sensible manner on Windows, one _must_ support RTF well.
      • Re:Basically (Score:4, Informative)

        by iluvcapra (782887) on Friday January 05 2007, @02:15AM (#17470486) Homepage

        After having written some tools on OS X that do stuff with RTF:

        RTF is well documented and you can make an RTF document on all manner of platforms (I've done it in Ruby and Cocoa), but many platforms have extended RTF in their own way in order to support special features. OS X has added a few special methods to RTF files to support Mac OS X typography, and I've noticed that different versions of Word handle document attributes (like headers and page numbers) in different ways.

        RTF is great if you want to make up something quick that is ONLY formatted text, but readers have all manner of different ways of interpreting the exact appearance of tables, page layouts and margins, and there doesn't seem to be any manageable common mechanism for including images or other documents, something Word and OO.org excel(pun) at. Even HTML seems to be better at this.

        I use RTF output in a few little in-house tools I have, so people can get the text+attributes they create and open them in a text editor of their choice for touching-up and delivery. When my tools have to create something that is supposed to be finished, they make PDFs.

        RTF is great for interoperability, but I never expect an RTF file to contain a "finished product," unless the recipient expects quality on par with a Selectric. It is merely a relatively-open serialization format for strings with attributes.

      • by Geof (153857) on Friday January 05 2007, @03:03AM (#17470690) Homepage

        There's nothing wrong with saving in a file format that matches your internal representation, in fact, it's a darn good idea (see .ABW for AbiWord, .DOC for Word, .WPD for WordPerfect I would also wager is the same idea).

        Documents are worth far more than software, and they outlive the applications used to create them. See the comment [robweir.com] to the original article - reading documents after 5, 20, 30, 100 years or more is not optional. You can pay the price of developing an independent format now, or you can pay the price of reverse engineering over and over again every time you change your internal representation.

        Repeated implementation limits future change and innovation. It's expensive: it likely costs more even for Microsoft. But they can afford it; their competitors may not be able to. Plus, Microsoft already has their first implementation.

        interoperability seems to work best when taken from the ground up - when working with another application's data structure of any complexity, you simply can't do a lossless roundtrip without losing before you've started.

        Perhaps so. But compare that cost to the cost I've just outlined. It is in the best interest of users and software developers (maybe even of Microsoft) to bite the bullet now, do the conversion once, and develop a clean format for the future.

        Maybe you have in mind an argument you're not making, but I don't see any sufficient basis for your broad contention that using a file format based on an internal representation is a "darn good idea". In specific cases, yes (e.g. where the cost of development time or effort are the most important factors). In general, I very much doubt it. That successful applications in the past have taken that approach is weak evidence. They were developed when the up-front cost of development in a time of rapid innovation, the loss of customer lock-in, and a lack of open-format competition where good business reasons for making such a choice - even if it was inferior technically, increased cost in the long term, and was bad for consumers. In today's climate of slower innovation, competition from open formats, and customers who are running into their own long-term interests, the situation is different.

        Which is not to say Microsoft's apparent attempt to set the rules of the game and throw sand in the gears of change is not in their interests, or that it will be unsuccessful.

      • Re: (Score:3, Insightful)

        by AuMatar (183847)
        No, saving the internal representation as a file is an utterly fucktarded idea. You see, an internal representation is made to make the most sense for your implementation of various features. It changes frequently, sometimes with every patch. It has performance hacks, redundancy, etc. A file format is supposed to be a representation of the data in an easy to parse format, so it can be loaded by applications.

        So what happens when you use the internal representation as the file format? Well, you have a fi
      • by NickFortune (613926) on Friday January 05 2007, @09:15AM (#17472688) Homepage
        ODF is a nice idea in theory, but really, it's a similar situation (OpenOffice.Org internal dataformat jammed into a standard, so designed with OO.o in mind by necessity)
        The ODF format must necessarily describe the structure and layout of an office document. There's no need for it to reflect the internal data structures of any specific application, except to the extent that they too describe office documents.

        OOXML includes data elements that should be part of internal import routines rather than being enshrined in the document format, and it includes elements that are not specified except by reference to applications for which no public specs exist. This is the problem, not the fact that OOXML is derived from MS Office file formats.

        RTF. It may not get press attention, but it's actually a fairly well-documented standard, has been working as an interchange format for years, and yet is designed with enough expandability that it's still useful with the kinds of documents produced today. It's a true de-facto standard.
        Well, I was a big fan of RTF at one time. But a few years back I found that documents with any kind of formatting more complex than paragraph+justification+font just wasn't working between MS Office and back. I don't know if this was because the format couldn't cope, or because of faulty implementations. In either case, it led me to give up on RTF.

        In any event, to be a replacement, RTF would need to work for spreadsheets and presentations at a minimum - something I don't think there's a lot of support for in the current RTF specification. We'd also lose the benefits of an XML based format, which given the amount of work on the seamless integration of XML documents into databases, web services and other data management applications means losing a lot of functionality.

        for those who really want interoperability, RTF is the way to go with today's software
        Interoperability is only part of the problem. We also want a spec that can be fully and freely implemented by anyone, which isn't under the control of any single vendor.We want a format to which we can entrust documents, knowing that in twenty years time there will be an application capable of reading them.

        an unnecessary dichotomy is drawn between OpenXML and ODF with regard to their design goals - both are repurposed native formats for a single application.
        I don't know what you mean by native in this case, but the repurposing of OOXML isn't the problem. It's one of size and obfuscation, and as TFA points out specification by reference to closed formats and the behaviour of extinct proprietary software. These are non trivial problems with OOXML which are not (to the best of knowledge) found in ODF.

        There's nothing wrong with ODF. Re-creating it based on the non-XML RTF would be a waste of time and effort.

  • by junglee_iitk (651040) on Friday January 05 2007, @12:18AM (#17469854)
    You want to hire a new programmer and you have the perfect candidate in mind, your old college roommate, Guillaume Portes. Unfortunately you can't just go out and offer him the job. That would get you in trouble with your corporate HR policies which require that you first create a job description, advertise the position, interview and rate candidates and choose the most qualified person. So much paperwork! But you really want Guillaume and only Guillaume.

    So what can you do?

    The solution is simple. Create a job description that is written specifically to your friend's background and skills. The more specific and longer you make the job description, the fewer candidates will be eligible. Ideally you would write a job description that no one else in the world except Guillaume could possibly match. Don't describe the job requirements. Describe the person you want. That's the trick.

    So you end up with something like this:

    * 5 years experience with Java, J2EE and web development, PHP, XSLT
    * Fluency in French and Corsican
    * Experience with the Llama farming industry
    * Mole on left shoulder
    * Sister named Bridgette

    Although this technique may be familiar, in practice it is usually not taken this extreme. Corporate policies, employment law and common sense usually prevent one from making entirely irrational hiring decisions or discriminating against other applicants for things unrelated to the legitimate requirements of the job.

    But evidently in the realm of standards there are no practical limits to the application of the above technique. It is quite possible to write a standard that allows only a single implementation. By focusing entirely on the capabilities of a single application and documenting it in infuriatingly useless detail, you can easily create a "Standard of One".

    Of course, this begs the question of what is essential and what is not. This really needs to be determined by domain analysis, requirements gathering and consensus building. Let's just say that anyone who says that a single existing implementation is all one needs to look at is missing the point. The art of specification is to generalize and simplify. Generalizing allows you to do more with less, meeting more needs with few constraints.

    Let's take a simplified example. You are writing a specification for a file format for a very simple drawing program, ShapeMaster 2007. It can draw circles and squares, and they can have solid or dashed lines. That's all it does. Let's consider two different ways of specifying a file format for ShapeMaster.

    In the first case, we'll simply dump out what ShapeMaster does in the most literal way possible. Since it allows only two possible shapes and only two possible line styles, and we're not considering any other use, the file format will look like this:

    <document>
    <shape iscircle="true" isdotted="false"/>
    <shape iscircle="false" isdotted="true"/>
    </document>

    Although this format is very specific and very accurate, it lacks generality, extensibility and flexibility. Although it may be useful for ShapeMaster 2007, it will hardly be useful for anyone else, unless they merely want to create data for ShapeMaster 2007. It is not a portable, cross-application, open format. It is a narrowly-defined, single application format. It may be in XML. It may be reviewed by a standards committee. But it is by its nature, closed and inflexible.

    How could this have been done in a way which works for ShapeMaster 2007 but also is more flexible, extensible and considerate of the needs of different applications? One possibility is to generalize and simplify:

    <document>
    <shape type="circle" lineStyle="solid"/>
    <shape type="square" lineStyle="dotted"/>
    </document>

  • by Bob54321 (911744) on Friday January 05 2007, @12:29AM (#17469916)
    I thought most people considered themselves lucky if there documents could open in successive versions of Office. Why would anyone want to implement support for really old versions if Microsoft does not do it themselves?
  • My favorite quote (Score:4, Insightful)

    by IvyKing (732111) on Friday January 05 2007, @12:50AM (#17470032)
    From TFA


    This is not a specification; this is a DNA sequence.


    Outrageously funny and to the point.

  • by PurifyYourMind (776223) on Friday January 05 2007, @01:48AM (#17470350) Homepage
    ...14- and 16-year-olds is illegal.
  • by frisket (149522) <peter AT silmaril DOT ie> on Friday January 05 2007, @04:43AM (#17471144) Homepage
    It's instructive to observe the panic-ridden frenzy with which Microsoft have approached the business of using XML as a file format. The marketing influence is all too plain to see, with the result that they feel an inner compulsion to preserve the appearance of the document at all costs, sacrificing all logic and common-sense to do it.

    OOo did the same, but with greater elegance and less haste because they were ahead of the field. Corel screwed it up with WordPerfect by keeping their stylesheet format proprietary so that transfer between WP document code and XML was made as hard as possible (a Class A blunder, given that their XML editor is actually quite good). AbiWord makes a good job of saving DocBook XML, but it's not trying to pretend it's reimportable; it screws up LaTeX formidably, though, by trying to pretend that it absolutely has to preserve line-length and font-size, which is evidence of the same neurotic attitude as Microsoft.

    The problem in all cases is not that the assorted authors and coders don't understand XML (although some of them clearly failed that test too), but that they don't understand documents. This is particularly true at Microsoft, where leaders such as Jean Paoli have been proselytizing XML for years. They still think a document is a jumble of letters; they have no idea of structure, and the DOM is simply laughable as a non-model of a document. Microsoft's particular problem with XML is that they came to it too late, and viewed it as a way of storing data, not text...indeed to this day many XML users, trained with Microsoft blinkers on, are unaware that XML can be used for normal text documents.

    With this level of ignorance surrounding Microsoft, it's hardly unexpected that they should blunder so badly.

        • Re: (Score:3, Insightful)

          by Brandybuck (704397)
          Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.
          • by Aadain2001 (684036) on Friday January 05 2007, @12:36AM (#17469950) Journal
            But they broke plenty of laws to keep their monopoly :) And while their actions during their rise to the top may not have been illegal, they could easily be called 'strong-armed'.
              • by hachete (473378) on Friday January 05 2007, @06:50AM (#17471726) Homepage Journal
                Yes, they got into trouble for bundling but it misses the point every time. The secret sauce that Microsoft uses is to strong-arm the OEMs into bundling windows with PCs, espeicially for consumers. I'm also thinking that the Windows Tax is levied even if you buy Linux on a Dell. This is the lynch-pin of Microsoft domination, without it all their other strategies whither on the vine. Without bundling of windows with new pcs, the bundling of IE (and all the other sofware), the resistance against inter-operability, the mysterious file formats etc wither on the vine. I've been disappointed that *none of the investigations I've read about have gone after the OEM-Microsoft link. Break that, and you'll have a free-market again.

                I think the Office XML format style is a play straight out of IBM's hand-book: make the standard complex and incomprehensible, and the little players - that's you - will find it hard to compete. In a way, that's a good sign: Microsoft is now lumbering into middle-age, hoist on their own evermore complex petard.

                The other thing about middle-age is that every little technological step away from their established base-line is treated as a revolution. In reality, it's no such thing, just a small stepping stone to shouting "pesky kids. Get off my lawn." Or maybe they've reached that stage already.
                • Re: (Score:3, Interesting)

                  by Gr8Apes (679165)
                  They did. It was "resolved" by being disallowed. There is no per machine "MS tax" anymore. That fell victim in teh IBM suit, I believe. (So many suits, so long ago, so many beers.... ahh - that explains it!)
          • by edwdig (47888) on Friday January 05 2007, @02:26AM (#17470532) Homepage
            Microsoft broke no laws getting DOS onto every PC. They happened to be in the right place at the right time, and the market fell onto them. But from there, Microsoft bended and broke the law every chance they got to ensure that there never was any competition.

            Also don't forget that although MS's purchase of DOS was perfectly legal, it was ethically horrible. They arrived at a handshake agreement to license the code from Seattle Computer Company. While the MS paperwork was being finalized by the lawyers, SCC then made arrangements to finance other business ventures using the MS money. MS then presented them a contract to buy the code rather than license it, and told SCC to take it or leave it. As SCC had already committed to the other deals, they had no choice but to take MS's offer. Sure, no one held a gun to the head of the SCC executives forcing them to take the deal, however, they didn't have any other reasonable alternatives. MS's behavior was legal, but certainly not ethical.
          • Re: (Score:3, Insightful)

            by tyme (6621)
            Brandybuck [slashdot.org] wrote:

            Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works. Microsoft broke no federal laws to *gain* their monopoly.

            Unfortunately, you are wrong on almost all counts:

            1. Section 1 [cornell.edu] of the Sherman Act [cornell.edu] (Restraints of Trade) applies to everyone, not just to monopolists. If Microsoft engaged in any restrains of trade, even before they were a monopoly (which doesn't have to mean 100% market share, as so many people seem to believe, i
          • Re: (Score:3, Insightful)

            Things that are illegal for a monopoly are perfectly legit for a non-monopoly. It's a crazy law, but that's how it works.

            I think your logic is more than a little broken. Monopolies have a great deal of power that other's don't have. They can undermine capitalism in a market and destroy innovation in entire industries. They can spread causing that damage to other markets. Think of it like this, people piloting airplanes aren't allowed to drink or step outside for a cigar, while those behaviors are perfect

            • by TrekkieGod (627867) on Friday January 05 2007, @02:28AM (#17470544) Homepage Journal

              If you get to the point where you build up a company that can even consider garnering the term "monopoly", then get back to us...At that point, maybe, just maybe, you may come to thinking that you you earned what you got, and the government has no right to tell you how to run your business...

              Yeah. Because the person best suited to decide what a company should or should not be allowed to do are the people who own the company. Of course you're going to want to be completely unrestricted to mow down your competitors using whatever advantages you have if you are in a position to do so. What you're missing is that no one should be allowed to use unfair practices to do it. Some people think we should idolize the free market as some sort of religion. We don't like free market economy because it was given to us by the gods. We like it because it tends to result in better products and lower prices. That ceases to be true when you have a monopoly in the mix.

              That being said, I'm not really informed about any Microsoft specifics, so I'm not going to argue in favor or against any "federal laws" as it applies to them (or failed to apply to them). However, suggesting that only people who have built a company that holds a monopoly should be able to decide what is fair regulation isn't rational. It may even be that the current federal laws regarding monopolies may be unfair and in need of reform, but the fact remains that the existence of a set of laws to regulate businesses is necessary.

            • by ArsenneLupin (766289) on Friday January 05 2007, @03:41AM (#17470850)

              If you get to the point where you build up a company that can even consider garnering the term "monopoly", then get back to us. Until then, you have no idea what you're talking about, especially when quoting arbitrary and esoteric "federal laws". Call me nuts, but if you ever got to that point, you'd might even get a crazy idea in your head that those "federal laws" that you are so damned proud of, are about as fair and just as our drug laws. At that point, maybe, just maybe, you may come to thinking that you you earned what you got, and the government has no right to tell you how to run your business that you started in your teens, and proceeded to build to make it one of the most successful companies in the history of capitalism.

              Until you get to that point, I suggest that you those "federal laws" out your ass, Mr. Ashcroft.
              I agree 100% with you. However, for fairness' sake, we should then abolish all those unjust business-hampering federal laws, including copyright and patent law.

              Oh, and also those so-called "computer misuse" laws. Indeed, if I want to set up a consultancy where I propose to convert customers ASP scripts to PHP I should be allowed to demo to my prospective customers in great graphical detail why ASP is so insecure, even if I don't yet have an existing business relationship. Why should I tolerate that the government tells me how I may and may not recruit new customers?

              Anything less would be one-sided and unfair.

        • Re: (Score:3, Insightful)

          by Al Dimond (792444)
          No, actually, I think you'd find it takes the skill of many people, good timing, and luck to be successful in business, even if you could break very many laws. Creating and sustaining a business for many years is hard. Not very many businesses make it.
    • Are you kidding? This is not a format specification. And it reflects badly on Microsoft and the engineers that authored this document: either they are too stupid to know that this is not a specification, or they are taking everybody else to be fools.
    • by animaal (183055) on Friday January 05 2007, @03:32AM (#17470820)

      What's the deal with you people? I have seen engineers take apart the most difficult situations. You have the format in your hands. It's ugly and crappy, go figure. Just get it done and stop bitching. Why is everyone so lazy?
      Jeff, is that you? Haven't seen you much since you became a project manager. Congrats on getting the MBA!
    • Re: (Score:3, Insightful)

      by gaijin99 (143693)
      You're missing the point. By defining their "standard" in this manner they can now say "Application X doesn't implement OOXML", naturally by "implement OOXML" they mean "fully implement OOXML" so that if even the most obscure and bizarre tag is not supported that's that. At that point they can either demand that application X not claim on its packaging that it supports their "standard", they might have one of those cute little "OOXML Compatiable" seals and refuse to let anyone who doesn't fully support th
    • by Askmum (1038780) on Friday January 05 2007, @03:02AM (#17470680)
      You seem to be missing the point.

      You do not need these features to begin with in a new format that is inherently incompatible with an old format. You don't want to say "now I'm going to do WP style linespacing and my linespacing is 1".
      If you want to convert a WP document to an XML document, the conversion program should know that the linespacing in WP is 0.9 times the linspacing in XML document (or what it really may be)and will then use linespacing=0.9 in the XML document. This is not a task of the new wordprocessor or its specification.

      By adding this so-called "backward compatibility" to your specification, you make the spec overly difficult and in fact you make the conversion program in the new application when this is absolutely not necessary.
      And on top of that, you require that the programmer who uses this spec should have knowledge of all these old versions and is able to program them without error. And as the application will grow because of these unnecessary features, the number of bugs will also rise. So this is not a blueprint for a good application, this is a blueprint for a very buggy implementation of a wordprocessor.
        • Re: (Score:3, Insightful)

          by cduffy (652)

          By "updating" these pages by not supporting the old format you will breaking the layout of some documents that need a consistent layout.

          You're missing his point: When converting the file to OOXML, one can and should add generic tags indicating the specific (broken) behavior which should emulated (such as "scale small caps by this percentage point") rather than just specifying a generic "Do What I Mean" marker without any useful guidance on how rendering of documents containing this marker should be implemen

    • by tepples (727027) <<moc.thgienip> <ta> <6002hsals>> on Friday January 05 2007, @01:52AM (#17470370) Homepage Journal

      OOXML is just as open as ODF

      The behavior of years-old proprietary word processing software is included by reference into OOXML. How is any spec that includes by reference the behavior of proprietary software exactly "open"? True, implementors could produce a partial implementation of the spec that degrades away the legacy baggage (more or less) gracefully, but some standards' patent licensors forbid implementors to publish a partial implementation. I don't know if this applies to OOXML's license.

    • Who's Scared? (Score:4, Informative)

      by Erris (531066) on Friday January 05 2007, @04:56AM (#17471200) Homepage Journal

      ... immediately render billions of existing MSO documents obsolete if you could get govt to mandate ODF exclusively. And the bonus is that such govt mandate would render any and all features not supported by ODF (i.e. not supported by OO.o) irrelevant.

      Eh? Isn't that why M$ made this supposedly "open" format? Because governments were tired of paying through the nose for secret formats that broke between versions? The purpose of an archive is to read it later. Governments and companies have already moved to pdf for archives. They are going to move their working documents to reasonable formats next.

      But MS opened their own format, thus leveling the playing field so that you must again compete on features ...

      You must not have read the 6000 page spec, which includes lots of sections like this:

      Guidance: To faithfully replicate this behavior, applications must imitate the behavior of that application, which involves many possible behaviors and cannot be faithfully placed into narrative for this Office Open XML Standard. If applications wish to match this behavior, they must utilize and duplicate the output of those applications. It is recommended that applications not intentionally replicate this behavior as it was deprecated due to issues with its output, and is maintained only for compatibility with existing documents from that application. end guidance

      That's neither open, nor a standard.

      Microsoft is hoping people believe what you say, but everyone knows better. Shit like OOXML this only proves that they have not changed. It's just another, more elaborate and more expensive lie. Even the name, by using "OO" is intentionally confusing. The New Office is everything the old Office was and always will be. Vista and Office 2007 are non starters.

    • Re:Unfair (Score:4, Insightful)

      by DavidTC (10147) <{moc.xobreven} {ta} {vidav.vidav.hgfdls}> on Friday January 05 2007, @02:53AM (#17470642) Homepage

      Tools that don't care about legacy support are unaffected by this; they can just pick the closest modern option to whatever the legacy flag calls for on input, and not output documents that use them.

      And thus tools, legally, are not OOXML, and won't qualify for purchasing by companies that specify OOXML. Which is the entire point.

      There's a difference between 'We need to make sure that old documents can be converted correctly.', and 'We will literally convert old documents into a new representation that contains all their weirdness, and we won't explain how to implement said weirdness in the standard.'.

      What Microsoft has produced is not even a standard. Standards must specify everything, or reference other standards that specify everything. They can't reference applications.

      If Microsoft wants to keep secret how to turn Office 95 documents into OOXML, fine. Producing a standard doesn't mean you have to explain how to convert things into that standard.

      It does, however, mean you have to explain exactly what should happen if mwSmallCaps is true, to the pixel. You can't just pawn it off on the unexplained hypothetical behavior of some other application.

    • Re: (Score:3, Insightful)

      by Askmum (1038780)
      So they did it wrong.

      You need to let a conversion program worry about converting Word 2006 documents to XML documents. You need to let the maker of Word 2006 worry about making this conversion program. This can be in the form of a "save as XML" option, but also an external program.
      You can not say "oh, this is an old feature, let's put it in the spec and let's let the programmer that uses this spec worry about it because we can't be bothered to convert it or don't know how to convert it".
      Sorry, but XML s