Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
Microsoft Programming IT Technology

Microsoft Word Document ML Schemas Published 439

Lars Munch writes "On Monday the 17th November the xml schemas for the Word Document ML along with documentation, was uploaded to the Infostructurebase (ISB). With the Word Document ML specification anybody can generate, view and process Microsoft word documents on any format." (Here are the legal terms under which the schemas can be used.) "The Word Document ML is based on the W3C specification eXtensible Markup Language (XML), there by providing documents that are easy to integrate into a large variety of systems. The Danish Government Infostructurebase is the first schema repository to make the schemas accessible to the public. The Microsoft Office Document ML schemas and documentation can now be downloaded from the ISB Repository." There are more links on this page.
This discussion has been archived. No new comments can be posted.

Microsoft Word Document ML Schemas Published

Comments Filter:
  • by RobertB-DC ( 622190 ) * on Monday November 17, 2003 @12:18PM (#7493323) Homepage Journal
    I was struck by Microsoft's about-face on proprietary data formats when I attended their "Microsoft Office System Launch" (details here [officesystemlaunch.com]) earlier this month.

    On the "Development" track, I was hoping to get some information on interfacing Office tools as objects in an existing (very large) VB application. Well, I didn't get that, but I did get to see how Microsoft is using XML to cut off one of Open Source software's big draws: open file formats. As mentioned, one of the big selling points was that you no longer have to install an app like Word on your server. You can instead use any XML-generating program to create fully compliant Word/Excel/Whatever files.

    So if the PHB [dilbert.com] was almost talked into Open Source by the security issues of installing a virus portal like Word on a trusted system behind the firewall, Microsoft just cut your legs off.

    An interesting case of "If you can't beat 'em, join 'em, *then* beat 'em."

    By the way, I bailed out of the "Development" track at lunch. The presentation didn't get into code at all... it was just a demo of how new features in Word will now allow anyone to create XML Schemas and "Solutions" (groups of schemae), and thereby call themselves a "programmer". Just what we need, another way to quickly generate bloated, write-only code.
  • At long last (Score:1, Interesting)

    by Xarius ( 691264 ) on Monday November 17, 2003 @12:19PM (#7493328) Homepage
    Finally. As a 100% Linux user, who has to use Microsoft products at university it has been a pain doing my work at home and transferring it over (without losing any details at all)...

    *smiles*
  • by Anonymous Coward on Monday November 17, 2003 @12:20PM (#7493340)

    From http://www.microsoft.com/mscorp/ip/format/xmlpaten tlicense.asp [microsoft.com]:

    "...You are not licensed to sublicense or transfer your rights..."

    That whole page is worth reading, but doesn't this phrase in particular damage the ability to make use of the information in open source code, whether GPL or BSD?

    The page also says:

    "...If you distribute, license or sell a Licensed Implementation, this license is conditioned upon you requiring that the following notice be prominently displayed in all copies and derivative works of your source code and in copies of the documentation and licenses associated with your Licensed Implementation:
    'This product may incorporate intellectual property owned by Microsoft Corporation. The terms and conditions upon which Microsoft is licensing such intellectual property may be found at http://msdn.microsoft.com/library/en-us/odcXMLRef/ html/odcXMLRefLegalNotice.asp?frame=true [microsoft.com].'...

    Unfortunately, the page they ask you to link to doesn't actually exist...

  • Not so fast (Score:2, Interesting)

    by OMG ( 669971 ) on Monday November 17, 2003 @12:23PM (#7493379)
    Wait a second ... I think the XML-format document types are only available for corporate versions of MS office. If that is true there still will be a lot of propiertary binary-only .DOCuments around in the future.

    Nice tactics: MS now tells everybody "we use open standards" (as they already do) but the users keep saving files in closed formats.
  • Possible solution (Score:5, Interesting)

    by infolib ( 618234 ) on Monday November 17, 2003 @12:23PM (#7493382)
    This is a real problem. However I think it may perhaps be circumvented by having a MSOfficeOpenOffice converter under a BSD-like license. The combination of the BSD'd plugin and eg. OpenOffice might however infringe patents if they were too closely integrated. Murky legal waters. Ugh :-(
  • by Bananenrepublik ( 49759 ) on Monday November 17, 2003 @12:26PM (#7493415)
    You have to display the following text in any derived work:

    "This product may incorporate intellectual property owned by Microsoft Corporation. The terms and conditions upon which Microsoft is licensing such intellectual property may be found at http://msdn.microsoft.com/library/en-us/odcXMLRef/ html/odcXMLRefLegalNotice.asp?frame=true [microsoft.com]."

    Now try the link ...
  • hell has frozen over (Score:2, Interesting)

    by bug ( 8519 ) on Monday November 17, 2003 @12:27PM (#7493416)
    Here's a blurb from the sister license granting use of their software patents related to the XML formats:

    By including the above notice in a Licensed Implementation, you will be deemed to have accepted the terms and conditions of this license. You are not licensed to distribute a Licensed Implementation under license terms and conditions that prohibit the terms and conditions of this license.

    A bit close to the GPL in some respects, hmm?

    I wonder, could these licenses get the OSI good housekeeping seal of approval?
  • by h4rm0ny ( 722443 ) * on Monday November 17, 2003 @12:28PM (#7493426) Journal
    Can someone clarify for me what this part means...

    Microsoft reserves the right to terminate this license grant if you sue Microsoft or any of Microsoft's affiliates for patent infringement over claims relating to reading or writing of files that comply with the Office Schemas


    I'm assuming it's actually fairly innocent but just how wide a scope does it have under the word 'relating' ?

    Finally, what are the legal constraints on M$ changing or withdrawing this licence at a later date? Presumably they are no more limiting than those on the GPL, but then I've never worried about Linus or RMS withdrawing rights from Linux, wheras with M$...

    ITIAL's (I Think I'm A Lawyer) out there who can explain?
  • by corebreech ( 469871 ) on Monday November 17, 2003 @12:31PM (#7493455) Journal
    Here's the part of the patent license I don't think I understand completely:

    By including the above notice in a Licensed Implementation, you will be deemed to have accepted the terms and conditions of this license. You are not licensed to distribute a Licensed Implementation under license terms and conditions that prohibit the terms and conditions of this license.


    You are not licensed to sublicense or transfer your rights.


    IANAL, but I think this says no open source implementation is possible, doesn't it?
  • Interesting links (Score:3, Interesting)

    by infolib ( 618234 ) on Monday November 17, 2003 @12:31PM (#7493462)
    This press release [www.oio.dk] from danish govt. agency Open public Information Online (OIO) has more info.

    Read the patent license [microsoft.com] for yourself. (The license for the schemas themselves is basically BSD)

    Also this (danish) Computerworld article [computerworld.dk] quoted MS EMEA boss Patrick de Smedt calling Interoperability a "holy grail", an "advantage to the ordinary consumer" and Competition "a very important part of our strategy." The quotes have now been removed again (why??)
  • Comment removed (Score:5, Interesting)

    by account_deleted ( 4530225 ) on Monday November 17, 2003 @12:32PM (#7493478)
    Comment removed based on user account deletion
  • by mAIsE ( 548 ) on Monday November 17, 2003 @12:33PM (#7493486) Homepage
    I will byte....

    If person A understands the spec and XML well, writes a spec for person B, C and D to implement.

    Person B, C and D go off and write filters without direct knowledge of micro$oft XML.

    Person A judges the results without touching any code and picks a winner.

    I believe this is fully legal and there is nothing MickeySoft can do to stop it.
  • Not true (Score:5, Interesting)

    by nodwick ( 716348 ) on Monday November 17, 2003 @12:36PM (#7493510)
    You omit the relevant parts of the patent license [microsoft.com]:
    Except as provided below, Microsoft hereby grants you a royalty-free license under Microsoft's Necessary Claims to make, use, sell, offer to sell, import, and otherwise distribute Licensed Implementations solely for the purpose of reading and writing files that comply with the Microsoft specifications for the Office Schemas. [...] If you distribute, license or sell a Licensed Implementation, this license is conditioned upon you requiring that the following notice be prominently displayed in all copies and derivative works of your source code and in copies of the documentation and licenses associated with your Licensed Implementation:

    "This product may incorporate intellectual property owned by Microsoft Corporation. The terms and conditions upon which Microsoft is licensing such intellectual property may be found at http://msdn.microsoft.com/library/en-us/odcXMLRef/ html/odcXMLRefLegalNotice.asp?frame=true."

    You are not licensed to distribute a Licensed Implementation under license terms and conditions that prohibit the terms and conditions of this license.

    The license explicitly allows you to sell/offer/distribute an implementation of their standard. The rest appears to be a bunch of legalese saying that you can't transfer your distribution rights to other people; it's not saying that you can't transfer your distribution. Since anyone else who feels like modifying your GPL'd code is allowed to sell/offer/distribute Microsoft's XML standard too under their license, I fail to see why this is hostile to the GPL license. The GPL itself only requires that a patent license be publicly available, not that the rights themselves have to be transfered to the users. Since the Microsoft license lets anyone use implementations royalty-free, it shouldn't be a problem.
  • by dmelchio ( 27732 ) on Monday November 17, 2003 @12:40PM (#7493547) Homepage
    The format for macros and some other things is not specified (at least not enough to recreate them). The format is still not portable for advanced features. Hopefully Microsoft isn't pushing this as an "open" format, because it isn't really open if it still has blackboxes in it. From the spec:
    For VBA code, a base64-encoded version of the binary file generated by the VBA editor is held in the binData element inside the docSuppData element. The binData element has a name attribute whose value must be set to "editdata.mso". The docSuppData element is a top-level element under the wordDocument root element, and follows the styles element in a document created by Word.
  • by Anml4ixoye ( 264762 ) * on Monday November 17, 2003 @12:45PM (#7493583) Homepage
    Now do you or your friends have anything to hide...

    And if you do, do you really trust Microsoft to keep it secret?

  • by cabalamat2 ( 227849 ) on Monday November 17, 2003 @12:51PM (#7493633) Homepage Journal

    I think you are making 2 mistakes here:

    (1) You say: Open Source != GNU Public License..
    There's no such thing as the "GNU Public License"; you probably mean the GNU General Public License.

    (2) Microsoft's license says: "You are not licensed to sublicense or transfer your rights". This means if you write a program using Microsoft's license, and license your preogram under the BSDL, then someone using your program isn't licensed to modify it. I would imagine MS have done this deliberately to sabotage open source / free software implementations of their XML schemas.

  • by drakaan ( 688386 ) on Monday November 17, 2003 @12:52PM (#7493640) Homepage Journal
    Simple...you state in the advertising materials for your product that it can open the same documents that a software suite with a name similar to Orifice made by a company with a name similar to Muckrosaft can...example documentation follows:

    Errfice 1.0 can open lots and lots of files...in addition to native StarOffice/OpenOffice document compatibility, there is another company that has a name like "Muckrosaft" that makes an office suite called "Orifice" with which this software is compatible (can you guess which company it is?)

    You can:

    • Open/Edit/Save "Muckrosaft Orifice" word-proccessing, spreadsheet, and presentation files...

    etc... ;)

  • by pyite69 ( 463042 ) on Monday November 17, 2003 @12:56PM (#7493675)

    Wrong! The specifications are one thing, but once
    you make code that uses it, you will have to
    follow Microsoft's licensing scheme, since they
    claim that their XML Office Schema is patented.
    Here is the link to that:

    http://www.microsoft.com/mscorp/ip/format/xmlpat en tlicense.asp

    The link on this page that references the actual
    details gives a 404, but I would bet that it only
    allows their schema to be used by closed source
    products. You can bet that GPL or BSD code will
    simply never be able to receive a license. The
    minute there is a Linux distro that can handle
    MS Office products with Open Source/Free Software,
    there will be a patent violation suit filed.

    Please correct me if I'm wrong, because I hope
    that I am.
  • Yes. (Score:2, Interesting)

    by Bill, Shooter of Bul ( 629286 ) on Monday November 17, 2003 @12:58PM (#7493684) Journal
    In general, I find that to be true. There is BSD freedom, and GPL freedom. Two different views on the world. One benifts the person writting the code, the other benifits those that do not. Its a shame there couldn't be something between the two.
  • by deadmonk ( 568008 ) on Monday November 17, 2003 @01:25PM (#7493925) Homepage Journal
    One of the things that's interesting to me is how much Microsoft generates that never gets used. I'll believe that they're using 'open formats' when that's the default file format for saving Microsoft Office documents.
    As it stands, they could make it an import/export option, and relegate it to the level of CSV for spreadsheets.

    Sure, it's there, but it's little more than a checkbox they can trumpet..
  • by Darth Daver ( 193621 ) on Monday November 17, 2003 @01:41PM (#7494077)
    That is not accurate based upon the benchmarks I have seen. Virtual PC has ~10-15% more overhead than VMWare. I have used VMWare for years. It is an excellent product. The ESX version has the best performance of any x86 VM technology I have seen including User Mode Linux (UML). VMWare also has features that put Virtual PC to shame. I also run VMWare under Linux because I believe in having a concrete foundation instead of a glass one.
  • Namespaces... (Score:3, Interesting)

    by Jack William Bell ( 84469 ) on Monday November 17, 2003 @01:46PM (#7494122) Homepage Journal
    Couldn't you extend the file formats the 'Namespace Module' way? This has several advantages: First off, you aren't changing their spec, only adding a new namespace for a particular need -- and now you namespace modules are the proper and accepted way, in XML, to add functionality to a schema you don't control!
  • by Anonymous Coward on Monday November 17, 2003 @02:14PM (#7494441)
    Once again it's good to be Swedish (tm). Under applicable Swedish law (overruling any EU-directive) there's no entry in patent-regulatory law that data structures may be patented or copyrighted. Several courtcases (one included myself) has proven that you cannot uphold a patent on a datastructure; only on 'active code'. XML is a bit of both, but I'd bet my savings they'd rule it to be a container (ie data structure) which under current law is free for all , published or reverse-engineered (yes, I have the courts ruling on this exact thing ;).

    As soon as we adopt the EU-standing on things, this'll be changed obviously.
  • by arevos ( 659374 ) on Monday November 17, 2003 @04:33PM (#7495757) Homepage
    The major reason why MS Office is so popular is because all it's formats are closed. A company cannot switch to a competitor. The vast majority of Microsoft products are either inbuilt, or lock the user with formats or compatability. Microsoft's great software realisation is that you don't need to make great products if you make sure that people can't use anything different.

    Whilst they already may have a very large market share, if there's suddenly 100% open specs, then Word's market share will shrink. People can quite easily undercut the cost. Whilst some PHBs might go for MS, if another word processor offers 100% compatability for a tenth the price... Well, money speaks loudly.

    Which is why I doubt Microsoft's intentions. Office isn't really that good a product. It crashes, for one. Look at, say, Warcraft III. A vastly more complex piece of software that is incredibly more stable. Whilst word processors may take a lot of coding, if the standards were truly open, then MS would have to compete on a level playing field. And I can't think of a single software program, on a level playing field, they've produced that is better than the competition.

    Compare Apache to IIS. Compare ASP to PHP. Compare IE to Firebird. None of Microsoft's software is outstanding in any technical way. When not bundled in with the OS, or locked in with compatibility issues, MS loses out.
  • Still Need to Know (Score:3, Interesting)

    by 4of12 ( 97621 ) on Monday November 17, 2003 @04:47PM (#7495866) Homepage Journal

    Whether these schemata are sufficient for someone besides MS to get a suitable XML document to render on the screen or the printed page in exactly the same fashion that MS does?

    The reason I ask is that earlier complaints about Word not being an open documented format were directed to an RTF specification at Microsoft.

    But the specification was insufficient for anyone who wanted to know how a Word document would be rendered - for that there were hidden rules in Word's codebase, rules that would change over time, or from platform to platfrom (ask anyone on a Mac).

  • by 3seas ( 184403 ) on Monday November 17, 2003 @05:50PM (#7496572) Homepage Journal
    (Forwarded from Patents list)

    -------- Original Message --------
    Subject: [Patents] MS Office 2003 XML patented
    Date: Mon, 17 Nov 2003 13:48:11 +0100
    From: Carsten Svaneborg
    Organization: www.mpipks-dresden.mpg.de
    To: patents@aful.org

    Hi! Just came across the following:

    http://www.microsoft.com/mscorp/ip/format/xmlpat en tlicense.asp
    Office 2003 XML Reference Schema Patent License

    Microsoft may have patents and/or patent applications that are necessary for
    you to license in order to make, sell, or distribute software programs that
    read or write files that comply with the Microsoft specifications for the
    Office Schemas.

    So usage of MS Word XML files requires a patentlicense.

    :

    You are not licensed to distribute a Licensed Implementation under license
    terms and conditions that prohibit the terms and conditions of this
    license. You are not licensed to sublicense or transfer your rights.

    The licence is royalty free, but GPL 7 requires the right to sublicence
    patent rights to the people who obtain a GPL program from you.

    so in other words Microsoft is using patents to prevent GPLed programs from
    accessing the XML format that MS Word will be using.

    This is very good timing, and goes to show how important it is to ensure
    that the software patent directive has articles that protects
    interoperativity from consituting patentinfringemet.

    --
    Mvh. Carsten Svaneborg
    http://www.softwarepatenter.dk
  • by russotto ( 537200 ) on Monday November 17, 2003 @06:10PM (#7496769) Journal
    There's a couple issues here:

    1) The clause forbidding you from modifying and making derivatives of the specification. Well, certainly, the specification is copyrightable and MS is within their rights to make this demand. Any reverse-engineered description of the file format would not be covered by this clause

    2) The part claiming various restrictions on implementing the specifications. This one's just plain strange. MS doesn't say they've patented the format. Nor do they say that they haven't. They simply suggest that they _might_ have. And if you want to be covered if they have, you've got to accept their terms. Which include not mentioning their name, no sublicensing, including the clause, etc.

    IF they have a valid patent, they can enforce this. They can enforce it even if you never looked at the specification. Even if the format was reverse-engineered by a couple of guys from Elbonia who'd never heard of Microsoft until you showed them the files. Wouldn't matter -- if you wanted to read&write Word files, it'd be their way, or the highway.

    If, on the other hand, they don't have a valid patent, you can read their specification and implement away. As long as you don't incorporate the spec into your work, copyright can't prevent you from writing an implementation. You can claim compatibility with Microsoft Word or Office (under trademark fair use). You don't have to include any verbiage of theirs. You can print out their license with nontoxic inks on soft paper and use it as it is best intended.

    So which is it? Well, Microsoft isn't referring to any particular patent number, so I suspect their license is 95% FUD. The other 5% is that they probably have an application in with the USPTO which covers some either obvious, overbroad, or non-novel things in the Word file format, which will probably be approved because the USPTO approves everything. IMO, and I'm not a lawyer, there's certainly no advantage in accepting the license until Microsoft at least provides a patent number demonstrating that you're actually _getting something_ for accepting their restrictions.

UNIX is hot. It's more than hot. It's steaming. It's quicksilver lightning with a laserbeam kicker. -- Michael Jay Tucker

Working...