Slashdot Log In
Microsoft Word Document ML Schemas Published
Posted by
timothy
on Mon Nov 17, 2003 11:17 AM
from the danish-means-more-than-a-snack dept.
from the danish-means-more-than-a-snack dept.
Lars Munch writes "On Monday the 17th November the xml schemas for the Word Document ML along with documentation, was uploaded to the Infostructurebase (ISB). With the Word Document ML specification anybody can generate, view and process Microsoft word documents on any format." (Here are the legal terms under which the schemas can be used.) "The Word Document ML is based on the W3C specification eXtensible Markup Language (XML), there by providing documents that are easy to integrate into a large variety of systems. The Danish Government Infostructurebase is the first schema repository to make the schemas accessible to the public. The Microsoft Office Document ML schemas and documentation can now be downloaded from the ISB Repository." There are more links on this page.
This discussion has been archived.
No new comments can be posted.
Microsoft Word Document ML Schemas Published
|
Log In/Create an Account
| Top
| 439 comments
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Out-Open-Sourcing Open Source (Score:5, Interesting)
(http://www.dixie-chicks.com/ | Last Journal: Tuesday July 24, @05:17PM)
On the "Development" track, I was hoping to get some information on interfacing Office tools as objects in an existing (very large) VB application. Well, I didn't get that, but I did get to see how Microsoft is using XML to cut off one of Open Source software's big draws: open file formats. As mentioned, one of the big selling points was that you no longer have to install an app like Word on your server. You can instead use any XML-generating program to create fully compliant Word/Excel/Whatever files.
So if the PHB [dilbert.com] was almost talked into Open Source by the security issues of installing a virus portal like Word on a trusted system behind the firewall, Microsoft just cut your legs off.
An interesting case of "If you can't beat 'em, join 'em, *then* beat 'em."
By the way, I bailed out of the "Development" track at lunch. The presentation didn't get into code at all... it was just a demo of how new features in Word will now allow anyone to create XML Schemas and "Solutions" (groups of schemae), and thereby call themselves a "programmer". Just what we need, another way to quickly generate bloated, write-only code.
Re:Out-Open-Sourcing Open Source (Score:5, Insightful)
(http://photo.net/photos/swillden | Last Journal: Wednesday July 19 2006, @01:42PM)
Microsoft isn't going to give up the golden strength of a file format lock-in any time soon, even if they let companies use custom indexing tools on their store of documents (which is really what this whole XML business is about).
Unless I'm missing something, I think this does break the lock-in, in large part. With a published, standardized format, non-Microsoft tools can implement support for it, and users can expect it to work reliably. Openoffice.org, for example, can probably support the new MS format simply by adding a pair of XSLT stylesheets (though they may want to take a different approach for performance).
This means that users of non-MS tools will be able to create documents, confident that MS Office users will be able to read them. There are still limitations going the other way, but that still means that non-MS tools only have to write import filters for the old Office formats, halving the work, and that is really won't be an issue in the business world, where Office Pro is the norm anyway.
I think think this move will prove painful for MS, but probably less painful than sticking with completely closed formats, given the way they've been getting beat up about it.
Re:Out-Open-Sourcing Open Source (Score:5, Informative)
(http://slashdot.org/~superyooser/journal/73607 | Last Journal: Wednesday June 20, @01:12PM)
Re:Out-Open-Sourcing Open Source (Score:4, Informative)
(http://slashdot.org/~superyooser/journal/73607 | Last Journal: Wednesday June 20, @01:12PM)
Re:Out-Open-Sourcing Open Source (Score:5, Interesting)
(http://www.nightlifemagazine.ca/ | Last Journal: Thursday March 24 2005, @12:46PM)
I think the issue here is that they have already won [essential.org].
Since their Office suite has %95+ of the Windows and Macintosh market share, why not open the specs up?
This leads to other apps copying the MS specs and PHB's to conclude that since all documents are now MS Office documents, why not buy the brand most compatible with the format? I.E. Microsoft.
--
Tired of spammers? Kill them all [si20.com]! Let the irony of this sig sort 'em out.
Re:Out-Open-Sourcing Open Source (Score:4, Insightful)
(Last Journal: Thursday February 05 2004, @11:30PM)
why not buy the brand most compatible with the format?
That's an easy one to answer. You've got 300-400 machines that require an office application suite, but you've got a small budget. Complete compatilibility is not much of an issue if you can save (400*$600) $180,000 - $240,000, yet run a "mostly compatible" suite. Now, with the opening of the format, that "mostly compatible" becomes "compatible."
Then there's the whole issue of MS Licensing 6.0 (as if it's a whole other application itself).
Re:Out-Open-Sourcing Open Source (Score:5, Informative)
(http://www.tanningbeds.org/ | Last Journal: Sunday November 05 2006, @07:23AM)
Did any of you read the actual Microsoft patent statement? It says you must obtain a license if you USE the information in a seperate application for compatability. Quoting them:
"There is a separate patent license available to parties interested in implementing software programs that can read and write files that conform to the Specification."
Technically, anyone that looks at it, and uses it to put compatability for Open Office, are infringing on their patent. And now that the spec is in the open, its very easy for microsoft to say "we opened it up, and they infringed, this is why we dont like open source". This also means, that if you DON'T look at it, and instead do manage to reverse engineer it, it is likely that a judge will believe MS that you are lying and instead just read their "open" standard.
Its open, as long as you don't use it.
Re:Out-Open-Sourcing Open Source (Score:5, Informative)
Just thought you would like to know, the plural of schema is schemata.
Mr. Language Person
Re:Out-Open-Sourcing Open Source (Score:5, Insightful)
I am still skeptical that Microsoft has truly made this open. Excuse me, but I don't just blindly accept what Microsoft says at face value. Microsoft has a serious credibility problem from lying about so much for so long. Even if Microsoft has finally caught up to the Open Source community regarding the openness of file formats, that helps OpenOffice and its users. It would make me feel even better about NOT spending hundreds of dollars on an office suite every few years.
Microsoft just cut our legs off over security issues? Do you think opening a Word file format just magically makes all of their security issues go away?
I saw some other Microsoft cheerleader congratulate Microsoft for "leapfrogging" Linux by finally providing a decent (remains to be seen) shell, but this person did not explain how this infant shell surpassed bash, pdksh, or zsh. Just because someone makes some wildly unsubstantiated claim about Microsoft's superiority does not make it true. Why should I believe this is anything more than PR and spin? I'm not convinced they have joined us, let alone beat us, at anything. Honestly, please explain your rationale.
Re:Out-Open-Sourcing Open Source (Score:4, Funny)
(Last Journal: Wednesday January 05 2005, @01:10PM)
No right to create modifications or derivatives of this Specification is granted herein.
I just had a flashback to when I was a kid and my Dad was giving me the old "do as I say, not as I do" lecture...
IS this relevant? Re:Out-Open-Sourcing Open Source (Score:5, Interesting)
(http://threeseas.net/ | Last Journal: Friday January 18 2002, @01:44PM)
-------- Original Message --------
Subject: [Patents] MS Office 2003 XML patented
Date: Mon, 17 Nov 2003 13:48:11 +0100
From: Carsten Svaneborg
Organization: www.mpipks-dresden.mpg.de
To: patents@aful.org
Hi! Just came across the following:
http://www.microsoft.com/mscorp/ip/format/xmlpa
Office 2003 XML Reference Schema Patent License
Microsoft may have patents and/or patent applications that are necessary for
you to license in order to make, sell, or distribute software programs that
read or write files that comply with the Microsoft specifications for the
Office Schemas.
So usage of MS Word XML files requires a patentlicense.
:
You are not licensed to distribute a Licensed Implementation under license
terms and conditions that prohibit the terms and conditions of this
license. You are not licensed to sublicense or transfer your rights.
The licence is royalty free, but GPL 7 requires the right to sublicence
patent rights to the people who obtain a GPL program from you.
so in other words Microsoft is using patents to prevent GPLed programs from
accessing the XML format that MS Word will be using.
This is very good timing, and goes to show how important it is to ensure
that the software patent directive has articles that protects
interoperativity from consituting patentinfringemet.
--
Mvh. Carsten Svaneborg
http://www.softwarepatenter.dk
The patent license terms seem reasonable... (Score:4, Informative)
(http://tomcopeland.blogs.com/)
Re:The patent license terms seem reasonable... (Score:5, Insightful)
(http://uma-thurman.zanlius.com/ | Last Journal: Monday February 09 2004, @01:26PM)
So, Microsoft will be free to continue changing their format with each new release, breaking all the open source programs for a time, causing time and trouble for users to upgrade.
We don't like Word formats because they change frequently, and they are developed in a direction that suits Microsoft. How does this change anything?
Re:The patent license terms seem reasonable... (Score:5, Insightful)
(http://www.livejournal.com/users/maxomai)
No. This is worse than before (Score:5, Insightful)
(http://fsfeurope.org/about/oriordan/index.en.html)
This schema is patented. Patents are an exclusive right to use an idea. Now if you use their format without upholding their conditions, you're a criminal, even if you figured out the format yourself.
By publishing the format, they can cast doubt on anyone that does reverse engineer it. "I bet you read the spec on line".
Also, being able to view the format isn't much use. It's XML, but that doesn't mean it will be meaningful cleartext. They can simply uuencode a big block of binary data, stick it between two tags, and it's valid XML.
Learn from the past. Microsoft are not here to do us favours.
I call bullshit (Score:5, Informative)
(http://jasonrumney.net/)
Microsoft knows full well that an XML schema cannot be patented. The patent nonsense is a way to scare off open source developers. They may hold patents on some algorithms they've used to implement this in MS Office, but we don't have to use those same algorithms to read those documents with an XML schema capable parser and do whatever we like with them.
Unfortunately, not bullshit (Score:5, Informative)
(http://fsfeurope.org/about/oriordan/index.en.html)
Some might say: "But that's a binary format."
Doesn't matter. Microsofts Office-xml format has plenty of binary data. They uuencode it so that it's official XML, but it's still encrypted or command content, not cleartext.
What if Microsoft embedded an ASF video in the word format?
They'd have to uuencode it first, then stick it in. Would this suddenly make the ASF format non-patented? no. And once parts of a format are patented, you can't recreate the whole format without negotiating a patent deal with the holder.
Yes, the law is an ass. No, you can't circumvent it with clever words.
Re:No. This is worse than before (Score:5, Informative)
Re:The patent license terms seem reasonable... (Score:5, Insightful)
(http://tomcopeland.blogs.com/)
> derivatives of the schema without permission
Hm. I guess I'm not sure what would be gained by doing that - i.e., changing the spec and republishing it. Why would that be a good thing to do, even if you could?
> Microsoft will be free to continue
> changing their format with each new
> release, breaking all the open source
> programs for a time
Right... but couldn't the same be said of any API? I mean, if the Apache plugin API [apache.org] changes, I'll need to rewrite my mod_foo module to use the new API.
Re:The patent license terms seem reasonable... (Score:5, Insightful)
(http://uma-thurman.zanlius.com/ | Last Journal: Monday February 09 2004, @01:26PM)
1) All specifications are incomplete. The requirements that it addresses today are not static, and in 10 years there will be new requirements.
2) Microsoft will change their XML schema.
3) Historically, Microsoft has done things that are in the interest of Microsoft. Everyone else must follow along.
4) Therefore, the changes that Microsoft will make the the XML schema have a high liklihood of being advantageous to Microsoft.
When Microsoft keeps all the real control of the format, it turns any open source developer into a sharecropper. We're going to be plowing a field that we don't own, and the price we pay is going to entrench the Microsoft format even further.
The patent license terms are "404 not found" (Score:5, Interesting)
Now try the link
Re:The patent license terms are "404 not found" (Score:4, Insightful)
(http://www.touset.org/)
But can the code be GPL'd? (Score:4, Interesting)
(Last Journal: Monday November 01 2004, @04:55AM)
IANAL, but I think this says no open source implementation is possible, doesn't it?
Re:But can the code be GPL'd? (Score:5, Interesting)
(http://www.cabalamat.org/weblog/current.html | Last Journal: Saturday June 28 2003, @11:29AM)
I think you are making 2 mistakes here:
(1) You say: Open Source != GNU Public License..
There's no such thing as the "GNU Public License"; you probably mean the GNU General Public License.
(2) Microsoft's license says: "You are not licensed to sublicense or transfer your rights". This means if you write a program using Microsoft's license, and license your preogram under the BSDL, then someone using your program isn't licensed to modify it. I would imagine MS have done this deliberately to sabotage open source / free software implementations of their XML schemas.
Free as in... BOW BEFORE YOUR MASTER (Score:3, Insightful)
Subject: [Patents] MS Office 2003 XML patented
Date: Mon, 17 Nov 2003 13:48:11 +0100
From: Carsten Svaneborg
Organization: www.mpipks-dresden.mpg.de
To: patents@aful.org
Hi! Just came across the following:
http://www.microsoft.com/mscorp/ip/format/xmlpa
Office 2003 XML Reference Schema Patent License
Microsoft may have patents and/or patent applications that are necessary for
you to license in order to make, sell, or distribute software programs that
read or write files that comply with the Microsoft specifications for the
Office Schemas.
So usage of MS Word XML files requires a patentlicense:
You are not licensed to distribute a Licensed Implementation under license
terms and conditions that prohibit the terms and conditions of this
license. You are not licensed to sublicense or transfer your rights.
The licence is royalty free, but GPL 7 requires the right to sublicence
patent rights to the people who obtain a GPL program from you.
so in other words Microsoft is using patents to prevent GPLed programs from
accessing the XML format that MS Word will be using.
This is very good timing, and goes to show how important it is to ensure
that the software patent directive has articles that protects
interoperativity from consituting patentinfringemet.
Possible solution (Score:5, Interesting)
Re:Free as in... BOW BEFORE YOUR MASTER (Score:4, Informative)
(http://www.nightlifemagazine.ca/ | Last Journal: Thursday March 24 2005, @12:46PM)
Anything uploaded as specs to "Infostructurebase" can not be, by their own mission statement, lock-in proprietary technology.
Check out their overview [isb.oio.dk].
--
Tired of spammers? Kill them all [si20.com]! Let the irony of this sig sort 'em out.
Could the problem possibly be in the GPL?! (Score:5, Insightful)
Microsoft is allowing you to license the patent free of charge but not to sublicense it. The GPL requires that you be allowed to sublicense patents applicable to GPLed software. And that's somehow Microsoft's fault?
GPL does not require sublicensing (Score:5, Informative)
(http://dominia.org/djao/)
Not true. Section 7 of the GPL requires that patent rights be publicly available, but it does not require that you personally sublicense those patent rights.
Specifically, GPL section 7 says:
Since the Microsoft patent license does permit royalty-free redistribution, it does not contradict the GPL in this regard (although it may have other incompatibilities; I have not looked at the whole thing thoroughly yet).Not true (Score:5, Interesting)
At long last (Score:1, Interesting)
(http://www.gentoo.org/)
*smiles*
What does this mean ... (Score:1)
(http://lanalot.com/)
Open Source Implications? (Score:3, Interesting)
From http://www.microsoft.com/mscorp/ip/format/xmlpaten tlicense.asp [microsoft.com]:
That whole page is worth reading, but doesn't this phrase in particular damage the ability to make use of the information in open source code, whether GPL or BSD?
The page also says:
Unfortunately, the page they ask you to link to doesn't actually exist...
Re:Open Source Implications? (Score:4, Insightful)
(Last Journal: Saturday May 01 2004, @04:37AM)
Any 'open' standard that imposes conditions on its use is not actually open at all. The owner can decide at any time to change the license, and this in itself should be enough reason to avoid this XML interface.
I believe these XML standards are what is technically called a "honeypot".
Of course, I may be paranoid, this may indeed be a munificent gesture by Microsoft who have realized that their XML schemas will serve the global community, add value to their products, and encourage a new generation of Office extension applications that will halt the trickle/rush/avalanche of Linux conversions.
Indeed.