First my admission: my subscription to Brian Jonesâ Blog, Open XML Formats, is treated with a cursory glance out of a general lack of interest. Every time I read it, the Microsoft elevator music keeps droning in my ears. So it is quite possible that I overlooked any mention of an Open XML initiative that matches the scale and scope of Bob Staytonâs DocBook XSL: The Complete Guide, DocBookXslStylesheets and DocBookToXhtml at the DocBook wiki.
My years long journey to hack together my software products, CleanHTML for Word 2000/2002 and CleanXHTML for Word 2003 influences my view that Microsoft âmakes it difficultâ to transform its âopenâ XML data into open formats like XHTML. And I can hear Brian Jones sigh with a noise that says, âWhat more do these people want?â After all Word 2007 âallowsâ you to:
- Save a document as HTML.
- Post an entire Word Document to a Blog using the popular Blog APIs.
But when I look at word 2007 this is whatâs not seen in my view:
- The ability to highlight a portion of a document and save that portion as HTML, XHTML, DocBookâand any other XML format that can be plugged into some kind of file conversion architecture.
- The ability to highlight a portion of a document and send that portion as HTML, XHTML, DocBookâand any other XML/REST/SOAP format that can be plugged into some kind of Web services architecture.
A single document of prose can be used like a free-form database supporting multiple targets. What we have now to address âmy problemâ (since supposedly Iâm the only guy in the world who has this problem), is Visual Studio Tools for Office. You might be able to guess that Iâve been working with this tool extensively and somehow Iâm still slouching here writing these words. And Brian Jones sighs again and he might produce a list of Word features that have been around for years like:
- The Apply transform option in the File > Save As⊠> Save as type: XML Document (*.xml) dialog.
- The Schema Library dialog with a view to Schema Solutions.
- The Templates and Add-ins dialog with the XML Expansion Packs tab.
Look at all these wonderful things Microsoft is âgivingâ to me. But somehow I still canât right click on a highlighted Range of text and find a Send as XML command. Why put this feature into a Word Processor? âNo oneâ will understand itâright?
Since Microsoft clearly has not âinnovatedâ here, why canât they in the very least copy what DocBook culture has produced and provide a CodePlex.com-like archive/compilation of all the XSL transforms we would need to freely move in and out of âOpenâ XML formats? Okay, Brian is getting winded hereâhere is his last list before he loses all patience with me (fictionally speaking):
- View revisions to a WordprocessingML document in a browser: âIt explains the usage of ASP.NET application in converting a WordprocessingML document with simple textual content and comments into HTML using XSLT.â
- MOSS: Converting documents with embedded images to web pages: âThis article presents the full source code to a simple version of such a tool. You should be able to just copy and paste the code fragments below into a single file that you hook up to your likingâŠâ
Like all previous flirting with this issue, these articles are treated as âexotic edge casesâ as long as there is no central repository like that produced by DocBook people.

