210 likes | 361 Vues
Fukiat Julnual [ http://www.narisa.com/blog/fuju ]. Overview of Open XML & Accessing Open XML data from Java. Fukiat Julnual. @Microsoft Thailand Platform Strategy Manager ITPro community @ narisa.com ( fuju )
E N D
Fukiat Julnual [ http://www.narisa.com/blog/fuju ] Overview of Open XML &Accessing Open XML data from Java
Fukiat Julnual • @Microsoft Thailand • Platform Strategy Manager • ITPro community • @narisa.com ( fuju ) • Voluntary Consultant (VC) in JEE , Oracle , LAMP and Microsoft technology Stacks • Participate in Blog / Discussion Forums
ECMA Office Open XML File Formats docx dotx docm dotm pptx potx pptm potm xlsx xltx xlsm xltm
Basic Components of the New Formats • Package – ZIP Container • Part – The “files” inside the ZIP • Most parts are XML • Binary files can be included • Each XML part is a discreet, compressed component • Content Types – Each part has a content type that is enforced on open • Relationships – Any part that references another part or plays a certain role in the application must do so via a relationship
How does Microsoft support its customers? Products Community Access Standards Deliberate delivery of interoperability in Microsoft products and technologies Listen to, and work with customers, partners, and competitors to build bridges and coexist Make MS technology assets available to others Participate in standards bodies and support standards in products to foster interop
Interoperability for DocumentsThe role of XML-based document formats in the Microsoft® Office system Business Process Efficiency Improved Data Access Business requirements Company standards Customer details, Costs Web service formatting intelligent receipt Efficient capture Validated information Query & extract Text Document Fragment Image Document Assembly Searching & Repurposing Content Auto-creating a polished document data XML structure Data and presentation Finding Previous RFP Content formatting Document Archival Archived & consumed long into the future without vendor-specific clients or applications
XML-based formats enable you to do things you couldn’t before • Better value from existing infrastructure • Information security • Regulatory and process compliance • Information integration • Retention, discovery and content management • Documents as digital assets – what are you worth? • Legacy documents & archives • Lifecycle cost vs. Implementation cost Documents that manage themselves Managing documents with systems
Different scenarios of XML-based formats • Past • preservation and archiving • Present • fidelity versus interchange • Future • document engineering and systems integration
Ecma Office Open XML • Specifications published by Ecma International TC-45. • Freely available for download and implementation • TC-45 is comprised of many companies, chaired by Microsoft – Apple, Toshiba, Novell, Statoil, and others • Microsoft offers the Open Specification Promise to alleviate IP-related concerns for the Open XML formats
Adoption in major Office suites • 2007 Microsoft Office system - Default Save Format is Open XML (+ free updates for Office 2000, XP, 2003) – Dec 2006/Jan 2007 • Open Office – Novell announcement of support of Open XML in Open Office – Novell edition • Corel announcement of support of Open XML - Availability mid 2007
Office XP, 2003 will open, edit and save new Office formats Will recognize new Word, Excel and PowerPoint file format extensions Enables users to Open XML Formats across multiple versions Windows 2000 SP4 and later can convert between binary and Open XML Formats Office 2007 users can change the default file format if desired Current .doc, .xls, .ppt file formats will be supported in 2007 Office system Default file format can be set by users during deployment or after Advanced policy controls for enabling and disabling the use of specific formats File Format CompatibilityEnsuring Free Document Exchange With Prior Office Releases Related Blog about Microsoft Office Compatibility Pack : http://www.narisa.com/blog/fuju/index.php?showentry=924
What difference from Apache POI Solution ? “ Apache POI - Java API To Access Microsoft Format Files The POI project consists of APIs for manipulating various file formats based upon Microsoft's OLE 2 Compound Document format using pure Java. In short, you can read and write MS Excel files using Java. Soon, you'll be able to read and write Word files using Java. POI is your Java Excel solution as well as your Java Word solution. However, we have a complete API for porting other OLE 2 Compound Document formats and welcome others to participate. OLE 2 Compound Document Format based files include most Microsoft Office files such as XLS and DOC as well as MFC serialization API based file formats. “ (Source : http://poi.apache.org/ )
Open XML Interoperability * Also includes abstractions for OPC concepts (Open Packaging Convention)
OpenXmlDeveloper.org • Formed by 40 companies to share developer information about the Office Open XML file formats • Articles with full source code for C#, VB, Java, XSLT • Forums for posting technical questions
NetBeans and the samples of Java and Open XML file format from [ http://openxmldeveloper.org/articles/OpenXMLandJava.aspx ] Demo
Fukiat Julnual http://www.narisa.com/blog/fuju
www.microsoft.com/office/preview www.OpenXMLDeveloper.org www.ecma-international.org msdn.microsoft.com/office/xml www.microsoft.com/technet/prodtechnol/office For more information