advertisement
javaboutique
Search Tips
Articles  |   Tutorials  |   Reviews  |   Tools  |   by Category  |   by Date  |   by Name  |   Submit  |   Source  |   Forums  |  
javaboutique
Browse DevX


Partners & Affiliates











advertisement

Reviews : Davisor Offisor 1.5.1 :

Review: Davisor Offisor 1.5.1

by Drew Falkman

Summary

Sometimes it's the most mundane, seemingly basic tasks that end up taking a lot of time and effort to deal with. I've found this to be particularly true with content management - especially dealing with Microsoft Word documents and getting them to work on Web sites. Davisor Offisor is a Java tool library to help developers handle Word documents and get them into an easier format to work with: eXtensible Markup Language (XML). In this review, I will take a look at Offisor to see if this can help us with development. More Information

Introduction

As Internet technology has evolved, so too have the formats of documents -- we now have PDF, a pretty solid HTML standard and XML. In theory, as more end-users move towards universal document formats, this should make the prospect of content management easier. Unfortunately, in most circumstances all of these newer formats require special tools or technical understanding. And let's be realistic; most people still use Microsoft Word. Anyone who has worked with Word documents, and even the HTML/XML output of Word documents, knows that this is not an easy format to work with. Tools like Macromedia Dreamweaver MX even have special processes to, as Dreamweaver calls it, "Clean up Word HTML". Microsoft seems to be addressing this issue by adding significant XML support in Office 2003, but many users are still using Word 2002, 2000, 97 and earlier or don't have the understanding (or inclination to obtain it) necessary to work with XML. It is in this arena where Davisor Offisor can help.

How Offisor Works

One of the nice things about Offisor is that it doesn't require any proprietary plug-ins or libraries, such as you might expect when working with Microsoft formats. Offisor will work in any native Java application, on Windows, Linux or whatever. The only requirement is a SAX (1 or 2) compatible XML parser. In version 1.5.1, Offisor will handle two basic types of files; standard Word docs (versions 6, 95, 97 and 2000, and though undocumented I had luck with 2002) and "real-world" HTML files. The real- world HTML parser is a nice addition to the package, as it will parse looser and sloppier (as their Davisor calls it, "almost- but-not-quite compliant") HTML into XML, allowing developers to create a universal XML storage paradigm for any HTML and Word documents that are imported into an application.

Using Offisor is straightforward to say the least. There are two primary classes that are used to parse documents. com.davisor.ms.doc.DocParser and com.davisor.xml.html.HTMLParser. As you have probably surmised, these will process Word docs and HTML documents respectively. The examples included with Offisor are actually quite handy and provided a good look at how to use the API to transform documents. Additionally, the API is quite comprehensive and a number of core classes include utilities, interfaces and exceptions that you can use when coding with Offisor.

Setup, Installation and Documentation

Setting up Offisor on my computer was a simple task. The zip I downloaded included a WAR file which I deployed on my JRun 4 server. Everything worked on the first try! The download also includes the examples and a good bit of documentation. The documentation includes the Offisor user's manual, the API docs, a guide for obfuscating Offisor code (if a developer wanted to include this code in a larger software package), information about the output XML format, some sample transformation style documents and a version history. Frankly, this was more than I expected from a relatively simple tool(from an implementation standpoint at least).

How to Add Java Applets to Your Site

New on the Java Boutique:

New Review:

Time Management Made Easy with the Quartz Enterprise Job Scheduler
Why not just use the Java timer API? This open source scheduling API boasts simplicity, ease-of-integration, a well-rounded feature set, and it's free!

New Applet:

Reverse Complement
Reverse Complement is a simple applet that converts DNA or RNA sequences into three useful formats.

Elsewhere on internet.com:

WebDeveloper Java
Lots of Java information on webdeveloper.com

WDVL Java
Thorough Java resource at the Web Developer's Virtual Library.

ScriptSearch Java
Hundreds of free Java code files to download.

jGuru: Your View of the Java Universe
Customizable portal with online training, FAQs, regular news updates, and tutorials.

 Avaya DevConnect Center
 Service Component Architecture/Service Data Objects Solution Center
 Intel Go Parallel Portal
 Internet.com eBook Library
 IBM Software Construction Toolbox
 Microsoft RIA Development Center
 Destination .NET
XML error: not well-formed (invalid token) at line 53
advertisement
Receive Articles via our XML/RSS feed
Receive Articles via our XML/RSS feed

JavaBytes
Internet Cyclone
This powerful, easy-to-use, internet optimizer is for Windows 95, 98, ME, NT, 2000 and XP. It's designed to automatically optimize your Windows settings, boosting your Internet connection up to 200%.

Is .NET on Linux Finally Ready?
Red Hat Takes on HPC Market, Microsoft
Python's New Release Bridges the Gap
No Flash Seen on iPhone Horizon
Apple Yields to Complaints Over iPhone NDA
Microsoft Shows Some Ankle With Visual Studio
Gentoo Linux Cancels Distribution
It's Official: Windows 7 at PDC, WinHEC
Oracle Keeps Building on Spoils From BEA
Intel, Oracle Head For 'The Cloud'

C++Ox: The Dawning of a New Standard
Getting Started with Virtualization
Master Complex Builds with MSBuild
eCryptfs: Single-File Encryption in Linux
CCXML in Action: A CCXML Auto Attendant
Ballmer: Current Woes Won't Halt Tech, Microsoft
Microsoft Uses VMworld to Hype Its Hypervisor
Microsoft Charges Ahead in Virtualization
Microsoft Shows Some Ankle With Visual Studio
Top 5 Reasons to Adopt SQL Server 2008

Advertising Info  |   Member Services  |   Contact Us  |   Help  |   Feedback  |   Site Map  |   Network Map  |   About



JupiterOnlineMedia

internet.comearthweb.comDevx.commediabistro.comGraphics.com

Search:

Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

Jupitermedia Corporate Info


Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

Solutions
Whitepapers and eBooks
IBM Whitepaper: Innovative Collaboration to Advance Your Business
Internet.com eBook: Real Life Rails
Avaya Article: Call Control XML - Powerful, Standards-Based Call Control
Internet.com eBook: The Pros and Cons of Outsourcing
Go Parallel Article: Scalable Parallelism with Intel(R) Threading Building Blocks
Internet.com eBook: Best Practices for Developing a Web Site
IBM CXO Whitepaper: The 2008 Global CEO Study "The Enterprise of the Future"
Avaya Article: Call Control XML in Action - A CCXML Auto Attendant
Go Parallel Article: James Reinders on the Intel Parallel Studio Beta Program
IBM CXO Whitepaper: Unlocking the DNA of the Adaptable Workforce--The Global Human Capital Study 2008
Adobe Acrobat Connect Pro: Web Conferencing and eLearning Whitepapers
Go Parallel Article: Getting Started with TBB on Windows
HP eBook: Storage Networking , Part 1
MORE WHITEPAPERS, EBOOKS, AND ARTICLES
Webcasts
Go Parallel Video: Intel(R) Threading Building Blocks: A New Method for Threading in C++
HP Video: Is Your Data Center Ready for a Real World Disaster?
Microsoft Partner Portal Video: Microsoft Gold Certified Partners Build Successful Practices
HP On Demand Webcast: Virtualization in Action
Go Parallel Video: Performance and Threading Tools for Game Developers
Rackspace Hosting Center: Customer Videos
Intel vPro Developer Virtual Bootcamp
HP Disaster-Proof Solutions eSeminar
HP On Demand Webcast: Discover the Benefits of Virtualization
MORE WEBCASTS, PODCASTS, AND VIDEOS
Downloads and eKits
Microsoft Download: Silverlight 2 Software Development Kit Beta 2
30-Day Trial: SPAMfighter Exchange Module
Red Gate Download: SQL Toolbelt
Iron Speed Designer Application Generator
Microsoft Download: Silverlight 2 Beta 2 Runtime
MORE DOWNLOADS, EKITS, AND FREE TRIALS
Tutorials and Demos
IBM IT Innovation Article: Green Servers Provide a Competitive Advantage
Microsoft Article: Expression Web 2 for PHP Developers--Simplify Your PHP Applications
Featured Algorithm: Intel Threading Building Blocks - parallel_reduce
MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES