Features and System Requirements
Import formats
- RTF documents conforming to the RTF
specification 1.8 (including style info)
- Word binary documents (using WordLink; only when running on Windows platform with Word installed)
- any well-formed XML document (no style info)
Processing features
- run Upcast Processing Language (UPL) code on documents
- transform documents using XSLT (XSLT 1: Xalan, XSLT 2: Saxon-B)
- integrate external tools via commandline call
- create complex, nested groupings using UPL and the Grouper module
- create markup based on regular expressions, even across element boundaries
- create markup based on style information
- validate against XML DTD, XML Schema and Relax NG
- in-place transformations, attach custom data to any node
- list detection heuristics, supporting nested lists
- create sub-processing pipelines
- perform detailed logging, handle processing errors
- sophisticated handling of embedded and referenced images
Export formats
- XML (upCast DTD)
- XML (internal tree format)
- RTF (Version 1.6)
- CSS
Interfaces
- GUI based on Swing
- Commandline
- Java API
System requirements
- Java 7 runtime environment or later
- 1400 MB of free RAM (recommended)
- environment compatible and running with Xerces 2.11.0 or later (Xerces 2.11.0 is delivered with upCast binary)