| Type checking XML parser with Xerces; Condense stylesheet strips whitespace
Bob DuCharme has
written
a tutorial explaining how to get XML Schema based
type-checking working with the Xerces Java parser.
Condense.xsl,
by Mike Brown, is an XSLT stylesheet to strip whitespace and
comments. Since the XSLT/XPath data model does not expose
the DOCTYPE
declaration or entity references to an XSL processor,
applying the
stylesheet to a document also has the effect of expanding
the entity
references and removing the DOCTYPE, so it may not be
appropriate for all
situations.
The author invites others to benchmark the processing
times for lengthy
stylesheets before and after removal of comments and
extraneous whitespace,
and to contact him or post to the XSL list their results. He
writes,
"Depending on the length of the source document, the
initial parse time and
the time required for the XSL processor to create and
prepare to use the
stylesheet tree before processing can be reduced by tens or
hundreds of
precious milliseconds." However, he concedes that his
initial benchmarks had
a wide margin of error.
|
|
|
|