xmlhack: XML developer news on the move
Front page | About | Contribute

TeXML: an XML vocabulary for TeX
[11:10, 13 Apr 2004 UTC | Oleg Paraschenko]

TeXML is an XML vocabulary for TeX. The processor transforms TeXML markup into the TeX markup, escaping special and out-of-encoding characters. Intended audience is developers who convert documents from XML to TeX.

Example of TeXML to TeX translation
TeXML TeX
<cmd name="documentclass">
  <opt>12pt</opt>
  <parm>letter</parm>
</cmd>
\documentclass[12pt]{letter}

One of the main benefits of TeXML usage is automatic translation of the TeX special symbols.

Example of translation of special TeX symbols
TeXML TeX
<TeXML>\section{No&#xa0;break}</TeXML>
$\backslash$section\{No~break\}

The TeXML processor supports different output encodings and escapes out-of-encoding chars automatically.

Example of translation of non-ASCII characters
TeXML
<TeXML>&#x422;&#x435;&#x425;</TeXML>
TeX in ASCII encoding
\cyrchar\CYRT \cyrchar\cyre \cyrchar\CYRH 
TeX in Russian encoding
TeX

Rationalities and objectives of TeXML were presented by Douglas Lovell at the Tex Users Group meeting in 1999. His paper is available online on the TUG site: "TeXML: Typesetting XML with TeX" (PDF, 180 Kb). He also wrote TeXMLatte, a Java implementation of TeXML by IBM alphaWorks, but now his realization "has been retired".

The new TeXML processor is written in Python and is published under GPL. Software can be downloaded from the SourceForge project files area.

Front Page | About | Contribute


Copyright © xmlhack 1999-2001.
info@xmlhack.com.
A Useful Production.