TeX4ht

from Wikipedia, the free encyclopedia

TeX4ht is a powerful and very largely configurable converter from Eitan M. Gurari (* 1947; † 2009), which can convert TeX and LaTeX documents into HTML and a number of XML formats. In particular, it is also used to convert (La) TeX documents into word processing formats .

The program is published as free software under the LaTeX Project Public License (LPPL).

history

Developed in the 1990s, TeX4ht was originally only able to convert (La) TeX to HTML . This made it possible to provide scientific documents that had been prepared for print edition with the help of (La) TeX for publication on the World Wide Web for display in the web browser . In particular, the possibility of inserting references as hypertext in a document is very widely supported.

In the course of time, more and more XML- based formats were added. Currently XHTML , MathML , OpenDocument , DocBook and TEI are supported; JavaHelp can also be generated.

TeX4ht is now preconfigured in all TeX distributions.

After the death of Eitan M. Gurari, the program is continued by Radhakrishnan CV and Karl Berry.

Working method

In contrast to other converters such as LaTeX2html or TtH in particular , no attempt is made to convert the syntax of the respective markup languages . TeX4ht does not parse the (La) TeX document itself, but leaves it to (La) TeX. A DVI file is generated, which TeX4ht then further converts.

This procedure offers the advantage that basically every compiling (La) TeX document can be processed by TeX4ht. Extensions through additional packages or user-specific additions (own commands and environments, extensions for reference management with BibTeX ) are therefore basically no problem during the conversion.

Mathematical formulas and other characters that cannot be displayed as text in the target format are converted into graphics and integrated.

Using ODT , the native text format of OpenOffice.org and LibreOffice Writer , LaTeX documents can easily be converted into the doc format of Microsoft Word .

literature

  • Translating LaTeX to HTML using TeX4ht, in: Michel Goossens, Sebastian Rahtz, Eitan M. Gurari, Ross Moore, Robert S. Sutor. The LaTeX Web Companion. Integrating TeX, HTML, and XML. 1999. 8th printing January 2006. pp. 155-194.
  • Eitan Gurari, HTML Production (PDF file; 171 kB), TUGBoat 25 (2004), No. 1, pp. 39-47.

Web links

Individual evidence

  1. ^ Karl Berry, posting in the texhax mailing list , July 17, 2009; id. Posting on the mailing list texhax , November 7th 2009.
  2. The LaTeX Web Companion , pp. 169f.