XML Support in APR and Apache

Recently the subject of bundling or non-bundling of expat within APR and Apache HTTPD (the web server) re-emerged on the dev list.  I’ve always been against bundling: it’s a third-party library and should be a dependency.  We’ve moved gradually towards that, but current practice includes bundling it in an optional dependencies package.

APR’s use of expat is in practice pretty limited and straightforward: the core does nothing very demanding with XML.  And in practice, when applications such as Apache Modules need to work with XML, expat is often too limiting.  So modules need to introduce an alternative XML library.  The most usual choice is libxml2 as in, for example, mod_proxy_html, mod_transform, and mod_security.

Libxml2 is not just a much bigger and more powerful library than expat, it’s also very nearly a drop-in replacement.  In particular, it provides a compatible SAX API.  So if we could use it in place of libxml2 in APR we have a win-win for web servers (and other applications) involving libxml2: replace expat in APR, and load just the one XML library instead of two.  At the same time, we don’t want to impose libxml2 as a dependency on APR applications that have no need for it.

So this week I’ve finally got around to rewriting APR’s XML module to decouple the parser and use either expat or libxml2.  The choice of XML parser is now available at compile time.  While libxml2 support should be considered experimental for the time being, it should become the preferred option for users of applications requiring it, potentially simplifying your configuration and reducing your  footprint.

For the time being, anyone interested will need to download APR from trunk.

Posted on March 25, 2011, in apache, xml. Bookmark the permalink. 1 Comment.

Leave a comment