2014-11-09-displaying-bibtex-in-hakyll

Posted on November 9, 2014

Displaying bibTeX in Hakyll

My personal site has been powered by Hakyll for a while. I like being able to rsync static html to my Raspberry Pi server. It gives me fewer security issues to worry about. Being an academic, one of the reasons I have a website is to list my publications for the world. Not that the world is too interested, but it helps when someone Google’s your name. Since I started using Hakyll I’ve had to duplicate my publications data, once for use with LaTeX (the least worse system for producing a print document) and once as my publications.html on this site. The computer scientist in my hates this as it violates the Don’t Repeat Yourself principle. So, how can I get Hakyll to generate my publications.html from my BibTeX data? Read on for a solution.

The core idea here is to generate IO [Item String] so that I can use Hakyll’s unsafeCompiler to generate a Compiler [Item String] which can then list each of the items as a bibliographic entry. It’d be a bit smarter to generate [Item Bibentry] where Bibentry is some complex bibliography record, but for the moment this suffices. In this simple solution I use Parsec to parse the BibTeX file:

Yeah, error handling leaves a lot to be desired.

Given the IO [Entry.T] we need to turn them into HTML strings. For this we’ll employ the awesome Pandoc to change any LaTeX formatted strings into HTML structure.

The only complexity here is that, by default, Pandoc will generate the string \textbf{foo} as <p><strong>foo</strong></p> whereas I’d prefer <strong>foo</strong>. Hence I have to do the topDown walking of the Pandoc document structure converting any Para into a Plain. After that, it’s a case of pulling out each bibliographic entry and htmlizing it:

As we now have something suitable for unsafeCompiler we can then use Hakyll to generate a HTML file as we normally would, where the content of templates/publications.html is a simple <ul>.

A full solution fills in the blanks of the above description. As my Blog (and the rest of my life) is stored in git, I’ve made it available on github.